Allow loading remote contents and showing images to get the best out of this email.AI/ML Weekly Newsletter, Kala, a FAUN Newsletter
 
🔗 View in your browser   |  ✍️ Publish on FAUN   |  🦄 Become a sponsor
 
Allow loading remote contents and showing images to get the best out of this email.
Kala
 
Curated AI/ML news, tutorials, tools and more!
 
 
 
 

Amid the whirlwind of rapid AI advancements, from EKS MCP's breakneck speeds to Hugging Face's budget-friendly robots, it's clear: AI is not only changing the way we code but also the gadgets we tinker with. Meanwhile, savvy developers are fine-tuning the delicate dance between innovation and security for a future that promises both power and peril. Dive in and untangle the threads of progress.


🚀 Accelerating application development with the Amazon EKS MCP server

🔍 Agentic AI Manifest: A Schema to Describe What Agents Do

🔑 AI agents have access to key data across the enterprise

🧩 Architecting Gen AI-Powered Microservices: The Unwritten Playbook

🔢 A visual introduction to vector embeddings

🛡️ GitHub MCP Exploited: Accessing private repositories via MCP

🤖 Human coders are still better than LLMs

🧗 It's not your imagination: AI is speeding up the pace of change

🖲️ New AI innovations that are redefining the future

🕵️ Using AI to outsmart AI-driven phishing scams


Read. Innovate. Secure. The AI frontier awaits.


Have a great week!
FAUN Team
 
 
⭐ Patrons
 
bytevibe.co bytevibe.co
 
Hydrate. Debug. Repeat. — In Style. 🍺
 
 
Our frosted pint glass isn't just for drinks — it's a badge of your developer lifestyle. With a smooth matte finish, crystal-clear print, and sleek design, this 16oz (473 ml) glass is perfect for beer, cold brew, or whatever keeps you coding past midnight.

Dishwasher safe. BPA-free. Nerd approved.

Grab yours now and sip like a real coder. 🍻
 
 

👉 Spread the word and help developers find you by promoting your projects on FAUN. Get in touch for more information.

 
ℹ️ News, Updates & Announcements
 
techcrunch.com techcrunch.com
 
It’s not your imagination: AI is speeding up the pace of change
 
 

AI takes a victory lap: Mary Meeker reveals ChatGPT snagged 800 million users in a brisk 17 months. Meanwhile, the bean counters cheer as inference costs nosedived 99% in just two years. Profitability? That's still a cliffhanger.

 
 
azure.microsoft.com azure.microsoft.com
 
New AI innovations that are redefining the future for software companies
 
 

Azure AI Foundry gives developers the power to masterfully control AI agent workflows and streamline decision-making through a single API and SDK. Agentic DevOps elevates AI agents beyond mere coding assistants, morphing GitHub Copilot into a formidable dev partner eager to wrestle with code reviews and testing.

 
 
legitsecurity.com legitsecurity.com
 
Remote Prompt Injection in GitLab Duo Leads to Source Code Theft
 
 

GitLab Duo, riding on Anthropic’s Claude, stumbled into a prompt injection blunder. Sneaky instructions nestled in projects allowed hackers to swipe private data. The culprit? Streaming markdown teamed up with shoddy sanitization. This opened a door for HTML injection and shined a spotlight on the double-edged sword of AI assistants: useful but also a tad too exploitable. GitLab scrambled to patch these loopholes, but the episode serves a stark reminder: AI insights need a fortress against crafty tampering.

 
 
cloud.google.com cloud.google.com
 
Text-to-Malware: How Cybercriminals Weaponize Fake AI-Themed Websites
 
 

UNC6032 swindled millions by spinning a tangled web of fake "AI video generator" sites. They slipped Python-based infostealers right under our noses, using social media ads as their Trojan horses. Meta’s ad transparency pulled back the curtain on over 30 malicious sites, yet the sneaky STARKVEIL malware continues to quietly plunder data.

 
 
helpnetsecurity.com helpnetsecurity.com
 
AI agents have access to key data across the enterprise
 
 

82% of organizations have AI agents on deck; a mere 44% bother with security policies. That leaves a lot of open doors. A staggering 96% of tech pros are side-eyeing these agents as ticking time bombs, yet 98% plan to unleash more. It's like setting out catnip for hackers. These agents wield power with scant supervision. Only 52% of companies track their data frolics. IT teams typically get it (71%), but compliance (47%) and executives (34%) are in the slow lane.

 
 
invariantlabs.ai invariantlabs.ai
 
GitHub MCP Exploited: Accessing private repositories via MCP
 
 

Invariant played detective and unearthed a gaping hole in GitHub MCP. This flaw lets sneaky attackers hijack agents using malicious GitHub issues, spilling private repo secrets all over the public domain. Fortify your agent systems: clamp down on access and deploy Invariant Guardrails along with MCP-scan. Keep those threats at bay, lightning-speed.

 
 
arstechnica.com arstechnica.com
 
Want a humanoid, open source robot for just $3,000? Hugging Face is on it.
 
 

Hugging Face just pulled the curtain back on HopeJR, a humanoid robot that swings 66 degrees of freedom—at just $3,000. This price tag shames the $16,000 slapped on Unitree's G1. Together with The Robot Studio, they've created this robot with a dash of Bender's charisma. The kicker? It's fully open-source. They're on a mission to democratize robotics for the rest of us.

 
 
helpnetsecurity.com helpnetsecurity.com
 
Using AI to outsmart AI-driven phishing scams
 
 

Phishing scams are growing craftier, employing AI like FraudGPT to weave through filters and masquerade as real emails, boosting scam rates by 70%. AI can unveil sneaky phishing patterns humans miss, but it loves a good panic. It often cries wolf with false alarms and needs a babysitter to adjust to ever-shifting threats.

 
 
 
🐾 From FAUNers
 
faun.dev faun.dev
 
Agentic AI Manifest – A Schema to Describe What Agents Do
 
 

Meet agent-manifest.org. It's a fresh schema designed to crack open the mysteries of AI agents: roles, requirements, trust levels. Think of it as a decoder ring for the robots taking API jobs.

 
 

👉 Create your FAUN Page if it's not done yet and start sharing your blog posts, news, and tools on FAUN Developer Community, collect badges and more!
 

 
🔗 Stories, Tutorials & Articles
 
aws.amazon.com aws.amazon.com
 
Accelerating application development with the Amazon EKS MCP server
 
 

The EKS MCP server hands AI code assistants, like Q Developer CLI, the keys to a streamlined Kubernetes kingdom. App development? Now lightning fast. With LLMs tapping into real-time context, AI flexes its muscles in the wild world of Kubernetes ops and troubleshooting.

 
 
blog.pocok.dev blog.pocok.dev
 
Building MCP Servers Like a Pro (With a Little Help from yfinance and LLMs)
 
 
Hook LLMs to real-time stock data with MCP + yfinance—see how to build, test, and deploy smarter with help from LLMs.
 
 
pmbanugo.me pmbanugo.me
 
Peer Programming with LLMs, For Senior+ Engineers
 
 

LLMs—the mysterious, fickle companions of coding. Senior engineers wade through it, extracting gold with tricks like "Second opinion" and "Throwaway debugging." Seth Godin rings the alarm: these clever machines aren't as clever as they look. First ask Claude, then call in a human.

 
 
antirez.com antirez.com
 
Human coders are still better than LLMs
 
 

Antirez recounted a story of working on Vector Sets for Redis, detailing a bug he encountered and his process of finding a solution through a creative approach involving LLM. He explored different methods to ensure link reciprocity and proposed a hashing solution that offered a balance between efficiency and protection against possible collisions. Overall, the experience highlighted the unique problem-solving capabilities of humans compared to LLMs.

 
 
techcommunity.microsoft.com techcommunity.microsoft.com
 
A visual introduction to vector embeddings   ✅
 
 

OpenAI's text-embedding-ada-002 often gets a peculiar itch at dimension 196—vectors peaking awkwardly there. Enter text-embedding-3-small, swooping in to smooth out the distribution. Now, onto similarity metrics. For unit vectors, the dot product is your fast friend. It's interchangeable with cosine similarity, minus the extra math homework. Vector compression can slim things down with quantization and dimension reduction, but watch out—it might cut corners. Innovative tactics for storage and search can clean up the mess.

 
 
turso.tech turso.tech
 
We rewrote large parts of our API in Go using AI: we are now ready to handle one billion databases
 
 

Turso overhauled its API with Go and AI, gunning for 1 billion databases. Think big, act smart. They squeezed every byte by adopting string interning. No more in-memory maps—they swapped them for a SQLite-backed LRU cache. The result? Leaner memory usage and hassle-free proxy bootstrapping.

 
 
modal.com modal.com
 
Linear Programming for Fun and Profit
 
 
Modal’s "resource solver" hacks cloud volatility. It taps into the simplex algorithmto snag cheap GPUs. Scale-ups? Lightning-fast. Savings? In the millions.
 
 
towardsdatascience.com towardsdatascience.com
 
Gaining Strategic Clarity in AI
 
 

AI Opportunity Tree welds cutting-edge tech to raw business value. Meanwhile, the AI System Blueprint knits tech tightly to stakeholder priorities. Lean models? They fuse teams, squash doubt, and thrust AI into action with exhilarating speed.

 
 
theregister.com theregister.com
 
Perplexity offers training wheels for building AI agents
 
 

Perplexity Labs is your quick-draw tool for crafting apps and digital delights, powered by LLMs like GPT-4 Omni. It’s a star where others stumble: fast, project-driven tasks. Expect example-heavy insights and real-world project demos. While competitors dawdle, it delivers. Need deep web browsing, code execution, and inventive results? Just dive into its user-friendly gallery of 20+ samples. You might not leave.

 
 
techcommunity.microsoft.com techcommunity.microsoft.com
 
From Zero to Hero: Build your first voice agent with Voice Live API
 
 

The Voice Live API ditches the clutter of juggling models. One API call, and voilà—real-time, natural-sounding bots. It’s harnessed over WebSocket, keeping everything sharp and efficient.

 
 
towardsdatascience.com towardsdatascience.com
 
LLM Optimization: LoRA and QLoRA
 
 

Learn how LoRA and QLoRA make it possible to fine-tune huge language models on modest hardware. Discover the adapter approach for scaling LLMs to new tasks—and why quantization is the next step in efficient model training.

 
 
infoworld.com infoworld.com
 
AI didn’t kill Stack Overflow
 
 

Stack Overflow once buzzed with collective brainpower. But then, it got too wrapped up in reputation points, a full-on leaderboard obsession. This detour dimmed its shine. It turns out, platforms flourish on real teamwork, not just gamified dick measuring contests. As AI sweeps through the coding world, developers are hungry for real connections. Let's face it—tech's true magic stems from humans, not soulless algorithms.

 
 
hackernoon.com hackernoon.com
 
LLMOps: DevOps Strategies for Deploying Large Language Models in Production
 
 

LLMOps shakes up the MLOps scene with tailor-made Kubernetes magic. It wrestles GPU scheduling, caching, and autoscaling for those behemoth LLM deployments. Keep an eye out for serverless endpoints and model meshes—smooth scaling and a wallet-friendly operation.

 
 
medium.com medium.com
 
Architecting Gen AI-Powered Microservices: The Unwritten Playbook
 
 

Plugging Gen AI into microservices isn't just a task. It's an adventure in tech wizardry. Get cozy with messaging queues, prompt caching, and the relentless art of watching in production.

 
 
medium.com medium.com
 
Why GCP Load Balancers Struggle with Stateful LLM Traffic — and How to Fix It
 
 

Deploying LLMs on GCP Load Balancers is like fitting a square peg in a round hole. These models aren't stateless, so skip HTTP, go straight for TCP Load Balancing. Toss in Redis to keep those sessions on a leash. Tweak load balancer settings to dodge mid-stream socket calamities. Embrace the power of GKE Autopilot or Compute Engine to boost streaming.

 
 
 
⚙️ Tools, Apps & Software
 
github.com github.com
 
stacklok/toolhive
 
 

ToolHive makes deploying MCP servers easy, secure and fun

 
 
github.com github.com
 
lark-parser/lark
 
 

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

 
 
github.com github.com
 
nccgroup/http-mcp-bridge
 
 

This project implements an HTTP server that acts as a bridge between HTTP/1.1 requests and Server-Sent Events (SSE) using the mcp python library

 
 
github.com github.com
 
chatmcp/mcpso
 
 

Directory for Awesome MCP Servers

 
 

👉 Spread the word and help developers find and follow your Open Source project by promoting it on FAUN. Get in touch for more information.

 
🤔 Did you know?
 
 
Did you know that GitHub deploys to production dozens of times per day using a sophisticated system they call ChatOps? Engineers initiate deployments directly from a chat interface, which integrates tools with their CI/CD pipelines. This rapid deployment frequency allows them to experiment continuously and roll back changes almost instantaneously if an issue arises. ChatOps also democratizes the deployment process, making it just as easy to deploy code as it is to send a message in chat.
 
 
😂 Meme of the week
 
 
 
 
🗣️ Quote of the week
 
 
"A well-written function is like a good joke—if you have to explain it, it’s not working."
— Sensei


(*) Sensei is a work-in-progress AI agent built by FAUN
 
 
❤️ Thanks for reading
 
 
👋 Keep in touch and follow us on social media:
- 💼LinkedIn
- 📝Medium
- 🐦Twitter
- 👥Facebook
- 📰Reddit
- 📸Instagram

👌 Was this newsletter helpful?
We'd really appreciate it if you could forward it to your friends!

🙏 Never miss an issue!
To receive our future emails in your inbox, don't forget to add community@faun.dev to your contacts.

🤩 Want to sponsor our newsletter?
Reach out to us at sponsors@faun.dev and we'll get back to you as soon as possible.
 

Kala #480: HuggingFace $3000 Humanoid, Text-to-Malware & DevOps Strategies for Deploying LLMs
Legend: ✅ = Editor's Choice / ♻️ = Old but Gold / ⭐ = Promoted / 🔰 = Beginner Friendly

You received this email because you are subscribed to FAUN.
We (🐾) help developers (👣) learn and grow by keeping them up with what matters.

You can manage your subscription options here (recommended) or use the old way here (legacy). If you have any problem, read this or reply to this email.