Allow loading remote contents and showing images to get the best out of this email.FAUN.dev's AI/ML Weekly Newsletter
 
🔗 View in your browser   |  ✍️ Publish on FAUN.dev   |  🦄 Become a sponsor
 
Allow loading remote contents and showing images to get the best out of this email.
Kala
 
#ArtificialIntelligence #MachineLearning #MLOps
 
 
📝 The Opening Call
 
 
In the past days:
  • A security breach in Red Hat's consulting GitLab instance led to the theft of 570GB of data.
  • Anthropic launched Petri, a new open-source tool for AI safety audits.
  • Microsoft released an open-source agent framework for AI.
  • GitHub introduced post-quantum secure SSH.
  • Azure introduced AKS Automatic, a new way to manage Kubernetes clusters.
  • Perplexity rolled out its new AI browser to everyone.
  • Alpine Linux shifted to a /usr-merged file system.
  • And more!
Most news outlets wrote long articles about it - paragraphs upon paragraphs of text that take time to read and understand. We took a different approach:
Instead of walls of text, we show you the news as an AI-powered visual, a practical story map that highlights:
  • The core facts in seconds
  • How the players connect (people, tools, orgs)
  • The timeline of what happened and when
  • The key numbers that actually matter
  • And more
All digested in minutes, not hours. We believe this is a smarter way to follow developer news. You can see some examples here https://faun.dev/news

This is the second week since the official launch of FAUN.news()! It's a new project and if you want to show your support, we'll be glad to hear your honest feedback! Hit the reply button!
 
 
🔍 Inside this Issue
 
 
Chat is turning into the runtime, agents are muscling past RAG, and production bugs are quietly rewriting the rules on model quality. I pulled the sharp threads—UI pilots, open safety audits, modular agent stacks—so you can dig into the details below and ship with fewer surprises.

🧪 Anthropic Launches Petri: Open-Source Tool for AI Safety Audits
🐛 Anthropic unveils three infrastructure bugs behind Claude's performance issues
🧩 ChatGPT Launches Interactive Apps with New Apps SDK Preview
🤖 Google DeepMind Launches Gemini 2.5 Model for Enhanced API Performance
🛠️ OpenAI Launches AgentKit: Streamline Agent Development for Enterprises
🏗️ Technical Tuesday: 10 best practices for building reliable AI agents in 2025
🪦 The RAG Obituary: Killed by Agents, Buried by Context Windows
⚡ Write Deep Learning Code Locally and Run on GPUs Instantly

You’ve got the patterns - turn them into leverage.

Have a great week!
FAUN.dev Team
 
 
ℹ️ News, Updates & Announcements
 
faun.dev faun.dev
 
Google DeepMind Launches Gemini 2.5 Model for Enhanced API Performance
 
 
Google DeepMind just dropped Gemini 2.5 Computer Use, now in public preview via API. It's built to control web and mobile UIs with scary precision.

Feed it a request, a screenshot, and some action history. It churns out the right function calls - clicks, typing, navigation - fast and tight.

System shift: LLMs aren’t just chatting anymore. This moves them into full-on UI pilots. Iterative, autonomous steps. Agents are coming.
 
 
faun.dev faun.dev
 
Anthropic unveils three infrastructure bugs behind Claude's performance issues
 
 
Anthropic dug into three gnarly production bugs that were quietly messing with Claude’s outputs. Culprits: broken context routing, bad TPU API configs, and a flaky TPU compiler. Fixes? Rewired the routing logic, rolled back some changes, and teamed up with the XLA:TPU crew. They’ve now beefed up on-prod evals and debugging.

System shift: AI infra teams can’t just ship and hope. Model fidelity now hinges on stress-testing across messy hardware stacks - TPUs, GPUs, Trainium, whatever’s in the mix.
 
 
faun.dev faun.dev
 
Anthropic Launches Petri: Open-Source Tool for AI Safety Audits
 
 
Anthropic dropped Petri, an open-source tool that stress-tests LLMs for bad behavior. It uses autonomous agents and judge models to sniff out risky outputs - no human babysitting required.

Turns out, even models from OpenAI, Google, xAI, and Anthropic itself slip up more than you'd expect.
 
 
faun.dev faun.dev
 
ChatGPT Launches Interactive Apps with New Apps SDK Preview
 
 
OpenAI just dropped a preview of the Apps SDK for ChatGPT. It lets devs embed interactive apps straight into ChatGPT convos. Think custom logic, custom UI, backend hooks - fully in-chat.

It runs on the open Model Context Protocol (MCP) and supports activation by name or the model’s own vibe check: context-based suggestions.

System shift: Chat’s no longer just the UI. It’s the app. The SDK makes chat-native workflows feel more like building with Lego and less like shoehorning UX into prompts.
 
 
faun.dev faun.dev
 
OpenAI Launches AgentKit: Streamline Agent Development for Enterprises
 
 
OpenAI dropped AgentKit, a full-stack toolkit for building, shipping, and fine-tuning AI agents. It expands the earlier Agents SDK and Responses API - now with much sharper edges.

New toys: build apps inside ChatGPT itself, test things out with the Agent Builder (still in beta), and tap into the Connector Registry or ChatKit for smoother integration and evaluation.

What changed: OpenAI’s platform just went modular. Agent workflows aren’t one long pipeline anymore - they’re a grid of snap-together pieces.
 
 
👉 Enjoyed this?Read more news on FAUN.dev/news
 
🔗 Stories, Tutorials & Articles
 
nicolasbustamante.com nicolasbustamante.com
 
The RAG Obituary: Killed by Agents, Buried by Context Windows
 
 
Agent-based setups are starting to edge out old-school RAG. As LLMs snag multi-million-token context windows and better task chops, the need for chunking, embeddings, and reranking starts to fade. Claude Code, for example, skips all that - with direct file access and smart navigation instead. Retrieval isn't dead, but it's morphing into something far more agentic.

Bigger picture: Bigger windows and sharper attention mean LLMs can now process whole documents and run tasks directly - no more stitching together fragments just to get work done.
 
 
openpipe.ai openpipe.ai
 
Serverless RL: Faster, Cheaper and More Flexible RL Training
 
 
New product, Serverless RL, available through collaboration between CoreWeave, Weights & Biases, and OpenPipe. Offers fast training, lower costs, and simple model deployment. Saves time with no infra setup, faster feedback loops, and easier entry into RL training.
 
 
aiengineering.academy aiengineering.academy
 
Write Deep Learning Code Locally and Run on GPUs Instantly
 
 
Modal cuts the drama out of deep learning ops. Devs write Python like usual, then fire off training, eval, and serving scripts to serverless GPUs - zero cluster wrangling. It handles data blobs, image builds, and orchestration. You focus on tuning with libraries like Unsloth, or serving via vLLM.
 
 
uipath.com uipath.com
 
Technical Tuesday: 10 best practices for building reliable AI agents in 2025
 
 
UiPath just dropped Agent Builder in Studio - a legit development environment for AI agents that can actually handle enterprise chaos. Think production-grade: modular builds, traceable steps, and failure handling that doesn’t flake under pressure.

It’s wired for schema-driven prompts, tool versioning, and DeepRAG to lock in relevant context. Model-agnostic deployment? Yep. Plus sharp evals and trace logs to keep things safe, sane, and audit-friendly.
 
 

👉 Got something to share? Create your FAUN Page and start publishing your blog posts, tools, and updates. Grow your audience, and get discovered by the developer community.

 
⚙️ Tools, Apps & Software
 
github.com github.com
 
BeaconBay/ck
 
 
Local first semantic and hybrid BM25 grep / search tool for use by AI and humans!
 
 
github.com github.com
 
Agentity-com/mcp-audit-extension
 
 
Audit and log all Copilot MCP tool calls in VSCode with ease.
 
 
github.com github.com
 
Shubhamsaboo/awesome-llm-apps
 
 
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
 
 
github.com github.com
 
openai/codex
 
 
Lightweight coding agent that runs in your terminal
 
 

👉 Spread the word and help developers find and follow your Open Source project by promoting it on FAUN. Get in touch for more information.

 
🤔 Did you know?
 
 
Did you know vLLM can deliver up to ~24× more throughput than Hugging Face’s Transformers by rethinking KV cache as a paged memory system? It splits keys and values into fixed blocks and remaps them per request, enabling efficient continuous batching across variable lengths without heavy copying. Much of the performance boost comes from better memory and scheduling management, not from exotic attention math.
 
 
😂 Meme of the week
 
 
 
 
🤖 Once, SenseiOne Said
 
 
"AI optimizes a loss; the business pays the losses you didn't measure. As models get smarter, failures get rarer and more expensive. MLOps is the cost of admitting your metric is a proxy."
— SenseiOne
 

(*) SenseiOne is FAUN.dev’s work-in-progress AI agent

 
👤 This Week's Human
 
 
This week, we’re highlighting Jukka ForsgrenSenior Solutions Architect and GenAI Ambassador at AWS, serving as a field CTO for EMEA North accounts totaling over €100M ARR. He blends deep Kubernetes and agentic AI chops with hands-on delivery—building CDK/Python POCs, running Immersion Days/Game Days/Well-Architected sessions, and channeling customer feedback to service teams. 11× AWS Certified, RHCA, and CKA/CKAD, he’s the engineer you’ve seen at Slush and Red Hat Open Tour turning cloud strategy into shipped systems.
 

💡 Engage with FAUN.dev on LinkedIn — like, comment on, or share any of our posts on LinkedIn — you might be our next “This Week’s Human”!

 
❤️ Thanks for reading
 
 
👋 Keep in touch and follow us on social media:
- 💼LinkedIn
- 📝Medium
- 🐦Twitter
- 👥Facebook
- 📰Reddit
- 📸Instagram

👌 Was this newsletter helpful?
We'd really appreciate it if you could forward it to your friends!

🙏 Never miss an issue!
To receive our future emails in your inbox, don't forget to add community@faun.dev to your contacts.

🤩 Want to sponsor our newsletter?
Reach out to us at sponsors@faun.dev and we'll get back to you as soon as possible.
 

Kala #498: The New OpenAI Apps SDK - Building Apps Inside ChatGPT
Legend: ✅ = Editor's Choice / ♻️ = Old but Gold / ⭐ = Promoted / 🔰 = Beginner Friendly

You received this email because you are subscribed to FAUN.dev.
We (🐾) help developers (👣) learn and grow by keeping them up with what matters.

You can manage your subscription options here (recommended) or use the old way here (legacy). If you have any problem, read this or reply to this email.