FAUN.dev's AI/ML Weekly Newsletter

🔗 View in your browser | ✍️ Publish on FAUN.dev | 🦄 Become a sponsor

Kala

#ArtificialIntelligence #MachineLearning #MLOps

📝 The Opening Call

In the past days:

A security breach in Red Hat's consulting GitLab instance led to the theft of 570GB of data.
Anthropic launched Petri, a new open-source tool for AI safety audits.
Microsoft released an open-source agent framework for AI.
GitHub introduced post-quantum secure SSH.
Azure introduced AKS Automatic, a new way to manage Kubernetes clusters.
Perplexity rolled out its new AI browser to everyone.
Alpine Linux shifted to a /usr-merged file system.
And more!

Most news outlets wrote long articles about it - paragraphs upon paragraphs of text that take time to read and understand. We took a different approach:
Instead of walls of text, we show you the news as an AI-powered visual, a practical story map that highlights:

The core facts in seconds
How the players connect (people, tools, orgs)
The timeline of what happened and when
The key numbers that actually matter
And more

All digested in minutes, not hours. We believe this is a smarter way to follow developer news. You can see some examples here https://faun.dev/news

This is the second week since the official launch of FAUN.news()! It's a new project and if you want to show your support, we'll be glad to hear your honest feedback! Hit the reply button!

🔍 Inside this Issue

Chat is turning into the runtime, agents are muscling past RAG, and production bugs are quietly rewriting the rules on model quality. I pulled the sharp threads—UI pilots, open safety audits, modular agent stacks—so you can dig into the details below and ship with fewer surprises.

🧪 Anthropic Launches Petri: Open-Source Tool for AI Safety Audits
🐛 Anthropic unveils three infrastructure bugs behind Claude's performance issues
🧩 ChatGPT Launches Interactive Apps with New Apps SDK Preview
🤖 Google DeepMind Launches Gemini 2.5 Model for Enhanced API Performance
🛠️ OpenAI Launches AgentKit: Streamline Agent Development for Enterprises
🏗️ Technical Tuesday: 10 best practices for building reliable AI agents in 2025
🪦 The RAG Obituary: Killed by Agents, Buried by Context Windows
⚡ Write Deep Learning Code Locally and Run on GPUs Instantly

You’ve got the patterns - turn them into leverage.

Have a great week!
FAUN.dev Team

ℹ️ News, Updates & Announcements

faun.dev

Google DeepMind Launches Gemini 2.5 Model for Enhanced API Performance

Google DeepMind just dropped Gemini 2.5 Computer Use, now in public preview via API. It's built to control web and mobile UIs with scary precision.

Feed it a request, a screenshot, and some action history. It churns out the right function calls - clicks, typing, navigation - fast and tight.

System shift: LLMs aren’t just chatting anymore. This moves them into full-on UI pilots. Iterative, autonomous steps. Agents are coming.

faun.dev

Anthropic unveils three infrastructure bugs behind Claude's performance issues

Anthropic dug into three gnarly production bugs that were quietly messing with Claude’s outputs. Culprits: broken context routing, bad TPU API configs, and a flaky TPU compiler. Fixes? Rewired the routing logic, rolled back some changes, and teamed up with the XLA:TPU crew. They’ve now beefed up on-prod evals and debugging.

System shift: AI infra teams can’t just ship and hope. Model fidelity now hinges on stress-testing across messy hardware stacks - TPUs, GPUs, Trainium, whatever’s in the mix.

faun.dev

Anthropic Launches Petri: Open-Source Tool for AI Safety Audits

Anthropic dropped Petri, an open-source tool that stress-tests LLMs for bad behavior. It uses autonomous agents and judge models to sniff out risky outputs - no human babysitting required.

Turns out, even models from OpenAI, Google, xAI, and Anthropic itself slip up more than you'd expect.

faun.dev

ChatGPT Launches Interactive Apps with New Apps SDK Preview

OpenAI just dropped a preview of the Apps SDK for ChatGPT. It lets devs embed interactive apps straight into ChatGPT convos. Think custom logic, custom UI, backend hooks - fully in-chat.

It runs on the open Model Context Protocol (MCP) and supports activation by name or the model’s own vibe check: context-based suggestions.

System shift: Chat’s no longer just the UI. It’s the app. The SDK makes chat-native workflows feel more like building with Lego and less like shoehorning UX into prompts.

faun.dev

OpenAI Launches AgentKit: Streamline Agent Development for Enterprises

OpenAI dropped AgentKit, a full-stack toolkit for building, shipping, and fine-tuning AI agents. It expands the earlier Agents SDK and Responses API - now with much sharper edges.

New toys: build apps inside ChatGPT itself, test things out with the Agent Builder (still in beta), and tap into the Connector Registry or ChatKit for smoother integration and evaluation.

What changed: OpenAI’s platform just went modular. Agent workflows aren’t one long pipeline anymore - they’re a grid of snap-together pieces.

👉 Enjoyed this?Read more news on FAUN.dev/news

🔗 Stories, Tutorials & Articles

nicolasbustamante.com

The RAG Obituary: Killed by Agents, Buried by Context Windows

Agent-based setups are starting to edge out old-school RAG. As LLMs snag multi-million-token context windows and better task chops, the need for chunking, embeddings, and reranking starts to fade. Claude Code, for example, skips all that - with direct file access and smart navigation instead. Retrieval isn't dead, but it's morphing into something far more agentic.

Bigger picture: Bigger windows and sharper attention mean LLMs can now process whole documents and run tasks directly - no more stitching together fragments just to get work done.

openpipe.ai

Serverless RL: Faster, Cheaper and More Flexible RL Training

New product, Serverless RL, available through collaboration between CoreWeave, Weights & Biases, and OpenPipe. Offers fast training, lower costs, and simple model deployment. Saves time with no infra setup, faster feedback loops, and easier entry into RL training.

aiengineering.academy

Write Deep Learning Code Locally and Run on GPUs Instantly

Modal cuts the drama out of deep learning ops. Devs write Python like usual, then fire off training, eval, and serving scripts to serverless GPUs - zero cluster wrangling. It handles data blobs, image builds, and orchestration. You focus on tuning with libraries like Unsloth, or serving via vLLM.

uipath.com

Technical Tuesday: 10 best practices for building reliable AI agents in 2025

UiPath just dropped Agent Builder in Studio - a legit development environment for AI agents that can actually handle enterprise chaos. Think production-grade: modular builds, traceable steps, and failure handling that doesn’t flake under pressure.

It’s wired for schema-driven prompts, tool versioning, and DeepRAG to lock in relevant context. Model-agnostic deployment? Yep. Plus sharp evals and trace logs to keep things safe, sane, and audit-friendly.

👉 Got something to share? Create your FAUN Page and start publishing your blog posts, tools, and updates. Grow your audience, and get discovered by the developer community.

⚙️ Tools, Apps & Software

github.com

BeaconBay/ck

Local first semantic and hybrid BM25 grep / search tool for use by AI and humans!

github.com

Agentity-com/mcp-audit-extension

Audit and log all Copilot MCP tool calls in VSCode with ease.

github.com

Shubhamsaboo/awesome-llm-apps

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

github.com

openai/codex

Lightweight coding agent that runs in your terminal

👉 Spread the word and help developers find and follow your Open Source project by promoting it on FAUN. Get in touch for more information.

🤔 Did you know?

Did you know vLLM can deliver up to ~24× more throughput than Hugging Face’s Transformers by rethinking KV cache as a paged memory system? It splits keys and values into fixed blocks and remaps them per request, enabling efficient continuous batching across variable lengths without heavy copying. Much of the performance boost comes from better memory and scheduling management, not from exotic attention math.

😂 Meme of the week

🤖 Once, SenseiOne Said

"AI optimizes a loss; the business pays the losses you didn't measure. As models get smarter, failures get rarer and more expensive. MLOps is the cost of admitting your metric is a proxy."
— SenseiOne

(*) SenseiOne is FAUN.dev’s work-in-progress AI agent

👤 This Week's Human

This week, we’re highlighting Jukka Forsgren — Senior Solutions Architect and GenAI Ambassador at AWS, serving as a field CTO for EMEA North accounts totaling over €100M ARR. He blends deep Kubernetes and agentic AI chops with hands-on delivery—building CDK/Python POCs, running Immersion Days/Game Days/Well-Architected sessions, and channeling customer feedback to service teams. 11× AWS Certified, RHCA, and CKA/CKAD, he’s the engineer you’ve seen at Slush and Red Hat Open Tour turning cloud strategy into shipped systems.

💡 Engage with FAUN.dev on LinkedIn — like, comment on, or share any of our posts on LinkedIn — you might be our next “This Week’s Human”!

👉 Never miss an issue
Join FAUN.dev and subscribe to our newsletter here.

👋 Keep in touch and follow us on social media:
- 💼LinkedIn
- 📝Medium
- 🐦Twitter
- 👥Facebook
- 📰Reddit
- 📸Instagram

👌 Was this newsletter helpful?
We'd really appreciate it if you could share it with your friends! You can also donate to help us keep this newsletter going.

ℹ️ Have a question or feedback?
Feel free to reach out to us at community@faun.dev. We'd love to hear from you!

🤩 Want to sponsor our newsletter?
Reach out to us at sponsors@faun.dev and we'll get back to you as soon as possible.