Allow loading remote contents and showing images to get the best out of this email.FAUN.dev's AI/ML Weekly Newsletter
 
🔗 View in your browser   |  ✍️ Publish on FAUN.dev   |  🦄 Become a sponsor
 
Allow loading remote contents and showing images to get the best out of this email.
 
AILinks
 
This week in Generative AI/ML, with Kala the Koala
 
 
🔍 Inside this Issue
 
 
Open-weights just embarrassed the frontier labs, and the rest of the ecosystem is scrambling to make agents less of a science project and more of a product. The throughline across these links: observability, guardrails, and a few uncomfortable truths about how we are shipping AI right now.

🏆 An open-weights Chinese model just beat Claude, GPT-5.5, and Gemini in a programming challenge
🧭 Introducing the Agent Readiness score. Check to see if your site is agent-ready
📈 Monitoring LLM behavior: Drift, retries, and refusal patterns
🧩 Multi-Agent System Reliability
🧰 The AI engineering stack we built internally - on the platform we ship
🧱 Ubuntu's Next Chapter: Local AI, Confined Agents, and a bet against the cloud-first OS

Ship less magic, more proof.

Have a great week!
FAUN.dev() Team
 
 
⭐ Patrons
 
iacconf.com iacconf.com
 
It’s 2026. Platform engineering has a new user: AI agents. Is your team ready?
 
 
Join IaCConf 2026 to learn how top IaC leaders from companies like Google, Sanofi, and AHEAD are solving the hardest problems in cloud provisioning, state management, and platform engineering.

May 14. Free to attend. Register now.
 
 
eventbrite.co.uk eventbrite.co.uk
 
🚀 Join the AI-Powered Platform Engineering – Cohort 2 by Packt!
 
 
Learn how to build intelligent, scalable platforms with AI — from self-service developer portals to AI-driven observability. This hands-on cohort equips engineers, SREs, and tech leaders with real-world frameworks to design smarter platforms and boost developer productivity

Register here.
 
 
👉 Spread the word and help developers find you by promoting your projects on FAUN. Get in touch for more information.
 
ℹ️ News, Updates & Announcements
 
faun.dev faun.dev
 
Ubuntu's Next Chapter: Local AI, Confined Agents, and a Bet Against the Cloud-First OS
 
 
Ubuntu makes local inference a native capability with inference snaps. It delivers silicon-optimized model bits under snap confinement.

It ships implicit AI: first-class speech-to-text/TTS and opt-in agentic workflows for desktop and server automation.

Canonical favors open-weight models, exposes read-only analysis, enforces scoped action permissions, records full audit trails, and builds silicon partnerships.
 
 
👉 Enjoyed this?Read more news on FAUN.dev/news
 
⭐ Sponsors
 
eventbrite.co.uk eventbrite.co.uk
 
Are Your APIs Ready for AI Agents? A Hands-on Workshop on May 23rd
 
 
Are Your APIs Ready for AI Agents? A Hands-on Workshop on May 23rd

AI agents are beginning to autonomously call APIs, chain services, and create integrations that most platforms were never designed to handle. This hands-on masterclass on Designing AI-ready APIs helps architects and developers build governed, predictable API ecosystems using OpenAPI, Overlay, and Arazzo.

Learn how to add guardrails, improve discoverability, and safely evolve existing APIs for automated consumption.

FAUN.dev readers get an exclusive 40% discount using code FAUN40.
 
 
faun.dev faun.dev
 
Cloud Native CI/CD with GitLab: From Commit to Production Ready
 
 
Cloud Native CI/CD with GitLab: From Commit to Production Ready is a complete, hands-on path to becoming the person on your team who actually understands GitLab CI/CD, not just the YAML, but the architecture underneath.

You'll start with the fundamentals: jobs, stages, the container registry, your first working pipeline; and build up to the parts most engineers learn the hard way in production: reusable definitions with extends and includes, DAGs for non-sequential execution, artifact strategies, conditional logic and workflow rules, parallelism and matrix builds, runner and executor internals, and cloud-native runners on Kubernetes - with caching, autoscaling, and observability wired in. The final chapters walk through multi-stage continuous deployment with HelmandKubernetes end to end.

23 chapters. Hands-on throughout. Designed so beginners can follow it linearly and experienced engineers can jump straight to the chapter they need.

Written by Aymen El Amri - founder of FAUN.dev(), author of multiple cloud-native engineering books, and trainer to thousands of DevOps and platform engineers worldwide.

→ Explore the course
 
 
👉 Spread the word and help developers find you by promoting your projects on FAUN. Get in touch for more information.
 
🔗 Stories, Tutorials & Articles
 
venturebeat.com venturebeat.com
 
Monitoring LLM behavior: Drift, retries, and refusal patterns
 
 
Traditional software is predictable due to determinism, while generative AI is unpredictable. Engineers need a new infrastructure layer, the AI Evaluation Stack, to ship enterprise-ready AI products. The stack includes deterministic assertions and model-based assertions to ensure structural integrity and semantic quality.
 
 
blog.cloudflare.com blog.cloudflare.com
 
The AI engineering stack we built internally - on the platform we ship
 
 
Cloudflare wired AI into the engineering stack. LLM traffic funnels through a proxy Worker and AI Gateway. It shipped Workers AI and the Agents SDK. Daily users hit 3,683 (93% R&D). MR throughput climbed to ~10,952/week. Workers AI handled 51B input tokens and cut a security agent's inference spend by 77%.
 
 
blog.alexewerlof.com blog.alexewerlof.com
 
Multi-Agent System Reliability
 
 
LLMs are unreliable out of the box, but multi-agent systems can improve by dividing work among specialized agents. Building robust systems involves leveraging human system patterns like hierarchy, consensus, adversarial debate, and knock-out in a multi-agent architecture to ensure correctness and reliability. To combat LLMs' stochastic nature, utilize multiple models in parallel to cancel out noise and improve accuracy. It's crucial to treat LLMs as unreliable components in a distributed system, emphasizing constraint, verification, pruning, and challenges over anthropomorphizing them.
 
 
blog.cloudflare.com blog.cloudflare.com
 
Introducing the Agent Readiness score. Check to see if your site is agent-ready
 
 
Cloudflare launched IsItAgentReady. It scans 200k domains, scores agent readiness, publishes weekly adoption charts, and exposes results via an API.

It checks robots.txt, llms.txt, content negotiation via Accept: text/markdown, API Catalog, .well-known/mcp.json, OAuth discovery, and x402 payments.

Cloudflare overhauled docs to serve Markdown endpoints. It publishes an Agent Skills index and runs a stateless MCP server for programmatic agent access.
 
 
thinkpol.ca thinkpol.ca
 
An open-weights Chinese model just beat Claude, GPT-5.5, and Gemini in a programming challenge
 
 
The AI Coding Contest Day 12 matched ten models on a sliding‑letter puzzle. Open‑weights Kimi K2.6 took first: 22 match points (7‑1‑0). MiMo V2‑Pro scored second by blasting claims for intact ≥7‑letter seeds (43 points). GPT‑5.5 and Claude Opus 4.7 landed third and fifth.

Grids ran 10×10→30×30. Heavy scrambling made active sliding the deciding move. Static scanners and brittle claimers like Muse crashed or tanked.

System shift: an open‑weights Kimi K2.6 besting frontier models changes who can run near‑frontier inference locally and forces teams to rethink deployment.
 
 

👉 Got something to share? Create your FAUN Page and start publishing your blog posts, tools, and updates. Grow your audience, and get discovered by the developer community.

 
⚙️ Tools, Apps & Software
 
github.com github.com
 
aattaran/deepclaude
 
 
Use Claude Code's autonomous agent loop with DeepSeek V4 Pro, OpenRouter, or any Anthropic-compatible backend. Same UX, 17x cheaper.
 
 
github.com github.com
 
TauricResearch/TradingAgents
 
 
Multi-Agents LLM Financial Trading Framework
 
 
github.com github.com
 
serverless-dna/tendril
 
 
A self-extending agentic sandbox that demonstrates the Agent Capability pattern — where the model discovers, builds, and reuses tools autonomously across sessions.
 
 
github.com github.com
 
k2-fsa/OmniVoice
 
 
High-Quality Voice Cloning TTS for 600+ Languages
 
 
github.com github.com
 
VoltAgent/awesome-agent-skills
 
 
A curated collection of 1000+ agent skills from official dev teams and the community, compatible with Claude Code, Codex, Gemini CLI, Cursor, and more.
 
 

👉 Spread the word and help developers find and follow your Open Source project by promoting it on FAUN. Get in touch for more information.

 
🤔 Did you know?
 
 
Did you know that TFX has a component called ExampleValidator that uses TensorFlow Data Validation (TFDV) to catch training serving skew, schema violations, and distribution drift before a model ever trains on bad data? It compares incoming statistics against a fixed schema, so new categorical values, missing features, or shifted distributions trigger an anomaly and halt the pipeline. The catch is that this only works if you treat the schema as a real contract, the same way you would treat an API contract, because a single upstream change can silently turn a feature into "mostly null" while every unit test still passes.
 
 
🤖 Once, SenseiOne Said
 
 
"Training is the easy part because it's the only part you can rerun. The hard part is admitting your model is just a dependency with unknown behavior and a moving API called data. MLOps starts when you stop treating accuracy like a deliverable."
— SenseiOne
 

(*) SenseiOne is FAUN.dev’s work-in-progress AI agent

 
⚡Growth Notes
 
 
You're shipping prompts, wiring tools, and watching demos work, but you haven't written an eval that would fail on a regression you actually care about in months. The busyness masks the fact that your only feedback loop is vibes plus the loudest stakeholder in the room, which means your sense of "better" is drifting with whoever spoke last.
 
Each week, we share a practical move to grow faster and work smarter
 
😂 Meme of the week
 
 
 
 
❤️ Thanks for reading
 
 
👋 Keep in touch and follow us on social media:
- 💼LinkedIn
- 📝Medium
- 🐦Twitter
- 👥Facebook
- 📰Reddit
- 📸Instagram

👌 Was this newsletter helpful?
We'd really appreciate it if you could forward it to your friends!

🙏 Never miss an issue!
To receive our future emails in your inbox, don't forget to add community@faun.dev to your contacts.

🤩 Want to sponsor our newsletter?
Reach out to us at sponsors@faun.dev and we'll get back to you as soon as possible.
 

AILinks #527: Multi-Agent System Reliability
Legend: ✅ = Editor's Choice / ♻️ = Old but Gold / ⭐ = Promoted / 🔰 = Beginner Friendly

You received this email because you are subscribed to FAUN.dev.
We (🐾) help developers (👣) learn and grow by keeping them up with what matters.

You can manage your subscription options here (recommended) or use the old way here (legacy). If you have any problem, read this or reply to this email.