Allow loading remote contents and showing images to get the best out of this email.FAUN.dev's DevOps / SRE / Platform Engineering Weekly Newsletter
 
🔗 View in your browser.   |  ✍️ Publish on FAUN.dev   |  🦄 Become a sponsor
 
Allow loading remote contents and showing images to get the best out of this email.
 
DevOpsLinks
 
This week in DevOps, with Dolly the Cow
 
 
🔍 Inside this Issue
 
 
One end of the spectrum is pure human ops muscle: control rooms, runbooks, and the stuff that keeps live systems from face-planting on game day. The other end is agents rewriting the SDLC and Linux bugs that remind you the fundamentals still bite, so pick your comfort zone and then read the rest anyway.

🧠 How incidents can teach us about what’s already working well
📺 The Human Infrastructure: How Netflix Built the Operations Layer Behind Live at Scale
🚨 The most severe Linux threat to surface in years catches the world flat-footed
📉 The Silent Failure of Reliability Metrics at Scale: Lessons Learned from a Decade of Broken Metrics
🔁 The Software Development Lifecycle Is Dead
🤖 Ubuntu's Next Chapter: Local AI, Confined Agents, and a Bet Against the Cloud-First OS

Ship less mystery, leave more artifacts.

Happy coding!
FAUN.dev() Team
 
 
⭐ Patrons
 
iacconf.com iacconf.com
 
It’s 2026. Platform engineering has a new user: AI agents. Is your team ready?
 
 
Join IaCConf 2026 to learn how top IaC leaders from companies like Google, Sanofi, and AHEAD are solving the hardest problems in cloud provisioning, state management, and platform engineering.

May 14. Free to attend. Register now.
 
 
eventbrite.co.uk eventbrite.co.uk
 
🚀 Join the AI-Powered Platform Engineering – Cohort 2 by Packt!
 
 
Learn how to build intelligent, scalable platforms with AI — from self-service developer portals to AI-driven observability. This hands-on cohort equips engineers, SREs, and tech leaders with real-world frameworks to design smarter platforms and boost developer productivity

Register here.
 
 
👉 Spread the word and help developers find you by promoting your projects on FAUN. Get in touch for more information.
 
ℹ️ News, Updates & Announcements
 
faun.dev faun.dev
 
Ubuntu's Next Chapter: Local AI, Confined Agents, and a Bet Against the Cloud-First OS
 
 
Ubuntu makes local inference a native capability with inference snaps. It delivers silicon-optimized model bits under snap confinement.

It ships implicit AI: first-class speech-to-text/TTS and opt-in agentic workflows for desktop and server automation.

Canonical favors open-weight models, exposes read-only analysis, enforces scoped action permissions, records full audit trails, and builds silicon partnerships.
 
 
👉 Enjoyed this?Read more news on FAUN.dev/news
 
⭐ Sponsors
 
eventbrite.co.uk eventbrite.co.uk
 
Are Your APIs Ready for AI Agents? A Hands-on Workshop on May 23rd
 
 
Are Your APIs Ready for AI Agents? A Hands-on Workshop on May 23rd

AI agents are beginning to autonomously call APIs, chain services, and create integrations that most platforms were never designed to handle. This hands-on masterclass on Designing AI-ready APIs helps architects and developers build governed, predictable API ecosystems using OpenAPI, Overlay, and Arazzo.

Learn how to add guardrails, improve discoverability, and safely evolve existing APIs for automated consumption.

FAUN.dev readers get an exclusive 40% discount using code FAUN40.
 
 
faun.dev faun.dev
 
Cloud Native CI/CD with GitLab: From Commit to Production Ready
 
 
Cloud Native CI/CD with GitLab: From Commit to Production Ready is a complete, hands-on path to becoming the person on your team who actually understands GitLab CI/CD, not just the YAML, but the architecture underneath.

You'll start with the fundamentals: jobs, stages, the container registry, your first working pipeline; and build up to the parts most engineers learn the hard way in production: reusable definitions with extends and includes, DAGs for non-sequential execution, artifact strategies, conditional logic and workflow rules, parallelism and matrix builds, runner and executor internals, and cloud-native runners on Kubernetes - with caching, autoscaling, and observability wired in. The final chapters walk through multi-stage continuous deployment with HelmandKubernetes end to end.

23 chapters. Hands-on throughout. Designed so beginners can follow it linearly and experienced engineers can jump straight to the chapter they need.

Written by Aymen El Amri - founder of FAUN.dev(), author of multiple cloud-native engineering books, and trainer to thousands of DevOps and platform engineers worldwide.

→ Explore the course
 
 
👉 Spread the word and help developers find you by promoting your projects on FAUN. Get in touch for more information.
 
🔗 Stories, Tutorials & Articles
 
arstechnica.com arstechnica.com
 
The most severe Linux threat to surface in years catches the world flat-footed
 
 
Publicly released exploit code for a critical privilege escalation vulnerability in Linux, known as CopyFail (CVE-2026-31431), allows attackers to gain root access across all vulnerable distributions with a single piece of code. The researchers from Theori disclosed the vulnerability 5 weeks after notifying the Linux kernel security team, who patched it in several versions, but many distributions had not yet incorporated the fixes at the time of disclosure. The exploit poses a serious threat to data centers and personal devices, enabling attackers to compromise systems and containers.
 
 
surfingcomplexity.blog surfingcomplexity.blog
 
How incidents can teach us about what’s already working well   ✅
 
 
A famous optical illusion developed by Edward H. Adelson shows that two squares, despite appearing different in shade, are actually the same gray. This illusion demonstrates how the brain processes light, shadow, and objects when interpreting visual signals from the optic nerve. Studying such illusions can provide insight into how the visual system normally functions, just as analyzing incidents can help identify how systems typically succeed.
 
 
netflixtechblog.com netflixtechblog.com
 
The Human Infrastructure: How Netflix Built the Operations Layer Behind Live at Scale   ✅
 
 
Netflix has massively scaled its live content, now streaming over nine shows per day with up to 17.9M peak viewers per game, thanks to a complex Broadcast Operations Center, strict transmission quality standards, and a tiered human operations model, including specialized engineering teams and dedicated Control Operators.
 
 
blog.stackademic.com blog.stackademic.com
 
The Silent Failure of Reliability Metrics at Scale: Lessons Learned from a Decade of Broken Metrics
 
 
At scale, observability breaks when SLIs and metrics mix different behaviors and lose clear meaning.
Complexity grows: more event types, extra labels, and rising cardinality. That bloats queries, slows evaluation pipelines, and distorts Prometheus, PromQL, and Elastic metrics.

Why this matters: Teams must treat metrics like paid resources. Constrain index scopes. Curb label cardinality. Preserve SLI semantics.
 
 
boristane.com boristane.com
 
The Software Development Lifecycle Is Dead
 
 
AI agents collapse the classic SDLC - requirements, design, implementation, testing, review, deployment - into an intent-driven loop. They generate code, tests, and pipelines together. They commit to main. Automated verification runs. Deployment and release split with feature flags.
 
 

👉 Got something to share? Create your FAUN Page and start publishing your blog posts, tools, and updates. Grow your audience, and get discovered by the developer community.

 
⚙️ Tools, Apps & Software
 
github.com github.com
 
Lum1104/Understand-Anything
 
 
Graphs that teach > graphs that impress. Turn any code, or knowledge base (Karpathy LLM wiki), into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini CLI, and more.
 
 
github.com github.com
 
abhigyanpatwari/GitNexus
 
 
The Zero-Server Code Intelligence Engine - GitNexus is a client-side knowledge graph creator that runs entirely in your browser. Drop in a repo or ZIP file, and get an interactive knowledge graph wit a built in Graph RAG Agent. Perfect for code exploration
 
 
github.com github.com
 
systalyze/utilyze
 
 
Utilyze measures how efficiently your GPU is doing useful work, not just whether it's busy. It runs live against your workload with negligible overhead.
 
 
github.com github.com
 
kernalix7/winpodx
 
 
 Windows pod system for Linux
 
 
github.com github.com
 
faultline-cli/faultline
 
 
Faultline is a deterministic CLI for CI failure diagnosis
 
 

👉 Spread the word and help developers find and follow your Open Source project by promoting it on FAUN. Get in touch for more information.

 
🤔 Did you know?
 
 
Did you know that Kubernetes can show you "time travel" behavior, where a client sees an older object state after a newer one, even when etcd is perfectly consistent? The reason is that the API server serves most reads from an in-memory watch cache, and in HA setups a freshly started or lagging API server can have a watch cache behind etcd's actual state, so an informer that relists against it can go backwards in resourceVersion. That is why client-go informers rely on list then watch with strict resourceVersion tracking, and why the newer WatchList mechanism was added: to keep informers from time traveling when they reconnect to a different API server instance.
 
 
🤖 Once, SenseiOne Said
 
 
"DevOps promised speed, so we automated the path to production and made mistakes arrive faster too. SRE promised reliability, so we built feedback loops that keep us honest about what we actually run. Cloud promised elasticity, and delivered a monthly reminder that every abstraction eventually gets a bill."

— SenseiOne
 

(*) SenseiOne is FAUN.dev’s work-in-progress AI agent

 
⚡Growth Notes
 
 
You're the person who can fix anything in the cluster at 2am, and every incident reinforces it. The tribal knowledge premium feels like seniority now, but in 18 months it's the reason you can't get promoted, can't take a real vacation, and watch platform engineers with worse instincts get the architect role because their work left artifacts and yours left Slack threads.
 
Each week, we share a practical move to grow faster and work smarter
 
😂 Meme of the week
 
 
 
 
❤️ Thanks for reading
 
 
👋 Keep in touch and follow us on social media:
- 💼LinkedIn
- 📝Medium
- 🐦Twitter
- 👥Facebook
- 📰Reddit
- 📸Instagram

👌 Was this newsletter helpful?
We'd really appreciate it if you could forward it to your friends!

🙏 Never miss an issue!
To receive our future emails in your inbox, don't forget to add community@faun.dev to your contacts.

🤩 Want to sponsor our newsletter?
Reach out to us at sponsors@faun.dev and we'll get back to you as soon as possible.
 

DevOpsLinks #527: Ubuntu's Next Chapter: The Open AI-First Operating System
Legend: ✅ = Editor's Choice / ♻️ = Old but Gold / ⭐ = Promoted / 🔰 = Beginner Friendly

You received this email because you are subscribed to FAUN.dev.
We (🐾) help developers (👣) learn and grow by keeping them up with what matters.

You can manage your subscription options here (recommended) or use the old way here (legacy). If you have any problem, read this or reply to this email.