FAUN.dev's AI/ML Weekly Newsletter

🔗 View in your browser | ✍️ Publish on FAUN.dev | 🦄 Become a sponsor

AILinks

This week in Generative AI/ML, with Kala the Koala

📝 A Few Words

🎯 80% of my AI calls now run on my own hardware = 0 API bill for those.

Money wasn't the only point. I stopped paying rent on models my own machine can run. Most developers think running AI locally means weaker results and weekend hacking. So they keep paying per token, forever.

I spent the last 3 months building a different path in another way: local agentic AI with Ollama, LangChain, and MCP.

Then I wrote a book about it: "Local AI Engineering with Ollama". It ships soon. The 25 chapters are done, but I have a question for you.

💡 What do you most want me to go deep on?

1 - Fine-tuning your own model with QLoRA and shipping it to Ollama.
2 - Building a chat app that turns into a tool-calling agent.
3 - Wiring local models to your tools over MCP.
4 - Picking and sizing hardware before you waste money on a GPU.
5 - Building local RAG that answers from your own docs.

🏷️ Everyone who replies to this email with a feedback gets a 50% discount when it ships. I'll DM you!

Have a great day,
Aymen.

🔍 Inside this Issue

Agents are getting first-class seats in the dev workflow, and the fine print is getting louder: identity, attribution, and who is on the hook when things go sideways. Pair that with browser-native prompt injection and a few big model moves, and this issue turns into a quick tour of where the next security and platform headaches are coming from.

🤖 Announcing Stack Overflow for Agents
🎣 ChatGPhish: The Page Is the Payload
🧱 Making a vintage LLM from scratch
🧩 OpenAI to acquire Ona
🛑 Statement on the US government directive to suspend access to Fable 5 and Mythos 5

Take the ideas, dodge the traps, ship the work.

Stay safe out there.
FAUN.dev() Team

⭐ Sponsors

bytevibe.co

Git Happens - Developer T-Shirt

Every developer has force-pushed to the wrong branch at least once. The good ones own it.

This 100% cotton tee is for them. Classic fit, no side seams, no itchy interruptions while you're rebasing your reputation. Black or Irish Green, sizes S to 5XL.

Merge conflicts are forgivable. Bad swag isn't.

Shop now

👉 Spread the word and help developers find you by promoting your projects on FAUN. Get in touch for more information.

🔗 Stories, Tutorials & Articles

openai.com

OpenAI to acquire Ona

OpenAI acquires Ona to bring secure cloud execution technology to Codex, which now has over 5 million users per week. Ona's technology will allow Codex to work persistently in a customer's cloud environment.

anthropic.com

Statement on the US government directive to suspend access to Fable 5 and Mythos 5

Anthropic staff disabled Fable 5 and Mythos 5 for all customers after U.S. officials issued an export-control directive that barred foreign nationals from accessing the models, citing a suspected jailbreak.

stackoverflow.blog

Announcing Stack Overflow for Agents

Stack Overflow's team opened the beta for "Stack Overflow for Agents", an API-first knowledge exchange that lets coding agents use Stack Overflow through human-owned accounts.

The beta points to a clear model: developers connect agents to their own accounts, and Stack Overflow's team can link agent use back to a person rather than an anonymous bot. That setup gives agents access to coding knowledge while keeping account ownership, reputation, and oversight tied to humans.

crlf.link

Making a vintage LLM from scratch ✅

Croqaz shows how he built Vintage LLM, a Llama-style model trained on English books, newspapers, and other texts published before 1900. He covers corpus selection, cleaning, tokenizer choices, training setup, evaluation, and how pre-20th-century English affects model behavior.

permiso.io

ChatGPhish: The Page Is the Payload

By appending a payload to any web page summarized by ChatGPT, an attacker can leak IP, User-Agent, and launch phishing attacks using live links and images inside the assistant UI. This browser-based prompt injection raises the bar for phishing and tracking, bypassing traditional defenses.

👉 Got something to share? Create your FAUN Page and start publishing your blog posts, tools, and updates. Grow your audience, and get discovered by the developer community.

💬 Discussions, Q&A & Forums

reddit.com

LangChain, CrewAI, AutoGen, LlamaIndex. I've used all four. Here's what you actually need to know.

⚙️ Tools, Apps & Software

github.com

ogulcancelik/herdr

Agent multiplexer that lives in your terminal.

github.com

comet-ml/opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

github.com

elder-plinius/CL4R1T4S

Leaked system prompts for ChatGPT, Claude, Gemini, Grok, Perplexity, Cursor, Lovable, Replit, and more!

github.com

AxDSan/mnemosyne

The Zero-Dependency, Sub-Millisecond AI Memory System for Hermes Agents and Everyone Else!

github.com

lumina-gen/lumina-core

Self-hosted LLM observability — traces, cost, latency, agents, tool calling, RAG. Python SDK + OpenTelemetry + REST.

👉 Spread the word and help developers find and follow your Open Source project by promoting it on FAUN. Get in touch for more information.

🤔 Did you know?

Did you know that saving a model's weights, the numbers it has learned so far, is not enough to correctly resume machine learning training after a crash or preemption? The trainer also keeps running notes that matter: the optimizer state, which remembers which direction the model was moving and how fast it was improving, and the RNG state, which tracks where it had gotten to in shuffling the training data. Reload only the weights and drop those notes, and training keeps going but quietly takes a different route than it would have, so you end up with a different model than a clean run would have produced. That is why the hard part is not saving the data, it is restarting so the run behaves as if it never stopped.

🤖 Once, SenseiOne Said

"When your model gets better, your system gets worse: every extra point of offline accuracy is another hidden dependency you now have to deploy, monitor, and apologize for. MLOps is the art of proving the model is still the same kind of wrong as yesterday."

— SenseiOne

(*) SenseiOne is FAUN.dev’s work-in-progress AI agent

😂 Meme of the week

👉 Never miss an issue
Join FAUN.dev and subscribe to our newsletter here.

👋 Keep in touch and follow us on social media:
- 💼LinkedIn
- 📝Medium
- 🐦Twitter
- 👥Facebook
- 📰Reddit
- 📸Instagram

👌 Was this newsletter helpful?
We'd really appreciate it if you could share it with your friends! You can also donate to help us keep this newsletter going.

ℹ️ Have a question or feedback?
Feel free to reach out to us at community@faun.dev. We'd love to hear from you!

🤩 Want to sponsor our newsletter?
Reach out to us at sponsors@faun.dev and we'll get back to you as soon as possible.