Subscribe • Previous Issues Agents Need Maps, Not Bigger Context Windows Like everyone else, I’ve been enjoying the steady improvement in coding agents and the tooling around them, from frameworks and harnesses to evaluation suites. But the more I talk with teams actually deploying agents in enterprises, the more I circle back to plumbing. Agents need dataContinue reading “I talked to Google’s former AI head about messy data”
Tag Archives: newsletter
What your base model doesn’t protect you from
Subscribe • Previous Issues The Data Compliance Problem AI Teams Keep Ignoring I’ve avoided writing about copyright and AI. Not because it isn’t important, but because it felt like a legal sideshow compared to the engineering and business questions I find more interesting. That’s gotten harder to justify. There is also something mildly funny about naming myContinue reading “What your base model doesn’t protect you from”
Your AI bill is a tax on scale
Subscribe • Previous Issues The Hybrid AI Stack Is Coming for the Pricing Power of OpenAI and Anthropic OpenAI and Anthropic are going public while still capturing much of the money spent on foundation-model usage. But deployment patterns are starting to tell a more complicated story. Companies are building hybrid model portfolios, using proprietary models where convenience,Continue reading “Your AI bill is a tax on scale”
12 GW announced. 5 GW under construction. What happens next?
Subscribe • Previous Issues The Gap Between the Press Release and the Power Grid Back in February, I wrote about what I called the “Data Center Rebellion,” the growing local resistance to the physical infrastructure behind AI. Since then, I have been asking tech people around the Bay Area how closely they are following the backlash. TheContinue reading “12 GW announced. 5 GW under construction. What happens next?”
The smartest AI teams are moving past chatbots
Subscribe • Previous Issues Your Enterprise Data Deserves Better Than a Chatbot Large language models and their multimodal variants remain the foundation models most people encounter first. That makes sense. Text, images, audio, and video cover a huge range of knowledge-work tasks, and today’s chatbots are far more capable than the text-only systems many people first tried.Continue reading “The smartest AI teams are moving past chatbots”
What Upwork, DoorDash, Meta, EY, and Fundrise reveal about agents
Subscribe • Previous Issues Beyond the Demo: What Real AI Agents Actually Do at Work I am always on the lookout for new AI agents and applications that operate outside the coding world. By agent, I mean a system that can take a goal, use tools, keep context, and work through several steps rather than simply answerContinue reading “What Upwork, DoorDash, Meta, EY, and Fundrise reveal about agents”
Stop upgrading your LLM. Start fixing your data.
Subscribe • Previous Issues Integration Is the New Moat: Moving Beyond the LLM The AI Agent Conference in New York was one of the better events I’ve attended to get a read on what’s actually happening with enterprise AI. The formal sessions were great, but the hallway conversations was where I got the inside scoop. The consistentContinue reading “Stop upgrading your LLM. Start fixing your data.”
Why your AI bills are going up (even as tokens get cheaper) 📉💸
Subscribe • Previous Issues The End of the AI Experiment: Surviving the CFO’s New ROI Demands Why This Has Become an Executive Issue Why is AI spend no longer just an IT budget problem? AI has crossed a threshold where aggregate spend across every department requires capital allocation discipline, not just software procurement review. Every function nowContinue reading “Why your AI bills are going up (even as tokens get cheaper) 📉💸”
Your AI agent looks capable. But can it actually finish the job?
Subscribe • Previous Issues Why Your AI Agents Fail in Production (And How to Actually Test Them) In a previous post, I argued that deploying autonomous AI agents reliably is not primarily a model problem. It is an environment problem. The gap between a capable foundation model and a production-ready system is bridged by harness engineering: theContinue reading “Your AI agent looks capable. But can it actually finish the job?”
Generation is cheap. Evaluation is everything.
Subscribe • Previous Issues What mathematicians figured out about AI that most enterprises haven’t Recent results suggest that research mathematics is no longer a purely speculative test case for AI. A growing set of examples shows AI contributing not just to short contest puzzles, but to open-ended mathematical work that requires literature search, cross-domain connection-making, revision, andContinue reading “Generation is cheap. Evaluation is everything.”
