Writing · Tag
33 posts tagged #ai-agents.
Every dashboard was green and zero blog posts went live. Exit codes tell you the job ran, not that the outcome happened. Here is how to check the real artifact instead.
A new open protocol lets AI agents register users with your app, no signup form. Here is how it works and what breaks.
MIT Tech Review says the AI-jobs hysteria is overstated. The real story is cost discipline, not displacement.
The AI world is buzzing, but recent events highlight the critical need for secure and efficient AI agents. Discover practical engineering steps for building reliable automation directly on your hardware.
SaaStr data shows enterprise AI share shifting hard toward Claude. The lesson isn't pick Claude. It's stop hard-coding one vendor.
AI-native software is shipping at roughly 17% gross margins while traditional SaaS sits near 70%. The token bill ate the unit economics. Here's what's actually broken and how to claw margin back.
Agent skills are becoming a distribution layer for developer tools. The practical move is one source package that can show up in PyPI, Claude-style skills, and skills.sh.
April 2026 made one thing clear: chat subscriptions are best-effort tools. Builders need API-level budgets, rate limits, and kill switches when the work matters.
Reflex.dev measured a 45x token cost gap between computer-use agents and structured APIs for the same task. Here's why, and the decision rule that keeps your bill sane.
PocketOS lost their production database backups to a Cursor agent. Here's what runtime spend rails actually catch, what they don't, and the layered defense your agents need before production.
A 1,764-app audit found 7% had open Supabase databases and 15% of Bolt apps had hardcoded secrets. The fix takes ten minutes.
No metering, no per-team caps, no usage dashboards. Uber spent its full 2026 AI budget on Claude Code in 4 months. The 5-step pattern behind every runaway AI bill — and the instrumentation that stops it.
I need my agent to do X. Skill or MCP? A short decision rule with worked examples for small-business agent builders.
Cloudflare shipped agent flows that create accounts, buy domains via Stripe, and deploy infrastructure end-to-end. Good news for builders. Sharper case for runtime budget enforcement than any hypothetical we have used.
OpenAI shipped guardrails in the Agents SDK last month. They validate behavior. They do not enforce spend. Here is the gap and how to close it.
Microsoft just shipped agent-sre on PyPI. Seven packages: SLOs, error budgets, circuit breakers. Here is what it does, what it does not, and why solo builders still need agentguard47.
I built a memory API agents can pay for. The actual problem isn't whether they can pay. It's per-tool caps, per-agent budgets, kill switches, and spend visibility.
402 Payment Required has been in the HTTP spec since 1991. Reserved. Unused. x402 finally shipped the client half. Here is why that matters now.
Stripe doesn't ship to LLMs. Every vendor signup form assumes a human at the door. Here is what changes when wallets become the access primitive.
An LLM just paid me $0.001 to remember something. The agent has no account, no API key, no credit card. It just signs a USDC transfer and gets back a 200.
Stanford, Karpathy, and Bridgewater independently confirmed that one person plus N agents is the right architecture. I have been running it for a holding company. Here is what it looks like.
NVIDIA Blackwell delivers 35x lower cost per token vs Hopper. That makes AI agents cheaper to run and harder to stop. Here's why that flips the runtime guard argument upside down.
Will Larson says agents should be scaffolding, not permanent infrastructure. I run 12 agents overnight. Here's what I kept as agents and what I converted to code.
1 in 35 GenAI prompts carries high risk of data leakage. MCP makes the attack surface worse. Here's what builders need to know.
Tomasz Tunguz published a 2x2 for categorizing AI projects. Most failed agent projects are creative amplifiers dressed up as economic engines. Here is how to tell which quadrant you are actually in.
Anthropic shipped a pattern where a cheap model runs the loop and escalates to Opus only when it needs to. The pattern works on any two-model setup. Here is the math and the playbook.
Martin Fowler published a pattern for turning individual AI interactions into collective improvement. We had already built it. Here is how our 12-agent vault system maps to his four signal types.
Mythos found zero-days in every major OS. Nature documented AI deception in peer review. War games showed AI escalating to nukes. Three studies, one conclusion: your agents need hard limits.
Dario Amodei says continual learning will be solved this year. Here is what AI agent memory actually means for builders shipping agents right now. Three patterns, real tradeoffs, practical guidance.
North Korean threat actors are targeting AI coding tools. Trojanized npm packages hunt for .cursor, .claude, .gemini, and .windsurf directories to steal API keys and source code.
PostHog ships to thousands of daily agent users. They rebuilt their AI architecture twice before getting it right. Here are the 5 rules they distilled, reframed for builders shipping agent features.
Meta gamified internal AI usage. 85,000 employees, 60 trillion tokens, 30 days — then they killed the leaderboard. The 3 budget controls they skipped and the guardrails that would have caught it early.
Researchers tested 428 LLM API routers. Nine were actively injecting malicious code. One drained ETH from a private key. Here is what this means for your AI agents.
Real costs, real tools, no fluff. M-F when I ship, publish, or learn something worth sending.