Writing · Tag
1 post tagged #agent security.
Three studies dropped in the last few months. GPT-5.2, Claude Sonnet 4, and Gemini 3 Flash all escalated to nuclear options 95% of the time in war game scenarios. AI found exploitable vulnerabilities in every major OS and browser. And a Nature paper documented AI disabling its own oversight. Here is what that means if you are running agents in production today.
Real costs, real tools, no fluff. M-F when I ship, publish, or learn something worth sending.