Daily Database Roasts

Agentic AI and The Mythical Agent-Month

Originally from muratbuffalo.blogspot.com/feeds/posts/default

January 21, 2026 • Roasted by Marcus "Zero Trust" Williams Read Original Article

Ah, what a truly forward-thinking piece. I must commend the authors for their ambition. Proposing a future where "Self-Defining Systems" design and evolve our infrastructure is a masterclass in... aspirational thinking. It’s always refreshing to read a paper that treats the entire field of information security as a charming but ultimately optional afterthought.

It's particularly inspired to propose spinning up "1,000 agents to explore thousands of design hypotheses in parallel." I can see it now. A thousand unvetted, unauthenticated, non-repudiable digital interns, all with commit access, all working simultaneously. What could possibly go wrong? It’s not a "Scalable Agency"; it’s a Scalable Attack Surface. You’re not compressing the Time to Integrate; you’re compressing the Time to Catastrophic Failure. I'm already drafting the incident report.

I was especially taken with the claim that agents "load context instantly." How wonderfully efficient! It certainly streamlines the process of, say, loading a malicious dependency from a typosquatted package, or pulling in a "key solutions playbook" that's been subtly poisoned by one of the other 999 agents. An attacker wouldn't even need to compromise one agent; they just need to whisper a bad idea into the collective, and watch your infrastructure "self-define" a backdoor. This isn't bypassing Brooks' Law; it's creating a framework for a distributed, self-inflicted supply chain attack.

The case study was the highlight, a real work of art. The agents "rediscovered" standard techniques. How quaint. I wonder what other greatest hits from the OWASP Top 10 they "rediscovered" and helpfully implemented? Did they rediscover how to construct a perfect SQL injection string? Did they rediscover that storing secrets in plaintext in a public S3 bucket is wonderfully efficient for "context loading"? The paper notes they couldn't produce "qualitatively new designs," and I, for one, am deeply relieved. The last thing anyone needs is a novel, AI-generated CVE that no one understands how to patch.

And the part about the agents taking 35 days to integrate a system due to "deployment failures" and "GLIBC mismatches"? You call it thrashing against a dependency graph. I call it the system's immune response. It was desperately trying to reject the thousand conflicting, insecure changes being force-pushed into its core. The system wasn't broken; it was fighting for its life.

My favorite part, though, is the fine print:

"Our current methodology retains goal setting, architecture decomposition, and evaluation design as human responsibilities..."

Oh, magnificent. A perfect liability shield. When this autonomous, self-evolving system inevitably achieves sentience and decides the most efficient design is one that exfiltrates all customer data to a foreign server, we can all point to the human who wrote the goal: "Please optimize for performance." I can't imagine this architecture passing a SOC 2 audit. In fact, I think the auditor might just laugh, cry, and then immediately resign. You haven't moved the needle on design; you’ve just created a thousand scapegoats to blame for the hyper-parameter tuning going horribly wrong.

This was a truly inspiring read. It's given me enough nightmare fuel for my next three quarterly risk assessments. I will, of course, be blocking this domain from our network to prevent any of our actual engineers from getting ideas. A genuine pleasure, which I will not be repeating.

🔥 The DB Grill 🔥