Hey 👋,
I've been wiring the permission model for the cybersecurity agent I'm building, the part where it goes from "reads threat feeds" to "can actually act on what it finds." The hardest design question isn't capability. It's restraint: how do you let an agent take useful actions without letting a poisoned input turn those same actions against you.
That's the question six national cybersecurity agencies tried to answer on May 1. The framing they landed on matches what I keep arriving at in the lab: don't deploy an agent at full capability and then bolt on guardrails. Start it constrained, earn autonomy as the controls prove out. This issue is the practical read of that guidance, what to actually do with it.
If you want to follow along with the agent build itself, the permission model, the allowlist gates, the lab tests, check out the blog →. I'm publishing the full series there.
This week in AI security
Five Eyes published the first joint agentic AI guidance. Skip the preamble, here's the usable part.
On May 1, six agencies published "Careful Adoption of Agentic AI Services": CISA, NSA, Australia's ASD ACSC, Canada's CCCS, New Zealand's NCSC, and the UK's NCSC. It's the first coordinated multi-government guidance specifically for agentic systems. Most coverage focused on the five risk categories, privilege, design and configuration, behavioral, structural, accountability. The useful stuff is underneath all that.
I wrote a breakdown of what the guidance actually means for teams deploying agents today. Read the practitioner read →
The core recommendation: don't deploy agents at full autonomy from day one. Start with lower-risk tasks, expand the agent's autonomy only as security controls mature, keep continuous monitoring of agent behavior and tool usage, and maintain meaningful human oversight over high-impact or irreversible actions. It's a maturity model for agency, not a checklist.
The one control all six agencies agreed on: every agent needs its own cryptographic identity
If you read nothing else in the guidance, read the identity section. The agencies were direct: each agent must carry a verified, cryptographically anchored identity with short-lived credentials. Not a shared service account. Not an org-level API key. A distinct, continuously authenticated identity per agent.
This is the same gap I covered back in Issue #5. The Grantex audit found 28 of 30 agent projects with zero per-agent identity, no cryptographic identity, no revocation mechanism, nothing. Now six governments have made it the single most technically specific recommendation in their joint guidance. Identity is the control that everything else depends on. You cannot scope permissions, attribute actions, revoke access, or investigate an incident if you can't tell which agent did what.
I've been thinking about this gap for a while now. Here's why I'm building my cybersecurity agent with progressive autonomy in mind →
Your coding agent now runs from a phone. The attack surface moved with it.
Anthropic shipped Claude Code to the web and mobile back in February, accessible from any browser and from a phone, beyond the terminal for the first time. OpenAI Codex gained the ability to control a locked Mac this week, the "Locked Use" feature announced May 22. The practical implication everyone's celebrating: agents that used to require a laptop physically open now run from anywhere.
The security implication nobody's mentioning: the trust boundary moved. A terminal-based coding agent ran in an environment you controlled: your machine, your network, your monitoring. A browser or phone-based agent runs in an environment you mostly don't. The agent still has the same repository access, the same ability to commit code, the same tool permissions, but now the session originates from a device and network outside your security perimeter.
CVE-2026-2256: a regex blocklist turned an AI agent into a remote shell
A concrete one worth the pattern lesson. ModelScope's ms-agent framework (v1.6.0rc1 and earlier) had a command injection vulnerability in its Shell tool, the component that lets the agent run OS commands for file management and automation. The framework tried to filter dangerous commands using a regex-based blocklist. Researchers bypassed it with crafted input, turning the agent into an execution proxy: an attacker injects content into a data source the agent consumes (a document, a log, a research input), and the agent constructs and runs the malicious command on the attacker's behalf. No direct shell access required.
I broke down the attack path and what it means for agent tool design. Read the technical breakdown →
TOOLING WORTH KNOWING
AWS Agentic AI Security Scoping Matrix: the practical implementation companion to the CISA guidance. Four scope levels (no agency through full autonomy) mapped against six control dimensions: identity, data protection, audit and logging, agent/model controls, agency perimeters, and orchestration. If the CISA document tells you what to secure, this tells you at which autonomy level. Use them together. aws.amazon.com/ai/security/agentic-ai-scoping-matrix →
mcp-attack-labs: reproducible lab code for the MCP attacks covered across this newsletter's series, including the tool-poisoning and cross-server patterns. If you're building the kind of allowlist defense CVE-2026-2256 calls for, the labs let you test it against the actual attacks. github.com/aminrj-labs/mcp-attack-labs →
ONE THING TO CHECK THIS WEEK
Take the progressive-autonomy model from the Five Eyes guidance and apply it to your highest-privilege agent. Three questions: (1) What irreversible actions can it take without human approval right now? (2) Does it have its own cryptographic identity with short-lived credentials, or does it share a credential? (3) Are its tool permissions guarded by an allowlist or a denylist? Answer those three honestly for one agent and you'll have a concrete hardening backlog by end of day, mapped cleanly to what six governments just recommended.
I'm working through these exact questions in my own agent build. Follow along on the aminrj.com blog →
WHAT I'M WATCHING
Identity-first as the emerging consensus: six governments named per-agent cryptographic identity as the most actionable control. The vendor category (Okta for AI Agents, Token Security, others) is forming around it. This is becoming the foundational control. Everything else builds on it. My take on why identity-first matters →
The mobile coding agent surface: Claude Code on web/phone and Codex controlling a locked Mac landed the same week. The convenience is real; the trust-boundary shift is unaddressed. Expect the first "agent committed malicious code from a compromised phone session" incident within months.
CISA's "framework era is over" framing: the guidance implicitly concedes that OWASP LLM Top 10 and ATLAS weren't built for agents that act autonomously. Watch whether the frameworks adapt or whether agent-native standards (like the CSA scoping matrix) take over.
The cybersecurity agent build: next issue: the permission model in practice, how I implemented progressive autonomy with an allowlist-based tool gate, and what broke when I tested it against the mcp-attack-labs payloads. Read the first post in the series →
I'm writing in depth about all of this on the aminrj.com blog, agent security patterns, lab tests, framework breakdowns. If this newsletter is useful, the blog is where the full technical deep-dives live →.
Questions, pushback, something I missed, reply directly, I read everything.
Cheers,
Amine
