The Agentic Anarchy Problem: AI Agents and the Failure of Traditional IAM
Traditional Identity and Access Management (IAM) infrastructures were architected for predictable, deterministic actors: human users, static groups, and rigid service accounts. These frameworks fail to account for autonomous software that dynamically interprets objectives, selects its own execution pathways, and operates through delegated, probabilistic authority. This structural misalignment represents the primary security hurdle for contemporary agentic AI deployments.
While an AI agent might present as a standard application service, it functions as a distinct, high-entropy enterprise actor capable of non-linear interaction with documents, data, and tool-based APIs. Organizations lacking granular visibility into the authorization lineage of these agents—specifically the scope of their tool access and the mechanisms for power revocation—operate within a state of agentic anarchy.
This is not a failure of model intelligence; it is a systemic architectural deficiency where operational authority is granted to non-deterministic systems prior to the implementation of robust identity and authorization controls.
- Core Thesis
AI agents break traditional IAM because they act across user intent, application authority, and tool permissions. A secure agent program requires explicit identity, delegated authorization, scoped credentials, and policy enforcement that lives outside the model.
This article helps security architects and technical leaders turn vague concerns into practical controls. We aim to make risks visible and controls explicit. The goal is not to promise perfect safety but to ensure that risks are managed and uncertainty is documented.
Dangerous failures happen when teams confuse product features with security controls. A model that promises to follow a policy is not a policy engine. A "user approval" button is not informed authorization if the user cannot see the underlying tool arguments. Disciplined engineering is required because the boundary between a suggestion and an action is often blurry in agentic systems.
- Why This Matters
AI identity and authorization now sit at the center of business workflows. These processes involve sensitive customer data, proprietary code, and cloud infrastructure. When AI touches these workflows, it can amplify both productivity and risk.
The question is not whether AI can make mistakes—it can. The question is whether the system is designed so that mistakes are constrained, detected, and reversible. A useful AI security program assumes failure is possible and builds a control system around that assumption.
For leadership, this affects trust. Customers and partners will ask whether AI systems are governed. Strong answers require more than claims; they require evidence like inventories, data-flow diagrams, and test results.
For engineers, vague governance language becomes implementation debt. Someone must decide exactly what is allowed, what is blocked, and what happens when the system behaves unexpectedly.
- Failure Model
The most common failure is not a dramatic model rebellion. It is a quiet control gap where the system behaves as instructed but within a weak boundary.
Systems often ship with broad credentials because narrow permissions were inconvenient during development. Reviewers may approve AI-generated actions because the interface hides the actual tool arguments. A document might enter a RAG corpus without source authority metadata, leading to unauthorized retrieval. A support agent might send messages based on content that includes hostile instructions.
Common failure ingredients include:
- unclear ownership of agent identity;
- ambiguous authority between the user and the assistant;
- overbroad access to internal APIs and databases;
- insufficient separation between data and instructions;
- weak approval designs that favor speed over security;
- missing telemetry for tool-call arguments;
- untested incident response for autonomous actions;
- unsupported public claims about safety;
- weak source verification for retrieved context;
- overreliance on the model's ability to follow instructions.
AI deployment needs explicit control surfaces that can be tested and audited.
- Why Traditional IAM Fails
IAM depends on knowing who is acting and whether a policy allows the action. Agents complicate this because an action begins as a user goal, is interpreted by a model, and executed under a service identity. By the time the action reaches an API, the original intent may be several steps away.
A mature implementation begins by naming the risk. The team should document what the system can see, decide, and change. If these questions are unanswered, the system is not ready for high-trust use.
AI Security Engineering differs from generic AI enthusiasm because it asks how a capability is bounded and who is accountable when boundaries fail.
- The Agent Is Not the User
Do not treat an agent simply as an extension of the user. While this might be safe for read-only tasks, it becomes dangerous when agents can change state. A user asking for help does not automatically approve every downstream action an agent might take.
The agent needs its own identity context. The system must preserve the link between the user request, the agent's decision, and the final result. Design should separate human intent from model interpretation and application policy. When these layers collapse, authorization is hard to verify. Teams should be suspicious of any architecture where the model both proposes and approves a sensitive action.
- The Agent Is Not Just a Service Account
Conversely, do not treat an agent as a standard backend service. Traditional services follow deterministic logic. Agents handle uncertainty and may choose paths that developers did not encode. Assigning an agent a broad service account turns this ambiguity into dangerous authority.
This is where many prototypes become risky. The fastest way to make a demo work is often the least safe: using broad tokens, shared accounts, and permissive retrieval without an approval boundary. The demo proves the workflow is possible, but it does not prove it is governed.
Security teams should treat prototypes as evidence of capability, not readiness. Production requires a higher bar for identity isolation and credential scoping.
- Delegated Authorization
The safest pattern is usually delegated authorization. The agent helps the user, but any sensitive action requires proof that the user had authority and that the request matches an approved purpose. This requires scoped tokens and tool-call validation.
A useful pattern is to place enforcement in deterministic software around the model. The model can suggest actions, but enforcement should happen through hard-coded policy checks and runtime limits. This makes the system inspectable. Inspectability lets teams test, monitor, and improve the control.
- Policy Must Live Outside the Model
The model should not be the primary enforcement point for high-impact authorization. Natural language is not an access-control list. The policy engine, tool broker, or application server must decide whether an action is allowed based on structured rules.
The system should also preserve enough telemetry to reconstruct decisions. This includes trace IDs, user IDs, model versions, retrieved sources, and tool arguments. While privacy must be managed, the absence of evidence is a significant risk.
The right approach is to define what metadata is retained, what raw content is kept selectively, and how long it remains in the system.
- Memory Creates Identity Drift
Agent memory can change how an agent interprets future requests. If an attacker writes policy-like instructions into an agent's memory, the agent may later behave as if those instructions were authoritative. Memory should have provenance and expiration rules.
Frameworks can help organize this work. OWASP Top 10 for LLM Applications is useful for identifying failure modes like prompt injection. NIST AI RMF is useful for risk management. MITRE ATLAS tracks adversary behavior. CSA and ISO standards support management thinking, while SOC 2 language helps translate these controls into trust-service evidence.
No framework is a substitute for engineering judgment. The framework identifies risks, but the architecture determines whether those risks are controlled.
- Approval Is Not Automatically Control
An "Approve" button is not a security control unless the approver can see what will happen. They must see which data is accessed, what authority is used, and what evidence supports the suggestion. A blind approval is a delegated risk. Every approval must be logged with identity, timestamp, and outcome.
A complete operating model assigns clear ownership. AppSec may own the design review. Platform engineering manages the runtime controls. IAM teams define the identity patterns. GRC leads map controls to policies, and Privacy teams review data handling. The SOC manages detection, while Product teams own the user experience. No critical control should be ownerless.
- Revocation and Kill Switches
Agent permissions must be revocable at multiple levels: per user, tenant, agent, or tool. A kill switch that depends on the model obeying a "stop" instruction is not reliable. Enforcement must happen at the infrastructure layer.
Test these mechanisms before using them in production. Testing should include design reviews and log reconstruction drills. If a team cannot verify that a control works, they should not claim it is effective.
Verification must include negative test cases. What happens when an agent receives malicious instructions? What happens when a user lacks permissions? Responders must understand how the system behaves when boundaries are pushed.
- Evidence and Auditability
Agent IAM must produce durable evidence. Logs should capture the user request, agent identity, tool selected, and the result. Without this data, incident response is guesswork.
Build evidence into the workflow from the start. This includes diagrams, data-flow maps, and security eval results. Claim-readiness means every public statement about security is supported by evidence. It does not mean a system is risk-free; it means the company can explain its controls.
- Operating Model
Agent security is a shared responsibility. It requires collaboration between IAM, AppSec, platform engineering, and legal teams. Accountability must be clear.
This approach must extend to incident response. If a control fails, responders need to know what happened and which containment options exist. For AI systems, this requires reconstructing the event chain: prompts, retrieval results, and tool calls. Playbooks for high-risk workflows should be tested before launch.
- Practical Example
Consider a sales agent asked to prepare renewal outreach. In an unsafe design, the agent reads CRM records and sends emails directly through a broad token. If an attacker puts a malicious instruction in a CRM note, the agent might leak pricing data.
In a safer design, the agent drafts recommendations but cannot send them without approval. The email tool acts as a broker. It checks the user's authority and the recipient's domain before sending. Real-world incidents will look like normal workflows with weak boundaries. Practical engineering is essential.
- Tooling Guidance
Tooling should be selected based on the specific job. A tool may help with automated evals, tracing, policy management, or runtime monitoring. No single product is a complete solution for agent security.
When evaluating a new tool, ask:
- What layer of the stack does it protect and what risk does it reduce?
- What sensitive data does the tool itself store?
- Can it run within your required deployment model?
- Does it integrate with your existing systems?
- What are its logging capabilities?
- How does the tool fail?
- What specific claims can the tool support with evidence?
Avoid falling for product endorsement language. Mentioning a tool category is not a claim that any specific vendor's product is sufficient.
- Governance and Trust Caveats
Sponsor support does not influence our research, scoring systems, or findings. We maintain strict independence to ensure the credibility of our analysis.
Hiring signals are directional indicators of market trends, not proof of internal security maturity. Similarly, psychometric outputs provide evidence of role-based language use, not a clinical diagnosis.
Avoid using inflammatory language when discussing company-level risks. Use precise phrases like directional signal, aggregate benchmark, and claim-readiness. These caveats protect the integrity of the research and ensure the site remains a trusted resource.
-
Implementation Controls
-
Assign every agent a unique identity before it touches production data.
-
Maintain separation between user, agent, service, and tool identity.
-
Use delegated authorization for workflows that involve user-specific data.
-
Avoid broad, shared service accounts for autonomous agents.
-
Place policy enforcement in a tool broker or API gateway, not the model.
-
Scope all credentials by task, user, tenant, and tool.
-
Require explicit human approval for irreversible actions.
-
Log every tool call, approval event, and policy decision.
-
Apply provenance and expiration rules to all agent memory stores.
-
Verify that revocation mechanisms work as intended before launch.
-
Common Mistakes
-
Treating the language model as the primary enforcement layer.
-
Using broad, shared credentials that provide agents with excessive reach.
-
Hiding critical approval details from the human-in-the-loop.
-
Logging too little information to support incident response.
-
Logging excessive sensitive content without clear deletion rules.
-
Making public security claims before internal evidence exists.
-
Testing only the "happy paths" during security reviews.
-
Ignoring the risk of indirect prompt injection through retrieved documents.
-
Failing to define clear ownership for the agent's actions.
-
Forgetting to implement and test revocation and containment procedures.
-
Conclusion
The problem of agentic anarchy is a fundamental challenge for deploying trusted AI systems. As agents move to real-world workflows, the IAM model must handle autonomous, non-deterministic actors.
The mature response is engineering discipline: define boundaries, assign ownership, constrain authority, test behavior, monitor performance, and preserve evidence. AI systems become credible not when they are perfect, but when they can be built, observed, and improved through rigorous control.
Implementation Checklist
- Assign every agent a unique identity before production use.
- Separate user, agent, service, and tool identity in the architecture.
- Build delegated authorization for workflows that touch user data.
- Avoid broad, shared service accounts; use task-scoped credentials.
- Place policy enforcement in a dedicated tool broker or application layer.
- Scope all credentials by task, user, tenant, and tool to minimize blast radius.
- Require explicit approval for any irreversible or external actions.
- Log all tool calls, approval events, and policy decisions for audit.
- Apply provenance and expiration rules to all agent memory components.
- Test revocation mechanisms and kill switches before the production launch.
- Map these controls to industry frameworks like NIST AI RMF or OWASP.
- Define the evidence required to support any public security claims.
- Integrate agent security reviews into your standard product checks.
- Test failure cases and adversarial inputs, not just expected use.
- Reassess security controls after material changes to models or data sources.
Source Notes Needed
- NIST AI Risk Management Framework.
- NIST Generative AI Profile.
- OWASP Top 10 for LLM Applications.
- MITRE ATLAS.
- Cloud workload identity documentation.
- OPA or policy-engine documentation.
Framework Alignment
This practice is mapped to the Identity control objective within our AI security operating model.
Read Methodology →