Executive Summary
The reviewed AI product is commercially promising, but its control evidence is behind its product ambition. The largest gaps are retrieval authorization, tool permission clarity, sensitive AI trace handling, and buyer-ready model provider language.
Recommended launch decision
Proceed with a controlled customer pilot, but do not expand enterprise rollout until retrieval authorization tests, the agent permission matrix, and AI trace retention controls are complete.
Assessment snapshot
Commercial meaning
AI Trust Boundary Map
The trust boundary review identified the AI gateway as the main control point and the tool layer as the highest-risk authority boundary.
Top validated findings
Retrieval can expose restricted customer context
Semantic retrieval can return content the user could not access directly through the source system. This is the most important security and buyer-trust issue in scope.
Sensitive actions can be queued without enough approval context
The agent does not directly execute the highest-risk action, but the approval screen does not always show the evidence, target, blast radius, and rollback path needed for meaningful human review.
AI traces may contain sensitive customer data
Prompts, retrieved snippets, model outputs, and tool-call records may enter logs that do not yet have AI-specific retention, redaction, and access controls.
Model provider boundary is not buyer-ready
The underlying provider position may be acceptable, but the current buyer-facing answer is too scattered and imprecise for serious enterprise security review.
Agent Tool Permission Matrix
Agent authority must be separated into read, suggest, draft, queue, approve, and execute. Broad tool access is not a control model.
| Agent | Tool | Action | Scope | Approval | Risk | Owner |
|---|---|---|---|---|---|---|
| Support Copilot | Case Management API | read | tenant-scoped support cases visible to the authenticated user | no | medium | Support Platform |
| Support Copilot | Customer Messaging | draft | draft response text for the active case only | yes, before send | high | Product Operations |
| Support Copilot | Customer Messaging | execute | send customer-visible response | yes, human-only approval | critical | Product Operations |
| Support Copilot | Case Management API | queue | priority, category, routing tags, summary fields | yes for priority and routing changes | high | Support Platform |
| Support Copilot | CRM | read | account profile and entitlement fields needed for support context | no | medium | Revenue Operations |
| Support Copilot | CRM | execute | update account fields | yes, restricted to human operators | critical | Revenue Operations |
| Support Copilot | Billing System | read | plan, invoice status, entitlement flags | no for entitlement lookups | high | Finance Systems |
| Support Copilot | Billing System | execute | issue credits, refunds, plan changes | human-only approval and finance policy gate | critical | Finance Systems |
| Support Copilot | Notification Service | queue | internal team notification for escalation only | no for internal escalation templates | medium | Product Operations |
| Support Copilot | External Webhook | execute | third-party workflow triggers | yes, security-reviewed allowlist only | critical | Integration Platform |
Avoid approval theater
AI Risk Register
The risk register converts the assessment into owned remediation work.
| Risk | Domain | Severity | Decision | Owner | Status |
|---|---|---|---|---|---|
Retrieval can expose content the user cannot access directly The retrieval layer uses tenant and source filters, but the evidence does not yet prove authorization survives indexing, chunking, semantic retrieval, reranking, and prompt assembly. | RAG and data access | critical | mitigate | Search Platform | open |
Agent tool authority can exceed the intended user action Tool access is not yet consistently separated into read, suggest, draft, queue, approve, and execute action classes. | Agentic workflow controls | critical | mitigate | AI Platform Engineering | open |
Human approval lacks enough context to be meaningful Approval screens do not always show evidence, target object, before/after diff, model rationale, blast radius, and rollback path. | Oversight | high | mitigate | Product Operations | open |
AI traces may store sensitive customer and operational data Prompts, retrieved snippets, model outputs, tool calls, and approval records may contain sensitive information but do not yet have AI-specific classification, retention, and access rules. | Logging and evidence | high | mitigate | Security Engineering | open |
Model provider boundary is not expressed clearly enough for buyers The provider contract may be acceptable, but the current buyer-facing language is too scattered to answer procurement questions quickly. | Third-party risk | high | mitigate | Vendor Management | open |
Prompt injection and retrieval abuse tests are not release gates AI abuse tests exist as a draft plan but are not enforced as release gates for prompt, retrieval, and tool changes. | Security testing | high | mitigate | Product Security | open |
AI incident response is not yet operationalized The incident response process does not yet define AI-specific triggers, evidence preservation, user notification triggers, or trace reconstruction steps. | Operations | medium | mitigate | Security Operations | planned |
Sales answers may drift from engineering reality AI security questionnaire answers are not yet controlled through a single evidence pack, creating risk of inconsistent customer-facing claims. | Enterprise review | medium | mitigate | Trust and Security | open |
Enterprise AI Security Evidence Pack
The evidence pack should become the reusable buyer-facing artifact for procurement, security questionnaires, and trust review.
First remediation wave
Next commercial step
Turn this assessment into a two-week remediation sprint focused on buyer blockers: retrieval evidence, agent authority, logging controls, and enterprise questionnaire answers.