SR 11-7 Model Risk Management for AI Agents
Banks and federally regulated financial institutions should engage Model Risk Management when an AI agent produces or materially influences regulated decisions. SR 11-7 and OCC 2011-12 set the standard: development discipline, independent validation, ongoing monitoring, and effective challenge. Each pillar needs evidence the validators can sample.
Last updated: May 20, 2026
What is SR 11-7?
Federal Reserve SR Letter 11-7, jointly issued with OCC Bulletin 2011-12 on April 4, 2011, established Supervisory Guidance on Model Risk Management for federally regulated banks and bank holding companies. The FDIC adopted similar guidance in 2017 (FIL-22-2017). The framework defines a model as a quantitative method that processes input data into quantitative estimates, and requires three pillars: development and use, validation, and governance. The 2021 interagency RFI on AI risk management did not replace SR 11-7, but it kept AI and machine-learning systems inside the supervisory risk conversation.
Why it applies to AI agents
SR 11-7 was written for credit risk models, market risk models, and operational models. AI agents may be a model or a model-adjacent control problem with extra surface area: they consume data, use an LLM, and take action. Each step is a separate risk vector. The three pillars matter on the governed tool path, where errors can translate directly into financial, legal, or operational harm.
MRM teams at banks already maintain model inventories, validation cadences, and effective challenge procedures. Adding AI agent systems to that framework requires evidence the team can sample: what does the agent do, who approved it, how often does it fail, what changed last month. Runtime authorization is where the answer to each of those questions becomes documented fact rather than a self-report.
Control mapping: SR 11-7 pillars to Veto features
The table maps the three SR 11-7 pillars and their concrete sub-areas to runtime authorization features that produce sampling-ready evidence.
| Area | Requirement | Veto feature |
|---|---|---|
| Development | Documented purpose, design, theory, assumptions, limitations | Policy YAML with comments describing agent scope, intent, and limitations |
| Development | Testing prior to deployment with appropriate rigor | Policy tests with offline cases; validation gates can assert expected outcomes |
| Implementation | Controls around deployment, change management, and operational use | Environment-scoped policies (dev, staging, and prod); CODEOWNERS-required reviews |
| Use | Appropriate use with awareness of limitations | Per-tool allowed-argument constraints; out-of-scope queries denied at runtime |
| Validation | Conceptual soundness: review of theory, design, data, assumptions | Policy diffs in version control reviewable by independent validators |
| Validation | Ongoing monitoring: performance compared to expectations over time | Workspace with decision outcome trend lines; alert thresholds on policy violation rates |
| Validation | Outcomes analysis: actual vs. expected results | Reviewer rejection rates and reason taxonomy exported for outcomes analysis |
| Validation | Independence of validation function from development | Approval workflow assigns independent reviewers; reviewer ID logged per decision |
| Governance | Model inventory: complete, accurate, current | Policy file directory acts as machine-readable agent inventory; policy checks can flag drift |
| Governance | Roles and responsibilities for model owners, users, validators | CODEOWNERS on policy files; per-org reviewer assignments in approval workflows |
| Governance | Effective challenge with authority and competence | Reviewer can deny any agent action with documented justification; decision record preserves the challenge |
| Governance | Internal audit review of model risk management framework | Exportable evidence packages: policy history, decision records, approval records, incident timeline |
Evidence Veto provides
Model Risk Management functions speak in terms of validation reports, ongoing monitoring reports, and the annual model risk assessment. Veto outputs feed each of those reports.
Model inventory
Policy YAML files are the agent inventory: each agent has a documented scope, allowed tools, and intended workflow. Reviewable in policy review workflows.
Ongoing monitoring
Monthly decision record exports with decision outcome rates, approval latency, policy violation counts, and reviewer rejection trends feed into MRM ongoing monitoring reports.
Effective challenge records
Reviewer denials, comments, and structured rejection reasons demonstrate effective challenge in operation. Each preserved with reviewer ID and timestamp.
Change records
Git history of policy changes documents model adjustments. Each can be tied to a reviewer-approved pull request with validation evidence where configured.
Implementation timeline
Enforcement is through bank supervision. Weak model-risk controls can lead to supervisory findings, remediation programs, and constraints on model or agent use until issues are resolved.
Frequently asked questions
Are AI agents considered models under SR 11-7?
What are the three pillars of SR 11-7?
What is effective challenge and how does it apply to AI agents?
Which Veto outputs feed a model risk management framework?
Related evidence resources
MRM does not approve self-reports. Give validators sampling-ready evidence.