Glossary

Guardrails

Guardrails are the constraints that keep an AI system inside acceptable bounds — input/output filters, policy checks, allowed-action limits, and escalation rules — so it cannot take unsafe or non-compliant actions.

For agents, guardrails matter most because the system acts. They define what an agent may do automatically, what requires a human, and what is forbidden outright.

Guardrails work alongside human-in-the-loop checkpoints, evaluation, and an audit trail; together they make an agent safe to deploy in a regulated workflow.

Related terms

Human-in-the-Loop (HITL)
Model Risk Management (MRM)
AI agent

Learn more

Human-in-the-loop AI
Model risk management for agentic AI

Book a demo Back to glossary