← Glossary

Glossary

Guardrails

Guardrails are the constraints that keep an AI system inside acceptable bounds — input/output filters, policy checks, allowed-action limits, and escalation rules — so it cannot take unsafe or non-compliant actions.


For agents, guardrails matter most because the system acts. They define what an agent may do automatically, what requires a human, and what is forbidden outright.

Guardrails work alongside human-in-the-loop checkpoints, evaluation, and an audit trail; together they make an agent safe to deploy in a regulated workflow.

Related terms

Learn more