How Safety Works in Cascade
Safety in Cascade is policy-driven and signal-based. As an agent executes, Cascade’s DeepStream system captures detailed traces of its behavior. These traces are evaluated in real time against user-defined safety policies. The evaluation produces Safety Signals that indicate policy compliance, behavioral drift, and semantic classification of agent actions. These signals enable teams to monitor agent behavior, identify violations, and take appropriate action based on the configured enforcement mode.Safety Policies
Cascade supports three policy types, each designed to address a different class of agent behavior.Tool Policies
Control which tools an agent can use. Evaluated at invocation to prevent access to destructive or sensitive operations.
Categorization Policies
Classify agent outputs into semantic categories and block out-of-policy content like harmful or unsafe responses.
Semantic Policies
Define behavioral constraints using natural language. Enforce complex, context-dependent rules through semantic reasoning.
Safety Signals
Safety Signals are the runtime indicators Cascade uses to evaluate agent behavior. There are three primary signal types:- Policy Violation Signals: Indicate when agent behavior does not comply with active safety policies. These signals are generated in real time and drive enforcement actions.
- Drift Signals: Surface statistically significant changes in agent behavior across executions. These signals help teams understand how changes in prompts, tools, or models affect behavior over time.
- Classification Signals: Label agent behavior into semantic categories. These classifications provide structured inputs that categorization policies act upon.
Runtime Evaluation
Safety evaluation happens continuously while an agent runs. As traces are generated, Cascade:- Extracts relevant behavioral signals from tool calls, reasoning traces, and outputs
- Evaluates all active policies against the current execution context
- Classifies agent behavior into semantic categories
- Detects drift against historical baselines
- Produces safety findings tied to the trace
Who Safety Is For
The Safety system is designed for:- Developers building agent workflows
- Platform teams managing agent behavior at scale
- Product teams defining acceptable outcomes
- Compliance and security teams monitoring behavioral risk