Understanding Traces

What are traces?

A trace represents a single execution of your AI agent from start to finish. It captures the complete workflow of every LLM call, tool invocation, and function execution as a hierarchical tree of operations called spans.

Cascade trace visualization showing hierarchical execution flow

Spans: The building blocks

A span is a single unit of work within a trace. Each span represents one operation in your agent’s execution.

LLM API call

Interaction with language models

Tool execution

Function and tool invocations

Reasoning step

Decision and extraction operations

Span types

Function spans

Created by the trace_run() context manager to mark the entry point of your agent execution.

What function spans capture

Span name: Agent or function name
Custom metadata: Task IDs, user IDs, or any contextual data
Total execution duration: End-to-end timing
Success/error status: Whether the execution completed successfully

Example:

with trace_run("CustomerSupportAgent", metadata={"ticket_id": "TKT-789"}):
    # Everything inside becomes child spans
    ...

The trace_run() context manager creates the root span—all other operations inside become child spans automatically.

LLM spans

Created automatically by wrap_llm_client() to track every interaction with language models. Captured data:

Attribute	Description
Model name	e.g., `claude-3-5-sonnet-20241022`
Provider	e.g., `anthropic`, `openai`
Prompt text	Complete prompt and system messages
Completion	Full response text
Token counts	Input, output, and total tokens
Estimated cost	Calculated cost in USD
Latency	Response time in milliseconds
Streaming status	Whether response was streamed
Extracted reasoning	Reasoning steps if present in completion

LLM spans work with both messages.create() and messages.stream() methods automatically.

Tool spans

Created by the @tool decorator to track function and tool executions. Captured data:

Input and output

Tool name: Function identifier
Tool description: Extracted from docstring
Serialized input parameters: All arguments passed to the function
Serialized output: Return value from the function

Performance and errors

Execution duration: Time taken in milliseconds
Error details: Exception information if execution failed

Example:

@tool
def search_database(query: str, limit: int = 10) -> dict:
    """Search the database for matching records."""
    # Input: {"query": "...", "limit": 10}
    results = db.search(query, limit)
    # Output: {"results": [...], "count": 5}
    return results

Trace context propagation

Cascade SDK uses OpenTelemetry’s context propagation to maintain parent-child relationships automatically. How it works:

Root span creation

When you call trace_run(), a root span is created for your agent execution.

Tool span nesting

When a @tool decorated function is called inside a trace, the tool span becomes a child of the root span.

LLM span nesting

When the tool makes an LLM call with a wrapped client, the LLM span becomes a child of the tool span.

Automatic propagation

Context propagates through both sync and async function calls—no manual wiring needed.

The SDK handles context propagation automatically. You don’t need to pass context objects in.

Technical details

Data size limits

Text values truncated at 10,000 characters by default
Large objects serialized efficiently to JSON
Binary data excluded from capture

Async support

Full support for async/await patterns:

Async tool decorators propagate context correctly
Streaming LLM responses tracked incrementally
No blocking of async event loops

Error handling

When an operation fails, the span automatically captures detailed error information. Captured error details:

@tool
def risky_operation(data: str):
    result = json.loads(data)  # May raise JSONDecodeError
    return result

If this fails, the span will contain:

status_code: ERROR
tool.error: Exception message
Exception event with full stack trace

All exceptions are captured automatically, but they don’t prevent error propagation. Make sure to handle errors appropriately in your code.

Getting started

Observability

Enforcement Modes

Safety

Security

Understanding Traces

What are traces?

Spans: The building blocks

LLM API call

Tool execution

Reasoning step

Span types

Function spans

LLM spans

Tool spans

Trace context propagation

Technical details

Data size limits

Async support

Error handling

Getting started

Observability

Enforcement Modes

Safety

Security

​What are traces?

​Spans: The building blocks

LLM API call

Tool execution

Reasoning step

​Span types

​Function spans

​LLM spans

​Tool spans

​Trace context propagation

​Technical details

​Data size limits

​Async support

​Error handling

What are traces?

Spans: The building blocks

Span types

Function spans

LLM spans

Tool spans

Trace context propagation

Technical details

Data size limits

Async support

Error handling