Why self-host?
Self-hosting Cascade is designed for organizations with specific operational and security requirements.Data sovereignty
Keep all execution data within your infrastructure and geographic region
Network isolation
Run entirely within your private network or VPC
Custom integrations
Connect to internal monitoring systems and private infrastructure
Cost control
Scale based on usage patterns and leverage existing commitments
Latency optimization
Deploy in the same region as your AI agents
Compliance
Meet regulatory requirements for data handling
Architecture overview
Cascade’s self-hosted deployment consists of three main components.Backend services
The backend handles trace ingestion and storage.| Component | Purpose | Port |
|---|---|---|
| OTLP Receiver | Receives OpenTelemetry traces from SDK clients | gRPC 4317 |
| REST API | Serves trace data to dashboard and query endpoints | HTTP 8000 |
| Database | Stores span data, attributes, and events | - |
Dashboard
The web application provides trace visualization and analysis.- Static frontend: React application served via CDN or web server
- API client: Connects directly to your backend REST API
- Authentication: Integrates with your API key system
SDK integration
Your AI agents continue to use the Cascade SDK with minimal configuration changes.The SDK sends traces directly to your self-hosted backend via OTLP HTTP.
Data residency
When you self-host Cascade, all sensitive data remains in your infrastructure.| Data Type | Storage Location |
|---|---|
| LLM prompts and completions | Your backend database |
| Tool inputs and outputs | Your backend database |
| Reasoning extractions | Your backend database |
| Trace metadata and attributes | Your backend database |
| Span events and exceptions | Your backend database |
| API keys (hashed) | Your backend configuration |
| Project configurations | Your backend configuration |
Deployment options
Cascade provides flexible deployment options to match your infrastructure.- AWS
- GCP
- Azure
- Other
Deploy Cascade on Amazon Web Services using managed services.Recommended architecture:
- Compute: ECS Fargate or EKS for containerized backend services
- Database: RDS PostgreSQL for trace storage
- Load balancing: Application Load Balancer for API traffic
- Storage: S3 for dashboard static assets
- Networking: Deploy in private VPC with security groups
- Native integration with AWS CloudWatch
- Easy scaling with Auto Scaling Groups
- Compliance with AWS security standards
- Regional deployment for data residency
Getting started with self-hosting
1
Contact the Cascade team
Reach out to the team to discuss your self-hosting requirements and receive deployment packages.
2
Review architecture requirements
Evaluate your infrastructure needs, database requirements, and networking setup.
3
Deploy backend services
Set up the OTLP receiver, REST API, and database in your environment.
4
Deploy dashboard
Configure and deploy the dashboard web application.
5
Configure SDK endpoint
Update your AI agents to point to your self-hosted Cascade endpoint.
6
Verify connectivity
Test trace ingestion and dashboard functionality.
Self-hosting requires a separate license. Contact hello@runcascade.com for pricing and deployment support.