Control Plane

Control Plane is an open-source CI/CD and observability platform built specifically for AI agents. It gives teams the deployment pipelines, tracing, evaluation, and dataset management that agent development demands, without the vendor lock-in of hosted platforms. Think of Control Plane as the open-source alternative to LangSmith: self-hostable, GitOps-native, and designed to slot into the engineering workflows your team already uses.

What Control Plane does

Capability	Description
Pipeline CI/CD	Automated build, test, and deploy pipelines for agent projects
Observability	Distributed tracing for every LLM call, tool invocation, and agent step
Evaluation	Run eval suites on every commit; catches regressions before they reach production
Dataset management	Version-control your prompts, test cases, and golden sets alongside your code
GitHub Actions integration	First-class Actions workflows; triggers evals on PRs and blocks merges on failure
Self-hostable	Run entirely on your infrastructure; no data leaves your environment

Why Control Plane

Agent development breaks standard CI/CD. A unit test can pass while the agent produces worse answers. A deploy can succeed while latency doubles. Control Plane adds the agent-specific checks (LLM evaluation, trace comparison, cost budgets) as first-class pipeline steps.

git push → CI pipeline → eval suite → trace diff → deploy

Every PR gets an automated quality gate. Every deploy generates a trace baseline. Every rollback is one command.

Key features

GitOps-native: Your langship.yaml lives in the repo. Pipelines, eval configs, and deployment targets are all version-controlled. Trace-first observability: Every agent run produces an OpenTelemetry-compatible trace with LLM call details (model, tokens, latency, cost), tool calls, memory reads, and final outputs. Eval-in-CI: Define evaluators (LLM-as-judge, exact match, semantic similarity, custom Python) and run them on every push. Block merges when quality drops. Dataset versioning: Store prompt templates, test cases, and expected outputs in Control Plane’s dataset store. Pin evals to specific dataset versions so results are reproducible. Multi-environment deployments: Promote agent versions from dev → staging → production with environment-specific configs and automatic rollback on eval failure.

Architecture overview

Control Plane has three components:

Control Plane Server: Receives traces, stores datasets, runs evals, manages deployments. Self-hosted via Docker or Kubernetes.
Control Plane CLI: langship command for local development, running evals, and triggering deploys.
Control Plane SDK: Lightweight instrumentation library (pip install langship) that adds trace collection to any LangChain, LlamaIndex, or raw OpenAI agent.

Where to go next

Quickstart: running Control Plane Server and tracing your first agent in 10 minutes
How It Works: pipeline architecture and core data model
Deployment: Docker Compose, Kubernetes, and scaling
GitHub Actions: integrating Control Plane evals into your PR workflow

Getting Started

Guides

Control Plane

What Control Plane does

Why Control Plane

Key features

Architecture overview

Where to go next

​What Control Plane does

​Why Control Plane

​Key features

​Architecture overview

​Where to go next

What Control Plane does

Why Control Plane

Key features

Architecture overview

Where to go next