Prerequisites
- Docker and Docker Compose
- Python 3.9+
- An agent project (LangChain, LlamaIndex, or raw OpenAI)
Step 1: Start Control Plane Server
http://localhost:3000. Open it in a browser to confirm the dashboard loads.
Step 2: Install the SDK
Step 3: Instrument your agent
Add two lines to your agent code:Step 4: Run your agent
Step 5: View the trace
Openhttp://localhost:3000 → select your project → click the latest run.
You’ll see the full trace tree:
- Agent decision step
- LLM call with token count, cost, and latency
- Tool invocations with inputs and outputs
- Final response
Step 6: Add your first eval
Createlangship.yaml in your project root:
evals/golden-set.jsonl:
pass_threshold, the command exits with code 1, blocking CI.
What’s next
- How It Works: pipeline architecture and core data model
- Deployment: Docker Compose, Kubernetes, and scaling options
- GitHub Actions: run evals on every PR automatically
- CLI Reference: full command reference