Features
Everything obsrv ships, in one page.
The reliability and observability surface for AI runtimes — built for agents, multimodal systems, and computer-use workflows that need to ship to production.
Trace & replay
- Step-tree timelineAuto-parented steps with model, latency, tokens, and cost — collapsible like a stack trace.
- Live tailWatch traces stream in over Server-Sent Events as they ingest.
- Replay viewerReconstruct full sessions including computer-use desktop and browser flows.
- Trace filtersStatus, run type, model, tag, and arbitrary metadata JSON paths — all server-filtered.
Evaluations
- Synthetic metricsLLM-as-judge metrics for adherence, coherence, tool selection, and unsupported request handling.
- Observed signalsCapture refunds, escalations, thumbs-down, and any custom event you record.
- Per-release scoringTag traces by release and prompt version. Compare regressions in one click.
- Annotations & flagsMark runs for review, attach notes, and roll them into evals.
Cluster discovery
- Continuous embeddingEvery trace embedded as it lands. Clusters update with traffic shifts.
- No predefined categoriesPatterns surface from real behavior — not a template guess.
- Auto-labelledCluster names generated from representative behaviors via Claude.
- Drill to tracesJump from cluster to underlying runs and replay them in context.
Multimodal & computer-use
- Native renderingImage, audio, video, sensor, and file artifacts inline in the timeline.
- Browser/desktop replayReconstruct sessions from captured screenshots and action streams.
- Tenant-scoped storageObject keys follow orgs/{org}/projects/{project}/traces/{trace}.
- Browser-safe mediaStream artifacts through signed proxy URLs — never expose raw object keys.
Monitors & alerts
- Threshold monitorsWatch metric pass rates, latency, error rates, and cluster volumes.
- Webhook deliveryPush alerts to Slack, PagerDuty, or any endpoint that speaks JSON.
- State transitionsOpen, acknowledged, snoozed, resolved — with full history.
Developer experience
- Python & Node SDKsSymmetrical APIs, async ingest, fail-soft defaults.
- Provider integrationsOpenAI, Anthropic, LangChain, OpenClaw — drop-in wrappers.
- MCP serverFirst-class agent tooling for IDE workflows.
- OpenTelemetry exportPipe traces into your existing OTel collector.
Operations & governance
- Org & project isolationEvery API key scoped to a project. Storage paths follow tenancy.
- API key hashingargon2id at rest. SHA-256 fingerprint lookup.
- Retention controlsPer-project retention policies for traces and artifacts.
- Usage attributionPer-org and per-project ingest, storage, and query usage.
08 — DEPLOY
Ship the agent.
obsrv will record everything.
Drop the SDK in three lines. The recorder lights up the moment traffic starts flowing.
$pip install theta-obsrv·npm i @theta-lab/obsrv
FDR · OBS-1
REC
tr_01HXR4Z9CK
support_agent · 4.2s
✗ wrong_order
tr_01HXR4Z9CK
support_agent · 4.2s
✗ wrong_order