Get Orchid
Self-Hosted Proxy • 100% Free

Record, Inspect, Replay AI Agents

The agent in your app stack went off the rails again. Stop digging through logs. Orchid records the full run. Inspect and replay it locally, step by step.

Spin up the local demo

API_KEY="orchid_live_8675309" docker run --pull=always -d --name orchid-proxy-demo \ -p 4320:4320 -p 4321:4321 \ -e ORCHID_API_KEY=$API_KEY \ -e ORCHID_DEMO=1 \ ghcr.io/mario-guerra/orchid-proxy

Once running, open http://localhost:4321 in your browser and enter the API key orchid_live_8675309 to explore the pre-seeded demo trace.

localhost:4320/session/spacex_investment
provider:vertex status:2xx
#1 IntentSchema200
gemini-2.5-flash1.30s
#2 o3-mini200
o3-mini4.90s
#3 serpapi.com200
Google Search4.60s
#4 UncertaintyDecision200
gemini-2.5-flash2.70s
#5 serpapi.com200
Google Search17.90s
#6 UncertaintyDecision200
gemini-2.5-flash1.60s
Provider
serpapi
Status
200 OK
Latency
4.60s
Tokens
-- / --
{
"search_parameters": {
"engine": "google",
"q": "SpaceX IPO detailed investment analysis and risk factors expert opinions",
"location": "Austin, Texas",
"google_domain": "google.com"
},
"organic_results": [
{
"position": 1,
"title": "SpaceX Share Valuation & Investment Risks - CNBC",
"link": "https://www.cnbc.com/spacex-valuation-risks"
}
]
}
STREAMING ACTIVE

Meet Orchid, the Orchestration interactive debugger

Stop digging through logs when agents go off the rails. Orchid records every API call for step-by-step local inspection & replay.

Records Any API

OpenAI, Anthropic, and Gemini recognized natively. Any other HTTP API, search, vector stores, internal services, captured with one header.

MCP Native

AI assistants query your traces directly. Search payloads, triage failures, export fixtures — all through Model Context Protocol.

Deterministic Replay

Blocks outbound network traffic and serves recorded session fixtures during local development and testing. 100% offline, deterministic, and free.

Real-Time Cost Tracking

Every exchange gets USD cost attribution from a pricing engine you control. Roll-ups per session and execution step.

Web Visualizer

Interactive pipeline graph UI. Time-travel through events, inspect payloads, view latency profiles — all in the browser.

Local-First by Design

Single binary, local SQLite, no cloud backend, no phone-home. Secret-like headers and fields are redacted before anything touches disk.

Debugging AI shouldn't feel like archaeology.

When your LLM is buried deep inside a backend service, finding the root cause of an unexpected output means searching through logs and guessing what was sent over the wire.

Stop sifting through terminal dust.

Orchid captures the exact artifact. Route your traffic through the proxy, and instantly inspect every raw prompt, schema, and response in high-fidelity, without altering a single line of your core business logic.

The best part - everything is stored locally, no data ever leaves your infrastructure.

How it works

1

Proxy Traffic

Import the Orchid SDK. Point your app to the proxy and run in capture mode mode. Standard libraries route natively — zero code rewrites.

2

Store & Inspect

Every LLM request/response is captured into a local thread-safe SQLite database. Inspection runs via the Web Visualizer or MCP queries.

3

Replay & Test

Export portable JSON fixtures. Run offline test suites with replay mode — zero API cost, fully deterministic.

Your App
Capture Mode
Orchid ProxyPort 4320
LLM Providers
SQLite
Request
Response
Stored to disk
Get Orchid

Deterministic Testing

Kill flaky tests with replay mode

In replay mode, the proxy blocks all outbound traffic. Requests are matched against recorded responses by semantic hash and served instantly. No API calls, no costs, fully deterministic.

Without Orchid

  • Tests hit live APIs — slow, expensive, rate-limited
  • Non-deterministic outputs cause flaky assertions
  • Manual mock maintenance per model update
  • CI costs scale linearly with test count

With Orchid

  • Saved fixtures serve responses locally instantly
  • Identical response every run, fully deterministic
  • Re-record fixtures after making agent changes
  • CI LLM spend: $0.00
Your App
Replay Mode
Orchid ProxyX-Orchid-Mode: replay
LLM ProvidersOFFLINE
SQLiteRecorded fixtures
$0.00
API Cost
100%
Deterministic
Request (to proxy)
Response (from storage)
Live APIs blocked

Runs Local. Stays Local.

Orchid runs entirely inside your infrastructure. No telemetry, no external cloud.

The Architecture

Local-First

  • 100% Offline: Run via Docker on your local machine or private cloud.
  • Zero Data Leaks: SQLite session recordings never leave your network.
  • Open SDKs: Intercept and replay at the transport layer with zero vendor lock-in.
Get Orchid

The Builder

Mario Guerra

  • 18 years shipping developer tools and AI platforms.
  • Drove TypeSpec adoption across 30% of Azure at Microsoft.
  • Onboarded 150+ customers to AI hardware at Qualcomm.
Learn More

Ready to debug smarter?

Start recording and replaying your AI pipeline traffic locally in minutes.