Self-Hosted Proxy • 100% Free

Record, Inspect, Replay AI Agents

Name: Orchid
Author: Mario Guerra

The agent in your app stack went off the rails again. Stop digging through logs. Orchid records the full run. Inspect and replay it locally, step by step.

Get Orchid See How It Works

Spin up the local demo

API_KEY="orchid_live_8675309"
docker run --pull=always -d --name orchid-proxy-demo \
  -p 4320:4320 -p 4321:4321 \
  -e ORCHID_API_KEY=$API_KEY \
  -e ORCHID_DEMO=1 \
  ghcr.io/mario-guerra/orchid-proxy

Once running, open http://localhost:4321 in your browser and enter the API key orchid_live_8675309 to explore the pre-seeded demo trace.

localhost:4320/session/spacex_investment

provider:vertex status:2xx

#1 IntentSchema200

gemini-2.5-flash1.30s

#2 o3-mini200

o3-mini4.90s

#3 serpapi.com200

Google Search4.60s

#4 UncertaintyDecision200

gemini-2.5-flash2.70s

#5 serpapi.com200

Google Search17.90s

#6 UncertaintyDecision200

gemini-2.5-flash1.60s

Provider

serpapi

Status

200 OK

Latency

4.60s

Tokens

-- / --

{

"search_parameters": {

"engine": "google",

"q": "SpaceX IPO detailed investment analysis and risk factors expert opinions",

"location": "Austin, Texas",

"google_domain": "google.com"

"organic_results": [

{

"position": 1,

"title": "SpaceX Share Valuation & Investment Risks - CNBC",

"link": "https://www.cnbc.com/spacex-valuation-risks"

}

]

}

STREAMING ACTIVE

Meet Orchid, the Orchestration interactive debugger

Stop digging through logs when agents go off the rails. Orchid records every API call for step-by-step local inspection & replay.

Records Any API

OpenAI, Anthropic, and Gemini recognized natively. Any other HTTP API, search, vector stores, internal services, captured with one header.

MCP Native

AI assistants query your traces directly. Search payloads, triage failures, export fixtures — all through Model Context Protocol.

Deterministic Replay

Blocks outbound network traffic and serves recorded session fixtures during local development and testing. 100% offline, deterministic, and free.

Real-Time Cost Tracking

Every exchange gets USD cost attribution from a pricing engine you control. Roll-ups per session and execution step.

Web Visualizer

Interactive pipeline graph UI. Time-travel through events, inspect payloads, view latency profiles — all in the browser.

Local-First by Design

Single binary, local SQLite, no cloud backend, no phone-home. Secret-like headers and fields are redacted before anything touches disk.

Debugging AI shouldn't feel like archaeology.

When your LLM is buried deep inside a backend service, finding the root cause of an unexpected output means searching through logs and guessing what was sent over the wire.

Stop sifting through terminal dust.

Orchid captures the exact artifact. Route your traffic through the proxy, and instantly inspect every raw prompt, schema, and response in high-fidelity, without altering a single line of your core business logic.

The best part - everything is stored locally, no data ever leaves your infrastructure.

How it works

Proxy Traffic

Import the Orchid SDK. Point your app to the proxy and run in capture mode mode. Standard libraries route natively — zero code rewrites.

Store & Inspect

Every LLM request/response is captured into a local thread-safe SQLite database. Inspection runs via the Web Visualizer or MCP queries.

Replay & Test

Export portable JSON fixtures. Run offline test suites with replay mode — zero API cost, fully deterministic.

Your App

Capture Mode

Orchid ProxyPort 4320

LLM Providers

SQLite

Request

Response

Stored to disk

Get Orchid

Deterministic Testing

Kill flaky tests with replay mode

In replay mode, the proxy blocks all outbound traffic. Requests are matched against recorded responses by semantic hash and served instantly. No API calls, no costs, fully deterministic.

Without Orchid

→Tests hit live APIs — slow, expensive, rate-limited
→Non-deterministic outputs cause flaky assertions
→Manual mock maintenance per model update
→CI costs scale linearly with test count

With Orchid

→Saved fixtures serve responses locally instantly
→Identical response every run, fully deterministic
→Re-record fixtures after making agent changes
→CI LLM spend: $0.00

Your App

Replay Mode

Orchid ProxyX-Orchid-Mode: replay

LLM ProvidersOFFLINE

SQLiteRecorded fixtures

$0.00

API Cost

100%

Deterministic

Request (to proxy)

Response (from storage)

Live APIs blocked

Runs Local. Stays Local.

Orchid runs entirely inside your infrastructure. No telemetry, no external cloud.

The Architecture

Local-First

100% Offline: Run via Docker on your local machine or private cloud.
Zero Data Leaks: SQLite session recordings never leave your network.
Open SDKs: Intercept and replay at the transport layer with zero vendor lock-in.

Get Orchid

The Builder

Mario Guerra

18 years shipping developer tools and AI platforms.
Drove TypeSpec adoption across 30% of Azure at Microsoft.
Onboarded 150+ customers to AI hardware at Qualcomm.

Learn More

Ready to debug smarter?

Start recording and replaying your AI pipeline traffic locally in minutes.

Get Orchid