currai
{ pricing: "/pricing", blog: "/blog", docs: "/docs" }
Sign In

LLM observability

LLM observability for every prompt, token, and tool call

Currai captures the full path of an LLM request so engineering teams can debug failures, measure quality, and understand production cost without stitching together logs by hand.

Primary keyword

LLM observability

Currai covers

Traces, generations, spans, evals, prompt A/B tests, token usage, cost, latency, sessions, users, and OpenTelemetry ingestion.

See what the model saw and did

Traditional observability can tell you a request was slow or failed. Currai shows the prompt, completion, tool calls, retriever spans, tokens, cost, latency, user, session, tags, and metadata behind that request.

Each trace becomes a replayable timeline. You can inspect a single model call, follow a nested agent run, or group a multi-turn conversation by session and user.

  • Trace prompts, completions, spans, tools, and retrieval steps.
  • Roll up token usage and cost by trace, model, user, and day.
  • Filter production traces by environment, tag, metadata, user, and session.

Improve quality from production evidence

Currai connects observability to the improvement loop: run evals on real outputs, compare prompt versions, and watch cost and latency move as your app changes.

The goal is not just collecting traces. The goal is knowing why an LLM app behaved the way it did and which change made it better.

Questions about LLM observability

What is LLM observability?

LLM observability is the practice of tracing prompts, completions, tool calls, token usage, cost, latency, and quality signals so teams can debug and improve AI applications in production.

Does Currai require a collector?

No. Use the Python or TypeScript SDK, point Langfuse-compatible clients at Currai, or export OpenTelemetry traces directly to Currai.