Blog

Notes on LLM observability.

Practical posts on tracing, evals, prompt changes, token cost, and the production habits that keep AI products explainable.

Featured

Highlights from the Currai blog: the posts worth reading first.

Currai

Human-in-the-loop AI agent evaluation: a complete guide

Jul 15, 2026 The Currai team Product

Human-in-the-loop AI agent evaluation: a complete guide

Why AI agent evaluation still needs humans in 2026, where to put them in the loop, and how to combine human review with automated evals on production traces.

Currai

The best LLM evaluation tools in 2026

Jul 15, 2026 The Currai team Research

The best LLM evaluation tools in 2026

A practical field guide to LLM evaluation tools — what each category is good at, where they break down, and how to pick one that survives contact with production traffic.

Currai

Best AI observability tools in 2026

Jul 15, 2026 The Currai team Product

Best AI observability tools in 2026

The best AI observability tools in 2026 compared on evaluation depth, quality-aware alerting, drift detection, cost tracking, and the production-to-eval loop.

Latest posts

Browse implementation notes, observability guides, product decisions, and workflow ideas by topic.

All topics BEST PRACTICES COMPANY DEEP DIVE ENGINEERING GUIDE PRODUCT TUTORIAL RSS feed

Jul 15, 2026 The Currai team Product

GUIDE

Human-in-the-loop AI agent evaluation: a complete guide

Why AI agent evaluation still needs humans in 2026, where to put them in the loop, and how to combine human review with automated evals on production traces.

Jul 15, 2026 The Currai team Research

GUIDE

The best LLM evaluation tools in 2026

A practical field guide to LLM evaluation tools — what each category is good at, where they break down, and how to pick one that survives contact with production traffic.

Jul 15, 2026 The Currai team Product

GUIDE

Best AI observability tools in 2026

The best AI observability tools in 2026 compared on evaluation depth, quality-aware alerting, drift detection, cost tracking, and the production-to-eval loop.

Jul 15, 2026 The Currai team Engineering

GUIDE

AI agent observability: everything you need to know in 2026

A complete 2026 guide to AI agent observability — why agents are hard to observe, the trace data model, threads, cost, failure modes, and how to roll it out.

Jul 14, 2026 The Currai team Product

GUIDE

Zendesk vs Intercom (2026): Which customer support platform is better?

Compare Zendesk vs Intercom on pricing, ticketing, messaging, AI agents, automation, reporting, and the best fit for your support team.