AI Engineering Insights

RAG fails upstream

Why most RAG failures originate in the data preparation layer, and what to do about it.

7 min read · January 11, 2026
Embeddings for intent classification: architecture trade-offs

Practical guide to building intent classifiers with embeddings. When shallow classifiers beat fine-tuning, how to handle confidence thresholds, and what actually matters in production.

8 min read · January 04, 2026
Similarity metrics for embeddings

Why almost always cosine and what actually works?

8 min read · December 29, 2025
Tokenizers: production economics cheat-sheet

Compact reference for tokenizer selection, metrics, and failure modes in production LLM systems.

8 min read · December 25, 2025
The metric gap: bridging business outcomes and AI component optimization

Why high component scores often mask system failures. A methodology for using E2E evaluation to prioritize engineering work.

7 min read · December 19, 2025

Production AI systems, evaluation, and architecture