AI Engineering Insights

Hierarchical signal tuning: optimizing components before fusion

Fusion algorithms like linear combination or RRF cannot fix poor input signals. Effective hybrid search requires a bottom-up optimization strategy: tuning field weights within BM25 and embedding strategies within dense components before attempting to merge them.

4 min read · November 26, 2025
Jensen-Shannon divergence for meaningful clustering

Silhouette score validates geometry, not meaning. Using Jensen-Shannon divergence to measure feature distribution divergence bridges the gap between mathematical separation and interpretability.

5 min read · November 22, 2025
Hybrid intent classification: compact-encoder-first routing for production systems

Production chatbots route most requests through fast compact encoder classifiers, escalating to LLMs only on low-confidence queries. This hybrid architecture mitigates the latency and cost overheads of monolithic LLM solutions, achieving significant speed gains while preserving high classification accuracy.

5 min read · November 19, 2025
Few-shot prompt ordering: the impact of example position

Investigating positional bias in few-shot prompting. While "Lost in the Middle" suggests boundary importance, the specific ordering of examples remains an important factor for performance stability.

4 min read · November 15, 2025
Temporal for LLM pipelines: durable execution starter pack

LLM agents often crash, losing state and expensive API work. Temporal provides durable execution for LLM pipelines: automatic state recovery, configurable retries, and long-running orchestration at the cost of determinism constraints and ops overhead.

16 min read · November 10, 2025

Production AI systems, evaluation, and architecture

Hierarchical signal tuning: optimizing components before fusion

Jensen-Shannon divergence for meaningful clustering

Hybrid intent classification: compact-encoder-first routing for production systems

Few-shot prompt ordering: the impact of example position

Temporal for LLM pipelines: durable execution starter pack