AI Dev Tools Weekly

Issue #9

Guardrails - Pydantic-first schema enforcement and safety checks: coerce LLM outputs into typed objects with validators, retries, and fallbacks

TruLens - LLM evaluation library providing reference-free and ground-truth metrics for RAG/agents

RAGxplorer - Tiny library and Streamlit UI to visualize chunking, retrieval, reranking, and answers (helping explain RAG behavior)

Cloudflare AI Search - Edge-hosted RAG search that indexes PDFs/HTML/Office and exposes a simple query API with hybrid + vector retrieval

GuideLLM - CLI to generate realistic, configurable benchmark workloads against inference endpoints to test latency/quality

MCP Inspector - Interactive UI to connect to any MCP server and test resources, prompts, and tools with OAuth and transport diagnostics

Langfuse Experiment Runner SDK - Programmatically run and compare prompt versions with schema-enforced structured outputs and scoring of workflows