Skip to content
Daniel Zhang
About Experience Skills Projects Blog Contact
About Experience Skills Projects Blog Contact
All Posts

Tag

LLM

3 posts

June 6, 2026

I Built RAG for a Domain Chatbot, Then Deleted It: When You Don't Need Vector Search

A build log for a customer-facing construction documentation assistant. How I started with client-side RAG, shipped a server-side RAG, and finally deleted all of it in favor of full-context injection plus prefix caching — and the rule of thumb that tells you which one you actually need.

April 18, 2026

The AI Engineering Landscape in Spring 2026: What You Need to Know

A comprehensive knowledge guide covering frontier model releases, agentic AI, MCP vs function calling, the evolution of RAG, edge AI with small language models, and what it all means for AI engineers.

April 18, 2026

Harness Engineering: The Discipline That Makes AI Agents Actually Work

A deep dive into harness engineering — the emerging discipline of designing systems, constraints, and feedback loops that make AI agents reliable in production. Covers core architecture, real-world case studies, and practical implementation.

© 2026 Bosheng (Daniel) Zhang. Built with Astro