Blog
Articles on applied AI, product engineering, search, and delivery. Writing about RAG systems, agentic AI, and building products that scale.
Articles on applied AI, product engineering, search, and delivery. Writing about RAG systems, agentic AI, and building products that scale.
A quick run through on how to get a leading OCR model deployed in a server-less environment
A quick exploration into building a coding agent in around 100 lines of code
Stop guessing which GGUF to use. A robust mental model for model quantization, helping you balance VRAM, perplexity, and performance when exporting fine-tuned models for local inference.
Practical case study on fine-tuning a small open-weight LLM with LoRA to speak a nice Scottish dialect, from dataset creation and Unsloth training through evaluation to deployment with vLLM/Modal.
A modern Python starter kit with TypeScript-like developer experience I use for my experiments.
Nuanced is a local code analysis tool that gives engineers and LLMs an understanding of how code actually behaves
Fetch Engines is an open source extraction toolkit that turns web pages into high-quality Markdown and structured JSON data.
A walkthrough of the architecture, content pipeline, styling, SEO, and tooling behind this personal site and blog.
A guide to recreating Claudette's ergonomic features (like tool loops and structured output) in TypeScript using the Vercel AI SDK.
This is where I write about my learning, experiements, and other things.