Reasonable UX
- production
- advisor pattern
AI-powered UX audit tool. A Playwright agent crawls a site and scores pages with Sonnet and Haiku executors; an Opus advisor gets consulted on ambiguous judgment calls.
Four things in active development. Three more in the lab. If you want to talk about any of them — [email protected].
featured
AI-powered UX audit tool. A Playwright agent crawls a site and scores pages with Sonnet and Haiku executors; an Opus advisor gets consulted on ambiguous judgment calls.
Automated UX quality audit agent built on Playwright + multi-model scoring. Cut running cost 84% by stripping images from conversation history and shipping screenshots at JPEG-40.
Private SvelteKit dashboard on Fly.io, reading a SQLite mirror of an Obsidian vault via Litestream. Modules for projects, daily notes, biometrics, and system health.
Self-paced curriculum on the operational side of LLM systems — telemetry, evals, cost routing, model gateways. Tracker reads phases, tasks, and progress live from the same SQLite mirror as everything else; the long-form writeups land through summer 2026.
also building
Built to understand what retrieval quality actually costs — chunking strategy, embedding model choice, and retrieval window size are the variables that matter. Voyage AI embeddings with H2-aware chunking over Obsidian markdown; ChromaDB for retrieval with SQLite mirror for query logs.
Built because sustained attention to maintenance is exactly what ADHD is worst at — so the checker had to be the thing that pays attention. Self-discovers every automation: launchd jobs, git hooks, OAuth tokens, vault integrity. Reports what to run when something breaks.
Specialist Claude agents for vault queries, hiring analysis, curriculum review, and automation design — each with typed inputs and narrow scope. An eval harness runs periodic trajectory checks to surface drift before it compounds. The architecture is LLMOps patterns made operational.