Cutting LLM costs 60% without dumbing your product down
Caching, routing, and right-sizing models. A practical playbook for keeping an AI feature cheap once real users show up.
Read articleSolo-founder SaaS lessons, AI & LLM engineering, and full-stack deep dives — the things I wish I'd read sooner.
Caching, routing, and right-sizing models. A practical playbook for keeping an AI feature cheap once real users show up.
Read articleThe architecture calls I'd make again, the ones I'd undo, and why "boring" infrastructure was the best decision I made.
Read articleMiddleware, wildcard DNS, and the data model that keeps tenant isolation honest. A field guide from production.
Read articleWhen data can't leave the building: a local retrieval pipeline that's good enough to ship, and where the cloud still wins.
Read articleProration, failed payments, plan migrations, and webhooks you actually need. The billing edge cases that bite in month three.
Read articleHow I keep types flowing from the database to the button label — and catch the breakage at compile time, not in support tickets.
Read article