-
On Evals and Agents
A common topic at Reddy lately has been how far can we go with LLM automation. Inference is cheap, so our primary target is getting reliable output without manual review. I want to address what makes this hard and my model for reasoning about it. Note: Most of these processes don’t need latency or cost…
-
A Technical Founder at a Tech Startup
-
Ruining January 1
-
Clean Web Apps Without an Opinionated JavaScript Framework
Note that this was originally posted for a tweet while I was still relatively early in our development stage. I’ve single-handedly shipped an app to production and this has worked great, but time will tell if it holds up as our development team and application size grows. I’ll be coming back to this article to…
-
This is why you can’t ship products quickly.
-
LLM-Based Education Will Change the World
Now… If you’re here from Twitter, you probably asking: What’s this graph about? We’ll get there, hang with me for some context. Disclaimer First, let’s be clear. This isn’t my idea, and this isn’t about me. I built something cool last night, but other’s have done exceedingly more in this space than I can even…
