Practical engineering and thoughtful research on the problems that sit between people and technology.
Research · Open Source · Fractional & Advisory
Current Work
pacabench
Benchmarking LLM agents shouldn't mean wrestling with brittle scripts and lost progress. pacabench is a local-first tool that provides reproducible, reliable benchmarks with isolated execution, persistent state, and built-in metrics tracking—no SDK required.
I work with companies at critical technical inflection points.
If you're facing a hard problem at the intersection of technology and organisation, let's talk.
Archive
context-store
Users expect full message history, but LLMs have hard limits. Most teams rebuild the same Redis + Postgres stack; context-store extracts that pattern into a reliable Elixir service with Raft consensus, horizontal scaling, and deterministic compaction.