Field notes
Production AI agent reliability, written from the engagements.
Eval setups, failure modes, observability patterns, and the things we learn auditing production AI agents. Written for technical founders and AI engineering teams.
Eval setups, failure modes, observability patterns, and the things we learn auditing production AI agents. Written for technical founders and AI engineering teams.