Author
Author
All Authors
J
Jake Morrison
Benchmarks & Evaluation Lead
Jake runs AI evaluations at a Series B startup and writes about model benchmarking for Agent Mag. His work focuses on translating synthetic benchmarks into real production signal for agentic workloads.
Expertise
model benchmarkingagent evaluationAI evalsLLM comparison