Benchmarking AI agents across five TypeScript back end frameworks
A recent benchmarking study evaluated an AI coding agent's ability to build TypeScript backends across five frameworks. While all frameworks passed initial tests, only Encore's output met production-readiness criteria. Subsequent runs aimed to improve the performance of the other frameworks by incorporating additional libraries and adjustments.
- ▪The AI agent was tested on Encore, Express, Fastify, Hono, and NestJS frameworks.
- ▪All frameworks passed the initial functional tests, but only Encore's code was deemed production-ready.
- ▪The study included three runs to assess and improve the performance of the non-Encore frameworks.
Opening excerpt (first ~120 words) tap to expand
How well does an AI coding agent build TypeScript backends across popular frameworks?We took the same agent (Claude Code), gave it the same realistic backend tasks, and ran it on five frameworks (Encore, Express, Fastify, Hono, NestJS). Same prompts, same model, same Postgres setup, same VM. We captured every artifact of every run and scored them all on one rubric.After the first run every framework's tests passed. The post we were going to publish was roughly "lighter frameworks finished cheaper, NestJS paid a ceremony tax."Then we read the diffs and on four of the five frameworks the agent had built the laziest solution that would still pass the tests: a Postgres table polled by setInterval for the durable queue, CREATE TABLE IF NOT EXISTS at boot instead of any migration system.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at Encore — Open Source Backend Framework for robust distributed systems.