Webwright: A Terminal Is All You Need for Web Agents
Webwright introduces a new approach to web agents by utilizing a terminal-based setup instead of traditional browser sessions. This method allows agents to write code for web tasks, enabling more efficient and flexible interactions. The resulting scripts are reusable and can be adapted across different platforms, enhancing the overall performance of web automation.
- ▪Webwright uses a minimal terminal setup to allow agents to spawn browser sessions and write code for web tasks.
- ▪This approach enables agents to chain multiple web interactions within a single step, improving efficiency.
- ▪The resulting scripts can be packaged as reusable CLI tools and shared across various platforms.
Opening excerpt (first ~120 words) tap to expand
AI Frontiers AI Frontiers Webwright: A Terminal Is All You Need For Web Agents Published May 4, 2026 Share this page Share on Facebook Share on X Share on LinkedIn Share on Reddit Subscribe to our RSS feed Webwright GitHub repo Webwright project page By Yadong Lu1, Lingrui Xu2, Chao Huang2, Ahmed Awadallah11Microsoft Research, 2The University of Hong Kong Instead of solving web tasks by predicting where to click one at a time, we only give the model a terminal where it has the full freedom to spawn browser sessions, and to explore websites through writing code. The final result was a reusable program to complete any web tasks. We found this minimal harness to be surprisingly effective in solving web tasks.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at Microsoft Research.