WeSearch

Zhengkid/AutoTTS: Agentic Discovery for Test-Time Scaling

·16 min read · 0 reactions · 0 comments · 15 views
#ai research#language models#test-time scaling#autotuning#machine learning
Zhengkid/AutoTTS: Agentic Discovery for Test-Time Scaling
⚡ TL;DR · AI summary

AutoTTS introduces an agentic approach to test-time scaling in large language models by automating the discovery of inference controllers through a replay-based environment. The method eliminates the need for hand-crafted heuristics and gradient updates, relying instead on a coding agent that iteratively refines code-defined controllers. Evaluated on AIME and HMMT benchmarks, the discovered Confidence Momentum Controller achieves competitive accuracy with significant token savings.

Key facts
Original article
GitHub
Read full at GitHub →
Opening excerpt (first ~120 words) tap to expand

AutoTTS LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Tong Zheng, Haolin Liu, Chengsong Huang, Huiwen Bao, Sheng Zhang, Rui Liu, Runpeng Dai, Ruibo Chen, Chenxi Liu, Tianyi Xiong, Xidong Wu, Hongming Zhang, Heng Huang UMD · UVA · WUSTL · UNC · Google · Meta Project page AutoTTS reframes TTS strategy design from hand-crafting heuristics to environment-driven automatic search: humans only construct an offline replay environment (states, actions, feedback, objectives), and a coding agent iteratively proposes and refines code-defined controllers within it — code edits, no gradient updates. Cheap: 0 LLM calls, fully replay.

Excerpt limited to ~120 words for fair-use compliance. The full article is at GitHub.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from GitHub