WeSearch

Long-Context Reasoning Through Proxy-Based Chain-of-Thought Tuning

·2 min read · 0 reactions · 0 comments · 15 views
#artificial intelligence#machine learning#computation
Long-Context Reasoning Through Proxy-Based Chain-of-Thought Tuning
⚡ TL;DR · AI summary

The paper presents a novel training framework called ProxyCoT aimed at improving long-context reasoning in large language models. It highlights the performance disparity between proxy contexts and full contexts in reasoning tasks. The proposed method demonstrates improved efficiency and generalization in reasoning capabilities across various datasets.

Key facts
Original article
arXiv cs.AI
Read full at arXiv cs.AI →
Opening excerpt (first ~120 words) tap to expand

Computer Science > Computation and Language arXiv:2605.20201 (cs) [Submitted on 6 Apr 2026] Title:Long-Context Reasoning Through Proxy-Based Chain-of-Thought Tuning Authors:Miao Li, Irina Saparina, Alexander Gurung, Mirella Lapata View a PDF of the paper titled Long-Context Reasoning Through Proxy-Based Chain-of-Thought Tuning, by Miao Li and 3 other authors View PDF HTML (experimental) Abstract:Recent large language models support inputs of up to 10 million tokens, yet they perform poorly on long-context tasks that require complex reasoning. Such tasks can be solved using only a subset of the input -- a proxy context -- rather than the full sequence. Despite sharing the same underlying reasoning process, models exhibit a significant performance disparity between proxy and full contexts.

Excerpt limited to ~120 words for fair-use compliance. The full article is at arXiv cs.AI.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from arXiv cs.AI