WeSearch

First-Passage Prediction of Grokking Delay: ACalibrated Law under AdamW with Causal Validation

·3 min read · 0 reactions · 0 comments · 10 views
#machine learning#optimization#artificial intelligence
First-Passage Prediction of Grokking Delay: ACalibrated Law under AdamW with Causal Validation
⚡ TL;DR · AI summary

The article presents a quantitative prediction of grokking delay under the AdamW optimization algorithm. It introduces a closed-form law for predicting this delay based on various parameters and validates it through empirical testing. The findings suggest that while the law is effective for certain architectures, its applicability to natural-language models remains uncertain.

Key facts
Original article
arXiv cs.AI
Read full at arXiv cs.AI →
Opening excerpt (first ~120 words) tap to expand

Computer Science > Machine Learning arXiv:2605.18845 (cs) [Submitted on 13 May 2026] Title:First-Passage Prediction of Grokking Delay: ACalibrated Law under AdamW with Causal Validation Authors:Truong Xuan Khanh, Truong Quynh Hoa, Luu Duc Trung, Phan Thanh Duc View a PDF of the paper titled First-Passage Prediction of Grokking Delay: ACalibrated Law under AdamW with Causal Validation, by Truong Xuan Khanh and 3 other authors View PDF HTML (experimental) Abstract:We give the first quantitative prediction of grokking delay under AdamW.

Excerpt limited to ~120 words for fair-use compliance. The full article is at arXiv cs.AI.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from arXiv cs.AI