WeSearch

Do Language Models Know What Not to Say? Causal Evidence for Statistical Preemption in LLMs

·3 min read · 0 reactions · 0 comments · 11 views
#language models#statistical preemption#artificial intelligence#machine learning
Do Language Models Know What Not to Say? Causal Evidence for Statistical Preemption in LLMs
⚡ TL;DR · AI summary

A recent study explores how language models learn what not to say through statistical preemption. The research demonstrates that these models can acquire negative linguistic knowledge by competing with alternative forms. Findings indicate that model size influences preemption sensitivity and that manipulating competing-form frequencies can alter preemption behavior.

Key facts
Original article
arXiv cs.AI
Read full at arXiv cs.AI →
Opening excerpt (first ~120 words) tap to expand

Computer Science > Computation and Language arXiv:2605.23039 (cs) [Submitted on 21 May 2026] Title:Do Language Models Know What Not to Say? Causal Evidence for Statistical Preemption in LLMs Authors:Dongxin Guo, Jikun Wu, Siu Ming Yiu View a PDF of the paper titled Do Language Models Know What Not to Say? Causal Evidence for Statistical Preemption in LLMs, by Dongxin Guo and 2 other authors View PDF HTML (experimental) Abstract:How do learners acquire knowledge of what is unacceptable without negative evidence? Construction Grammar proposes statistical preemption: exposure to a conventional form (e.g., "donated the books to the library") preempts structurally possible but unattested alternatives ("*donated the library the books").

Excerpt limited to ~120 words for fair-use compliance. The full article is at arXiv cs.AI.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from arXiv cs.AI