WeSearch

Do as I Say, Not as I Do: Instruction-Induction Conflict in LLMs

·3 min read · 0 reactions · 0 comments · 12 views
#artificial intelligence#language models#computation
Do as I Say, Not as I Do: Instruction-Induction Conflict in LLMs
⚡ TL;DR · AI summary

The paper explores the conflict between instruction-following and pattern-completion in language models. It examines how models respond to user instructions that conflict with hardcoded patterns. The findings indicate that instruction-following is inconsistent and highly dependent on the model and the nature of the instructions.

Key facts
Original article
arXiv cs.AI
Read full at arXiv cs.AI →
Opening excerpt (first ~120 words) tap to expand

Computer Science > Computation and Language arXiv:2605.20382 (cs) [Submitted on 19 May 2026] Title:Do as I Say, Not as I Do: Instruction-Induction Conflict in LLMs Authors:Carolina Camassa, Derek Shiller View a PDF of the paper titled Do as I Say, Not as I Do: Instruction-Induction Conflict in LLMs, by Carolina Camassa and 1 other authors View PDF HTML (experimental) Abstract:Language models are trained to follow instructions, but they are also powerful pattern completers. What happens when these two objectives conflict? We construct conversations in which a user instruction to behave in a target way T (e.g., always output a specific token, answer in a particular language, or adopt a persona) is opposed by N hardcoded assistant turns demonstrating a competing pattern P.

Excerpt limited to ~120 words for fair-use compliance. The full article is at arXiv cs.AI.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from arXiv cs.AI