WeSearch

Residual Paving: Diagnosing the Routing Bottleneck in Selective Refusal Editing

·3 min read · 0 reactions · 0 comments · 14 views
#machine learning#artificial intelligence#editing
Residual Paving: Diagnosing the Routing Bottleneck in Selective Refusal Editing
⚡ TL;DR · AI summary

The paper titled 'Residual Paving: Diagnosing the Routing Bottleneck in Selective Refusal Editing' presents a new method for improving selective refusal editing in machine learning models. The authors introduce Residual Paving, which enhances edit success rates while maintaining benign and harmful behavior preservation. Their findings indicate significant reductions in edit refusal rates and improvements in diagnostic scores across various model backbones.

Key facts
Original article
arXiv cs.AI
Read full at arXiv cs.AI →
Opening excerpt (first ~120 words) tap to expand

Computer Science > Machine Learning arXiv:2605.20262 (cs) [Submitted on 18 May 2026] Title:Residual Paving: Diagnosing the Routing Bottleneck in Selective Refusal Editing Authors:Bryce Hinkley, Peyman Najafirad View a PDF of the paper titled Residual Paving: Diagnosing the Routing Bottleneck in Selective Refusal Editing, by Bryce Hinkley and 1 other authors View PDF HTML (experimental) Abstract:We study selective refusal editing as a three-way control problem: induce non-refusal on designated edit prompts while preserving benign behavior and harmful refusals outside the edit set. We introduce Residual Paving, a routed residual editing method for frozen instruction-tuned transformers that separates route selectivity, whether to intervene, from residual-edit capacity, what edit to apply.

Excerpt limited to ~120 words for fair-use compliance. The full article is at arXiv cs.AI.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from arXiv cs.AI