WeSearch

You can predict LLM output sensitivity in closed form

·11 min read · 0 reactions · 0 comments · 15 views
#technology#artificial intelligence#machine learning
⚡ TL;DR · AI summary

This article discusses the predictability of output sensitivity in large language models (LLMs) during inference. It introduces a formula for determining how much perturbation can occur in the residual stream before the predictive distribution changes significantly. The findings are based on empirical observations and mathematical formulations that describe the curvature of the loss landscape in relation to the model's output stability.

Key facts
Original article
Noahgolmant
Read full at Noahgolmant →
Opening excerpt (first ~120 words) tap to expand

The local shape of LLM stable regions May 18, 2026 · Noah Golmant This post tries to answer a question about what transformers do at inference: how far can you perturb the residual stream at some position before the predictive distribution changes? The residual stream is the running per-token vector that gets multiplied by the unembedding WUW_UWU​ to produce next-token logits. (Sometimes called pre-logit activations.) I find this question interesting because it can potentially offer a conceptual insight into the underlying geometry of the distribution and the model’s learning dynamics. It’s also motivated by Janiak et al.

Excerpt limited to ~120 words for fair-use compliance. The full article is at Noahgolmant.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from Noahgolmant