WeSearch

The Special Token `<Think>` Problem/Bug of Latest DeepSeek LLM

liuyuancheng· ·8 min read · 0 reactions · 0 comments · 8 views
#ai#llm#bug
The Special Token `<Think>` Problem/Bug of Latest DeepSeek LLM
⚡ TL;DR · AI summary

The latest version of the DeepSeek LLM has been found to produce unstable responses when given specific incomplete or special tokens such as <think> or <think. The issue was discovered by users who reported abnormal behavior when interacting with the model, including unrelated responses and hidden reasoning traces. The cause of the problem is still being investigated, with possible explanations including tokenizer or prompt parser bugs, special token parsing issues, and GPU cache or load balancer cache leakage.

Key facts
Original article
PixelsTech · liuyuancheng
Read full at PixelsTech →
Opening excerpt (first ~120 words) tap to expand

Recently the users observed an issue/bug in the latest version of the DeepSeek LLM: during testing, it was found that when specific incomplete or special tokens such as <think> or <think are included in the user prompt, the model may produce highly unstable responses, severe hallucinations, abnormal reasoning outputs, or unexpected behavior.(adsbygoogle = window.adsbygoogle || []).push({}); The purpose of this article is to document the issue, demonstrate how the behavior can be reproduced, and discuss the verification experiments conducted to better understand the possible cause of the problem.

Excerpt limited to ~120 words for fair-use compliance. The full article is at PixelsTech.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from PixelsTech