Qwen 3.6 27B in Claude Code says it will do something then stops and prompts for user reply (not failing a tool call)

April 26, 2026 at 5:19 PM · 0 reactions · 0 comments · 2 views

I'm running Qwen/Qwen3.6-27B-FP8 via vLLM using this command: vllm serve Qwen/Qwen3.6-27B-FP8 --tensor-parallel-size 4 --gpu-memory-utilization 0.95 --max-num-seqs 8 \ --enable-auto-tool-choice --tool-call-parser qwen3_xml \ --enable-prefix-caching --attention-backend flashinfer It works pretty well in Claude Code, except fairly often it will announce its about to do something, then just stops and waits for a user response. E.g.: ``` Let me continue with the remaining edits. ✻ Brewed for 48s ```

Original article

Read full at Reddit →

Anonymous · no account needed

Discussion

0 comments

Qwen 3.6 27B in Claude Code says it will do something then stops and prompts for user reply (not failing a tool call)

Discussion

More from Reddit