WeSearch

Simple to use vLLM Docker Container for Qwen3.6 27b with Lorbus AutoRound INT4 quant and MTP speculative decoding - 118 tokens/second on 2x 3090s

Apr 27, 2026 · 1:19 PM UTC · 0 reactions · 0 comments · 4 views

via

LocalLlama

Original article

LocalLlama

Read full at LocalLlama →

Anonymous · no account needed

Discussion

0 comments

More from LocalLlama