Hub / social / r/LocalLLaMA

social · source

r/LocalLLaMA on WeSearch

Recent social headlines from r/LocalLLaMA.

Follow-up: Qwen3.6-27B on 1× RTX 3090 — pushing to ~218K context + ~50–66 TPS, tool calls now stable (PN12 fix)

Open Models - April 2026 - One of the best months of all time for Local LLMs?

New Stealth Model : Owl Alpha

DeepSeek Vision/Multimodal 👀

No, nothing special, just a tiny local language model playing a game it itself wrote.

I stumbled on a Gemma 4 chat template bug for tools and fixed it

MiMo-V2.5-GGUF (preview available)

Hipfire dev update: full AMD arch validation incoming (RDNA 1 thru 4, plus Strix Halo and bc250)

Deepseek v4 pricing is genuinely silly, did the math and now i am questioning my entire stack

100M tokens for $2.65 (Deepseek V4 Pro)

Why isn’t LLM reasoning done in vector space instead of natural language?

llama.cpp's Preliminary SM120 Native NVFP4 MMQ Is Merged

great work, Gemma

More social sources

r/programming r/webdev r/typescript r/javascript r/Python r/rust r/golang r/cpp r/csharp r/java r/elixir r/haskell r/ruby r/PHP r/reactjs r/vuejs r/sveltejs r/node

Visit r/LocalLLaMA directly →