WeSearch
Hub / social / r/LocalLLaMA
social · source

r/LocalLLaMA on WeSearch

Recent social headlines from r/LocalLLaMA.

R/LOCALLLAMA

Introducing Gemma 4 12B: a unified, encoder-free multimodal model

6/3/2026 · 19 views
R/LOCALLLAMA

Best way to index full Italian Wikipedia for 100% offline RAG in LM Studio?

6/3/2026 · 20 views
R/LOCALLLAMA

This day in LLM history….105 years ago today, Qwen 3.6 27b was released open source. /s

6/3/2026 · 25 views
R/LOCALLLAMA

Gemma 4 Unified is coming

6/3/2026 · 20 views
R/LOCALLLAMA

Take Three: What’s the rub on memory sessions?

6/3/2026 · 19 views
R/LOCALLLAMA

ui: Mermaid Diagrams in chat + interactive preview by allozaur · Pull Request #24032 · ggml-org/llama.cpp

6/3/2026 · 20 views
R/LOCALLLAMA

Gemma 4 is coming - No Vision Tower - No Audio Tower

6/3/2026 · 23 views
R/LOCALLLAMA

I developed a hard LLM Challenge

6/3/2026 · 11 views
R/LOCALLLAMA

lipsync possible on mac?

6/3/2026 · 13 views
R/LOCALLLAMA

Qwen 3.7 Plus just briefly appeared and then disappeared on OpenRouter.

6/3/2026 · 22 views
R/LOCALLLAMA

Half the top 10 trending GitHub repos right now are "skills" projects, not models

6/3/2026 · 16 views
R/LOCALLLAMA

Tensor split mode: CUDA error on latest llama.cpp with Qwen-3.6-27b

6/3/2026 · 12 views
R/LOCALLLAMA

Calling it now Microsoft is buying Unsloth.

6/3/2026 · 15 views
R/LOCALLLAMA

Helvete-nano

6/3/2026 · 13 views
R/LOCALLLAMA

Holo3.1 35B/9B/4B/0.8B (Qwen 3.5 finetunes)

6/3/2026 · 17 views
R/LOCALLLAMA

Mellum & Granite Embedding models are ready on llama.cpp

6/3/2026 · 17 views
R/LOCALLLAMA

Another shout out to llama.cpp build b9455 2x3090

6/3/2026 · 16 views
R/LOCALLLAMA

Microsoft Aion 1.0 Instruct and Aion 1.0 Plan models!

6/3/2026 · 16 views
R/LOCALLLAMA

Nous Research — Hermes Desktop

6/3/2026 · 30 views
R/LOCALLLAMA

Why do we benchmark quants on perplexity and prose but never on tool call validity?

6/3/2026 · 12 views
R/LOCALLLAMA

Someone out there likely needs this

5/30/2026 · 17 views
R/LOCALLLAMA

Everyone here self-hosts inference. Almost nobody self-hosts the tooling around it. That feels backwards to me.

5/30/2026 · 14 views
R/LOCALLLAMA

Cost Analysis of my $6.4k Local LLM Server

5/30/2026 · 16 views
R/LOCALLLAMA

Running Qwen 3.6 35b MoE With Zoo Code On M1 Max is Amazing! Fully local, battery-powered coding powerhouse!

5/30/2026 · 24 views
R/LOCALLLAMA

Would a MacBook M5 16/24/32GB be an upgrade, complement, or waste next to my RTX 4060 laptop?

5/30/2026 · 19 views
R/LOCALLLAMA

What features dramatically improved your custom memory system?

5/30/2026 · 16 views
R/LOCALLLAMA

For those creating personal assistants locally - how has short/long term memory impacted your experience?

5/30/2026 · 17 views
R/LOCALLLAMA

Parallax: Parameterized Local Linear Attention for Language Modeling

5/30/2026 · 18 views
R/LOCALLLAMA

nvidia/Qwen3.6-35B-A3B-NVFP4 · Hugging Face

5/30/2026 · 13 views
R/LOCALLLAMA

SupraLabs 50M Parameter Model Just Hit the Trending Page on Hugging Face 🤯

5/30/2026 · 16 views
R/LOCALLLAMA

Why does Thinking Output More Tokens Than a Response?

5/30/2026 · 20 views
R/LOCALLLAMA

[LLM analysis challenge] OPERATION: REVERSE ROBOTOMY. We need an LLM Neurosurgeon to extract a password from a fractured artificial mind.

5/30/2026 · 15 views
R/LOCALLLAMA

Can't get over 250TPS on RTX5090 with Qwen3.5-4B

5/30/2026 · 15 views
R/LOCALLLAMA

LFM2.5-8B-A1B release

5/30/2026 · 15 views
R/LOCALLLAMA

anybody got llama-swap working answering concurrent requests for a single model?

5/30/2026 · 17 views
R/LOCALLLAMA

STT -> LLM -> TTS pipeline

5/30/2026 · 19 views
R/LOCALLLAMA

Qwen 3.6 coding choice–27B vs 35B quants

5/30/2026 · 11 views
R/LOCALLLAMA

"What are you good at?"

5/30/2026 · 10 views
R/LOCALLLAMA

Fulloch V2: 100% Local Voice Assistant for Home Assistant & Obsidian (Runs on 16GB VRAM)

5/30/2026 · 19 views
R/LOCALLLAMA

MINISFORUM UM790 Pro

5/30/2026 · 12 views
R/LOCALLLAMA

Gryphe/Pantheon-Reasoning-27B · Hugging Face

5/30/2026 · 10 views
R/LOCALLLAMA

Open source : Turning vocal imitations into sound effects. (New UX for sound generation)

5/30/2026 · 15 views
R/LOCALLLAMA

Vidai Community is now available: one Rust binary for cost attribution, guardrails and multi-provider routing on every LLM call

5/30/2026 · 13 views
R/LOCALLLAMA

The best AI Model for Arabic dialects 🇪🇬🦅🧡

5/30/2026 · 12 views
R/LOCALLLAMA

made a local voice AI for windows you can talk to in any language. open source, bring your own key

5/30/2026 · 16 views
R/LOCALLLAMA

I have 2x PC's. One with a 5090 and one with a 4080. Is there an easy way to use both together networked?

5/30/2026 · 13 views
R/LOCALLLAMA

Keeping multi-GPU rigs cool?

5/30/2026 · 12 views
R/LOCALLLAMA

Breaking the music supply constraint

5/29/2026 · 14 views
R/LOCALLLAMA

Uploaded my Qwen3.6 27B based fine tune, after two years of experience fine tuning models

5/29/2026 · 15 views
R/LOCALLLAMA

Mutating Gemma 4 31B Dense in to a native Gemma 4 additive-MoE model

5/29/2026 · 11 views

More social sources

Visit r/LocalLLaMA directly →