11 stories tagged with #gguf, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Gguf"
RELATED TAGS
R/LOCALLLAMA
Qwen3.5 35B A3B uncensored heretic Native MTP Preserved is Out Now With the Full 785 MTPs Preserved and Retained, Available in Safetensors, GGUFs. NVFP4, NVFP4 GGUFs and GPTQ-Int4 Formats
GITHUB
Ollama v0.30.0-rc23: "directly support llama.cpp" & "compatibility with GGUF"
This version of Ollama will change the architecture to directly support llama.cpp instead of building on top of GGML, and allows for compatibility with GGUF file format. MLX is use…
R/LOCALLLAMA
What is the smallest amount of RAM sufficient to run any available on HF GGUF LLM model locally?
R/LOCALLLAMA
If one .gguf makes it past the great filter, humanity survives in some way.
R/LOCALLLAMA
Quick note on sudden performance loss when running GGUFs
R/STABLEDIFFUSION
ggufy: easy quantization for the GPU poor
R/LOCALLLAMA
GGUF with MTP vs MLX without. Is mlx still the way to go for mac users?
R/LOCALLLAMA
Jackrong/Qwopus3.5-9B-Coder-GGUF · Hugging Face
DEV.TO (TOP)
I’m not building “librarian AI.”
This is a submission for the Hermes Agent Challenge ARC-Neuron LLMBuilder: A Local-First...…
DEV.TO (TOP)
Running Local GGUF Models with Ollama (GPU Enabled)
1. Install & Start Ollama curl -fsSL https://ollama.com/install.sh | sh systemctl...…
R/LOCALLLAMA