#gemma — Tagged Stories | WeSearch Press

Every story in the WeSearch catalog tagged with #gemma, chronological, with view counts. Subscribe to the per-tag RSS feed to follow this topic in your reader of choice.

60 stories tagged with #gemma, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.

⌘ RSS feed for this tag → or search "Gemma"

RELATED TAGS

#ai16 #technology7 #gemma-46 #ollama6 #llm4 #ml4 #gemma44 #qwen3 #android2 #immigration2 #open-source2 #google2

ARXIV.ORG

Still: Amortized KV Cache Compaction in a Single Forward Pass

The KV cache is the memory bottleneck of long-horizon language model deployment. Practically, a deployable compactor must be lightweight enough to call during inference, expressive…

9 views · Sun, 14 Jun 2026 22:37:33 GMT

#machine‑learning #natural‑language‑processing #model‑compression

MEDIUM

Gemma 4 for Telephony: From Two AI Models to One – Until I Switched to Chinese

Building a phone agent on a multimodal LLM: dropping faster-whisper and letting Gemma 4 hear the caller directly — a response-time and……

18 views · Sun, 14 Jun 2026 17:34:27 GMT

ARS TECHNICA - ALL CONTENT

Google's latest DiffusionGemma open AI model comes with a 4x speed boost

Diffusion AI is most common in image generation, but it can make text outputs much faster.…

15 views · Wed, 10 Jun 2026 20:03:29 GMT

GITHUB

TensorSharp: Open-Source Local LLM Inference Engine

A C# inference engine for running large language models (LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama…

Gemma coverage.

Still: Amortized KV Cache Compaction in a Single Forward Pass

Gemma 4 for Telephony: From Two AI Models to One – Until I Switched to Chinese

Google's latest DiffusionGemma open AI model comes with a 4x speed boost

TensorSharp: Open-Source Local LLM Inference Engine

You can run Gemma on just about anything!

Introducing Gemma 4 12B: a unified, encoder-free multimodal model

I ran Gemma 4 and Qwen 3.5 for the same local tasks, and one pulled miles ahead

Gemma 4 Unified is coming

Gemma 4 is coming - No Vision Tower - No Audio Tower

Structure: A Local-First Interview IDE Powered by Gemma 4

I use Claude Pro, Qwen 3-Coder, and Gemma 4 together, and it's the most cost-efficient AI workflow I've ever built

How the Community Trained Gemma to "Think" with Tunix and TPUs

Mutating Gemma 4 31B Dense in to a native Gemma 4 additive-MoE model

I put Google's Gemma 4 on my homelab and Tailscale on my phone — and cancelled Claude Pro

Google's free Gemma 4 model runs on hardware you probably already own

Using a Gemma4 Safetensor Already Downloaded Locally

Is something went wrong with those online free model, why I feel they worse than Gemma 4 26B A4B Q4_KM ??

Want Built a React-style looping agent with small LLMs (Qwen 3.5 9B / Gemma4) + LangGraph?

First Gemma 4 ExecuTorch Deployment on Raspberry Pi 5 — and Why It's 7.7 Slower Than llama.cpp

Gemma 4: A new, budget-focused model in Posit AI

I Downloaded Gemma4:e2b On My Macbook

Orakle: Turning Raw Blockchain Data into Intelligence with Gemma 4

OpenShift Virtualization Migration Advisor — Local-First, Powered by Gemma 4 26B MoE

Gemma4 Challenge

OptiLearn - Powered by Google Gemma 4

Aura — The Gemma 4 Powered Agentic Web Copilot & Self-Healing Accessibility Engine

I built a tool that catches misleading charts using Gemma 4 running locally

Worklog companion with Gemma4

Using Gemma4 2B to Assist Community Health Workers

Policy Storyteller: Turning Nepali Bills into Human Stories with Gemma 4

Choosing the Right Gemma 4 Model Matters More Than Choosing the Best One

Gemma 4 Made Me Rethink Local AI: Not Just Text, But Images Too

Aether: A local Android assistant built with Gemma 4

Gemma 4: The 128K Multimodal Powerhouse in Your Terminal

From Understanding Gemma 4 🧠 to Building SpeakUp 🎙️ — An AI English Coach 🤖

🔬 Direction 1 closure on JAMES — when the hypothesis fails but the data turns "7-tier monotonic natural-stop gradient"

Vestige: A Gemma 4 Brain Tracker That Won't Blow Smoke Up Your Ass

I built PROOFER - Privacy first Chrome extension that proofreads your texts using Gemma 4

I Built a Quotation Generator for Kenyan Street Welders Using Gemma 4's Vision

Gemma 4 Soft Tokens: The Rise and Fall of 16x16 Words ⚡👀

My Thoughts on Gemma 4

I built GemmaPod - A truly composable and portable AI agent solution powered by your local LLM

Gemma 4 E4B caught three planted fabrications in 50 seconds — on a laptop, no cloud

Running Gemma 4 on a Modest Machine: Unsloth vs LM Studio vs llama.cpp vs Ollama

Gemma 4 vs GPT-4o vs Llama 3: What Actually Works Locally?

The model is not the product: lessons from building with local Gemma 4

I Gave Gemma 4 150 Tools on Windows. Here's What Actually Happened.

I Ditched Cloud LLMs for Gemma 4 4B: A DevOps Engineer's 48-Hour Reality Check

RememberMe CareGrid: Local Gemma 4 for dementia memory and safety

Bootstrapping with AI: Why Gemma 4 is the Micro-SaaS Founder’s Best Friend

I built a local first AI CCTV assistant using Gemma 4 + Frigate

SparshAI: I Built an Offline AI Tutor for Students Using Gemma 4 — Here's What Happened

Building NeuroSense AI: A Human-Centered Stress Insight Assistant Powered by Gemma

CureNet AI: Decentralized Health Intelligence for India, Powered by Gemma 4 and ABHA Standardization

When Open-Weights AI Meets a Broken Healthcare System: Deploying Gemma 4 in Rural India

Bondmap: AI-Powered Relationship Network That Maps How You're Connected to Everyone Using Gemma 4

I Built an AI Second Brain with Gemma 4

🎙️Interview Coach AI — Practice Mock Interviews Locally with Gemma 4 + Jan

Basics of Gemma 4 with Google ADK

Gemma SlideCast: A 4-Agent Autonomous Presentation & Document Engine

Browse more