Shipping Gemma 4 speech recognition in a Windows .NET desktop app: a 5-variant model-selection tour
Maksim Demin has developed a voice-to-text desktop application called Parlotype using Gemma 4 speech recognition technology. The application allows users to dictate text directly into any app while ensuring that all processing occurs locally on their machine. The article discusses the selection of the best model variant from five available options to optimize performance and user experience.
- ▪Parlotype is built with .NET 10 and Avalonia UI, allowing for local speech recognition without cloud dependency.
- ▪Gemma 4 was released by Google in April 2026 and offers competitive performance compared to existing models like Whisper.
- ▪The application features a model selector that lets users choose between different variants of Gemma 4 based on their needs.
Opening excerpt (first ~120 words) tap to expand
try { if(localStorage) { let currentUser = localStorage.getItem('current_user'); if (currentUser) { currentUser = JSON.parse(currentUser); if (currentUser.id === 3910746) { document.getElementById('article-show-container').classList.add('current-user-is-article-author'); } } } } catch (e) { console.error(e); } Maksim Demin Posted on May 24 Shipping Gemma 4 speech recognition in a Windows .NET desktop app: a 5-variant model-selection tour #devchallenge #gemmachallenge #gemma #dotnet Gemma 4 Challenge: Build With Gemma 4 Submission This is a submission for the Gemma 4 Challenge: Build with Gemma 4 What I Built Parlotype is a voice-to-text desktop app for Windows. It is built with .NET 10 and Avalonia UI. You hold a global hotkey, speak, then release it.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at DEV.to (Top).