43 stories tagged with #latency, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Latency"
AMD says new EXPO ‘Ultra Low Latency’ DDR5 memory should be 'effectively the same price' as current kits — feature will work on existing chipsets, but will require new DIMMs
After a small mention in its Computex announcements, AMD’s David McAfee provides a bit more color on what exactly EXPO ULL is.…
Stop Using LLMs to Audit Other LLMs: You Are Bricking Your Production Latency
Look at your modern Agentic AI stack. An agent wants to execute a tool, trigger a deployment, access...…
New Silicon Motion SM2524XT chip brings 14 GB/s to mainstream SSDs — 6nm DRAMless controller boasts heavy AI PC optimization and slashes KV cache latency
Fast, furious, inexpensive.…
Why Your Website Can Be "Up" And Still Broken: A Deep Dive Into Latency Phases
Why Your Website Can Be "Up" And Still Broken Most uptime monitors tell you one thing: is...…
This Rewrite Isnt the Constraint: How a 300ms Tail Latency Hunt Led to a New Event Pipeline
We were burning 400ms in p99 tail latency on a core event-processing path in Veltrix. The upstream...…
OpenAI investigating ‘elevated latency’ issue affecting ChatGPT
Does ChatGPT seem slower than usual for you today? You’re not alone. OpenAI says it’s currently investigating an issue around...…
OpenAI investigating ‘elevated latency’ issue affecting ChatGPT - 9to5Mac
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
Automating Incident Response at the Network Edge with Low-Latency ML
Learn how to eliminate latency lag in incident response by deploying low-latency ML models at the network edge using HookProbe’s AI-native NAPSE engine.…
The Day the Treasure Hunt Engine Decided to Lie to Us About Latency
The Problem We Were Actually Solving The marketing department wanted treasure hunts to...…
Keyboard latency probe
The pause before the first token
There is a pause between sending a prompt to a language model and seeing the first token appear. Half...…
Local LLM Hosting in Switzerland: Real Costs, Latency & Compliance
When a Lausanne fintech burned through CHF 4,200 in a single day after a GDPR‑triggered API outage,...…
Your Treasure Hunt Engine Was Probably a Latency Minefield (And Heres the Postmortem)
We had just finished the first major traffic spike. Our Veltrix-based treasure hunt game ran...…
That 0.8 second P99 Latency Cliff in Production Wasnt Supposed to Happen
The Problem We Were Actually Solving We built the Treasure Hunt Engine to process millions...…
TC fanout latency
Windows 11's Low Latency Profile isn't innovation, it's Microsoft finally catching up to its rivals
It's not new, but it's still welcome…
How we optimized a local Llama 3 agent: From 15s latency and 68% accuracy to 4s and 100% (Full E2E Code & Guide)
Toward Reliable Design of LLM-Enabled Agentic Workflows: Optimizing Latency-Reliability-Cost Tradeoffs
Modern AI systems increasingly rely on workflows composed of multiple interacting agents, some powered by large language models (LLMs) and others by conventional computational modu…
Optimizing Signal Latency and Weight Allocations in Algorithmic Pipelines
In our latest commits to core/tools/buildinpublic.py and phases/phase4content.py, we minimized signal...…
ns scale ultra-low-latency fabric over shm and MMAP for IPC
Show HN: Impatience – measure event latency on peers using synchronised clocks
A library for instrumentation of event-to-event latency over a network - xfoa/Impatience…
The new cyber gap is response latency
Why cyber response now falters after detection…
Real-time video classification with PaliGemma: architecture patterns for low-latency VLM inference
In a previous article, we benchmarked three open-source Vision-Language Models on zero-shot object...…
Dataverse virtual tables on SQL: three latency patterns
Virtual tables surface external data inside Dataverse without copying it. The user experience matches any other table. The performance does not - and the gap is what decides whethe…
Latency on iPhone 17 pro max??
Cutting agent latency from 30s to 8s without model swap
A founder pinged us with a UX problem disguised as an engineering question. His team had launched an...…
The Myth of Low Latency: Why Event Meshes Make Your System Slow
The Problem We Were Actually Solving At Veltrix we had a simple monolithic service that...…
Latte: a single-header latency measurement for quick insights
Embedded acoustic AI with <16ms latency running on 8MB RAM
AI-powered acoustic intelligence for real-world voice devices. Hybrid AI + DSP for robotics, conferencing, and edge AI.…
Best way to control a laptop directly from a desktop PC with low/zero latency?
Rust's Low-Latency Conquest: Why We Ditched C++ for a Treasure Hunt Engine
The Problem We Were Actually Solving In our case, the problem wasn't just about serving a...…
Need Technical Advice: Broker changed my latency to 2000ms after a highly profitable run on XAUUSD
Why P95 Latency Is the Only Metric That Matters at 3 AM
If your checkout endpoint serves 10,000 requests per minute, a 5% latency spike means 500 users are...…
High E2E latency on fine-tuned Gemma 4 26B despite low TTFT [R]
The Feature Store: Consistency and Latency Are Both Non-Negotiable
Part 3 of 5 in the series: When Your AI Pipeline Grows Up In the previous post, we worked through...…
OpenAI Outlines WebRTC Architecture for Low-Latency Voice AI at Scale - infoq.com
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
Architecture advice: Real-time pipeline for YouTube Audio -> Whisper -> LLM -> SSE (Sub-10s latency) [D]
Open-Source "low_latency_layer" Brings Reflex & Anti-Lag 2 To AMD & Intel GPUs On Linux
Open-Source "low_latency_layer" Brings Reflex & Anti-Lag 2 To AMD & Intel GPUs on Linux
Open-Source "low_latency_layer" Brings Reflex & Anti-Lag 2 To AMD & Intel GPUs On Linux
A new open-source project called low_latency_layer is an implicit Vulkan layer that enables AMD Anti-Lag 2 and NVIDIA Reflex 2 to reportedly work in a hardware-agnostic manner so t…
Built a CLI tool that diagnoses your internet in 25s. Shows bufferbloat, jitter, DNS latency, and edge routing, not just download speed.
The Streaming Latency Tradeoff: Why Some TTS Models Lose Accuracy in Real Time
Streaming TTS loses 5-20x context vs batch processing, causing pronunciation failures on alphanumeric IDs. Learn when to choose batch over real-time synthesis.…
How we tracked down a mysterious latency issue in our AWS Lambda + RDS Proxy stack, and discovered Prisma was the culprit all along.
Our API Was Fine. Database Was Fine. So Why Were Queries Taking 16 Seconds? It started with a...…