23 stories tagged with #cuda, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Cuda"
Tensor split mode: CUDA error on latest llama.cpp with Qwen-3.6-27b
Nvidia says RTX Spark offers up to 20 CPU cores and a Blackwell GPU with 6,144 CUDA cores, capable of "100 FPS 1440p gaming" or running 120B-parameter models (Jeffrey Kampman/Tom's Hardware)
Jeffrey Kampman / Tom's Hardware : Nvidia says RTX Spark offers up to 20 CPU cores and a Blackwell GPU with 6,144 CUDA cores, capable of “100 FPS 1440p gaming” or running 120B-para…
Show HN: Tiny-vLLM – high performance LLM inference engine in C++ and CUDA
Build your own high performance LLM inference engine in C++ and CUDA - a smaller version of vLLM - jmaczan/tiny-vllm…
NVIDIA CUDA 13.3 Rolls Out CUDA Python 1.0, CUDA Tile For C++
NVIDIA Releases CUDA-Oxide 0.1 For Experimental Rust-To-CUDA Compiler
Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation
Large language models (LLMs) have shown strong empirical gains as self-evolving agents for CUDA kernel generation, driven by feedback-conditioned planning across generations. Howev…
CUDA: add fast walsh-hadamard transform by am17an · Pull Request #23615 · ggml-org/llama.cpp
Barracuda Defenseman Set to Return to Sweden for 2026-27 Season
One of the San Jose Sharks' pending unrestricted free agents has already gotten his plans for the 2026-27 season sorted out.…
The Microsecond Lie: Why your Go timers are lying about the GPU
TL;DR: I thought my CUDA kernel was running in 160 microseconds. I was wrong. Here is how I used CUDA...…
Profiling a CUDA Python Program with GPUFlight
In the previous post, I used a C++ CUDA example to look at memory coalescing and how memory access...…
chipStar 1.3 Released For Running HIP/CUDA Code On SPIR-V With OpenCL
A new release of chipStar is now available as the open-source tool for compiling and running HIP/CUDA code in a vendor-neutral manner with the SPIR-V intermediate representation on…
Deleting the 8.4GB Python Sidecar: Pure Go + CUDA with `CGO_ENABLED=0`
TL;DR: I built gocudrv so Go services can talk directly to NVIDIA GPUs — no cgo, no CUDA toolkit, no...…
Generated with Flux on AMD RX 580 (2017 GPU) — Vulkan only, no CUDA
Barracuda Cloud Email Archive - broken indexing
Army Orders 3,000 Container-Launched Barracuda-500M Cruise Missiles
Anduril will provide the U.S. Army with a minimum of 3,000 Surface-Launched Barracuda-500M, starting in 2027 with the first 1,000 along with the associated containerized launch sys…
Calling CUDA from Go without cgo
Go is great at infrastructure. It gives us fast builds, simple deployment, lightweight concurrency,...…
Programmers Spend Their Time – Probably Dance
I submitted a tiny patch to flash attention. The necessary typing for the change takes less ten seconds, but the overall change took more than ten hours So where does the time go? …
Molecular Dynamics on Apple M4
Molecular dynamics on Apple M4 — NEON intrinsics, SME2, Metal compute shaders, OpenMP. Pushing Apple Silicon to its limits. - vyasgiridhar/moleqular…
13 years later, the GTX Titan is still the most important GPU Nvidia ever made
Borrowing a $7,000 Tesla and selling it for $1,000 changed everything…
Asynchronicity in Continuous Batching
We’re on a journey to advance and democratize artificial intelligence through open source and open science.…
Evaluating CUDA Tile for AI Workloads on Hopper and Blackwell GPUs
NVIDIA's CUDA Tile (CuTile) introduces a Python-based, tile-centric abstraction for GPU kernel development that aims to simplify programming while retaining Tensor Core and Tensor …
'Why can't I?' launched Florida's first girls-duals wrestling team
"Why can't I?" Lily Calzadilla, 14, asked in 2017. Her persistence led to Florida's first all girls-duals wrestling team in Jensen Beach.…
'Why can't I?' launched Florida's first girls-duals wrestling team
"Why can't I?" Lily Calzadilla, 14, asked in 2017. Her persistence led to Florida's first all girls-duals wrestling team in Jensen Beach.…