WeSearch

Programmers Spend Their Time – Probably Dance

·9 min read · 0 reactions · 0 comments · 15 views
#software development#debugging#cuda#memory management#programming#Compute Sanitizer#PyTorch#flash-attention#CUDA#cudnn#Makefile#apt-get#Internet
Programmers Spend Their Time – Probably Dance
⚡ TL;DR · AI summary

A programmer details the lengthy debugging process behind a seemingly simple code change, highlighting the hidden complexities in modern software development environments. Despite the fix requiring only seconds of typing, the process took over ten hours due to tooling issues, environment constraints, and subtle memory bugs. The investigation revealed a use-after-free bug in the flash-attention library, which required building a custom version to confirm.

Key facts
Original article
Probably Dance
Read full at Probably Dance →
Opening excerpt (first ~120 words) tap to expand

I submitted a tiny patch to flash attention. The necessary typing for the change takes less ten seconds, but the overall change took more than ten hours So where does the time go? It started when coworker had a bug where cudnn attention would crash randomly. We looked at his unreleased changes and concluded that they couldn’t possibly cause this, so we suspected that we had a lingering bug that was exposed by making harmless changes to related code. Step 1, a few hours: My coworker tried to figure this out just by running the code repeatedly, trying out various theories. The bug was hard to reproduce so this took hours without much progress. Step 2, 1 hour: I thought this is a good reason to try out compute sanitizer.

Excerpt limited to ~120 words for fair-use compliance. The full article is at Probably Dance.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from Probably Dance