WeSearch

We added W8A8 activation quantization to MLX — prefill went from 2.84s to 2.52s on M5 Pro

May 25, 2026 · 8:16 AM UTC · 0 reactions · 0 comments · 16 views

via

r/LocalLLaMA

Original article

r/LocalLLaMA

Read full at r/LocalLLaMA →

Anonymous · no account needed

Discussion

0 comments

More from r/LocalLLaMA