WeSearch

llama: avoid copying logits during prompt decode in MTP by am17an · Pull Request #23198 · ggml-org/llama.cpp

· 0 reactions · 0 comments · 16 views
Original article
r/LocalLLaMA
Read full at r/LocalLLaMA →
Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from r/LocalLLaMA