llama: avoid copying logits during prompt decode in MTP by am17an · Pull Request #23198 · ggml-org/llama.cpp
·
0 reactions
·
0 comments
·
16 views
Original article
r/LocalLLaMA
Anonymous · no account needed