Why do we benchmark quants on perplexity and prose but never on tool call validity?
·
0 reactions
·
0 comments
·
13 views
Original article
r/LocalLLaMA
Anonymous · no account needed