I built a 103B-token Usenet corpus (1980–2013) — pre-web, human-only, zero AI contamination. Got strong traction on r/ML, thought this community would find it useful.
·
0 reactions
·
0 comments
·
15 views
Original article
r/LocalLLaMA
Anonymous · no account needed