Datacurve releases the DeepSWE coding benchmark, a 113-task test across 91 open-source repositories and five languages, and says GPT-5.5 is the leader at 70% (Michael Nuñez/VentureBeat)
·
0 reactions
·
0 comments
·
16 views
Original article
Techmeme
Anonymous · no account needed