ammaox 2 hours ago

A very large review of AI benchmarks that reveals a worrying trend in their effectiveness and scientific rigor