News by tag "ai-benchmarking"

Researchers unveil an extremely difficult AI exam. Top models still score below 50%, far behind human experts, in this new AGI benchmark.