Introducing BigLaw Bench: Arena – Harvey
Presenting BigLaw Bench: Arena — our approach to head-to-head evaluation of AI systems on legal tasks.
Today, we’re introducing BigLaw Bench: Arena (BLB: Arena), which is our internal system for determining which models and systems experts actually prefer through scaled, pairwise comparison. Inspired by open research initiatives such as LMArena, this approach complements formal benchmarks and provides unique perspectives from experts on not just what makes an AI system good, but also what makes it better than alternatives.
We’re also publishing recent results which compare our recently released version of Harvey Assistant against prior implementations and leading foundation models.
To read the article in full, click here



