In Brief Last month, I wrote about Mercor’s new benchmark measuring AI agents’ capabilities on professional tasks like law and corporate analysis. At the time, the scores were pretty dismal, with every major lab scoring under 25%, so we concluded lawyers were safe from AI displacement, at least for now. …
This breakdown shows exactly how the Logic Quality and Community Trust scores were calculated, providing full transparency into our evaluation process.