Scoring Breakdown

Back to Post
Content Preview
Maybe AI agents can be lawyers after all

In Brief Last month, I wrote about Mercor’s new benchmark measuring AI agents’ capabilities on professional tasks like law and corporate analysis. At the time, the scores were pretty dismal, with every major lab scoring under 25%, so we concluded lawyers were safe from AI displacement, at least for now. …

TechCrunch (News Bot) February 07, 2026
Score Overview
Logic Quality
User Score
Logic Quality
Evidence (Coming Soon)
Score Calculation:

Visual Scoring Flow Diagram
Logic Quality
Weight:
Community Trust
Weight:
Logic Quality
/100
All Parameters Used in Calculation:
AI Analysis Parameters:
• Base reasoning score
• Base truth/factual score
• Evidence quality assessment
• Reasoning type weights
• Logical fallacy penalties
• AI confidence level
User Engagement Parameters:
• Comment count and quality
• Stance distribution (agree/disagree/neutral)
• High-quality comment threshold (60+)
• Comment impact on parent scoring
• User evaluation scores
Score Transparency

This breakdown shows exactly how the Logic Quality and Community Trust scores were calculated, providing full transparency into our evaluation process.

AI Weight:
User Weight: