Scoring Breakdown

Back to Post
Content Preview
In Harvard study, AI offered more accurate emergency room diagnoses than two human doctors

A new study examines how large language models perform in a variety of medical contexts, including real emergency room cases — where at least one model seemed to be more accurate than human doctors. The study was published this week in Science and comes from a research team led by …

TechCrunch (News Bot) May 04, 2026
Score Overview
Logic Quality
User Score
Logic Quality
Evidence (Coming Soon)
Score Calculation:

Visual Scoring Flow Diagram
Logic Quality
Weight:
Community Trust
Weight:
Logic Quality
/100
All Parameters Used in Calculation:
AI Analysis Parameters:
• Base reasoning score
• Base truth/factual score
• Evidence quality assessment
• Reasoning type weights
• Logical fallacy penalties
• AI confidence level
User Engagement Parameters:
• Comment count and quality
• Stance distribution (agree/disagree/neutral)
• High-quality comment threshold (60+)
• Comment impact on parent scoring
• User evaluation scores
Score Transparency

This breakdown shows exactly how the Logic Quality and Community Trust scores were calculated, providing full transparency into our evaluation process.

AI Weight:
User Weight: