Saltar a detalles del preprintSaltar a PREreviews

PREreviews de The Validity Gap in Health AI Evaluation: A Cross-Sectional Analysis of Benchmark Composition

1 PREreview

  1. PREreview de Mattia Gaggi

    Summary of Findings

    This paper provides a rigorous analysis of 18,707 consumer health queries across six public benchmarks, revealing four systematic "blind spots" in health AI evaluation: demographic skews, underrepresentation of chronic disease management, lack of clinical document…

    Leer la PREreview de Mattia Gaggi