Ir para detalhes do preprintIr para avaliações PREreview

Avaliações PREreview de The Validity Gap in Health AI Evaluation: A Cross-Sectional Analysis of Benchmark Composition

1 PREreview

  1. Avaliação PREreview de Mattia Gaggi

    Summary of Findings

    This paper provides a rigorous analysis of 18,707 consumer health queries across six public benchmarks, revealing four systematic "blind spots" in health AI evaluation: demographic skews, underrepresentation of chronic disease management, lack of clinical document…

    Ler a avaliação PREreview de Mattia Gaggi