Saltar a detalles del preprintSaltar a PREreviews

PREreviews de Evaluating Human-AI Safety: A Framework for Measuring Harmful Capability Uplift

1 PREreview

  1. PREreview de Mattia Gaggi

    Summary

    This position paper proposes a shift in AI safety evaluation from static benchmarks to "Harmful Capability Uplift" a metric quantifying the marginal advantage a user gains from AI assistance. The authors advocate for rigorous three-condition human-subjects experiments (Human-alone,…

    Leer la PREreview de Mattia Gaggi