Skip to preprint detailsSkip to PREreviews

PREreviews of Evaluating Human-AI Safety: A Framework for Measuring Harmful Capability Uplift

1 PREreview

  1. PREreview by Mattia Gaggi

    Summary

    This position paper proposes a shift in AI safety evaluation from static benchmarks to "Harmful Capability Uplift" a metric quantifying the marginal advantage a user gains from AI assistance. The authors advocate for rigorous three-condition human-subjects experiments (Human-alone,…

    Read the PREreview by Mattia Gaggi