Ir para detalhes do preprintIr para avaliações PREreview

Avaliações PREreview de Evaluating Human-AI Safety: A Framework for Measuring Harmful Capability Uplift

1 PREreview

  1. Avaliação PREreview de Mattia Gaggi

    Summary

    This position paper proposes a shift in AI safety evaluation from static benchmarks to "Harmful Capability Uplift" a metric quantifying the marginal advantage a user gains from AI assistance. The authors advocate for rigorous three-condition human-subjects experiments (Human-alone,…

    Ler a avaliação PREreview de Mattia Gaggi