Saltar a detalles del preprintSaltar a PREreviews

PREreviews de Citation Hallucination Determines Success: An Empirical Comparison of Six Medical AI Research Systems

1 PREreview

  1. PREreview de Matt Spick

    Shi et al have written an interesting and timely piece on the reliability of large langauge models (LLMs) in producing medical research manuscripts. They introduce MedResearchBench, a benchmarking tool to assess the reliability of LLM outputs, and report on how different LLMs perform on their…

    Leer la PREreview de Matt Spick