Skip to main content

Write a PREreview

Retrieval-Augmented Generative AI Versus Human-Curated Expert Systems in Drug-Drug Interaction Screening

Posted
Server
Preprints.org
DOI
10.20944/preprints202605.1786.v1

Background: Clinical decision support systems are integral components of modern healthcare information technology, aiming to enhance medication safety by identifying potential drug–drug interactions (DDIs). Traditional expert-curated databases, such as LexiDrug, provide structured interaction data but may vary in coverage and clinical recommendations. Meanwhile, retrieval-augmented generative artificial intelligence tools such as ClinicalKey AI offer narrative-based DDI assessments with contextual explanations, yet their real-world performance and clinical relevance remain underexplored within electronic clinical workflows. Objective: To evaluate and compare the performance of a retrieval-augmented generative AI system (ClinicalKey AI) against a human-curated expert system (UpToDate LexiDrug) in the screening and characterization of clinically important DDIs, with emphasis on implications for clinical decision support and healthcare delivery. Methods: A reference set of 280 clinically relevant drug pairs was compiled. For each pair, data were extracted from LexiDrug, including interaction mechanism, severity category, recommended management actions, and evidence levels. CKAI was queried with standardized prompts to obtain equivalent outputs. Agreement between systems was measured for mechanism classification and severity ratings. Differences in management recommendations and supplemental information (e.g., onset, risk factors, alternative suggestions) were analyzed descriptively. The evaluation focused on clinical decision support relevance, information richness, and potential impacts on healthcare practice. Results: CKAI and the expert system agreed on interaction mechanism classification in over 99% of cases, supporting the AI’s ability to recognize basic pharmacological relationships. However, CKAI consistently assigned higher severity levels compared to LexiDrug and frequently recommended “avoid” actions rather than nuanced modifications. CKAI provided significantly more contextual information on interaction onset, patient risk factors, and alternative therapy options, enhancing interpretability for clinical users. Divergences in evidence categorization highlighted variability in underlying knowledge representation between generative AI and curated sources. Conclusion: Retrieval-augmented generative AI demonstrates high concordance with expert systems in identifying DDI mechanisms and yields rich contextual insights that may support clinician interpretation.

You can write a PREreview of Retrieval-Augmented Generative AI Versus Human-Curated Expert Systems in Drug-Drug Interaction Screening. A PREreview is a review of a preprint and can vary from a few sentences to a lengthy report, similar to a journal-organized peer-review report.

Before you start

We will ask you to log in with your ORCID iD. If you don’t have an iD, you can create one.

What is an ORCID iD?

An ORCID iD is a unique identifier that distinguishes you from everyone with the same or similar name.

Start now