Saltar al contenido principal

Escribe una PREreview

Evaluation of a Multimodal Custom Finetuned LLM for Virtual Healthcare Consultations

Publicada
Servidor
OSF Preprints
DOI
10.31219/osf.io/p3wvs_v1

We present a modular, privacy-conscious prototype for multimodal agency with retrieval-augmentedgeneration (RAG) for a virtual medical assistant in healthcare consultation. The system features a locallydeployed LLaMA 3.2 11B with 4-bit quantization to keep the model small yet efficient. The model directlyaccepts both images and text and has been fine-tuned using 50,000 image label pairs. The image label pairs aretaken from the MedTrinity dataset, which consists of a wide variety of medical-related image-text pairs. Themodel was fine-tuned to enhance multimodal query answering in medical contexts. Text, image, and speechinputs are all supported. Speech is transcribed via the Assembly AI transcription API. For retrieval-augmentedgeneration, ChromaDB semantically stores indexed medical documents sourced from the MedQuAD dataset,where 41,000 medicine-related question–answer pairs are stored.We evaluate the finetuned model by comparing it with the base model, both of which are compared with andwithout the support of Retrieval Augmented Generation (RAG). We assess the response via LLM as ajudgement criterion via OpenAI’s GPT-4.1. We use strict vs nonstrict evaluations of the model against theMMMU benchmark. For the MMMU dataset, we select the fields of basic medical science, clinical medicine,and diagnostic & laboratory medicine. Each field was evaluated with 30 questions per LLM variant with orwithout RAG support.

Puedes escribir una PREreview de Evaluation of a Multimodal Custom Finetuned LLM for Virtual Healthcare Consultations. Una PREreview es una revisión de un preprint y puede variar desde unas pocas oraciones hasta un extenso informe, similar a un informe de revisión por pares organizado por una revista.

Antes de comenzar

Te pediremos que inicies sesión con tu ORCID iD. Si no tienes un iD, puedes crear uno.

¿Qué es un ORCID iD?

Un ORCID iD es un identificador único que te distingue de otros/as con tu mismo nombre o uno similar.

Comenzar ahora