Skip to main content

Write a PREreview

Evaluation of a Multimodal Custom Finetuned LLM for Virtual Healthcare Consultations

Posted
Server
OSF Preprints
DOI
10.31219/osf.io/p3wvs_v1

We present a modular, privacy-conscious prototype for multimodal agency with retrieval-augmentedgeneration (RAG) for a virtual medical assistant in healthcare consultation. The system features a locallydeployed LLaMA 3.2 11B with 4-bit quantization to keep the model small yet efficient. The model directlyaccepts both images and text and has been fine-tuned using 50,000 image label pairs. The image label pairs aretaken from the MedTrinity dataset, which consists of a wide variety of medical-related image-text pairs. Themodel was fine-tuned to enhance multimodal query answering in medical contexts. Text, image, and speechinputs are all supported. Speech is transcribed via the Assembly AI transcription API. For retrieval-augmentedgeneration, ChromaDB semantically stores indexed medical documents sourced from the MedQuAD dataset,where 41,000 medicine-related question–answer pairs are stored.We evaluate the finetuned model by comparing it with the base model, both of which are compared with andwithout the support of Retrieval Augmented Generation (RAG). We assess the response via LLM as ajudgement criterion via OpenAI’s GPT-4.1. We use strict vs nonstrict evaluations of the model against theMMMU benchmark. For the MMMU dataset, we select the fields of basic medical science, clinical medicine,and diagnostic & laboratory medicine. Each field was evaluated with 30 questions per LLM variant with orwithout RAG support.

You can write a PREreview of Evaluation of a Multimodal Custom Finetuned LLM for Virtual Healthcare Consultations. A PREreview is a review of a preprint and can vary from a few sentences to a lengthy report, similar to a journal-organized peer-review report.

Before you start

We will ask you to log in with your ORCID iD. If you don’t have an iD, you can create one.

What is an ORCID iD?

An ORCID iD is a unique identifier that distinguishes you from everyone with the same or similar name.

Start now