Logo do repositório
 
Publicação

LLM fine-tuning with biomedical open-source data

dc.contributor.authorAnaya, Christopher
dc.contributor.institutionFaculty of Sciences
dc.contributor.institutionDepartment of Informatics
dc.contributor.supervisorCouto, Francisco José Moreira
dc.contributor.supervisorFernandes, Maria Isabel Mou Sequeira
dc.date.accessioned2026-01-15T18:35:01Z
dc.date.available2026-01-15T18:35:01Z
dc.date.issued2025
dc.descriptionTese de Mestrado, Ciência de Dados, 2025, Universidade de Lisboa, Faculdade de Ciências
dc.description.abstractBiomedical question answering (QA) systems aim to support researchers and clinicians by providing accurate, context-aware answers to complex information needs. Recent advances in large language models (LLMs) have significantly improved QA performance across domains, yet challenges remain in the biomedical domain due to terminology complexity, limited data availability, and the risk of generating hallucinated content. This thesis investigates the application of parameter-efficient fine-tuning techniques to adapt LLMs for biomedical QA, focusing on the BioASQ challenge Task B Phase B, which includes yes/no, factoid, list, and ideal questions. A comprehensive review of biomedical QA datasets and LLM adaptations highlights the evolving landscape of knowledge-infused models. The thesis presents a fine-tuning pipeline based on QLoRA, a memory-efficient method for adapting the Mistral-7B-Instruct-v0.1 model using quantized weights. Domain-specific prompt templates were designed for each question type to optimize answer formatting and reduce hallucinations. The experimental setup included training on a curated dataset comprising the training dataset provided by BioASQ, Gene Ontology, DrugBank, and BiQA-derived examples. Results show that the proposed system achieves competitive performance across question types, particularly for yes/no questions, attaining F1 scores of 0.76, where structured JSON outputs enabled reliable automatic evaluation. For ideal (free-text) questions, the system demonstrated fluent but occasionally speculative responses, highlighting the trade-offs between informativeness and factual grounding. Evaluation metrics such as F1, MRR, and ROUGE were complemented by qualitative error analysis to assess system robustness. The study concludes that combining domain-adapted prompts with QLoRA fine-tuning offers a promising approach for deploying efficient and effective biomedical QA systems. Future work should explore retrieval-augmented generation, deeper integration of biomedical ontologies, and improved evaluation frameworks tailored to the nuances of clinical and research settings.en
dc.formatapplication/pdf
dc.identifier.tid204174821
dc.identifier.urihttp://hdl.handle.net/10400.5/116644
dc.language.isoeng
dc.subjectBiomedical Question Answering
dc.subjectLarge Language Models
dc.subjectParameter-Efficient Fine-Tuning
dc.subjectBioASQ
dc.subjectEvaluation Metrics
dc.titleLLM fine-tuning with biomedical open-source dataen
dc.typemaster thesis
dspace.entity.typePublication
rcaap.rightsopenAccess

Ficheiros

Principais
A mostrar 1 - 1 de 1
A carregar...
Miniatura
Nome:
TM_Christopher_Anaya.pdf
Tamanho:
322.64 KB
Formato:
Adobe Portable Document Format