Benchmarking Contrastive Learning for Multimodal Medical Imaging

Silva, Martim Dourado da

Publication

Benchmarking Contrastive Learning for Multimodal Medical Imaging

2025Master thesis

datacite.subject.fos	Departamento de Informática	pt_PT
dc.contributor.advisor	Garcia, Nuno Cruz
dc.contributor.author	Silva, Martim Dourado da
dc.date.accessioned	2025-07-24T08:51:38Z
dc.date.available	2025-07-24T08:51:38Z
dc.date.issued	2025
dc.date.submitted	2025
dc.description	Tese de mestrado, Ciência de Dados, 2025, Universidade de Lisboa, Faculdade de Ciências	pt_PT
dc.description.abstract	Deep learning has achieved remarkable success in complex tasks, but its reliance on large, annotated datasets limits scalability in medical imaging, where expert labeling is costly and scarce. Contrastive learning, a self-supervised approach, offers a way to learn useful visual representations from unlabeled data by training models to distinguish between different images while aligning augmented views of the same instance. This thesis investigates the effectiveness of three state-of-the-art contrastive learning frameworks - SimCLR, MoCo, and BYOL - in generating transferable representations from medical images for two downstream tasks: multiclass classification and binary segmentation of breast tissue. It also examines whether combining ultrasound and mammography images during pretraining supports or hinders model generalization, reflecting real-world multimodal diagnostic workflows. Using seven public datasets, three modality-specific pretraining sets were constructed (ultrasound, mammography, and a balanced multimodal mix), each used to train the three frameworks. The resulting models were then fine-tuned for each downstream task. All networks shared a ResNet-18 backbone, and segmentation models used U-Net architectures with pretrained encoders. Results show that contrastive pretraining improves classification performance, particularly with ultrasound data and using BYOL or MoCo. These models outperformed randomly initialized baselines. For segmentation, however, random initialization yielded superior results, suggesting that standard contrastive objectives and augmentations do not capture the spatial precision needed for pixel-wise tasks. Mammography images posed further challenges due to small lesion size and detail loss from uniform resizing. This work underscores both the promise and the limitations of contrastive learning in clinical imaging. While effective for classification with limited labels, adapting contrastive methods for segmentation requires the design of specialized modality-aware pipelines.	pt_PT
dc.identifier.uri	http://hdl.handle.net/10400.5/102399
dc.language.iso	eng	pt_PT
dc.subject	Visão Computacional	pt_PT
dc.subject	Aprendizagem Auto-Supervisionada Contrastiva	pt_PT
dc.subject	Análise de Imagens Médicas	pt_PT
dc.subject	Deteção/Diagnóstico do Cancro da Mama	pt_PT
dc.subject	Teses de mestrado - 2025	pt_PT
dc.title	Benchmarking Contrastive Learning for Multimodal Medical Imaging	pt_PT
dc.type	master thesis
dspace.entity.type	Publication
rcaap.rights	openAccess	pt_PT
rcaap.type	masterThesis	pt_PT
thesis.degree.name	Tese de mestrado em Ciência de Dados	pt_PT

Files

Original bundle

Now showing 1 - 1 of 1

Name:: TM_Martim_Silva.pdf
Size:: 32.12 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.2 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

FC-DI - Master Thesis (dissertation)