Logo do repositório
 
Publicação

Development of a Website for Creation of Vulnerability Datasets

datacite.subject.fosEngenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informáticapt_PT
dc.contributor.advisorNeves, Nuno Fuentecilla Maia Ferreira
dc.contributor.advisorMedeiros, Ibéria Vitória de Sousa
dc.contributor.authorFerreira, Miguel Pinto da Silva
dc.date.accessioned2024-02-16T17:23:14Z
dc.date.available2024-02-16T17:23:14Z
dc.date.issued2024
dc.date.submitted2023
dc.descriptionTese de Mestrado, Engenharia Informática, 2024, Universidade de Lisboa, Faculdade de Ciênciaspt_PT
dc.description.abstractWith the evolution of the digital era, guaranteeing the robustness and security of software has become a major concern. In order to address this subject, it is important to effectively not only detect, but also mitigate software vulnerabilities. Static Analysis Tools (SATs) present a cost-effective solution to this, being able to achieve a cheap and fast analysis, but often incur in a high percentage of false positives and negatives. Recent studies suggest that machine learning (ML) techniques could enhance the effectiveness of these tools, but this requires trustworthy and reliable datasets to train the ML models. This dissertation aims to provide a way of create the aforesaid datasets that can help with the development of ML models capable of identifying vulnerabilities in computer programs. To achieve this, we propose a novel approach to construct these datasets, which consists in collecting inputs from the crowd as a way of mitigating the false positives and negatives generated by the SATs, but at the same time leverage from their deterministic classifications. This approach is applied within the context of web vulnerabilities that appear in applications built with the PHP programming language. To facilitate crowdsourcing, we developed a user-friendly website called BugSpotting where users can classify PHP code snippets, indicating whether these are vulnerable (or not vulnerable) to a set of vulnerability classes. With the results obtained both from the crowd and the SATs, we are able to obtain a reliable and trustworthy dataset comprised of accurately classified PHP code snippets. We evaluated BugSpotting in terms of UI and UX and the results obtained were very satisfactory. Moreover, although we were not able to reach a consensus about the code snippet’s final label, we still manage to analyse the data we have collected until the moment, showing promising results.pt_PT
dc.identifier.tid203882067
dc.identifier.urihttp://hdl.handle.net/10451/62676
dc.language.isoengpt_PT
dc.relationLASIGE - Extreme Computing
dc.subjectVulnerabilidades em aplicações webpt_PT
dc.subjectDeteção de vulnerabilidadespt_PT
dc.subjectAnálise estáticapt_PT
dc.subjectAprendizagem automáticapt_PT
dc.subjectContribuição coletivapt_PT
dc.subjectTeses de mestrado - 2024pt_PT
dc.titleDevelopment of a Website for Creation of Vulnerability Datasetspt_PT
dc.typemaster thesis
dspace.entity.typePublication
oaire.awardTitleLASIGE - Extreme Computing
oaire.awardURIinfo:eu-repo/grantAgreement/FCT/6817 - DCRRNI ID/UIDB%2F00408%2F2020/PT
oaire.fundingStream6817 - DCRRNI ID
project.funder.identifierhttp://doi.org/10.13039/501100001871
project.funder.nameFundação para a Ciência e a Tecnologia
rcaap.rightsopenAccesspt_PT
rcaap.typemasterThesispt_PT
relation.isProjectOfPublicationb429b8f0-500f-4a0b-8e91-33e0a200ad1c
relation.isProjectOfPublication.latestForDiscoveryb429b8f0-500f-4a0b-8e91-33e0a200ad1c
thesis.degree.nameMestrado em Engenharia Informáticapt_PT

Ficheiros

Principais
A mostrar 1 - 1 de 1
A carregar...
Miniatura
Nome:
TM_Miguel_Ferreira.pdf
Tamanho:
794.84 KB
Formato:
Adobe Portable Document Format
Licença
A mostrar 1 - 1 de 1
Miniatura indisponível
Nome:
license.txt
Tamanho:
1.2 KB
Formato:
Item-specific license agreed upon to submission
Descrição: