Repository logo
 
Publication

Exploring nanopore long reads and whole-genome sequencing data to characterize long tandem repeats and chromosome structure in mammalian genomes

datacite.subject.fosDepartamento de Química e Bioquímicapt_PT
dc.contributor.advisorCarvalho, Margarida Henriques da Gama, 1972-
dc.contributor.advisorDias, Ricardo
dc.contributor.authorEleutério, Daniel Alexandre Varela Marques
dc.date.accessioned2024-02-02T15:01:37Z
dc.date.embargo2025-10-13
dc.date.issued2023
dc.date.submitted2023
dc.descriptionTese de mestrado, Bioquímica e Biomedicina, 2023, Universidade de Lisboa, Faculdade de Ciênciaspt_PT
dc.description.abstractLong tandem repeats (LTRs) sequences, namely satellite DNA (satDNA), play a critical role in genome organization and stability. Their detection and characterization still represent a challenge, which long-read Nanopore sequencing is helping to simplify. Thus, there is a growing interest in developing bioinformatic tools for the analysis of LTRs and in understanding their role in genome evolution. The main aim of this work was the characterization of LTRs in mammalian genomes, using WGS data from two model organisms: Peromyscus genus and Homo sapiens. In the first case, the focus was to characterize PMSat, an evolutionarily conserved satDNA. In the second case, the key objective was to perform a comparative assessment of LTRs between different human long-read genome datasets, to characterize in detail the human satellite HSat1A and its transcripts, and to establish methods to analyze satDNA involvement in Robertsonian translocations (specifically the t(14;21)). The analysis performed identified PMSat as the most abundant LTR of the Peromyscus genus, in accordance with previous cytogenetic studies, with GC-content, monomer and array length exhibiting little variation. Moreover, a tendency in the positioning in Peromyscus genus chromosomes was identified. In the study of the three WGS Nanopore human datasets, the LTRs ALR, BSR, HSat2 and HSat1A were identified as the most abundant. It was possible to demonstrate the potential of chromosome sorting to facilitate the analysis of translocated chromosomes. However, low sequencing yield and DNA fragmentation did not allow for characterization of the target region. Finally, LTRs of HSat1A were shown to have a predominance of 9-mer monomers, and to express polyadenylated transcripts of varying lengths, probably resulting from alternative polyadenylation. This study represents a relevant contribution to understanding the role of LTRs in genome organization and evolution.pt_PT
dc.identifier.tid203486994
dc.identifier.urihttp://hdl.handle.net/10451/62425
dc.language.isoengpt_PT
dc.subjectLTRpt_PT
dc.subjectDNA satélitept_PT
dc.subjectSequenciação Nanoporept_PT
dc.subjectPMSatpt_PT
dc.subjectHSat1Apt_PT
dc.subjectTeses de mestrado - 2023pt_PT
dc.titleExploring nanopore long reads and whole-genome sequencing data to characterize long tandem repeats and chromosome structure in mammalian genomespt_PT
dc.typemaster thesis
dspace.entity.typePublication
rcaap.rightsembargoedAccesspt_PT
rcaap.typemasterThesispt_PT
thesis.degree.nameTese de mestrado em Bioquímica e Biomedicinapt_PT

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
TM_Daniel_Eleutério.pdf
Size:
3.76 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.2 KB
Format:
Item-specific license agreed upon to submission
Description: