DSpace/Dipòsit Manakin

Machine learning methods in personalized medicine: application to genomic data in Alzheimer's disease

Registre simple

dc.contributor Universitat de Vic - Universitat Central de Catalunya. Facultat de Ciències i Tecnologia
dc.contributor Universitat de Vic - Universitat Central de Catalunya. Màster Universitari en Anàlisi de Dades Òmiques
dc.contributor.author Ballestà López, Mireia
dc.date.accessioned 2019-03-05T18:45:14Z
dc.date.available 2019-03-05T18:45:14Z
dc.date.created 2018-09
dc.date.issued 2018-09
dc.identifier.uri http://hdl.handle.net/10854/5729
dc.description Curs 2017-2018 es
dc.description.abstract The main goal of this project is to validate and compare machine learning methods to perform GWAS analysis. This study worked with genomic data on Alzheimer’s disease (AD). The data obtained was imputed by the Michigan Imputation Server and pre-processed by a quality control at both SNPs and individual’s level. In order to reduce the dimensionality, SNPs were filtered using different Linkage-Disequilibrium (LD) thresholds (0.2, 0.4 and 0.6). Filtered data was then analysed by five machine learning statistical methods: logistic regression, random forest, k-nearest neighbours, Gradient Boosting Machine and, deep neural networks. The model performance were compared using AUC, sensitivity, specificity and F-measure to evaluate the predictive capacity or reliability of the models. In addition, best models were validated using KEGG pathways. Our conclusion is that best results are obtained when applying a LD threshold of 0.2. From all the five algorithms performed, GBM with a LD threshold 0.2 was seen to be the best model to predict AD based on AUC, sensitivity, specificity, F-measure and validating the results with KEGG pathways. es
dc.format application/pdf es
dc.format.extent 39 p. es
dc.language.iso eng es
dc.rights Tots els drets reservats es
dc.subject.other Aprenentatge automàtic es
dc.subject.other Alzheimer, Malaltia d' es
dc.title Machine learning methods in personalized medicine: application to genomic data in Alzheimer's disease es
dc.type info:eu-repo/semantics/masterThesis es
dc.description.version Supervisor/a: Juan Ramón González
dc.description.version Director/a: Josep M. Serrat
dc.rights.accessRights info:eu-repo/semantics/openAccess es

Text complet d'aquest document

Registre simple

Buscar al RIUVic


Cerca avançada

Llistar per

Estadístiques