Please use this identifier to cite or link to this item: https://zone.biblio.laurentian.ca/handle/10219/3670
Full metadata record
DC FieldValueLanguage
dc.contributor.authorShi, ZhengXin-
dc.date.accessioned2021-05-13T19:01:30Z-
dc.date.available2021-05-13T19:01:30Z-
dc.date.issued2020-02-26-
dc.identifier.urihttps://zone.biblio.laurentian.ca/handle/10219/3670-
dc.description.abstractEpigenetics is the study of biological mechanisms that will switch genes on and off, its alterations are deeply involved in the change of gene expression among various diseases including cancers. Machine learning is frequently used in cancer diagnosis and detection. In this research, four types of data are used towards the correct prediction of lung cancer, including DNA Methylation data, Histone data, Human Genome data, and RNA-Seq data. Four feature selection methods - ReliefF, Gain Ratio (GR), Principle Component Analysis (PCA), Correlation-based feature selection (CFS) and seven different classifiers - Random Forest (RF), Support Vector Machine (SVM) with Gaussian Kernel function and Linear Kernel function, Logistic Regression (LR), Naive Bayes (NB), Artificial Neural Network, and Convolutional Neural Network (CNN) were implemented in this study. The processing of these data sets is done using custom R-script. The tools that were used for feature selection and classification in the presented work are Weka 3 and Python. With the help of machine learning and deep learning methods, we were able to improve the accuracy and area under the curve (AUC) of the lung cancer prediction from an earlier published work. It was observed that the CNN model overperformed the other six classification methods.en_US
dc.language.isoenen_US
dc.subjectEpigenomicsen_US
dc.subjectdeep learningen_US
dc.subjecthistone modificationen_US
dc.subjectDNA methylationen_US
dc.subjectRNA-sequencingen_US
dc.subjectfeature selectionen_US
dc.subjectclassificationen_US
dc.titleImproved prediction of gene expression of epigenomics data of lung cancer using machine learning and deep learning modelsen_US
dc.typeThesisen_US
dc.description.degreeMaster of Science (MSc) in Computational Sciencesen_US
dc.publisher.grantorLaurentian University of Sudburyen_US
Appears in Collections:Computational Sciences - Master's theses
Master's Theses

Files in This Item:
File Description SizeFormat 
Zhengxin Shi - Thesis FINAL.pdf1.56 MBAdobe PDFThumbnail
View/Open


Items in LU|ZONE|UL are protected by copyright, with all rights reserved, unless otherwise indicated.