Please use this identifier to cite or link to this item: https://zone.biblio.laurentian.ca/handle/10219/3855
Full metadata record
DC FieldValueLanguage
dc.contributor.authorVaghasia, Shreya-
dc.date.accessioned2022-04-19T14:11:33Z-
dc.date.available2022-04-19T14:11:33Z-
dc.date.issued2021-04-13-
dc.identifier.urihttps://zone.biblio.laurentian.ca/handle/10219/3855-
dc.description.abstractFor the perspective of the latest happing news or some events occurring around the world, social media is widely used. The reaction given by the people’s opinion comes in the way of raw natural data in different languages and environments. All those written views have some kind of unbalanced statement, i.e., some sensitive information or some slang words and uneven words. This makes the researcher or data analyst to extract information and pattern from the dataset available. This makes opinion mining and taking strategic decision useful in future market. For sentiment analysis, Natural Language Processing (NLP) and Data Mining techniques are used to structure an unbalanced data. Using machine learning techniques, the built method analyses Twitter data to detect sentiment of views from people all around the world. For research purposes of this study, the dataset was taken from Twitter for Citizenship Amendment Act 2019, India. Throughout that time many people had given their opinions, views about this new Citizenship Amendment Act. The sentiment polarity is measured using VADER (Valence Aware Dictionary and Sentiment Reasoner), which purifies and analyses the data using natural language processing techniques. The dataset was normalized and prepared using natural language processing techniques such as Word Tokenization, Stemming and lemmatization, Part of Speech Tagging in order to be used by machine learning algorithms. All the input variables are converted in the form of vectors by using “term frequency-inverse document frequency” (TF-IDF). The python programming language was used to implement this process. Classifiers such as Naïve Bayes, SVM (support vector machine), k-nearest neighbor (KNN), neural network, Logistic Regression, Random Forest, and a LSTM (Long-short Term Memory) based RNN (Recurrent Neural Network) deep learning method were used to obtain evaluation parameters such as accuracy, precision, recall and F-score. On the mean values of performance metrics, a One-way Analysis of Variance (ANOVA) test was performed on all the methods.en_US
dc.language.isoenen_US
dc.subjectNatural Language Processing (NLP)en_US
dc.subjectTwitteren_US
dc.subjectSentiment analysisen_US
dc.subjectCitizenship Amendment Acten_US
dc.subjectNaïve Bayesen_US
dc.subjectSVMen_US
dc.subjectRandom Foresten_US
dc.subjectKNNen_US
dc.subjectPythonen_US
dc.subjectMachine learningen_US
dc.subjectDeep learningen_US
dc.titleSentiment analysis on Citizenship Amendment Act of India 2019 using Twitter dataen_US
dc.typeThesisen_US
dc.description.degreeMaster of Science (MSc) in Computational Sciencesen_US
dc.publisher.grantorLaurentian University of Sudburyen_US
Appears in Collections:Computational Sciences - Master's theses

Files in This Item:
File Description SizeFormat 
Thesis FINAL - Shreya Vaghasia - 22-Apr-2021.pdf3.25 MBAdobe PDFThumbnail
View/Open


Items in LU|ZONE|UL are protected by copyright, with all rights reserved, unless otherwise indicated.