Show simple item record

dc.contributor.advisorSharif, Amer
dc.contributor.advisorAmalia
dc.contributor.authorMetta, Roshan Ram
dc.date.accessioned2024-01-15T04:49:38Z
dc.date.available2024-01-15T04:49:38Z
dc.date.issued2023
dc.identifier.urihttps://repositori.usu.ac.id/handle/123456789/90180
dc.description.abstractThis thesis aims to conduct research related to sentiment analysis of public opinion regarding the work program of the City of Medan during the administration of the Mayor and Deputy Mayor for the period 2020 to the present. This research uses the LSTM (Long Short Term Memory) algorithm and a weighting method using BERT (Bidirectional Encoder Representations From Transformers). The dataset used in this research was obtained from Twitter using hashtags such as #MedanBerkah, #MedanBersih, #MedanKolaborasi, and keywords such as Medan City, Medan Maju, and Medan Sejahtera. This dataset goes through a scraping process and then a pre-processing stage, including case folding, tokenizing, stopword removal, punctuation removal, and lemmatization. Data that has been cleaned is called clean data. Next, the data that has been cleaned will be labeled with sentiment, which is done automatically using the Lexicon Based method with a word piece approach. The total labeled data consists of 3642 tweets, with 1723 having positive sentiment, 1665 negative sentiment, and 256 neutral sentiment. The labeled data is ready for the model training process. Before training, researchers carried out a hyperparameter search using the Optuna library to get the best parameters. Optuna tries a combination of parameters that have been prepared and chooses the best parameters based on the calculation results. After getting the best hyperparameters, researchers trained the model using the LSTM algorithm and weighting using the BERT algorithm. The training results show the lowest accuracy of 75% and the highest accuracy of 76%. These results show that the model can classify public opinion about the Medan City work program well, although there is still room for further development, especially in the data labeling process. Constraints in this research include the quality of data labeling using the Lexicon Based method, the model not understanding some slang words in the dataset, and the need for a larger dataset for a more complex model. Additional features are also needed in the pre-processing process, such as data normalization, to improve the quality of sentiment analysisen_US
dc.language.isoiden_US
dc.publisherUniversitas Sumatera Utaraen_US
dc.subjectLSTMen_US
dc.subjectLong Short Term Memoryen_US
dc.subjectTwitteren_US
dc.subjectSDGsen_US
dc.titleAnalisis Sentimen terhadap Opini Masyarakat Mengenai Program Kerja Kota Medan Menggunakan LSTM (Long Short Term Memory) dengan Media Sosial Twitteren_US
dc.typeThesisen_US
dc.identifier.nimNIM191401107
dc.identifier.nidnNIDN0121106902
dc.identifier.nidnNIDN0121127801
dc.identifier.kodeprodiKODEPRODI59201#Teknologi Informasi
dc.description.pages73 Halamanen_US
dc.description.typeSkripsi Sarjanaen_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record