Analisis Sentimen Terhadap Film Indonesia dengan Pendekatan Bert

Fimoza, Dwi

Analisis Sentimen Terhadap Film Indonesia dengan Pendekatan Bert

dc.contributor.advisor	Amalia
dc.contributor.advisor	Harumy, T. Henny Febriana
dc.contributor.author	Fimoza, Dwi
dc.date.accessioned	2021-02-01T07:14:45Z
dc.date.available	2021-02-01T07:14:45Z
dc.date.issued	2021
dc.identifier.uri	http://repositori.usu.ac.id/handle/123456789/30445
dc.description.abstract	This study aims to analyze the sentiment in Indonesia Language towards the Gundala movie reviews on YouTube. However, sentiment analysis on YouTube comments are varying from positive, negative, and neutral comments which requires some automation in terms of classifying comments based on the polarity of sentiment. Sentiment analysis using traditional machine learning algorithms such as Naïve Bayes, SVM, etc cannot understand the context of comments in depth about the semantic of words because it only learns the given patters such as the frequency of occurrence of words. We need a transfer learning approach such as BERT (Bidirectional Encoder Representations from Transformers) which produces a bidirectional language model. The dataset used to do sentiment analysis goes through a pre-processing step which consists of case folding, data cleaning, tokenization, stop words removal, stemming, and normalization, using libraries from NLTK and Sastrawi. In this study, the hyperparameters used were 10 epochs, learning rate of 2e-5, and a batch size 16. In sentiment analysis, we will be using a multilingual-cased-model BERTBASE model and it was carried out with three experiments. During this experiment, the accuracy gained in first experiment is 66%, while the second experiment was 68%, and the third experiment was 66%. So, the average accuracy obtained is 66,7%.	en_US
dc.description.abstract	Penelitian ini bertujuan untuk analisis sentimen Bahasa Indonesia terhadap review film Gundala di YouTube. Namun, analisis sentimen pada komentar YouTube yang bervariasi dari komentar positif, negatif, maupun netral membutuhkan suatu otomatisasi dalam mengklasifikasikan komentar berdasarkan polaritas sentimennya. Analisis sentimen dengan penggunaan algoritma machine learning tradisional seperti Naïve Bayes, SVM, dan lain-lain tidak dapat memahami konteks dari komentar secara mendalam tentang semantik kata yang ada karena hanya mempelajari pola-pola yang diberikan seperti frekuensi kemunculan kata. Untuk itu dibutuhkan sebuah pendekatan transfer learning seperti BERT (Bidirectional Encoder Representations from Transformers) yang menghasilkan sebuah model bahasa dua arah (bidirectional). Dataset yang digunakan melalui tahap pre-processing yang terdiri dari case folding, data cleaning, tokenisasi, stopwords removal, stemming, dan normalisasi dengan library NLTK dan Sastrawi sebelum dilakukan analisis sentimen. Dalam penelitian ini hyperparameters yang digunakan adalah 10 epoch, learning rate 2e-5, dan batch size 16. Pengujian analisis sentimen menggunakan model BERTBASE multilingual-cased-model dan dilakukan dengan tiga kali percobaan. Nilai akurasi yang diperoleh pada percobaan pertama adalah 66%, sedangkan percobaan kedua adalah 68%, dan percobaan ketiga adalah 66%. Sehingga rata-rata nilai akurasi yang diperoleh adalah 66,7%.	en_US
dc.language.iso	id	en_US
dc.publisher	Universitas Sumatera Utara	en_US
dc.subject	Analisis Sentimen	en_US
dc.subject	Film Indonesia	en_US
dc.subject	YouTube	en_US
dc.subject	Gundala	en_US
dc.subject	Bidirectional Encoder Representations from Transformers	en_US
dc.subject	Deep Learning	en_US
dc.subject	Transformers	en_US
dc.title	Analisis Sentimen Terhadap Film Indonesia dengan Pendekatan Bert	en_US
dc.type	Thesis	en_US
dc.identifier.nim	NIM161401131
dc.description.pages	95 Halaman	en_US
dc.description.type	Skripsi Sarjana	en_US

Files in this item

Name:: 161401131.pdf
Size:: 5.146Mb
Format:: PDF
Description:: Fulltext

View/Open

This item appears in the following Collection(s)

Undergraduate Theses [1253]
Skripsi Sarjana

Show simple item record