Show simple item record

dc.contributor.advisorNababan, Erna Budhiarti
dc.contributor.advisorMawengkang, Herman
dc.contributor.authorSyahputra, Andika
dc.date.accessioned2024-11-06T06:50:07Z
dc.date.available2024-11-06T06:50:07Z
dc.date.issued2024
dc.identifier.urihttps://repositori.usu.ac.id/handle/123456789/98621
dc.description.abstractThis study examines the performance of GloVe word embedding on the Long Short-Term Memory (LSTM) model in translating from Batak to English. GloVe has the ability to capture semantic meaning from a wide context of words. This study includes training the GloVe model with various parameters as well as collecting and processing a unique parallel Batak - English dataset. LSTM is a derivative of Recurrent Neural Network (RNN) which has the ability to maintain long-term dependencies and handle sequential data. In machine translation, LSTM performance is influenced by the quality of the word embedding used, which produces vector representations of words, and captures their semantic and contextual relationships. In this study, the authors analyze the performance of GloVe word embedding on the LSTM model and compare it with Word2Vec. The dataset used is 28,420 Batak-English sentence pairs collected from various sources. With encoder and decoder components, the LSTM model is trained for several epochs and the results are evaluated using the Bilingual Evalution Understudy (BLEU) score. This metric evaluates n-grams of actual translations with predicted translations, which then gives a translation accuracy score. The results show that GloVe word embedding performs better than Word2Vec. Glove word embedding gets an average BLEU score of 0.9415, while Word2Vec gets an average BLEU score of 0.9346. GloVe's better performance is due to its ability to understand language in a larger dataset and understand the context of words in a wider contexten_US
dc.language.isoiden_US
dc.publisherUniversitas Sumatera Utaraen_US
dc.subjectGloVeen_US
dc.subjectWordVecen_US
dc.subjectLSTMen_US
dc.subjectMachine Translationen_US
dc.subjectBatak Language- Englishen_US
dc.titleAnalisis Kinerja Word Embedding Glove dalam Penerjemahan Bahasa Batak-Inggrisen_US
dc.title.alternativePerformance Analysis of Glove Word Embedding for Batak Language - English Translationen_US
dc.typeThesisen_US
dc.identifier.nimNIM207038002
dc.identifier.nidnNIDN0026106209
dc.identifier.nidnNIDN8859540017
dc.identifier.kodeprodiKODEPRODI55101#Teknik Informatika
dc.description.pages90 Pagesen_US
dc.description.typeTesis Magisteren_US
dc.subject.sdgsSDGs 4. Quality Educationen_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record