Show simple item record

dc.contributor.advisorLydia, Maya Silvi
dc.contributor.advisorSihombing, Poltak
dc.contributor.authorDiba, Farah
dc.date.accessioned2023-08-08T06:49:45Z
dc.date.available2023-08-08T06:49:45Z
dc.date.issued2023
dc.identifier.urihttps://repositori.usu.ac.id/handle/123456789/86407
dc.description.abstractData that has high dimensions requires machine learning methods that can work faster and more effectively in the classification process. One of the algorithms that can handle complex data is Random Forest. Random Forest works by building several decision trees randomly as a reference for feature selection. However, high- dimensional data requires more storage space, resulting in a longer computation time. Therefore, Principal Component Analysis is a reliable dimension reduction method for representing high-dimensional data. PCA will form several Principal Components that contain important information from the original data. The dataset used in this study is sourced from the Kaggle Repository which consists of 3 types of datasets, namely the water quality dataset (continuous dataset), stroke disease dataset (nominal dataset), and airline satisfaction (ordinal dataset). The results of this study, Random Forest with n_estimators = 9 without reduction has the best accuracy of 95.86% in the Airline Satisfaction dataset. At n_estimators = 3, 5, 7, and 9 the accuracy decreases when reduced by PCA. So it can be concluded that without reducing the dimensions of the Random Forest, it has been able to provide the best accuracy by forming 9 n_estimators trees. This means that the more trees built on high-dimensional data, the better the resulting accuracy.en_US
dc.language.isoiden_US
dc.publisherUniversitas Sumatera Utaraen_US
dc.subjectRandom Foresten_US
dc.subjectPrincipal Component Analysisen_US
dc.subjectDimension Reductionen_US
dc.subjectSDGsen_US
dc.titleAnalisis Akurasi Random Forest Menggunakan Principal Component Analysis (PCA)en_US
dc.typeThesisen_US
dc.identifier.nimNIM187038067
dc.identifier.nidnNIDN0027017403
dc.identifier.nidnNIDN0017036205
dc.identifier.kodeprodiKODEPRODI55101#Teknik Informatika
dc.description.pages96 Halamanen_US
dc.description.typeTesis Magisteren_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record