Optimasi Kinerja Algoritma K-Nearest Neighbor Menggunakan Seleksi Fitur Gain Ratio dan Local Mean Vector

Iswanto

Optimasi Kinerja Algoritma K-Nearest Neighbor Menggunakan Seleksi Fitur Gain Ratio dan Local Mean Vector

dc.contributor.advisor	Tulus
dc.contributor.advisor	Sihombing, Poltak
dc.contributor.author	Iswanto
dc.date.accessioned	2021-08-09T06:44:10Z
dc.date.available	2021-08-09T06:44:10Z
dc.date.issued	2021
dc.identifier.uri	http://repositori.usu.ac.id/handle/123456789/39684
dc.description.abstract	K-Nearest Neighbor (K-NN) is an algorithm that many used in the data classification method. But based on the results of some research, it shows that the performance of K-NN is still lower than other classification methods. The low K-NN accuracy value because of the influence of features (attributes) less relevant and the majority vote system method in determining the new class of test data. As a solution in this research, the authors use feature selection of gain ratio to select relevant features on the dataset and the local mean vector in determining a new class for the test data. These methods tested using datasets sourced from the UCI Machine Learning Repository and Kaggle Repository, namely Ionosphere, Breast Cancer, Voice genre, Hypothyroid. Based on the test results, the use of the gain ratio feature selection and local mean vector method is proven to be able to increase the accuracy value of the K-NN algorithm with an average increase in accuracy in each dataset, namely Voice Genre of 26.90%, Ionosphere of 4.86%, Breast Cancer by 3.57%, and Hypothyroid by 1.25%. Of all the datasets tested in this study, the average accuracy value increased by 9.14%.	en_US
dc.description.abstract	K-Nearest Neighbor (K-NN) merupakan salah satu algoritma yang banyak digunakan dalam metode klasifikasi data. Tetapi berdasarkan hasil dari beberapa penelitian menunjukkan bahwa kinerja K-NN masih lebih rendah dibandingkan dengan metode klasifikasi yang lain. Rendahnya nilai akurasi K-NN disebabkan oleh pengaruh fitur (atribut) yang kurang relevan pada dataset dan penggunaan sistem vote majority dalam penentuan kelas baru data uji. Sebagai solusinya pada penelitian ini penulis menggunakan metode seleksi fitur gain ratio untuk memilih fitur yang relevan pada dataset dan local mean vector dalam penentuan kelas baru bagi data yang diujikan. Metode tersebut diuji menggunakan dataset yang bersumber dari UCI Machine Learning Repository dan Kaggle Repository yaitu Ionosphere, Breast Cancer, Voice genre dan Hypothyroid. Berdasarkan hasil pengujian, penggunaan metode seleksi fitur gain ratio dan local mean vector terbukti mampu meningkatkan nilai akurasi pada algoritma K-NN dengan rata-rata peningkatan akurasi pada masing-masing dataset yaitu Voice Genre sebesar 26,90%, Ionosphere sebesar 4,86%, Breast Cancer sebesar 3,57%, dan Hypothyroid sebesar 1,25%. Dari seluruh dataset yang diujikan pada penelitian ini rata-rata peningkatan nilai akurasi sebesar 9,14%.	en_US
dc.language.iso	id	en_US
dc.publisher	Universitas Sumatera Utara	en_US
dc.subject	K-Nearest Neighbor	en_US
dc.subject	Gain Ratio	en_US
dc.subject	Local Mean Vector	en_US
dc.title	Optimasi Kinerja Algoritma K-Nearest Neighbor Menggunakan Seleksi Fitur Gain Ratio dan Local Mean Vector	en_US
dc.type	Thesis	en_US
dc.identifier.nim	NIM187038048
dc.description.pages	64 Halaman	en_US
dc.description.type	Tesis Magister	en_US

Files in this item

Name:: 187038048.pdf
Size:: 1.720Mb
Format:: PDF
Description:: Fulltext

View/Open

This item appears in the following Collection(s)

Master Theses [627]
Tesis Magister

Show simple item record