Kombinasi Metode Tomek-Links dan Random Undersampling untuk Identifikasi Single Nucleotide Polymorphism Menggunakan Artificial Neural Network  pada Genom Kedelai

Pulungan, Aflah Mutsanni

Kombinasi Metode Tomek-Links dan Random Undersampling untuk Identifikasi Single Nucleotide Polymorphism Menggunakan Artificial Neural Network pada Genom Kedelai

dc.contributor.advisor	Arisandi, Dedy
dc.contributor.advisor	Nurhasanah, Rossy
dc.contributor.author	Pulungan, Aflah Mutsanni
dc.date.accessioned	2022-12-19T03:04:29Z
dc.date.available	2022-12-19T03:04:29Z
dc.date.issued	2022
dc.identifier.uri	https://repositori.usu.ac.id/handle/123456789/75116
dc.description.abstract	Next Generation Sequencing (NGS) is a machine that can read Single Nucleotide Polymorphism on a genome, including the soybean genome used in this study. However, the machine has a high error rate so that more SNP candidate data are found which are caused by errors when reading the NGS machine compared to the actual SNP candidate data. Then the data generated by the NGS also has an imbalance problem, where the number of negative SNPs is more than the number of positive SNPs. To overcome the imbalanced data, researchers will use Tomek Links and Random Undersampling which aims to eliminate noise data and form a new dataset. Then the SNP identification process uses a method that can classify large amounts of data, namely Artificial Neural Network. The resulting model is formed from Artificial Neural Network hyperparameters, namely epoch 10, activation function using Log Softmax and batch size 64. In addition to Artificial Neural Network, Random Undersampling also uses hyperparameter sampling strategy/balance ratio of 0.4. Based on the evaluation that has been done, the G-Mean is 93 with these results it can be concluded that the methods Random Undersampling and Artificial Neural Network used in this study can identify SNPs well.	en_US
dc.language.iso	id	en_US
dc.publisher	Universitas Sumatera Utara	en_US
dc.subject	Next Generation Sequencing	en_US
dc.subject	Single Nucleotide Polymorphism	en_US
dc.subject	soybean	en_US
dc.subject	imbalanced data	en_US
dc.subject	Tomek Links	en_US
dc.subject	Random Undersampling	en_US
dc.subject	Artificial Neural Network	en_US
dc.subject	hyperparameter	en_US
dc.subject	epoch	en_US
dc.subject	activation function	en_US
dc.subject	Log Softmax	en_US
dc.subject	batch size	en_US
dc.subject	sampling strategy	en_US
dc.title	Kombinasi Metode Tomek-Links dan Random Undersampling untuk Identifikasi Single Nucleotide Polymorphism Menggunakan Artificial Neural Network pada Genom Kedelai	en_US
dc.type	Thesis	en_US
dc.identifier.nim	NIM171402012
dc.identifier.nidn	NIDN0031087905
dc.identifier.nidn	NIDN0001078708
dc.identifier.kodeprodi	KODEPRODI59201#Teknologi Informasi
dc.description.pages	82 Halaman	en_US
dc.description.type	Skripsi Sarjana	en_US

Files in this item

Name:: 171402012.pdf
Size:: 2.759Mb
Format:: PDF
Description:: Fulltext

View/Open

This item appears in the following Collection(s)

Undergraduate Theses [795]
Skripsi Sarjana

Show simple item record