Show simple item record

dc.contributor.advisorNababan, Erna Budhiarti
dc.contributor.advisorFahmi
dc.contributor.authorMuhibuddin, Muhibuddin
dc.date.accessioned2025-01-13T02:51:14Z
dc.date.available2025-01-13T02:51:14Z
dc.date.issued2024
dc.identifier.urihttps://repositori.usu.ac.id/handle/123456789/100060
dc.description.abstractCustomer segmentation is a critical component in developing effective marketing strategies, yet obtaining accurate and comprehensive customer datasets frequently poses significant challenges. This study investigates the feasibility of synthetic data generated through Time-series Generative Adversarial Networks (TimeGAN) to augment Length, Recency, Frequency, and Monetary (LRFM) analysis and K-Means clustering for customer segmentation. TimeGAN was selected for its capacity to capture temporal dependencies in sequential data, while the LRFM framework was employed to elucidate key behavioral dimensions of customers. K-Means served as the clustering algorithm, with Silhouette Coefficient and Davies-Bouldin Index as performance indicators. Experimental results reveal that the synthetic dataset achieved a Silhouette Coefficient of 0.42, marginally higher than the real dataset’s 0.41. Meanwhile, the Davies-Bouldin Index for the synthetic data was 0.90, closely paralleling the real data’s 0.89. These findings indicate that synthetic data effectively replicates the intrinsic patterns and distributions of actual datasets without compromising the integrity or quality of the clustering outcomes. By integrating synthetic data with LRFM analysis and K-Means clustering, this research provides valuable insights into customer segmentation, underscoring the potential of synthetic data as a reliable alternative when real-world data are scarce or unavailable. Consequently, the study highlights the broader practical implications of leveraging synthetic data, particularly in preserving data confidentiality and mitigating privacy concerns.en_US
dc.language.isoiden_US
dc.subjectData Syntheticen_US
dc.subjectTimeGanen_US
dc.subjectKmeans Clusteringen_US
dc.subjectCustomer Segmentationen_US
dc.subjectLRFMen_US
dc.titlePembangkitan Data Sintetik dalam Length Recency Frequency Monetary (LRFM) dan Klasterisasi K-Means untuk Segmentasi Pelangganen_US
dc.title.alternativeSynthetic Data Generation for Length, Recency, Frequency, Monetary (LRFM) Analysis and K-Means Clustering in Customer Segmentationen_US
dc.typeThesisen_US
dc.identifier.nimNIM217056006
dc.identifier.nidnNIDN0026106209
dc.identifier.nidnNIDN0009127608
dc.identifier.kodeprodiKODEPROD49302#Sains Data dan Kecerdasan Buatan
dc.description.pages66 Pagesen_US
dc.description.typeTesis Magisteren_US
dc.subject.sdgsSDGs 9. Industry Innovation And Infrastructureen_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record