• Login
    View Item 
    •   USU-IR Home
    • Faculty of Computer Science and Information Technology
    • Data Science and Artificial Intelligence
    • Master Theses
    • View Item
    •   USU-IR Home
    • Faculty of Computer Science and Information Technology
    • Data Science and Artificial Intelligence
    • Master Theses
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Pembangkitan Data Sintetik dalam Length Recency Frequency Monetary (LRFM) dan Klasterisasi K-Means untuk Segmentasi Pelanggan

    Synthetic Data Generation for Length, Recency, Frequency, Monetary (LRFM) Analysis and K-Means Clustering in Customer Segmentation

    Thumbnail
    View/Open
    Cover (406.0Kb)
    Fulltext (2.262Mb)
    Date
    2024
    Author
    Muhibuddin, Muhibuddin
    Advisor(s)
    Nababan, Erna Budhiarti
    Fahmi
    Metadata
    Show full item record
    Abstract
    Customer segmentation is a critical component in developing effective marketing strategies, yet obtaining accurate and comprehensive customer datasets frequently poses significant challenges. This study investigates the feasibility of synthetic data generated through Time-series Generative Adversarial Networks (TimeGAN) to augment Length, Recency, Frequency, and Monetary (LRFM) analysis and K-Means clustering for customer segmentation. TimeGAN was selected for its capacity to capture temporal dependencies in sequential data, while the LRFM framework was employed to elucidate key behavioral dimensions of customers. K-Means served as the clustering algorithm, with Silhouette Coefficient and Davies-Bouldin Index as performance indicators. Experimental results reveal that the synthetic dataset achieved a Silhouette Coefficient of 0.42, marginally higher than the real dataset’s 0.41. Meanwhile, the Davies-Bouldin Index for the synthetic data was 0.90, closely paralleling the real data’s 0.89. These findings indicate that synthetic data effectively replicates the intrinsic patterns and distributions of actual datasets without compromising the integrity or quality of the clustering outcomes. By integrating synthetic data with LRFM analysis and K-Means clustering, this research provides valuable insights into customer segmentation, underscoring the potential of synthetic data as a reliable alternative when real-world data are scarce or unavailable. Consequently, the study highlights the broader practical implications of leveraging synthetic data, particularly in preserving data confidentiality and mitigating privacy concerns.
    URI
    https://repositori.usu.ac.id/handle/123456789/100060
    Collections
    • Master Theses [13]

    Repositori Institusi Universitas Sumatera Utara (RI-USU)
    Universitas Sumatera Utara | Perpustakaan | Resource Guide | Katalog Perpustakaan
    DSpace software copyright © 2002-2016  DuraSpace
    Contact Us | Send Feedback
    Theme by 
    Atmire NV
     

     

    Browse

    All of USU-IRCommunities & CollectionsBy Issue DateTitlesAuthorsAdvisorsKeywordsTypesBy Submit DateThis CollectionBy Issue DateTitlesAuthorsAdvisorsKeywordsTypesBy Submit Date

    My Account

    LoginRegister

    Repositori Institusi Universitas Sumatera Utara (RI-USU)
    Universitas Sumatera Utara | Perpustakaan | Resource Guide | Katalog Perpustakaan
    DSpace software copyright © 2002-2016  DuraSpace
    Contact Us | Send Feedback
    Theme by 
    Atmire NV