ANALYSIS OF THE APPLICATION OF THE K-MEANS ALGORITHM TO THE CLUSTERING METHOD APPROACH FOR GROUPING CONSUMER PURCHASING TRENDS IN ONE OF THE TEXTILE COMPANIES

 

Debby Kurniawan1, Hidayat Anwari2, Christina Juliane3

Sekolah Tinggi Manajemen Informatika dan Komputer LIKMI, Bandung, Indonesia

 

[email protected]1, [email protected]2 , [email protected]3

 


ABSTRACT

In Indonesia, the regulation of sexual abuse crimes is a critical aspect of ensuring justice and protection for victims. However, challenges remain in the effectiveness and comprehensiveness of these regulations. This study aims to analyze and evaluate the current legal framework addressing sexual abuse in Indonesia, identifying gaps and proposing improvements to enhance legal protections for victims. The research employs a qualitative approach, utilizing legal analysis and case studies to assess the application of existing laws. Data collection involves reviewing legal documents, court cases, and expert interviews to gather comprehensive insights into the regulatory landscape. The findings indicate significant shortcomings in the legal framework, including inconsistencies in legal definitions, procedural delays, and inadequate victim support mechanisms. The study discusses the implications of these findings, emphasizing the need for a more cohesive and victim-centered approach in legal reforms. This research underscores the necessity for legislative improvements to address the identified gaps in the regulation of sexual abuse crimes. Recommendations include clearer legal definitions, expedited legal processes, and enhanced victim support services. These measures are essential for ensuring justice and effective protection for victims of sexual abuse in Indonesia.

 

Keywords: Clustering, K-Means Algorithm, Data Mining, Consumer Purchasing Trends, Textile Industry, Business Strategy.

 



Corresponding Author: Debby Kurniawan

E-mail: [email protected]

INTRODUCTION

In an era of increasingly free business climate development, companies face increasingly complex challenges and fierce competition (Mujiati, 2023). There are often obstacles for companies where demand or demand is enormous. However, the company cannot meet the needs of this demand because one of the factors causing this is having a large stock of goods (Eunike et al., 2021). However, the type or type differs from the demand (need) consumers need. Therefore, a stock of goods fills the storage area (Wijaya et al., 2016).

On the other hand, the company needs land to store or restock goods in high consumer demand. One of the ways to survive and thrive while changing market dynamics is for companies to sharpen their business strategies (Yunus, 2016). The right strategy is needed so that the company can provide added value to consumers and remain relevant in an increasingly competitive market (Fatyandri et al., 2023). In addition to a sense of trust, the company also needs to pay attention to a sense of comfort for consumers (Darmawan, 2017).

According to Rofaida et al. (2019), one effective strategic approach is to focus on product development that provides added value to consumers, both in terms of benefits and quality. However, challenges come not only from inter-company competition but also from the demands of society and the environment.

In this context, a company is not only faced with competition with business competitors who do the same business and produce similar products (Sanawiri & Iqbal, 2018). However, it must also consider its responsibilities to society and the environment to meet the rapidly changing market needs in this increasingly complex era (April, 2018). The products produced not only have to be of high quality by existing standards, but the service to customer satisfaction must also be considered and be environmentally friendly and pay attention to social aspects such as human rights by the provisions of the International Labor Organization (ILO) (Toriq & Martoatmodjo, 2014).

Therefore, this research aims to utilize data mining with a clustering approach to examine strategies that companies can apply in producing products that are not only from the quality aspect but also pay attention to aspects of trust and comfort for consumers and social commitment. Thus, this research is expected to contribute to understanding modern business dynamics and guide companies in developing sustainable and responsible business strategies. The existence of research can be adopted and become an evaluation for companies in other fields.

 

METHOD

This section will explain the research design, which should include the selection of data samples, data collection methods, and data analysis techniques. For example, this research may use a secondary data analysis approach by using historical data on textile companies' consumer purchases over time. In addition, it will explain how the clustering analysis process will be carried out by separating the data into groups with similar purchasing patterns using the K-Means algorithm.

Figure 1. Research Stage

Data collection will describe the data sources used in this research. The data source may come from the textile company's internal database or data on an analytics platform. The data collection process will be described in detail, including data preprocessing processes such as cleaning, removing missing values, and variable coding if required.

Algorithm implementation will explain how to implement the K-Means algorithm in the clustering analysis of consumer purchase trends. The explanation will include stages such as centroid initialization, distance calculation, data clustering, centroid update, and iteration until convergence. In addition, it will explain how this algorithm is applied to the collected consumer purchase dataset.

In this result evaluation, we will describe the metrics used to evaluate the quality of clustering produced by the K-Means algorithm (Sari et al., 2018). Examples may include metrics such as inertia, silhouette score, or Davies-Bouldin index. In addition, we will explain how the clustering results are evaluated and how the findings from the analysis are interpreted.

Data mining involves finding interesting patterns or information in selected data using specific techniques or methods (Mardi, 2017). Data mining is a process that involves discovering functional patterns, information, or knowledge from large and complex data sets. It involves analyzing data from multiple points of view and using various techniques or methods to uncover patterns hidden in the data (Juwita & Ali, 2024). Data mining mainly aims to gain valuable insights from existing data. One of the techniques used in data mining is classification. Classification divides data into predetermined categories or classes based on specific attributes (Haryati et al., 2015). The goal is to group data into the most appropriate class based on its characteristics.

One of the commonly used classification algorithms in data mining is K-Means. K-Means is a clustering algorithm used to group data into clusters based on the similarity of their characteristics. (Zuhal, 2022). This algorithm works by grouping data into k, called groups, where each data observation belongs to a group with the closest average. The stages of the K-means clustering algorithm, which will be discussed at a later stage in this study

 

RESULTS AND DISCUSSION

The data we use is data obtained from one of the companies that focuses on the textile category. For the dataset we use is as follows:

Table 1. Data Source

Customer�s

Name

Month

Motif Name

Sales

COGS

Qty (Mtr)

Price /M

Total

OGS/M

Total

TIO

Rayon

Jan-22

APTKT-2 K

1,25

45.000

56.250

39.000

48.750

TIO

Cotton

Jan-22

Cotton W.

12,00

45.000

540.000

38.500

462.000

TIO

Rayon

Jan-22

RAYON V

32,00

45.000

#######

39.000

#######

TIO

Cotton

Jan-22

Cotton W.

106,00

45.000

#######

38.500

#######

SYA

Silk

Jan-22

CARTAGE

52,82

42.500

#######

41.000

#######

SYA

Rayon

Jan-22

Pueblos

160,00

42.500

#######

39.000

#######

The data will be evaluated in rows and columns, producing 2 axes, the x-axis, and y-axis, which are used to group the data into how many parts.

Clustering

At this stage, the clustering process data is carried out by selecting the x and y axes with QTY (Mtr) combined with the type of fabric. It will produce the total number of QTY consumers need, and then we will group it into 3 clusters. The data mining clustering formula that we use is the clustering formula or clustering calculating the Euclidean distance between two data points XX and YY with coordinates X = (x1, x2, ..., xn) X = (x1, x2, ..., xn) and Y = (y1, y2, ...., yn). Euclidean distance =∑𝑖=1𝑛(𝑥𝑖-𝑦𝑖)2Euclidean distance=∑i=1n(xi-yi)2. The column or column data obtained from the following data will become a table or data shown in Figure 2 data grouping.

Table 2. Data grouping

No

Customer Name

Sales

Meter

Qty (Mtr)

Rayon

Cotton

Linen

Viscose

Silk

Tencel

Rami

1

TIO

1,25

1,25

-

-

-

-

-

-

2

TIO

12,00

-

12,00

-

-

-

-

-

3

TIO

32,00

32,00

-

-

-

-

-

-

4

TIO

106,00

-

106,00

-

-

-

-

-

5

SYA

52,82

-

-

-

-

52,82

-

-

6

SYA

160,00

160,00

-

-

-

-

-

-

7

SYA

120,00

120,00

-

-

-

-

-

-

8

SYA

160,00

160,00

-

-

-

-

-

-

9

SYA

123,00

123,00

-

-

-

-

-

-

10

SYA

2,00

-

-

2,00

-

-

-

-

11

SYA

2,00

-

-

2,00

-

-

-

-

12

SYA

49,65

-

-

-

-

49,65

-

-

The previous process then divided the class into 3 groupings so that a group will be formed, which we call priority one, priority 2, and priority 3. It can be explained that group one is the highest priority group for companies to have priority 2 fabric stock, which is a priority after priority one, and the least priority is priority three, which is to stock goods. The data is obtained by referring to the calculation after grouping through the calculation of the nearest average; on this occasion, we were assisted with orange tools. After using the orange tools, we use or implement the k-means algorithm, as seen in Figure 1.

Figure 1. Implement The K-Means Algorithm

Our data focus on fabric type, where we calculate the average order of the fabric. There are seven types of fabrics that we process, as can be seen in Figure 2.

Figure 2. Seven Types Of Fabrics

 

Figure 3. Cluster Results

CONCLUSION

Clustering analysis using k-means helps identify different customer segments based on fabric preferences, allowing companies to prioritize fabric inventory appropriately. This approach not only facilitates better inventory management and effectively meets customer demands, but also provides important implications for more targeted marketing strategies and product development tailored to diverse market needs.

 

REFERENCES

Aprizal, S. (2018). Orientasi Pasar dan Keunggulan Bersaing: Studi Kasus Penjualan Komputer (Vol. 1). CELEBES MEDIA PERKASA.

Darmawan, D. (2017). Hubungan Antara Inovasi Produk, Kepuasan dan Kepercayaan Pelanggan. Jurnal Ekonomi Dan Bisnis, 7(1), 36�42.

Eunike, A., Setyanto, N. W., Yuniarti, R., Hamdala, I., Lukodono, R. P., & Fanani, A. A. (2021). Perencanaan Produksi dan Pengendalian Persediaan: Edisi Revisi. Universitas Brawijaya Press.

Fatyandri, A. N., Fiona, F., Fernando, R., Wijaya, R. C., Alexandro, W., & Agustian, W. (2023). Meningkatkan Keunggulan Kompetitif Dengan Memilih Strategi Tingkat Bisnis Yang Tepat. Jurnal Mirai Management, 8(2), 230�236.

Haryati, S., Sudarsono, A., & Suryana, E. (2015). implementasi data mining untuk memprediksi masa studi mahasiswa menggunakan algoritma c4. 5 (studi kasus: universitas dehasen bengkulu). Jurnal Media Infotama, 11(2).

Juwita, I., & Ali, I. (2024). Penerapan Pola Penjualan Dengan Menggunakan Metode Algoritma Asosiasi Fp-Growth Bertujuan Untuk Meningkatkan Penjualan Kopi Di Point Coffee. JATI (Jurnal Mahasiswa Teknik Informatika), 8(2), 1600�1607.

Mardi, Y. (2017). Data mining: Klasifikasi menggunakan algoritma c4. 5. Jurnal Edik Informatika Penelitian Bidang Komputer Sains Dan Pendidikan Informatika, 2(2), 213�219.

Mujiati, N. W. (2023). Perkembangan, metode pendekatan dan tantangan manajemen sumber daya manusia. Forum Manajemen, 21(1), 68�86.

Rofaida, R., Aryanti, A. N., & Perdana, Y. (2019). Strategi inovasi pada industri kreatif digital: Upaya memperoleh keunggulan bersaing pada era Revolusi Industri 4.0. Jurnal Manajemen Dan Keuangan, 8(3), 402�414.

Sanawiri, B., & Iqbal, M. (2018). Kewirausahaan. Universitas Brawijaya Press.

 

Sari, V. N., Yupianti, Y., & Maharani, D. (2018). Penerapan Metode K-Means Clustering Dalam Menentukan Predikat Kelulusan Mahasiswa Untuk Menganalisa Kualitas Lulusan. JURTEKSI (Jurnal Teknologi Dan Sistem Informasi), 4(2), 133�140.

Toriq, M., & Martoatmodjo, S. (2014). Pengaruh fasilitas dan kualitas layanan terhadap kepuasan pelanggan pada SPBU Pertamina 54.612. 64 di Sidoarjo. Jurnal Ilmu & Riset Manajemen, 3(8).

Wijaya, D., Mandey, S., & Sumarauw, J. S. B. (2016). Analisis pengendalian persediaan bahan baku ikan pada PT. Celebes minapratama bitung. Jurnal EMBA: Jurnal Riset Ekonomi, Manajemen, Bisnis Dan Akuntansi, 4(2).

Yunus, E. (2016). Manajemen strategis. Penerbit Andi.

Zuhal, N. K. (2022). Study Comparison K-Means Clustering Dengan Algoritma Hierarchical Clustering: AHC, K-Means Clustering, Study Comparison. Seminar Nasional Teknologi & Sains, 1(1), 200�205.

 

� 2024 by the authors. This work has been submitted for possible open-access publication under the terms and conditions of the Creative Commons Attribution (CC BY SA) license (https://creativecommons.org/licenses/by-sa/4.0/).