K-means based method for handling unlabeled data
Abstract:
From the development achieved by the current information society, incalculable volumes of data are generated. The exponential growth of information significantly supports people’s decision making in their daily activities. In Ecuador there are many institutions that store the data of their processes, the tourism sector representing an example of this. However, the data generated exceeds the power of analysis and processing of human beings, sometimes relevant information is presented that is not visible to people. The present investigation proposes a solution to the described problem starting from the development of a method for the treatment of unlabeled data.The proposed method is based on the unsupervised k-means algorithm. The proposal has been implemented from the stored data set of the tourism sector in the City of Riobamba.
Año de publicación:
2021
Keywords:
- Information gain
- Roughsets
- entropy
- Machine learning
- Data Mining
Fuente:
Tipo de documento:
Article
Estado:
Acceso restringido
Áreas de conocimiento:
- Aprendizaje automático
- Algoritmo
Áreas temáticas:
- Ciencias de la computación