K-means based method for handling unlabeled data


Abstract:

From the development achieved by the current information society, incalculable volumes of data are generated. The exponential growth of information significantly supports people’s decision making in their daily activities. In Ecuador there are many institutions that store the data of their processes, the tourism sector representing an example of this. However, the data generated exceeds the power of analysis and processing of human beings, sometimes relevant information is presented that is not visible to people. The present investigation proposes a solution to the described problem starting from the development of a method for the treatment of unlabeled data.The proposed method is based on the unsupervised k-means algorithm. The proposal has been implemented from the stored data set of the tourism sector in the City of Riobamba.

Año de publicación:

2021

Keywords:

  • Information gain
  • Roughsets
  • entropy
  • Machine learning
  • Data Mining

Fuente:

scopusscopus

Tipo de documento:

Article

Estado:

Acceso restringido

Áreas de conocimiento:

  • Aprendizaje automático
  • Algoritmo

Áreas temáticas:

  • Ciencias de la computación