A Comparative Evaluation of Preprocessing Techniques for Short Texts in Spanish


Abstract:

Natural Language Processing (NLP) is used to identify key information, generating pbkp_redictive models, and explaining global events or trends. Also, NLP is supported during the process to create knowledge. Therefore, it is important to apply refinement techniques in major stages such as preprocessing, when data is frequently produced and processed with poor results. This document analyzes and measures the impact of combinations of preprocessing techniques and libraries for short texts that have been written in Spanish. These techniques were applied in tweets for analysis of sentiments considering evaluation parameters in its analysis, the processing time and characteristics of the techniques for each library. The performed experimentation provides readers insights for choosing the appropriate combination of techniques during preprocessing. The results show improvement of up to 5% to 9% in the performance of the classification.

Año de publicación:

2020

Keywords:

  • Natural Language processing
  • sentiment analysis
  • TEXT MINING
  • Twitter
  • Preprocessing

Fuente:

scopusscopus

Tipo de documento:

Conference Object

Estado:

Acceso restringido

Áreas de conocimiento:

  • Ciencias de la computación

Áreas temáticas:

  • Funcionamiento de bibliotecas y archivos