Development and Verification of a Verbal Corpus Based on Natural Language for Ecuadorian Dialect


Abstract:

The use of the corpus becomes essential in the development of applications based on natural language processing (NLP). In Ecuador, these applications are incompatible because in each region use words outside the context of Spanish. This article presents the development of a corpus compatible with Ecuadorian natural language words. We applied a identification algorithm to take advantage of local literature and power a new data base. The corpus mounted is verified by a quantitative and qualitative comparison with an open access corpus. The result is the first corpus in this country with high scalability and great versatility.

Año de publicación:

2017

Keywords:

  • Database
  • Computational linguistics
  • Text processing
  • Natural Language processing

Fuente:

googlegoogle
scopusscopus

Tipo de documento:

Conference Object

Estado:

Acceso restringido

Áreas de conocimiento:

    Áreas temáticas:

    • Lengua
    • Lingüística
    • Otras lenguas