TAID-LCA: Segmentation Algorithm Based on Ternary Trees
Abstract:
In this work, a statistical method for the segmentation of samples and/or populations is presented, which is based on a ternary tree structure. This approach overcomes known limitations of other segmentation methods such as CHAID, concerning the multivariate response and the non-symmetric relationship between explanatory and response variables. The multivariate response segmentation problem is handled through latent class models, while the factorial decomposition of the explanatory capability of variables is based on the Non-Symmetrical Correspondence Analysis. Stop criteria based on the CATANOVA index and impurity measures are proposed. A Simulated Annealing based post-pruning strategy is considered to avoid over-fitting relative to the training set and guarantee a better generalization capability for the method.
Año de publicación:
2022
Keywords:
- CHAID algorithm
- Impurity measures
- τ index
- Simulate annealing
- latent class analysis
- CATANOVA
Fuente:
Tipo de documento:
Article
Estado:
Acceso abierto
Áreas de conocimiento:
- Algoritmo
- Algoritmo
- Algoritmo
Áreas temáticas:
- Métodos informáticos especiales
- Programación informática, programas, datos, seguridad
- Ciencias de la computación