Comparison of statistical methods commonly used in pbkp_redictive modelling


Abstract:

Logistic Multiple Regression, Principal Component Regression and Classification and Regression Tree Analysis (CART), commonly used in ecological modelling using GIS, are compared with a relatively new statistical technique, Multivariate Adaptive Regression Splines (MARS), to test their accuracy, reliability, implementation within GIS and ease of use. All were applied to the same two data sets, covering a wide range of conditions common in pbkp_redictive modelling, namely geographical range, scale, nature of the pbkp_redictors and sampling method. We ran two series of analyses to verify if model validation by an independent data set was required or cross-validation on a learning data set sufficed. Results show that validation by independent data sets is needed. Model accuracy was evaluated using the area under Receiver Operating Characteristics curve (AUC). This measure was used because it summarizes performance across all possible thresholds, and is independent of balance between classes. MARS and Regression Tree Analysis achieved the best pbkp_rediction success, although the CART model was difficult to use for cartographic purposes due to the high model complexity.

Año de publicación:

2004

Keywords:

  • Multivariate Adaptive Regression Splines
  • Grimmia
  • Fagus
  • Regression Tree Analysis
  • Classification and Regression Tree
  • logistic regression

Fuente:

scopusscopus

Tipo de documento:

Article

Estado:

Acceso restringido

Áreas de conocimiento:

  • Análisis de datos
  • Ciencias de la computación

Áreas temáticas:

  • Sistemas