Assessing methods for multiple imputation of systematic missing data in marine fisheries time series with a new validation algorithm
Abstract:
Time series from fisheries often contain multiple missing data. This is a severe limitation that prevents using the data for research on population dynamics, stock assessment, forecasting, and, hence, decision-making around marine resources. Several methods have been proposed to impute missing data in univariate time series. Still, their performances depend not only on the amount of missing data but also on the data structure. This study compares the performance of twelve imputation methods on the time series of marine fishery landings for six species in the Colombian Pacific Ocean. Unlike other studies, we validate the precision of the imputations in the same target time series that include missing data, using the Known Sub-Sequence Algorithm (KSSA), a novelty validation approach that simulates missing data in known sub-sequences of the target time series. The results showed that the best methods for imputation are Seasonal Decomposition with Kalman filters and Structural Models with Kalman filters fitted by maximum likelihood. Results also show that validating the imputation methods with other time series different to the target time series, leads to wrong imputation methods choices. It is noteworthy that these methods and also the validation framework are mainly suited to time series with non-random distribution of missing data, this is, missing data produced systematically in chunks or clusters with pbkp_redictable frequency, which are common in marine sciences.
Año de publicación:
2023
Keywords:
- Time series imputation
- Fish populations
- Artisanal fishing
- Data gaps
Fuente:
Tipo de documento:
Article
Estado:
Acceso abierto
Áreas de conocimiento:
- Estadísticas
- Análisis de datos
Áreas temáticas:
- Caza, pesca y conservación