| Nome: | Descrição: | Tamanho: | Formato: | |
|---|---|---|---|---|
| 1.11 MB | Adobe PDF |
Orientador(es)
Resumo(s)
The Analog Ensemble (AnEn) method allows to reconstruct incomplete time series, based on correlated series with full records. It has been extensively applied to meteorological data, which may involve many variables and stations, and span many years, slowing down the reconstruction. To accelerate this process, Principal Component Analysis (PCA) may be employed, to combine several input series into a few principal components (PCs), over which AnEn is then applied for a faster data reconstruction. The integration of PCA with K-means clustering further amplifies efficiency. In this paper, multicore-based implementations of non-PCA and PCA-based methods, in MATLAB, R, and Python, were scrutinized to ascertain numerical consistency and evaluate relative performance. The experiments show that when using PCA, accuracy is kept, or even improved, despite lowering the number of input time series. Performance-wise, the experiments revealed a distinct edge for the Python code, for which the benefits of parallel processing were most evident. Preliminary results are also shown for a Python variant that exploits GPUs for the analog search, with very promising speedups.
Descrição
Palavras-chave
Analog ensemble CUDA K-means clustering MATLAB Meteorological records reconstruction PCA Python R
Contexto Educativo
Citação
Breve, Murilo Montanini; Camargos, Ana Flavia P.; Rufino, José; Balsa, Carlos (2025). Computational performance analysis of PCA enhanced AnEn method. In Breve, Murilo Montanini; Camargos, Ana Flavia P.; Rufino, José; Balsa, Carlos (2025). Computational performance analysis of PCA enhanced AnEn method. In ARTIIS 2024 International Workshops. Chile. p. 18-33. ISBN 978-3-031-83435-6
Editora
Springer Nature
