Advances and Applications in Statistics

The Advances and Applications in Statistics is an internationally recognized journal indexed in the Emerging Sources Citation Index (ESCI). It provides a platform for original research papers and survey articles in all areas of statistics, both computational and experimental in nature.

Submit Article

SELECTION OF PREDICTION MODELS BASED ON POSSIBILITY FUNCTION OF BINARY CONNECTION NUMBERS

Authors

  • Huali Zhou
  • Huayou Chen

Keywords:

Time series decomposition; Boostrap; Binary connection number; Model selection, PM2.5

DOI:

https://doi.org/10.17654/0972361724020

Abstract

Choice of a suitable prediction model is conducive to improve the efficiency of prediction, while the selection of prediction models can be regarded as a problem of ranking predictive performance. On this basis, it is proposed to screen the method of prediction based on the possibility function of binary connection number in this study. First of all, the original time series are transformed by Box-Cox for conformance to the normal distribution of the sequence. Then,  the residual components of the decomposed time series are further bootstrapped, and the generated sequences are integrated with the other sub-items. Next, the integrated sequences are transformed by InvBox-Cox to create the training pool. The performance-related index of each prediction model is determined by the training pool. Finally, the possibility function of binary connection number is applied to describe the relative size of the interval number, for comparison of  the deterministic and non-deterministic information. The models are sorted to find the one that can be generalized. The method proposed in this study is empirically verified using the PM2.5 data collected in Hefei, China, which shows the rationality and effectiveness of the model.

Received: November 6, 2023
Accepted: January 8, 2024

References

C. Bergmeir, R. J. Hyndman and J. M. Benítez, Bagging exponential smoothing methods using STL decomposition and Box-Cox transformation, International Journal of Forecasting 32(2) (2016), 303-312.

B. Butcher and B. J. Smith, Feature engineering and selection: a practical approach for predictive models, Amer. Statist. 74(3) (2020), 308-309.

T. A. Kuffner, S. M. S. Lee and G. A. Young, Consistency of a hybrid block bootstrap for distribution and variance estimation for sample quantiles of weakly dependent sequences, Aust. N. Z. J. Stat. 60(1) (2018), 103-114.

J. B. Lan, L. J. Cao and J. Lin, Method for ranking interval numbers based on two-dimensional priority degree, Journal of Chongqing Institute of Technology 10(21) (2007), 63-67.

D. Q. Li and Y. D. Gu, Method for ranking interval numbers based on possibility degree, Journal of Systems Engineering 2(23) (2008), 243-246.

W. Li and X. Jiang, Prediction of air pollutant concentrations based on TCN-BiLSTM-DMAttention with STL decomposition, Scientific Reports 13 (2023), Article number: 4665.

P. Montero-Manso, G. Athanasopoulos, R. J. Hyndman and T. S. Talagala, FFORMA: feature-based forecast model averaging, International Journal of Forecasting 36(1) (2020), 86-92.

N. Moran, B. Stringer, B. Lin and M. T. Hoque, Machine learning model selection for predicting bathymetry, Deep Sea Research Part I: Oceanographic Research Papers 185 (2022), Article number 103788.

Y. Nakahara, M. Sasaki and M. Gen, On the linear programming problems with interval coefficients, Computers and Industrial Engineering 23(1-4) (1992), 301-304.

J. Piironen and A. Vehtari, Comparison of Bayesian predictive methods for model selection, Stat. Comput. 27(3) (2017), 711-735.

D. Pilavakis, E. Paparoditis and T. Sapatinas, Moving Block and tapered block bootstrap for functional time series with an application to the K-sample mean problem, Bernoulli 25(4B) (2019), 3496-3526.

X. Shao and D. N. Politis, Fixed-b Subsampling and block bootstrap: improved confidence sets based on p-value calibration, J. R. Stat. Soc. Ser. B Stat. Methodol. 75 (2013), 161-184.

B. S. Stevens and C. J. Conway, Predicting species distributions: unifying model selection and scale optimization for multi-scale occupancy models, Ecosphere 10(5) (2019), e02748.

R. Synowiecki, Consistency and application of moving block bootstrap for non-stationary time series with periodic and almost periodic structure, Bernoulli 13(4) (2007), 1151-1178.

M. Theodosiou, Forecasting monthly and quarterly time series using STL decomposition, International Journal of Forecasting 27(4) (2011), 1178-1195.

M. E. Thomson, A. C. Pollock, D. Önkal and M. S. Gönül, Combining forecasts: performance and coherence, International Journal of Forecasting 35(2) (2019), 474-484.

S. Yang, Z. Deng, X. Li, C. Zheng, L. Xi, J. Zhuang, Z. Zhang and Z. Zhang, A novel hybrid model based on STL decomposition and one-dimensional convolutional neural networks with positional encoding for significant wave height forecast, Renewable Energy 173 (2021), 531-543.

H. Yin, D. Jin, Y. H. Gu, C. J. Park, S. K. Han and S. J. Yoo, STL-ATTLSTM: vegetable price forecasting using STL and attention mechanism-based LSTM, Agriculture 10(12) (2020), 612.

H. Zhu, R. Xu and H. Deng, A novel STL-based hybrid model for forecasting hog price in China, Computers and Electronics in Agriculture 198 (2022), Article number: 107068.

Published

12-02-2024

Issue

Section

Articles

How to Cite

SELECTION OF PREDICTION MODELS BASED ON POSSIBILITY FUNCTION OF BINARY CONNECTION NUMBERS. (2024). Advances and Applications in Statistics , 91(3), 371-392. https://doi.org/10.17654/0972361724020

Similar Articles

1-10 of 225

You may also start an advanced similarity search for this article.