[go: up one dir, main page]

IDEAS home Printed from https://ideas.repec.org/a/wly/jforec/v41y2022i8p1701-1724.html
   My bibliography  Save this article

Making the whole greater than the sum of its parts: A literature review of ensemble methods for financial time series forecasting

Author

Listed:
  • Pedro Henrique Melo Albuquerque
  • Yaohao Peng
  • João Pedro Fontoura da Silva
Abstract
This paper discusses the application of ensemble techniques for the prediction of time series, presenting an in‐depth review of the main techniques and algorithms used by the recent literature, with emphasis on the bootstrap aggregation (bagging) and boosting approaches. We also discuss the theoretical foundations of the ensemble‐based models, presenting measures of model stability and the main aggregation methods to combine the forecasts of the individual models, as well as recommendations for future developments for related research agendas.

Suggested Citation

  • Pedro Henrique Melo Albuquerque & Yaohao Peng & João Pedro Fontoura da Silva, 2022. "Making the whole greater than the sum of its parts: A literature review of ensemble methods for financial time series forecasting," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(8), pages 1701-1724, December.
  • Handle: RePEc:wly:jforec:v:41:y:2022:i:8:p:1701-1724
    DOI: 10.1002/for.2894
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/for.2894
    Download Restriction: no

    File URL: https://libkey.io/10.1002/for.2894?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Zou, Hui & Yang, Yuhong, 2004. "Combining time series models for forecasting," International Journal of Forecasting, Elsevier, vol. 20(1), pages 69-84.
    2. David Rapach & Jack Strauss, 2010. "Bagging or Combining (or Both)? An Analysis Based on Forecasting U.S. Employment Growth," Econometric Reviews, Taylor & Francis Journals, vol. 29(5-6), pages 511-533.
    3. James E. Matheson & Robert L. Winkler, 1976. "Scoring Rules for Continuous Probability Distributions," Management Science, INFORMS, vol. 22(10), pages 1087-1096, June.
    4. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    5. Inoue, Atsushi & Kilian, Lutz, 2008. "How Useful Is Bagging in Forecasting Economic Time Series? A Case Study of U.S. Consumer Price Inflation," Journal of the American Statistical Association, American Statistical Association, vol. 103, pages 511-522, June.
    6. de Menezes, Lilian M. & W. Bunn, Derek & Taylor, James W., 2000. "Review of guidelines for the use of combined forecasts," European Journal of Operational Research, Elsevier, vol. 120(1), pages 190-204, January.
    7. Jordan, Steven J. & Vivian, Andrew & Wohar, Mark E., 2017. "Forecasting market returns: bagging or combining?," International Journal of Forecasting, Elsevier, vol. 33(1), pages 102-120.
    8. Jonathan H. Wright, 2009. "Forecasting US inflation by Bayesian model averaging," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 28(2), pages 131-144.
    9. Bou-Hamad, Imad & Jamali, Ibrahim, 2020. "Forecasting financial time-series using data mining models: A simulation study," Research in International Business and Finance, Elsevier, vol. 51(C).
    10. Bunn, Derek W., 1985. "Statistical efficiency in the linear combination of forecasts," International Journal of Forecasting, Elsevier, vol. 1(2), pages 151-163.
    11. Anthony Niblett, 2018. "Regulatory Reform in Ontario: Machine Learning and Regulation," C.D. Howe Institute Commentary, C.D. Howe Institute, issue 507, March.
    12. Kourentzes, Nikolaos & Petropoulos, Fotios & Trapero, Juan R., 2014. "Improving forecasting by estimating time series structural components across multiple frequencies," International Journal of Forecasting, Elsevier, vol. 30(2), pages 291-302.
    13. Nii Ayi Armah & Norman Swanson, 2010. "Seeing Inside the Black Box: Using Diffusion Index Methodology to Construct Factor Proxies in Large Scale Macroeconomic Time Series Environments," Econometric Reviews, Taylor & Francis Journals, vol. 29(5-6), pages 476-510.
    14. James H. Stock & Mark W. Watson, 2012. "Generalized Shrinkage Methods for Forecasting Using Many Predictors," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 30(4), pages 481-493, June.
    15. Graham Elliott & Ivana Komunjer & Allan Timmermann, 2008. "Biases in Macroeconomic Forecasts: Irrationality or Asymmetric Loss?," Journal of the European Economic Association, MIT Press, vol. 6(1), pages 122-157, March.
    16. Jiang, Minqi & Liu, Jiapeng & Zhang, Lu & Liu, Chunyu, 2020. "An improved Stacking framework for stock index prediction by leveraging tree-based ensemble models and deep learning algorithms," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 541(C).
    17. Lee, Tae-Hwy & Yang, Yang, 2006. "Bagging binary and quantile predictors for time series," Journal of Econometrics, Elsevier, vol. 135(1-2), pages 465-497.
    18. Bergmeir, Christoph & Hyndman, Rob J. & Benítez, José M., 2016. "Bagging exponential smoothing methods using STL decomposition and Box–Cox transformation," International Journal of Forecasting, Elsevier, vol. 32(2), pages 303-312.
    19. Yin, Anwen, 2020. "Equity premium prediction and optimal portfolio decision with Bagging," The North American Journal of Economics and Finance, Elsevier, vol. 54(C).
    20. Yang, Yuhong, 2004. "Combining Forecasting Procedures: Some Theoretical Results," Econometric Theory, Cambridge University Press, vol. 20(1), pages 176-222, February.
    21. Susan Athey & Guido W. Imbens, 2017. "The State of Applied Econometrics: Causality and Policy Evaluation," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 3-32, Spring.
    22. Min, Chung-ki & Zellner, Arnold, 1993. "Bayesian and non-Bayesian methods for combining models and forecasts with applications to forecasting international growth rates," Journal of Econometrics, Elsevier, vol. 56(1-2), pages 89-118, March.
    23. Peter F. Christoffersen & Francis X. Diebold, 2006. "Financial Asset Returns, Direction-of-Change Forecasting, and Volatility Dynamics," Management Science, INFORMS, vol. 52(8), pages 1273-1287, August.
    24. Bai, Jushan & Ng, Serena, 2006. "Evaluating latent and observed factors in macroeconomics and finance," Journal of Econometrics, Elsevier, vol. 131(1-2), pages 507-537.
    25. Clements,Michael & Hendry,David, 1998. "Forecasting Economic Time Series," Cambridge Books, Cambridge University Press, number 9780521634809.
    26. M. Hashem Pesaran & Davide Pettenuzzo & Allan Timmermann, 2006. "Forecasting Time Series Subject to Multiple Structural Breaks," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 73(4), pages 1057-1084.
    27. G. Elliott & C. Granger & A. Timmermann (ed.), 2006. "Handbook of Economic Forecasting," Handbook of Economic Forecasting, Elsevier, edition 1, volume 1, number 1.
    28. Clive W. J. Granger, 2002. "Some comments on risk," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 17(5), pages 447-456.
    29. Lutz Kilian & Atsushi Inoue, 2004. "Bagging Time Series Models," Econometric Society 2004 North American Summer Meetings 110, Econometric Society.
    30. Brock, William & Lakonishok, Josef & LeBaron, Blake, 1992. "Simple Technical Trading Rules and the Stochastic Properties of Stock Returns," Journal of Finance, American Finance Association, vol. 47(5), pages 1731-1764, December.
    31. Jose, Victor Richmond R. & Winkler, Robert L., 2008. "Simple robust averages of forecasts: Some empirical results," International Journal of Forecasting, Elsevier, vol. 24(1), pages 163-169.
    32. Day, Richard H. & Huang, Weihong, 1990. "Bulls, bears and market sheep," Journal of Economic Behavior & Organization, Elsevier, vol. 14(3), pages 299-329, December.
    33. Jon Kleinberg & Jens Ludwig & Sendhil Mullainathan & Ziad Obermeyer, 2015. "Prediction Policy Problems," American Economic Review, American Economic Association, vol. 105(5), pages 491-495, May.
    34. Liang, Deron & Tsai, Chih-Fong & Lu, Hung-Yuan (Richard) & Chang, Li-Shin, 2020. "Combining corporate governance indicators with stacking ensembles for financial distress prediction," Journal of Business Research, Elsevier, vol. 120(C), pages 137-146.
    35. Yang, Ke & Tian, Fengping & Chen, Langnan & Li, Steven, 2017. "Realized volatility forecast of agricultural futures using the HAR models with bagging and combination approaches," International Review of Economics & Finance, Elsevier, vol. 49(C), pages 276-291.
    36. Sami Ben Jabeur & Salma Mefteh-Wali & Jean-Laurent Viviani, 2021. "Forecasting gold price with the XGBoost algorithm and SHAP interaction values," Post-Print hal-03331805, HAL.
    37. Fitzenberger, Bernd, 1998. "The moving blocks bootstrap and robust inference for linear least squares and quantile regressions," Journal of Econometrics, Elsevier, vol. 82(2), pages 235-287, February.
    38. Jialin Liu & Chih-Min Lin & Fei Chao, 2019. "Gradient Boost with Convolution Neural Network for Stock Forecast," Papers 1909.09563, arXiv.org.
    39. Shao, Xiaofeng, 2010. "The Dependent Wild Bootstrap," Journal of the American Statistical Association, American Statistical Association, vol. 105(489), pages 218-235.
    40. Huiyu Huang & Tae-Hwy Lee, 2010. "To Combine Forecasts or to Combine Information?," Econometric Reviews, Taylor & Francis Journals, vol. 29(5-6), pages 534-570.
    41. Clive W.J. Granger, 1999. "Outline of forecast theory using generalized cost functions," Spanish Economic Review, Springer;Spanish Economic Association, vol. 1(2), pages 161-173.
    42. Wright, Jonathan H., 2008. "Bayesian Model Averaging and exchange rate forecasts," Journal of Econometrics, Elsevier, vol. 146(2), pages 329-341, October.
    43. Barrow, Devon K. & Crone, Sven F., 2016. "A comparison of AdaBoost algorithms for time series forecast combination," International Journal of Forecasting, Elsevier, vol. 32(4), pages 1103-1119.
    44. Timmermann, Allan & Elliott, Graham & Komunjer, Ivana, 2003. "Estimating Loss Function Parameters," CEPR Discussion Papers 3821, C.E.P.R. Discussion Papers.
    45. Davis, Richard A. & Drees, Holger & Segers, Johan & WarchoÅ‚, MichaÅ‚, 2018. "Inference on the tail process with application to financial time series modelling," LIDAM Reprints ISBA 2018022, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    46. James H. Stock & Mark W.Watson, 2003. "Forecasting Output and Inflation: The Role of Asset Prices," Journal of Economic Literature, American Economic Association, vol. 41(3), pages 788-829, September.
    47. Gary Koop & Simon Potter, 2004. "Forecasting in dynamic factor models using Bayesian model averaging," Econometrics Journal, Royal Economic Society, vol. 7(2), pages 550-565, December.
    48. Aksu, Celal & Gunter, Sevket I., 1992. "An empirical analysis of the accuracy of SA, OLS, ERLS and NRLS combination forecasts," International Journal of Forecasting, Elsevier, vol. 8(1), pages 27-43, June.
    49. Sainan Jin & Liangjun Su & Aman Ullah, 2014. "Robustify Financial Time Series Forecasting with Bagging," Econometric Reviews, Taylor & Francis Journals, vol. 33(5-6), pages 575-605, August.
    50. Davis, Richard A. & Drees, Holger & Segers, Johan & Warchoł, Michał, 2018. "Inference on the tail process with application to financial time series modeling," Journal of Econometrics, Elsevier, vol. 205(2), pages 508-525.
    51. Jensen, Michael C & Bennington, George A, 1970. "Random Walks and Technical Theories: Some Additional Evidence," Journal of Finance, American Finance Association, vol. 25(2), pages 469-482, May.
    52. Malki, Zohair & Atlam, El-Sayed & Hassanien, Aboul Ella & Dagnew, Guesh & Elhosseini, Mostafa A. & Gad, Ibrahim, 2020. "Association between weather data and COVID-19 pandemic predicting mortality rate: Machine learning approaches," Chaos, Solitons & Fractals, Elsevier, vol. 138(C).
    53. Petropoulos, Fotios & Hyndman, Rob J. & Bergmeir, Christoph, 2018. "Exploring the sources of uncertainty: Why does bagging for time series forecasting work?," European Journal of Operational Research, Elsevier, vol. 268(2), pages 545-554.
    54. Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
    55. Yang Y., 2001. "Adaptive Regression by Mixing," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 574-588, June.
    56. R. Cont, 2001. "Empirical properties of asset returns: stylized facts and statistical issues," Quantitative Finance, Taylor & Francis Journals, vol. 1(2), pages 223-236.
    57. Stock, James H & Watson, Mark W, 2002. "Macroeconomic Forecasting Using Diffusion Indexes," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(2), pages 147-162, April.
    58. Yeung Lewis Chan & James H. Stock & Mark W. Watson, 1999. "A dynamic factor model framework for forecast combination," Spanish Economic Review, Springer;Spanish Economic Association, vol. 1(2), pages 91-121.
    59. Davis, Richard & Drees, Holger & Segers, Johan & Warchol, Michal, 2018. "Inference on the tail process with application to financial time series modelling," LIDAM Discussion Papers ISBA 2018002, Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA).
    60. Aiolfi, Marco & Timmermann, Allan, 2006. "Persistence in forecasting performance and conditional combination strategies," Journal of Econometrics, Elsevier, vol. 135(1-2), pages 31-53.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jiaju Miao & Pawel Polak, 2023. "Online Ensemble of Models for Optimal Predictive Performance with Applications to Sector Rotation Strategy," Papers 2304.09947, arXiv.org.
    2. Hafiz Muhammad Abdullah & Sanghyoun Park & Kwanjae Seong & Sangyong Lee, 2023. "Hybrid Renewable Energy System Design: A Machine Learning Approach for Optimal Sizing with Net-Metering Costs," Sustainability, MDPI, vol. 15(11), pages 1-37, May.
    3. Wang, Lu & Wu, Rui & Ma, WeiChun & Xu, Weiju, 2023. "Examining the volatility of soybean market in the MIDAS framework: The importance of bagging-based weather information," International Review of Financial Analysis, Elsevier, vol. 89(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    2. Wang, Xiaoqian & Hyndman, Rob J. & Li, Feng & Kang, Yanfei, 2023. "Forecast combinations: An over 50-year review," International Journal of Forecasting, Elsevier, vol. 39(4), pages 1518-1547.
    3. Kim, Hyun Hak & Swanson, Norman R., 2018. "Mining big data using parsimonious factor, machine learning, variable selection and shrinkage methods," International Journal of Forecasting, Elsevier, vol. 34(2), pages 339-354.
    4. Marine Carrasco & Barbara Rossi, 2016. "In-Sample Inference and Forecasting in Misspecified Factor Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(3), pages 313-338, July.
    5. David Rapach & Jack Strauss, 2010. "Bagging or Combining (or Both)? An Analysis Based on Forecasting U.S. Employment Growth," Econometric Reviews, Taylor & Francis Journals, vol. 29(5-6), pages 511-533.
    6. Kim, Hyun Hak & Swanson, Norman R., 2014. "Forecasting financial and macroeconomic variables using data reduction methods: New empirical evidence," Journal of Econometrics, Elsevier, vol. 178(P2), pages 352-367.
    7. Özen, Kadir & Yıldırım, Dilem, 2021. "Application of bagging in day-ahead electricity price forecasting and factor augmentation," Energy Economics, Elsevier, vol. 103(C).
    8. Lee, Tae-Hwy & Yang, Yang, 2006. "Bagging binary and quantile predictors for time series," Journal of Econometrics, Elsevier, vol. 135(1-2), pages 465-497.
    9. Timmermann, Allan, 2006. "Forecast Combinations," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 1, chapter 4, pages 135-196, Elsevier.
    10. Yang, Ke & Tian, Fengping & Chen, Langnan & Li, Steven, 2017. "Realized volatility forecast of agricultural futures using the HAR models with bagging and combination approaches," International Review of Economics & Finance, Elsevier, vol. 49(C), pages 276-291.
    11. Rossi, Barbara, 2013. "Advances in Forecasting under Instability," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1203-1324, Elsevier.
    12. Weron, Rafał, 2014. "Electricity price forecasting: A review of the state-of-the-art with a look into the future," International Journal of Forecasting, Elsevier, vol. 30(4), pages 1030-1081.
    13. Wei, Xiaoqiao & Yang, Yuhong, 2012. "Robust forecast combinations," Journal of Econometrics, Elsevier, vol. 166(2), pages 224-236.
    14. Jordan, Steven J. & Vivian, Andrew & Wohar, Mark E., 2017. "Forecasting market returns: bagging or combining?," International Journal of Forecasting, Elsevier, vol. 33(1), pages 102-120.
    15. Eric Hillebrand & Huiyu Huang & Tae-Hwy Lee & Canlin Li, 2018. "Using the Entire Yield Curve in Forecasting Output and Inflation," Econometrics, MDPI, vol. 6(3), pages 1-27, August.
    16. Meira, Erick & Cyrino Oliveira, Fernando Luiz & Jeon, Jooyoung, 2021. "Treating and Pruning: New approaches to forecasting model selection and combination using prediction intervals," International Journal of Forecasting, Elsevier, vol. 37(2), pages 547-568.
    17. Jordan, Steven J. & Vivian, Andrew & Wohar, Mark E., 2016. "Can commodity returns forecast Canadian sector stock returns?," International Review of Economics & Finance, Elsevier, vol. 41(C), pages 172-188.
    18. Dantas, Tiago Mendes & Cyrino Oliveira, Fernando Luiz, 2018. "Improving time series forecasting: An approach combining bootstrap aggregation, clusters and exponential smoothing," International Journal of Forecasting, Elsevier, vol. 34(4), pages 748-761.
    19. Cheng, Xu & Hansen, Bruce E., 2015. "Forecasting with factor-augmented regression: A frequentist model averaging approach," Journal of Econometrics, Elsevier, vol. 186(2), pages 280-293.
    20. Kadir Özen & Dilem Yıldırım, 2021. "Application of Bagging in Day-Ahead Electricity Price Forecasting and Factor Augmentation," ERC Working Papers 2101, ERC - Economic Research Center, Middle East Technical University, revised Apr 2021.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:jforec:v:41:y:2022:i:8:p:1701-1724. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www3.interscience.wiley.com/cgi-bin/jhome/2966 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.