Forecasting Daily Stock Market Return using Dimensionality Reduction
Abstract
In financial markets, it is both important and challenging to forecast the daily direction of the stock market return. Among the few studies that focus on predicting daily stock market returns, the data mining procedures utilized are either incomplete or inefficient, especially when a large amount of features are involved. This paper presents a complete and efficient data mining process to forecast the daily direction of the S&P 500 Index ETF (SPY) return based on 60 financial and economic features. Three mature dimensionality reduction techniques, including principal component analysis (PCA), fuzzy robust principal component analysis (FRPCA), and kernel-based principal component analysis (KPCA) are applied to the whole data set to simplify and rearrange the original data structure. Corresponding to different levels of the dimensionality reduction, twelve new data sets are generated from the entire cleaned data using each of the three different dimensionality reduction methods. Artificial neural networks (ANNs) are then used with the thirty-six transformed data sets for classification to forecast the daily direction of future market returns. Moreover, the three different dimensionality reduction methods are compared with respect to the natural data set. A group of hypothesis tests are then performed over the classification and simulation results to show that combining the ANNs with the PCA gives slightly higher classification accuracy than the other two combinations, and that the trading strategies guided by the comprehensive classification mining procedures based on PCA and ANNs gain significantly higher risk-adjusted profits than the comparison benchmarks, while also being slightly higher than those strategies guided by the forecasts based on the FRPCA and KPCA models.
Recommended Citation
X. Zhong and D. L. Enke, "Forecasting Daily Stock Market Return using Dimensionality Reduction," Expert Systems with Applications, vol. 67, pp. 126 - 139, Elsevier, Jan 2017.
The definitive version is available at https://doi.org/10.1016/j.eswa.2016.09.027
Department(s)
Engineering Management and Systems Engineering
Research Center/Lab(s)
Intelligent Systems Center
Keywords and Phrases
Classification (of information); Commerce; Data mining; Data reduction; Economic analysis; Electronic trading; Finance; Financial data processing; Financial markets; Forecasting; Fuzzy neural networks; Investments; Neural networks; Robust control; Classification accuracy; Dimensionality reduction; Dimensionality reduction method; Dimensionality reduction techniques; Fuzzy robust; Kernel based principal component analysis; Stock return forecasting; Trading strategies; Principal component analysis (PCA); Artificial neural networks (ANNs); Daily stock return forecasting; Fuzzy robust principal component analysis (FRPCA); Kernel-based principal component analysis (KPCA)
International Standard Serial Number (ISSN)
0957-4174
Document Type
Article - Journal
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2017 Elsevier, All rights reserved.
Publication Date
01 Jan 2017