×

Inductive representation learning on dynamic stock co-movement graphs for stock predictions. (English) Zbl 07587550

Summary: Co-movement among individual firms’ stock prices can reflect complex interfirm relationships. This paper proposes a novel method to leverage such relationships for stock price predictions by adopting inductive graph representation learning on dynamic stock graphs constructed based on historical stock price co-movement. To learn node representations from such dynamic graphs for better stock predictions, we propose the hybrid-attention dynamic graph neural network, an inductive graph representation learning method. We also extended mini-batch gradient descent to inductive representation learning on dynamic stock graphs so that the model can update parameters over mini-batch stock graphs with higher training efficiency. Extensive experiments on stocks from different markets and trading simulations demonstrate that the proposed method significantly improves stock predictions. The proposed method can have important implications for the management of financial portfolios and investment risk.
Summary of Contribution: Accurate predictions of stock prices have important implications for financial decisions. In today’s economy, individual firms are increasingly connected via different types of relationships. As a result, firms’ stock prices often feature synchronous co-movement patterns. This paper represents the first effort to leverage such phenomena to construct dynamic stock graphs for stock predictions. We develop hybrid-attention dynamic graph neural network (HAD-GNN), an inductive graph representation learning framework for dynamic stock graphs to incorporate temporal and graph attention mechanisms. To improve the learning efficiency of HAD-GNN, we also extend the mini-batch gradient descent to inductive representation learning on such dynamic graphs and adopt a t-batch training mechanism (t-BTM). We demonstrate the effectiveness of our new approach via experiments based on real-world data and simulations.

MSC:

90Cxx Mathematical programming
Full Text: DOI

References:

[1] Agarwal A, Leung ACM, Konana P, Kumar A (2017) Cosearch attention and stock return predictability in supply chains. Inform. Systems Res. 28(2):265-288.Link, Google Scholar
[2] Aggarwal CC, Hinneburg A, Keim DA (2001) On the surprising behavior of distance metrics in high dimensional space. Van den Bussche J, Vianu V, eds. Proc. Internat. Conf. Database Theory (Springer, Cham, Switzerland), 420-434.Crossref, Google Scholar · Zbl 1047.68038 · doi:10.1007/3-540-44503-X_27
[3] Bali TG, Bodnaruk A, Scherbina A, Tang Y (2018) Unusual news flow and the cross section of stock returns. Management Sci. 64(9):4137-4155.Link, Google Scholar
[4] Ban GY, El Karoui N, Lim AE (2016) Machine learning and portfolio optimization. Management Sci. 64(3):1136-1154.Link, Google Scholar
[5] Bertsimas D, Cory-Wright R (2022) A scalable algorithm for sparse portfolio selection. INFORMS J. Comput., ePub ahead of print January 14, https://doi.org/10.1287/ijoc.2021.1127.Link, Google Scholar · Zbl 07552219
[6] Bonanno G, Lillo F, Mantegna R (2001) High-frequency cross-correlation in a set of stocks. Quantitative Finance 1(1):96-104.Crossref, Google Scholar · Zbl 1405.91538 · doi:10.1080/713665554
[7] Brown RG (2004) Smoothing, Forecasting and Prediction of Discrete Time Series (Dover Publications, Mineola, NY).Google Scholar
[8] Bruna J, Zaremba W, Szlam A, Lecun Y (2014) Spectral networks and locally connected networks on graphs. Proc. 2nd Internat. Conf. Learn. Representations.Google Scholar
[9] Campbell JY, Mackinlay AC, Lo AW, Press PU (1997) The econometrics of financial markets. J. Empirical Finance 3(1):15-102.Google Scholar · Zbl 0927.62113
[10] Chan LKC, Lakonishok J, Swaminathan B (2007) Industry classifications and return co-movement. Financial Anal. J. 63(6):56-70.Crossref, Google Scholar · doi:10.2469/faj.v63.n6.4927
[11] Chi KT, Liu J, Lau FC (2010) A network perspective of the stock market. J. Empirical Finance 17(4):659-667.Crossref, Google Scholar · doi:10.1016/j.jempfin.2010.04.008
[12] Defferrard M, Bresson X, Vandergheynst P (2016) Convolutional neural networks on graphs with fast localized spectral filtering. Lee D, Sugiyama M, Luxburg U, Guyon I, Garnett R, eds. Proc. 29th Advances in Neural Information Processing Systems, vol. 27 (Curran Associates, Red Hook, NY), 3844-3852.Google Scholar
[13] Dhar V, Geva T, Oestreicher-Singer G, Sundararajan A (2014) Prediction in economic networks. Inform. Systems Res. 25(2):264-284.Link, Google Scholar
[14] Ding X, Zhang Y, Liu T, Duan J (2015) Deep learning for event-driven stock prediction. Proc. 24th Internat. Joint Conf. Artificial Intelligence (AAAI Press, Palo Alto, CA), 2327-2333.Google Scholar
[15] Ehling P, Heyerdahl-Larsen C (2017) Correlations. Management Sci. 63(6):1919-1937.Link, Google Scholar
[16] Elfeky MG, Aref WG, Elmagarmid AK (2005) Periodicity detection in time series databases. IEEE Trans. Knowledge Data Engrg. 17(7):875-887.Crossref, Google Scholar · doi:10.1109/TKDE.2005.114
[17] Elliott RJ, Van Der Hoek J, Malcolm WP (2005) Pairs trading. Quantitative Finance 5(3):271-276.Crossref, Google Scholar · Zbl 1134.91415 · doi:10.1080/14697680500149370
[18] Fan X, Li B, Li C, Isson S, Chen L (2019) Scalable deep generative relational model with high-order node dependence. Wallach H, Larochelle H, Beygelzimer A, d’Alché-Buc F, Fox E, Garnett R, eds. Proc. 32nd Advances in Neural Information Processing Systems, vol. 32 (Curran Associates, Red Hook, NY), 12658-12668.Google Scholar
[19] Feng F, He X, Wang X, Luo C, Liu Y, Chua TS (2019) Temporal relational ranking for stock prediction. ACM Trans. Inform. Systems 37(2):27.Crossref, Google Scholar · doi:10.1145/3309547
[20] Goodfellow I, Bengio Y, Courville A, Bengio Y (2016) Deep Learning (MIT Press, Cambridge, MA).Google Scholar
[21] Grover A, Leskovec J (2016) node2vec: Scalable feature learning for networks. Krishnapuram B, Shah M, Smola A, Aggarwal C, Shen D, Rastogi R, eds. Proc. 22nd ACM SIGKDD Internat. Conf. Knowledge Discovery and Data Mining (ACM, New York), 855-864.Google Scholar
[22] Hamilton W, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs. Guyon I, Luxburg UV, Bengio S, Wallach H, Fergus R. Vishwanathan S, Garnett R, eds. Proc. 30th Advances in Neural Information Processing Systems, vol. 30 (Curran Associates, Red Hook, NY), 1025-1035.Google Scholar
[23] Hoberg G, Phillips G (2016) Text-based network industries and endogenous product differentiation. J. Political Econom. 124(5):1423-1465.Crossref, Google Scholar · doi:10.1086/688176
[24] Hu Z, Liu W, Bian J, Liu X, Liu TY (2018) Listening to chaotic whispers: A deep learning framework for news-oriented stock trend prediction. Chang Y, Zhai CX, Liu Y, Maarek Y, eds. Proc. 11th ACM Internat. Conf. Web Search and Data Mining (ACM, New York), 261-269.Google Scholar
[25] Hudson RS, Gregoriou A (2015) Calculating and comparing security returns is harder than you think: A comparison between logarithmic and simple returns. Internat. Rev. Financial Anal. 38:151-162.Crossref, Google Scholar · doi:10.1016/j.irfa.2014.10.008
[26] Kim D, Qi Y (2010) Accruals quality, stock returns, and macroeconomic conditions. Accounting Rev. 85(3):937-978.Crossref, Google Scholar · doi:10.2308/accr.2010.85.3.937
[27] Kingma DP, Ba J (2015) Adam: A method for stochastic optimization. Proc. 3rd Internat. Conf. Learn. Representations.Google Scholar
[28] Kipf TN, Welling M (2017) Semi-supervised classification with graph convolutional networks. Proc. 5th Internat. Conf. Learn. Representations.Google Scholar
[29] Kock CJ (2005) When the market misleads: Stock prices, firm behavior, and industry evolution. Organ. Sci. 16(6):637-660.Link, Google Scholar
[30] Kraus M, Feuerriegel S (2017) Decision support from financial disclosures with deep neural networks and transfer learning. Decision Support Systems 104:38-48.Crossref, Google Scholar · doi:10.1016/j.dss.2017.10.001
[31] Krawczyk B (2016) Learning from imbalanced data: Open challenges and future directions. Progress Artificial Intelligence 5(4):221-232.Crossref, Google Scholar · doi:10.1007/s13748-016-0094-0
[32] Kumar S, Zhang X, Leskovec J (2019) Predicting dynamic embedding trajectory in temporal interaction networks. Teredesai A, Kumar V, Li Y, Rosales R, Terzi E, Karypis G, eds. Proc. 25th ACM SIGKDD Internat. Conf. Knowledge Discovery & Data Mining (ACM, New York), 1269-1278.Google Scholar
[33] Kwan SH (1996) Firm-specific information and the correlation between individual stocks and bonds. J. Financial Econom. 40(1):63-80.Crossref, Google Scholar · doi:10.1016/0304-405X(95)00836-4
[34] Li Q, Chen Y, Jiang LL, Li P, Chen H (2016a) A tensor-based information framework for predicting the stock market. ACM Trans. Inform. Systems 34(2):11.Crossref, Google Scholar · doi:10.1145/2838731
[35] Li X, Chen K, Sun SX, Fung T, Wang H, Zeng DD (2016b) A commonsense knowledge-enabled textual analysis approach for financial market surveillance. INFORMS J. Comput. 28(2):278-294.Link, Google Scholar · Zbl 1415.91336
[36] Maas AL, Hannun AY, Ng AY (2013) Rectifier nonlinearities improve neural network acoustic models. Dasgupta S, McAllester D, eds. Proc. ICML Workshop on Deep Learn. for Audio, Speech, and Language Processing.Google Scholar
[37] Mao MQ, Wei KCJ (2016) Cash-flow news and the investment effect in the cross section of stock returns. Management Sci. 62(9):2504-2519.Link, Google Scholar
[38] Markowitz H (1968) Portfolio Selection: Efficient Diversification of Investments (Yale University Press, New Haven, CT).Google Scholar
[39] Nair V, Hinton GE (2010) Rectified linear units improve restricted Boltzmann machines. Daumé H, ed. Proc. 27th Internat. Conf. Machine Learn. (PMLR), 807-814.Google Scholar
[40] Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: Online learning of social representations. Macskassy S, Perlich C, Leskovec J, Wang W, Ghani R, eds. Proc. 20th ACM SIGKDD Internat. Conf. Knowledge Discovery and Data Mining (ACM, New York), 701-710.Google Scholar
[41] Qin Y, Song D, Cheng H, Cheng W, Jiang G, Cottrell GW (2017) A dual-stage attention-based recurrent neural network for time series prediction. Proc. 26th Internat. Joint Conf. Artificial Intelligence (AAAI Press, Palo Alto, CA), 2627-2633.Google Scholar
[42] Raffel C, Ellis DP (2016) Feed-forward networks with attention can solve some long-term memory problems. Proc. 4th Internat. Conf. Learn. Representations Workshop Track.Google Scholar
[43] Rios J, Zhao K, Street WN, Blackhurst J (2021) Stock price movement cross-predictability in supply chain networks. Preprint, submitted January 23, https://dx.doi.org/10.2139/ssrn.3737658.Google Scholar
[44] Rosenberg B, Rudd A (1982) Factor-related and specific returns of common stocks: Serial correlation and market inefficiency. J. Finance 37(2):543-554.Crossref, Google Scholar · doi:10.1111/j.1540-6261.1982.tb03575.x
[45] Ruder S (2016) An overview of gradient descent optimization algorithms. Preprint, submitted June 5, 2017, https://arxiv.org/abs/1609.04747.Google Scholar
[46] Schwert GW (1989) Why does stock market volatility change over time? J. Finance 44(5):1115-1153.Crossref, Google Scholar · doi:10.1111/j.1540-6261.1989.tb02647.x
[47] Sharpe WF (1966) Mutual fund performance. J. Bus. 39(1):119-138.Crossref, Google Scholar · doi:10.1086/294846
[48] Si J, Mukherjee A, Liu B, Pan SJ, Li Q, Li H (2014) Exploiting social relations and sentiment for stock prediction. Moschitti A, Pang B, Daelemans W, eds. Proc. 2014 Conf. Empirical Methods in Natural Language Processing, (Association for Computational Linguistics, Stroudsburg), 1139-1145.Google Scholar
[49] Singleton JC, Wingender J (1986) Skewness persistence in common stock returns. J. Financial Quantitative Anal. 21(3):335-341.Crossref, Google Scholar · doi:10.2307/2331046
[50] Tian H, Zheng X, Zeng DD (2019) Analyzing the dynamic sectoral influence in Chinese and American stock markets. Phys. A 536:120922.Crossref, Google Scholar · doi:10.1016/j.physa.2019.04.158
[51] Tütüncü RH, Koenig M (2004) Robust asset allocation. Ann. Oper. Res. 132(1):157-187.Crossref, Google Scholar · Zbl 1090.90125 · doi:10.1023/B:ANOR.0000045281.41041.ed
[52] Veličković P, Cucurull G, Casanova A, Romero A, Liò P, Bengio Y (2018) Graph attention networks. Proc. 6th Internat. Conf. Learn. Representations.Google Scholar
[53] Wu H, Zhang W, Shen W, Wang J (2018) Hybrid deep sequential modeling for social text-driven stock prediction. Paton NW, Srivastava D, Agrawal R, Broder A, eds. Proc. 27th ACM Internat. Conf. Inform. and Knowledge Management (ACM, New York), 1627-1630.Google Scholar
[54] Wu Z, Pan S, Chen F, Long G, Zhang C, Yu PS (2019) A comprehensive survey on graph neural networks. IEEE Trans. Neural Network Learn. Systems 32(1):4-24.Crossref, Google Scholar · doi:10.1109/TNNLS.2020.2978386
[55] Xie J, Qiu Z (2007) The effect of imbalanced data sets on LDA: A theoretical and empirical analysis. Pattern Recognition 40(2):557-562.Crossref, Google Scholar · Zbl 1118.68129 · doi:10.1016/j.patcog.2006.01.009
[56] Xu Y, Cohen SB (2018) Stock movement prediction from tweets and historical prices. Gurevych I, Miyao Y, eds. Proc. 56th Annual Meeting Assoc. for Computational Linguistics, vol. 1 (ACL, Melbourne, Australia), 1970-1979.Google Scholar
[57] Xu B, Wang N, Chen T, Li M (2015) Empirical evaluation of rectified activations in convolutional network. Preprint, submitted November 27, https://arxiv.org/abs/1505.00853.Google Scholar
[58] Yang Y, Padmanabhan B, Liu H, Wang X (2012) Discovery of periodic patterns in sequence data: a variance-based approach. INFORMS J. Comput. 24(3):372-386.Link, Google Scholar · Zbl 1465.62140
[59] You J, Ying R, Ren X, Hamilton WL, Leskovec J (2018) GraphRNN: Generating realistic graphs with deep auto-regressive models. Proc. 27th Internat. Conf. Machine Learn., vol. 80 (PMLR), 5708-5717.Google Scholar
[60] Zhang L, Aggarwal C, Qi GJ (2017a) Stock price prediction via discovering multi-frequency trading patterns. Matwin S, Yu SP, Farooq F, eds. Proc. 23rd ACM SIGKDD Internat. Conf. Knowledge Discovery and Data Mining (ACM, New York), 2141-2149.Google Scholar
[61] Zhang X, Zou Y, Shi W (2017b) Dilated convolution neural network with LeakyReLU for environmental sound classification. Proc. 22nd Internat. Conf. Digital Signal Processing (IEEE, New York), 1-5.Google Scholar
[62] Zhao K, Yen J, Ngamassi LM, Maitland C, Tapia AH (2012) Simulating inter-organizational collaboration network: a multi-relational and event-based approach. Simulation 88(5):617-633.Crossref, Google Scholar · doi:10.1177/0037549711421942
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.