1 Introduction
-
We suggest the use of two types of stock networks; (1) a price-based network using the stock price correlations and (2) a text-based network using the 10-K and 10-Q document embeddings. To build the text-based network, we develop and propose a novel NLP-based technique to fuse multiple document embedding techniques.
-
It is analysed that those two networks have exclusive characteristics that the other does not have. As such, using both networks simultaneously, we construct Combined Network that identify peers that have strong business similarity yet have low stock price correlation.
-
Using the Combined Network, a novel industry momentum portfolio, Hidden Neighbours portfolio, is proposed, delivering a Sharpe ratio of 0.85 between 2013 and 2022, while the SIC-based industry momentum benchmark delivered a Sharpe ratio of 0.55.
2 Background and related works
2.1 Momentum in stock returns
2.2 Industry momentum
2.3 Network of stocks
2.4 Portfolio construction leveraging text information
3 Constructing text-based and price-based networks
3.1 Price-based network
3.2 Text-based network
3.2.1 Text data collection
3.2.2 NLP models
3.3 Network backboning
4 Constructing combined network
4.1 Network analysis
Period | Text-based network | Price-based network |
---|---|---|
2012–2013 | 0.292 | 0.173 |
2013–2014 | 0.279 | 0.214 |
2014–2015 | 0.276 | 0.229 |
2015–2016 | 0.251 | 0.195 |
2016–2017 | 0.276 | 0.191 |
2017–2018 | 0.268 | 0.202 |
2018–2019 | 0.230 | 0.206 |
2019–2020 | 0.241 | 0.217 |
2020–2021 | 0.237 | 0.237 |
4.2 Combined network
5 Hidden neighbours industry momentum portfolio
5.1 Hidden neighbours portfolio
5.2 Benchmarks
-
Standard Momentum (Jegadeesh and Titman 1993) Standard Momentum benchmark is computed by selecting the top 50 stocks in the S&P500 with the highest total returns, with a \(6-1\) look-back period. We hold this portfolio for 6 months, resulting in \(J=6, K=6\) equal-weighted momentum portfolio with 1 month skipping.
-
SIC Industry Momentum (Moskowitz and Grinblatt 1999) Specifically, we first identify momentum stocks as stocks with top 30th percentile total returns. Then, industry momentum is extracted using the first 2 digits of a company’s SIC code, which appear among the momentum stocks. Finally, all S&P 500 companies with the respective momentum-experiencing SIC codes are selected, and we equally weight the top 50 stocks with the highest total return in a \(6-1\) look-back period. The holding period is set to 6 months.
-
S&P 500 Index We compare our returns to the market portfolio, which was inferred from the total return of the S&P 500 index.
5.3 Sharpe ratio and cumulative returns
Strategy | Sharpe ratio | Annualised returns | Cumulative returns |
---|---|---|---|
Hidden Neighbours | 0.85 | 18.16 | 457 |
Standard Momentum | 0.73 | 17.06 | 393 |
SIC Industry Momentum | 0.55 | 13.72 | 290 |
S&P 500 | 0.57 | 12.02 | 268 |
5.4 Factor decomposition
Cahart’s factors | Hidden Neighbours | SIC Industry Momentum | Standard Momentum |
---|---|---|---|
\(\alpha\) | 0.06 (0.024) \(**\) | 0.05 (0.027) | 0.06 (0.030) |
\(R_m - R_f\) | 0.10 (0.055) | 0.10 (0.064) | 0.11 (0.057) \(**\) |
SMB | 0.11 (0.064) | 0.12 (0.072) | 0.12 (0.067) |
HML | 0.01 (0.054) | −0.01 (0.063) | 0.12 (0.067) |
UMD | 0.04 (0.030) | 0.05 (0.036) | 0.06 (0.026) \(**\) |
6 Further discussion
6.1 Identifying hidden neighbours with combined network
SIC | NAICS | Price-based network | Text-based network | Combined network |
---|---|---|---|---|
State Farm | AIG | Bank of America | W.R. Berkley | The Hartford |
Travelers | Progressive | MetLife | Cincinnati Financial | DR Horton |
The Hartford | Travelers | Prudential | Allstate | CVS Health |
MetLife | Liberty Mutual | J.P. Morgan | AIG | Cardinal Health |
Prudential | Loew’s | Loew’s | Norfolk Southern | J.M. Smucker |
6.2 Maximum drawdown
Strategy | 2020 | 2022 |
---|---|---|
Hidden Neighbours | −34.35 | −28.20 |
Standard Momentum | −39.37 | −21.20 |
SIC Industry Momentum | −42.68 | −28.51 |
S&P 500 | −33.92 | −25.43 |
6.3 Trading cost and portfolio turnover
6.4 Embedding model
Embedding method | Sharpe ratio |
---|---|
Doc2Vec | 0.75 |
FinBERT | 0.63 |
FinBERT w/o fine-tuning | 0.56 |
Doc2Vec+FinBERT (Proposed) | 0.85 |