Image Source: PexelsWhile portfolio optimization typically focuses on risk and return prediction, implementation costs critically matter. Unfortunately, predicting trading costs is challenging because the largest component for a large investor is price impact, which depends on the size of the trade, the amount traded by other traders in that security, and the identity of the trader, thus, impeding a generic solution. To address the problem, Rusian Goyenko, Bryan Kelly, Tobias Moskowitz, Yinan Su, and Chao Zhang, authors of the August 2024 study “Trading Volume Alpha,” hypothesized that trading volume is a valuable source of information for estimating trading costs. Assuming a fixed trade size, trading costs should be declining in trading volume. To test the hypothesis, they developed a machine learning model that can accurately predict trading volume for individual stocks. They then demonstrated how this model can be used to construct a portfolio that outperforms a traditional market-cap weighted portfolio.They began by noting that prior research (Frazzini, Israel, and Moskowitz,2018) has shown that trade size divided by daily trading volume– termed the market participation rate in a stock– is the key driver of price impact costs and that price impact is an increasing function of participation rate. Holding trade size constant, the less trading volume in the stock, the greater the trader’s price impact will be. Thus, all else equal, a higher predicted volume allows the trader to trade larger amounts because price impact per dollar traded will be lower. Conversely, a lower predicted volume causes the trader to trade less aggressively, scaling back trading (perhaps even to zero) because the price impact per dollar will be higher. Thus, their strategy was to forecast trading volume for each security as a proxy for expected trading costs, and then use this forecast to optimize portfolios net of those costs and quantify its benefits. Their analysis assumed that the trade size was fixed. Note that in practice, intuitively, a fund might attempt to trade when the cost to trade was cheap and less when more expensive.The authors then presented their machine learning model for predicting trading volume. Their model is based on a recurrent neural network. To predict volume they used technical signals, such as lagged returns and lagged trading volume, as well as firm characteristics that the literature finds capture return anomalies. They then added indicators for various market-wide or firm-level events associated with volume fluctuation, including upcoming and past earnings releases and analyzed both linear and non-linear prediction methods using various neural networks designed to maximize out-of-sample predictability. Finally, they altered the objective/loss function of the neural network to consider the portfolio problem’s economic objective when predicting volume.The authors then demonstrated how their model can be used to construct a portfolio.
“To quantify the economic magnitude of volume prediction, we incorporate volume prediction into a portfolio theory problem. We model a portfolio framework that seeks to maximize the net-of-cost performance of the portfolio using a mean-variance utility function, where the cost of transacting scales linearly with participation rate (motivated by theory and empirical work from the literature). The optimization trades off the cost of trading versus the (opportunity) cost of not trading– minimizing trading costs versus minimizing tracking error to the before-cost optimal portfolio– where trading costs and tracking error are endogenously negatively related. In the model, we take the first and second moments of security returns as given and focus solely on the tradeoff between trading costs and tracking error to the pre-cost optimal portfolio.”
Again, note that in practice, when a stock is expensive to trade another option would be to trade something else instead. For example, if stock A gets too expensive to trade, a fund could consider a similar stock B which is cheaper to trade. That may be better than not trading because there are often a number of stocks with similar characteristics, and thus similar expected returns, and every so often a fund needs to trade (for example, due to cash flows).Their sample period was 2018 to 2022, or 1,258 days. The cross-section covered around 4,700 stocks, with an average of 3,500 stocks per day, or 4,400,000 observations in total. Following is a summary of some of their key findings:Their model was able to accurately predict trading volume for a wide range of stocks.
The results are hypothetical results and are NOT an indicator of future results and do NOT represent returns that any investor actually attained. Indexes are unmanaged and do not reflect management or trading fees, and one cannot invest directly in an index.
Their findings led the authors to conclude:
“Volume is highly predictable, especially when using machine learning techniques, large data signals, and exploiting the virtueof complexity in prediction. We find that volume prediction can be as valuable as return prediction in achieving optimal mean-variance portfolios net of trading costs. We find that incorporating an economic objective function directly into machine learning is even more effective for obtaining useful predictions. This feature may be general to many finance applications of machine learning, where incorporating the economic objective directly may dominate a two-step process that first satisfies some statistical objective and then incorporates that statistical object into an economic framework. For volume prediction, the asymmetric cost of overestimating versus underestimating volume is captured (ignored) by an economic (statistical) objective, and delivers sizeable economic impact.” They added: “Refining the prediction methods and deepening the prediction models could add substantially to these improvements, generating even larger trading volume alpha.”
SummaryThe paper Trading Volume Alpha makes a significant contribution to the literature on trading volume. The authors provide a clear and concise overview of the existing literature. They also presented a novel machine-learning model for predicting trading volume. Their findings have the potential to revolutionize the way funds are constructed and strategies implemented. In the conclusion, the authors stated:
“A more exhaustive search for prediction variables and models that forecast volume more accurately could translate into even larger economic benefits than we show here. Some promising candidates for additional features and methods are lead-lag volume relations across stocks, more seasonal indicators, other market microstructure variables, and more complex nn and rnn models.”
The paper is a must-read for anyone interested in trading volume or machine learning.More By This Author:Artificial Intelligence, Textual Analysis And Hedge Fund PerformanceThe Hidden Cost Of Index Replication Data-Driven Approach To Clustering Similar Macroeconomic Regimes