Quantitative Investment in China
Quantitative asset management in China has matured alongside core benchmarks like the CSI 300 Index. As illustrated in Fig. 1.1, the index's market capitalization and trading volume have expanded substantially since its inception, providing the necessary liquidity and breadth for institutional-scale portfolio strategies.
While foundational models such as the Capital Asset Pricing Model (Sharpe, 1964) and the Fama-French multi-factor (Fama & French, 1993) frameworks are widely employed, their static formulations frequently struggle to adapt to the unique dynamics of China's market. This study addresses this gap by developing a reinforcement learning–graph neural network framework designed to empower portfolio managers with dynamic factor integration capabilities, enabling real-time optimization of both security allocation and underlying factor sensitivities.
AI/ML Applications in Asset Management
A robust quantitative asset management process begins with comprehensive data acquisition and preprocessing. As illustrated in Fig. 1.2, our framework ingests multi-source data from platforms like BaoStock (BaoStock, 2025) and AkShare (AkShare, 2025). This raw data undergoes rigorous cleaning, normalization, and feature engineering to construct predictive signals. These signals inform AI/ML models which, in turn, drive portfolio construction and optimization against defined risk constraints. The entire strategy is validated through historical backtesting, ensuring its practical viability before live deployment. This end-to-end, methodical workflow forms the backbone of our adaptive asset management approach.
Research Contributions
This paper makes three primary contributions to asset management practice:
Methodological: We propose a novel, three-component AI architecture (GBDT + RL + GNN) for dynamic, multi-factor portfolio optimization, explicitly incorporating event-study signals.
Empirical: We provide extensive backtesting and event-study evidence from the Chinese A-share market (2021–2025), demonstrating significant outperformance and effective risk management.
Practical: We detail a fully implementable pipeline using publicly available data and models, offering a reproducible blueprint for asset managers and researchers.
Machine Learning for Asset Management Forecasting
Machine learning models have become increasingly adopted in professional asset management for analyzing non-linear patterns in financial data. Ensemble methods like Gradient Boosting (Chen & Guestrin, 2016) and deep learning architectures like LSTMs (Hochreiter & Schmidhuber, 1997) are widely used for return prediction and market state classification within quantitative investment teams. In the context of China's A-share market, these models must capture the high-momentum behavior of growth stocks, as exemplified in Fig. 2.1, which shows the strong, trend-driven price action of a stock like Cambricon (688256.SH)—a characteristic highly relevant for active managers seeking momentum alpha.
Reinforcement Learning and Graph-Based Models in Portfolio Management
Reinforcement Learning (Sutton & Barto, 2018) has shown great promise in optimal sequential decision-making, including dynamic asset allocation—a core function of portfolio management. Meanwhile, Graph Neural Networks (Kipf & Welling, 2017) provide a powerful paradigm for modeling relational data, such as inter-stock correlations and sectoral networks, which are crucial for understanding systemic risk and spillover effects in concentrated portfolios like those often held by institutional investors in China's market.
Event Study Applications in Active Management
Event study analysis is a cornerstone for assessing the market impact of corporate information. In active asset management, automating the detection and exploitation of abnormal returns around events is a key source of alpha. Furthermore, rigorous backtesting that accounts for realistic constraints (transaction costs, liquidity, position limits) is essential to validate any strategy's practical efficacy and robustness for implementation in live portfolios.
Synthesis and Research Positioning in Asset Management
While existing literature explores these areas in isolation, our work integrates them into a cohesive framework specifically designed for asset management applications. We bridge the gap between predictive modeling (GBDT), adaptive allocation (RL), structural market understanding (GNN), and empirical validation through event studies and rigorous backtesting, providing a comprehensive toolkit for quantitative portfolio managers specifically tailored for the A-share market's complexities.
Methodology
Data and Feature Engineering for Asset Management
We utilize a multi-source dataset spanning 2021–2025, selected for its relevance to institutional portfolio management. BaoStock provides daily price, volume, and fundamental data (BaoStock, 2025) commonly used in quantitative investment processes. AkShare supplies alternative data, including corporate announcements and a real-time Social Sentiment Index (BaoStock, 2025)—increasingly important inputs for modern asset managers. Features are constructed across four pillars relevant to portfolio construction: Fundamental, Technical (e.g., 12-month momentum), Sentiment/Liquidity, and Event-Driven factors (e.g., earnings surprise). All features are winsorized, normalized, and temporally aligned to prevent look-ahead bias and ensure implementability in live trading environments.
Reinforcement Learning Strategies for Portfolio Management
Our framework employs a tripartite AI model structure designed for systematic asset management:
GBDT for Return Forecasting: A Gradient Boosting model is trained to predict next-period returns (Chen & Guestrin, 2016), providing a crucial "state" signal for the RL agent—generating alpha signals comparable to those used by quantitative hedge funds. Its performance against actual returns is shown in Fig. 3.1.
A
RL for Dynamic Portfolio Optimization: Portfolio management is formulated as a Partially Observable Markov Decision Process (POMDP). The state includes GNN embeddings, GBDT forecasts, and raw factors. The action is a vector of portfolio weights (Sutton & Barto,
2018). The reward function optimizes for the Sharpe ratio with penalties for drawdowns and turnover—a formulation aligned with institutional portfolio objectives. The agent learns to dynamically adjust sector allocations (Fig. 3.2), mimicking but systematizing the tactical allocation decisions made by active managers.
Figure 3.2 Sector Allocation Over Time (RL Portfolio)
GNN for Market Structure Modeling: We construct a graph where nodes represent stocks (e.g., Cambricon, Eoptolink). Edges are weighted based on return correlations and sector affiliations. A GNN processes this graph to generate embeddings that capture both intrinsic features and relational context (Fig. 3.3)—providing insights similar to fundamental analysts' understanding of industry relationships, but in a systematic, scalable framework. (Kipf & Welling, 2017)
Portfolio Construction & Backtesting for Asset Management
A concentrated portfolio of five high-growth stocks across technology sub-sectors (e.g., Semiconductors, Biopharma, FinTech) is optimized—a strategy size appropriate for many actively managed portfolios. Backtesting incorporates a 0.1% transaction cost and a 10% per-asset position limit (Zhang, Jiang, & Li, 2024) to ensure realism and reflect typical constraints faced by portfolio managers.
Feature Engineering: The Manager’s Signal Toolkit
Our feature engineering process begins with the data foundation. The AkShare Financial Terminal supplies the historical data backbone for our management system, covering Chinese equities from 2010 to 2025. It includes daily market data and quarterly financial statements, which are fundamental for constructing the value, quality, and momentum factors used in systematic portfolio construction. The cross-sectional distribution of a key fundamental metric, Return on Equity (ROE), is shown in Fig. 3.6. This heterogeneity across sectors is crucial for a portfolio manager seeking to tilt exposure toward high-quality firms.
Timely reaction to material corporate events is a key source of potential alpha. We utilize AkShare’s corporate disclosure data to systematically integrate event-driven signals into our management framework. Figure 3.7 shows the Cumulative Abnormal Returns (CAR) around technology sector earnings announcements, demonstrating a clear post-event drift. Our system is designed to systematically identify and weight such informational advantages, translating them into actionable portfolio adjustments.
Integrating Corporate Events into the Management Performance
Timely reaction to material corporate events is a key source of potential alpha. We utilize AkShare’s corporate disclosure data to systematically integrate event-driven signals into our management framework. Figure 3.7 shows the Cumulative Abnormal Returns (CAR) around technology sector earnings announcements, demonstrating a clear post-event drift. Our system is designed to systematically identify and weight such informational advantages, translating them into actionable portfolio adjustments.
Alternative Data: Sentiment and Liquidity
Modern portfolio management increasingly incorporates alternative data to gauge market mood and liquidity conditions. We utilize AkShare to collect a real-time Social Sentiment Index and intraday liquidity metrics. As shown in Fig. 3.8, sentiment spikes around major policy announcements, which often correlate with heightened volatility. Integrating these signals allows the management system to anticipate short-term regime shifts and adjust portfolio risk exposure accordingly.
Data Preprocessing, Cleaning, and Normalization
A rigorous, reproducible data preprocessing pipeline is a non-negotiable prerequisite for any systematic management strategy. Our pipeline handles missing data, winsorizes extreme values, and normalizes numerical features to a standard scale. Crucially, we ensure strict temporal alignment of all data points to prevent look-ahead bias, thereby maintaining the integrity of the backtested management decisions.
The Manager’s Toolkit: Feature and Factor Construction
Post-preprocessing, we engineer a multi-dimensional feature set designed to capture the key return drivers relevant to A-share portfolio management:
Fundamental Factors: e.g., ROE growth, earnings surprise (derived from financial statements).
Technical/Momentum Factors: e.g., 12-month price momentum, volatility (calculated from price/volume series).
Sentiment & Event-Driven Factors: e.g., the Social Sentiment Index, Cumulative Abnormal Returns (CAR) around corporate events.
A
A
The distributions of two representative signals—normalized ROE (a fundamental factor) and 12-month momentum (a technical factor)—are shown in Figs.
3.9 and 3.10, respectively. This cross-sectional variation is the raw material from which the management system generates its allocation signals
.
Core AI Portfolio Management Architecture
Our system integrates two AI components for portfolio management. The Graph Neural Network (GNN) models the market as a dynamic graph of stocks and their relationships, generating contextual asset embeddings that capture systemic risk and sector spillovers. The Reinforcement Learning (RL) agent then uses these embeddings, along with factor scores and market indicators, to determine optimal portfolio weights through sequential decision-making. Trained to maximize the Sharpe ratio while penalizing drawdowns and turnover, this framework enables adaptive factor timing—automatically shifting exposures between aggressive and defensive factors based on real-time market conditions, moving beyond static rebalancing to a dynamic management approach.
Results & Analysis
Overall Portfolio Management Performance
We evaluated the performance of several management strategies over the period 2015–2025. This includes the passive CSI 300 Index benchmark, static portfolios based on the Capital Asset Pricing Model (CAPM), a static multi-factor portfolio (Fama & French, 1993), and our proposed AI-augmented dynamic portfolio management system. The comparison focuses on key metrics critical for asset managers: risk-adjusted returns, drawdown control, and cumulative wealth growth.
Return Attribution and Dynamic Factor Exposure Analysis
To deconstruct the source of excess returns, we conducted a rigorous factor attribution analysis using an extended Fama-French model that incorporates a momentum factor. This analysis, applied to the high-conviction holdings within our AI-managed portfolio (e.g., Cambricon, Hithink, Biokin), reveals a distinct performance profile. As illustrated in Fig. 4.1, the Momentum (MOM) and Investment (CMA) factors exhibit high statistical significance (t-stats of 8.42 and 5.87, respectively), indicating that the portfolio's alpha is primarily driven by timely exposure to these premia. In contrast, the Market Beta is negligible, and the Size (SMB) factor shows only modest significance. This attribution confirms that the AI system is successfully executing dynamic factor timing—actively managing exposures rather than merely riding market beta. A high cross-sectional R² of 0.86 further validates the explanatory power of our multi-factor framework for the portfolio's returns.
Comparative Performance and the Value of Adaptive Management
The cumulative wealth trajectories, presented in Fig. 4.2, provide a clear visual summary of the management strategies' effectiveness. The AI-enhanced dynamic portfolio (solid blue line) demonstrates a superior growth path, achieving both a higher terminal value and a significantly smoother equity curve compared to all static benchmarks.
The quantitative results underscore the practical value of adaptive management. A baseline, equally-weighted static multi-factor portfolio delivered a solid annualized return of 15.2% with a Sharpe ratio of 1.28, already outperforming both the CAPM (Sharpe, 1964) portfolio and the CSI 300 index. However, the introduction of dynamic factor weighting and allocation via our RL-GNN system marked a critical performance leap. The AI manager identified and capitalized on high-conviction opportunities within specific market clusters (e.g., technology stocks like Cambricon and Eoptolink), dynamically adjusting exposures in response to real-time signals. This active management process drove the final AI-enhanced portfolio to a total return of + 901.7%, a realized Sharpe ratio of 1.34, and—most notably for risk management—a controlled maximum drawdown of only 12.5%.
These results demonstrate that while factor diversification provides a stable foundation, AI-driven dynamic portfolio management is essential for capturing significant alpha and mitigating downside risk in the complex A-share market.
Robustness Checks: Ensuring Management Strategy Viability
To ensure the reported performance is robust and not an artifact of data mining or favorable backtest conditions, we conducted two critical tests from a portfolio manager's perspective.
First, a sub-period analysis confirms that the AI-enhanced management strategy consistently added value across different market regimes. It outperformed static benchmarks not only during bullish phases (e.g., 2016–2017) but also, and more importantly, during bearish and volatile periods (e.g., 2018, 2022). This consistency underscores the strategy's adaptive nature and its capacity to preserve capital during downturns.
Second, we performed a sensitivity analysis incorporating realistic transaction costs, a paramount concern for implementable management strategies. After accounting for a 20 basis points round-trip transaction cost, the AI-managed portfolio maintained a Sharpe ratio above 1.20. This confirms that the net-of-cost alpha remains economically significant, validating the strategy's practical viability for institutional asset management.
Overall Portfolio Performance
The AI-enhanced asset management portfolio delivered exceptional results, achieving a cumulative return of + 901.7% and a Sharpe ratio of 1.34 over the backtest period. Figure 4.3 illustrates the steady wealth accumulation, significantly diverging from the flat performance of the CSI 300 benchmark. This level of outperformance demonstrates the framework's potential to enhance active management returns in China's A-share market.
Model Performance and Attribution
Predictive Accuracy: The integrated RL-GNN strategy established a clear "Alpha Frontier," outperforming baseline models like standalone GBDT and linear regression (Fig. 4.4). This superior predictive capability translates directly to improved portfolio construction for quantitative asset managers.
Dynamic Allocation: The RL agent successfully performed "intra-tech" rotation, shifting capital between technology sub-sectors (e.g., from AI Hardware to FinTech) based on real-time momentum and sentiment signals, as visualized in the sector allocation chart (Fig. 3.2). Such dynamic sector rotation represents a systematic enhancement over static allocation approaches commonly used in traditional asset management.
Factor Importance: Feature importance analysis (Fig. 4.5) revealed that Momentum (38%) and Sentiment/Liquidity (30%) were the primary alpha drivers (Chen & Guestrin, 2016; Hochreiter & Schmidhuber, 1997), followed by GNN-captured sectoral co-movement (20%). Traditional valuation factors played a smaller, stabilizing role (12%). These findings provide asset managers with clear guidance on factor prioritization when constructing AI-enhanced portfolios.
Risk Management in Asset Management Context
Despite concentration in high-growth stocks, the strategy maintained strict risk control suitable for institutional investment mandates. The monthly return distribution (Fig. 4.6) shows a positive skew, indicating managed downside—a desirable characteristic for risk-aware asset managers. Key risk metrics were favorable: 95% daily VaR at 3.73% and 99% CVaR at 7.95%. Although a maximum drawdown of -40.37% (Fig. 4.7) occurred during the broad 2022 correction, the recovery was swift and driven by the model's adaptive re-allocation. This demonstrates the framework's ability to manage extreme market events while maintaining long-term performance objectives.
Event Study Results: Enhancing Active Management
Event studies confirmed the strategy's ability to capitalize on informational inefficiencies, providing systematic alpha sources for active asset managers.
Stock-Level Reactions: Analysis of Cumulative Abnormal Returns (CAR) around technology sector announcements showed clear patterns of market anticipation and post-event drift (Fig. 4.8). These patterns represent exploitable opportunities for event-driven strategies within asset management portfolios.
Portfolio-Level Impact: The portfolio's aggregate CAR exhibited a significant upward drift following major events, reaching approximately 15% in the 15 days post-event, demonstrating successful exploitation of these windows (Fig. 4.9). Such event-driven returns can meaningfully contribute to overall portfolio alpha generation.
A
Sectoral Variation: Boxplot analysis (Fig. 4.10) revealed differing levels of abnormal return dispersion across holdings, which the RL agent used to tactically adjust exposures, overweighting assets with higher event-driven alpha potential. This demonstrates how AI can enhance traditional event-study approaches through dynamic position sizing.
Figure 4.10 Sectoral Distribution of Cumulative Abnormal Returns
Conclusion
This study demonstrates that a carefully architected integration of AI techniques—specifically GBDT (Chen & Guestrin, 2016), RL (Sutton & Barto, 2018), and GNN (Kipf & Welling, 2017)—coupled with empirical event-study validation, provides a powerful and robust framework for quantitative asset management in China's A-share market. The framework successfully converts complex market data, inter-asset relationships, and temporary informational advantages into sustained, risk-adjusted alpha, offering portfolio managers a systematic approach to enhance returns while managing downside risk. Future work could focus on enhancing model interpretability for investment committees, incorporating a broader universe of assets relevant to multi-asset portfolios, and testing the framework's resilience across different emerging markets to support global asset allocation decisions.