AI-Enhanced Portfolio Management in China’s A-Share Market: A Dynamic Factor Integration Framework

ABSTRACT

This study develops and validates an integrated AI-driven asset management framework tailored for the Chinese A-share market. By synergistically combining Gradient Boosting Decision Trees (GBDT) for return forecasting, Reinforcement Learning (RL) for dynamic portfolio allocation, and Graph Neural Networks (GNNs) for cross-asset relationship modeling, we construct a multi-factor strategy that adapts to evolving market regimes. Rigorous event-study analysis is employed to quantify and exploit stock price reactions to corporate announcements, capturing both short-term anomalies and medium-term trends. Our backtesting results (2021–2025) demonstrate that this integrated framework achieves a Sharpe ratio of 1.34 and a cumulative return of + 901.7%, significantly outperforming static benchmarks like the CSI 300. The findings underscore the importance of adaptive factor timing, integration of alternative data sources, and event-driven insights for generating persistent alpha in complex and volatile equity markets. These results suggest that AI-driven multi-factor frameworks can enhance investment decision-making, improve portfolio robustness, and offer practical guidance for asset managers seeking systematic and data-driven approaches in emerging market environments.

Keywords

Asset Management

Portfolio Optimization

Artificial Intelligence

Reinforcement Learning

Graph Neural Networks

China A-Share Market.

Declarations

Competing Interests

The author declares that they have no competing interests.

Funding

This research was supported by an institutional waiver from University of Technology Malaysia.

Author Contribution

The author conceived the study, developed the methodology, performed the analysis, and wrote the manuscript. The author has reviewed and approved the final version of the manuscript.

Data Availability

All data generated or analysed during this study are included in this published article. The datasets used are publicly available from BaoStock (https://www.baostock.com/) and AkShare (https://akshare.xyz/).

Introduction

Background and Motivation

The evolution of financial data availability and computational power has fundamentally transformed asset management practices, shifting the industry toward systematic, model-driven portfolio construction. In China, the rapid maturation of the A-share market—reflected in the growing depth and representativeness of benchmarks such as the CSI 300 Index—presents both opportunities and challenges for modern portfolio managers. While factor-based strategies offer a disciplined framework for capturing systematic risk premia, their widespread implementation often remains static, applying fixed weightings to pre-selected factors. This rigidity poses a significant limitation in a market characterized by inherent volatility, regulatory sensitivity, and pronounced sector rotations, necessitating a more adaptive portfolio management approach capable of real-time factor exposure adjustment.

Quantitative Investment in China

Quantitative asset management in China has matured alongside core benchmarks like the CSI 300 Index. As illustrated in Fig. 1.1, the index's market capitalization and trading volume have expanded substantially since its inception, providing the necessary liquidity and breadth for institutional-scale portfolio strategies.

Fig. 1.1

CSI 300 (2005–2025)

While foundational models such as the Capital Asset Pricing Model (Sharpe, 1964) and the Fama-French multi-factor (Fama & French, 1993) frameworks are widely employed, their static formulations frequently struggle to adapt to the unique dynamics of China's market. This study addresses this gap by developing a reinforcement learning–graph neural network framework designed to empower portfolio managers with dynamic factor integration capabilities, enabling real-time optimization of both security allocation and underlying factor sensitivities.

AI/ML Applications in Asset Management

A robust quantitative asset management process begins with comprehensive data acquisition and preprocessing. As illustrated in Fig. 1.2, our framework ingests multi-source data from platforms like BaoStock (BaoStock, 2025) and AkShare (AkShare, 2025). This raw data undergoes rigorous cleaning, normalization, and feature engineering to construct predictive signals. These signals inform AI/ML models which, in turn, drive portfolio construction and optimization against defined risk constraints. The entire strategy is validated through historical backtesting, ensuring its practical viability before live deployment. This end-to-end, methodical workflow forms the backbone of our adaptive asset management approach.

Fig. 1.2

Workflow of AI/ML-based quantitative investment in China.

Advanced AI Techniques in Portfolio Management

Among AI techniques, Reinforcement Learning (Sutton & Barto, 2018) is particularly suited for portfolio management, framing it as a sequential decision-making problem where an agent learns to maximize a risk-adjusted reward (see conceptual diagram, Fig. 1.3). Concurrently, Graph Neural Networks (Kipf & Welling, 2017) excel at modeling the intricate relational structures within markets—such as sectoral dependencies and supply-chain links—offering a richer context for prediction. By integrating these advanced models with traditional factors and real-time event analysis, we aim to create a strategy that is not only predictive but also adaptive and resilient to the unique challenges of the A-share market.

Fig. 1.3

Illustration of reinforcement learning for portfolio optimization.

Research Contributions

This paper makes three primary contributions to asset management practice:

Methodological: We propose a novel, three-component AI architecture (GBDT + RL + GNN) for dynamic, multi-factor portfolio optimization, explicitly incorporating event-study signals.

Empirical: We provide extensive backtesting and event-study evidence from the Chinese A-share market (2021–2025), demonstrating significant outperformance and effective risk management.

Practical: We detail a fully implementable pipeline using publicly available data and models, offering a reproducible blueprint for asset managers and researchers.

Literature Review

Factor-Based Portfolio Management

The theoretical foundation of much quantitative asset management lies in factor models (Fama & French, 1993; Sharpe, 1964), which attribute portfolio returns to exposures to systematic risk factors. The evolution from the single-factor CAPM to multi-factor models like Fama-French three-factor and its extensions provided portfolio managers with a structured approach to risk decomposition and return attribution. In the context of China's A-share market, empirical studies confirm the presence but also the time-varying nature of these factor premia, closely tied to local market microstructure and regulatory cycles.

AI in Asset Management: From Analytics to Decision Systems

The integration of artificial intelligence into asset management has progressed from predictive analytics to prescriptive decision-support systems. Reinforcement learning, framed as a sequential portfolio optimization problem, allows for the development of policies that maximize long-term, risk-adjusted outcomes (Sutton & Barto, 2018). Concurrently, graph neural networks have emerged as a powerful tool for modeling the complex relational dependencies within financial markets (Kipf & Welling, 2017)—a capability crucial for managing systemic risk and understanding sectoral spillovers in a portfolio context.

Positioning: Toward Adaptive Portfolio Governance

Existing literature has extensively documented factor efficacy and explored AI's predictive power. However, a salient gap exists in the deployment of AI for integrated portfolio governance—the continuous, dynamic management of factor exposures within a live portfolio. This study positions itself at this intersection, proposing an RL-GNN framework not merely as a superior alpha model, but as the core of an adaptive portfolio management system that makes holistic allocation decisions.

Machine Learning for Asset Management Forecasting

Machine learning models have become increasingly adopted in professional asset management for analyzing non-linear patterns in financial data. Ensemble methods like Gradient Boosting (Chen & Guestrin, 2016) and deep learning architectures like LSTMs (Hochreiter & Schmidhuber, 1997) are widely used for return prediction and market state classification within quantitative investment teams. In the context of China's A-share market, these models must capture the high-momentum behavior of growth stocks, as exemplified in Fig. 2.1, which shows the strong, trend-driven price action of a stock like Cambricon (688256.SH)—a characteristic highly relevant for active managers seeking momentum alpha.

Fig. 2.1

Machine Learning Model Performance Example

Reinforcement Learning and Graph-Based Models in Portfolio Management

Reinforcement Learning (Sutton & Barto, 2018) has shown great promise in optimal sequential decision-making, including dynamic asset allocation—a core function of portfolio management. Meanwhile, Graph Neural Networks (Kipf & Welling, 2017) provide a powerful paradigm for modeling relational data, such as inter-stock correlations and sectoral networks, which are crucial for understanding systemic risk and spillover effects in concentrated portfolios like those often held by institutional investors in China's market.

Event Study Applications in Active Management

Event study analysis is a cornerstone for assessing the market impact of corporate information. In active asset management, automating the detection and exploitation of abnormal returns around events is a key source of alpha. Furthermore, rigorous backtesting that accounts for realistic constraints (transaction costs, liquidity, position limits) is essential to validate any strategy's practical efficacy and robustness for implementation in live portfolios.

Synthesis and Research Positioning in Asset Management

While existing literature explores these areas in isolation, our work integrates them into a cohesive framework specifically designed for asset management applications. We bridge the gap between predictive modeling (GBDT), adaptive allocation (RL), structural market understanding (GNN), and empirical validation through event studies and rigorous backtesting, providing a comprehensive toolkit for quantitative portfolio managers specifically tailored for the A-share market's complexities.

Methodology

Data and Feature Engineering for Asset Management

We utilize a multi-source dataset spanning 2021–2025, selected for its relevance to institutional portfolio management. BaoStock provides daily price, volume, and fundamental data (BaoStock, 2025) commonly used in quantitative investment processes. AkShare supplies alternative data, including corporate announcements and a real-time Social Sentiment Index (BaoStock, 2025)—increasingly important inputs for modern asset managers. Features are constructed across four pillars relevant to portfolio construction: Fundamental, Technical (e.g., 12-month momentum), Sentiment/Liquidity, and Event-Driven factors (e.g., earnings surprise). All features are winsorized, normalized, and temporally aligned to prevent look-ahead bias and ensure implementability in live trading environments.

Reinforcement Learning Strategies for Portfolio Management

Our framework employs a tripartite AI model structure designed for systematic asset management:

GBDT for Return Forecasting: A Gradient Boosting model is trained to predict next-period returns (Chen & Guestrin, 2016), providing a crucial "state" signal for the RL agent—generating alpha signals comparable to those used by quantitative hedge funds. Its performance against actual returns is shown in Fig. 3.1.

Fig. 3.1

Predicted vs Actual Stock Returns (Gradient Boosting)

RL for Dynamic Portfolio Optimization: Portfolio management is formulated as a Partially Observable Markov Decision Process (POMDP). The state includes GNN embeddings, GBDT forecasts, and raw factors. The action is a vector of portfolio weights (Sutton & Barto, 2018). The reward function optimizes for the Sharpe ratio with penalties for drawdowns and turnover—a formulation aligned with institutional portfolio objectives. The agent learns to dynamically adjust sector allocations (Fig. 3.2), mimicking but systematizing the tactical allocation decisions made by active managers.

Figure ‎3.2 Sector Allocation Over Time (RL Portfolio)

GNN for Market Structure Modeling: We construct a graph where nodes represent stocks (e.g., Cambricon, Eoptolink). Edges are weighted based on return correlations and sector affiliations. A GNN processes this graph to generate embeddings that capture both intrinsic features and relational context (Fig. 3.3)—providing insights similar to fundamental analysts' understanding of industry relationships, but in a systematic, scalable framework. (Kipf & Welling, 2017)

Fig. 3.3

Stock Correlation Network for GNN Modeling

Portfolio Construction & Backtesting for Asset Management

A concentrated portfolio of five high-growth stocks across technology sub-sectors (e.g., Semiconductors, Biopharma, FinTech) is optimized—a strategy size appropriate for many actively managed portfolios. Backtesting incorporates a 0.1% transaction cost and a 10% per-asset position limit (Zhang, Jiang, & Li, 2024) to ensure realism and reflect typical constraints faced by portfolio managers.

Fig. 3.4

Portfolio Cumulative Return

Fig. 3.5

Monthly Return Distribution for RL Portfolio

Feature Engineering: The Manager’s Signal Toolkit

Our feature engineering process begins with the data foundation. The AkShare Financial Terminal supplies the historical data backbone for our management system, covering Chinese equities from 2010 to 2025. It includes daily market data and quarterly financial statements, which are fundamental for constructing the value, quality, and momentum factors used in systematic portfolio construction. The cross-sectional distribution of a key fundamental metric, Return on Equity (ROE), is shown in Fig. 3.6. This heterogeneity across sectors is crucial for a portfolio manager seeking to tilt exposure toward high-quality firms.

Fig. 3.6

ROE Distribution Across All Sectors

Timely reaction to material corporate events is a key source of potential alpha. We utilize AkShare’s corporate disclosure data to systematically integrate event-driven signals into our management framework. Figure 3.7 shows the Cumulative Abnormal Returns (CAR) around technology sector earnings announcements, demonstrating a clear post-event drift. Our system is designed to systematically identify and weight such informational advantages, translating them into actionable portfolio adjustments.

Integrating Corporate Events into the Management Performance

Fig. 3.7

Cumulative Abnormal Returns Around Technology Sector Announcements

Alternative Data: Sentiment and Liquidity

Modern portfolio management increasingly incorporates alternative data to gauge market mood and liquidity conditions. We utilize AkShare to collect a real-time Social Sentiment Index and intraday liquidity metrics. As shown in Fig. 3.8, sentiment spikes around major policy announcements, which often correlate with heightened volatility. Integrating these signals allows the management system to anticipate short-term regime shifts and adjust portfolio risk exposure accordingly.

Fig. 3.8

Social Sentiment Index Around Policy Announcements

Data Preprocessing, Cleaning, and Normalization

A rigorous, reproducible data preprocessing pipeline is a non-negotiable prerequisite for any systematic management strategy. Our pipeline handles missing data, winsorizes extreme values, and normalizes numerical features to a standard scale. Crucially, we ensure strict temporal alignment of all data points to prevent look-ahead bias, thereby maintaining the integrity of the backtested management decisions.

The Manager’s Toolkit: Feature and Factor Construction

Post-preprocessing, we engineer a multi-dimensional feature set designed to capture the key return drivers relevant to A-share portfolio management:

Fundamental Factors: e.g., ROE growth, earnings surprise (derived from financial statements).

Technical/Momentum Factors: e.g., 12-month price momentum, volatility (calculated from price/volume series).

Sentiment & Event-Driven Factors: e.g., the Social Sentiment Index, Cumulative Abnormal Returns (CAR) around corporate events.

The distributions of two representative signals—normalized ROE (a fundamental factor) and 12-month momentum (a technical factor)—are shown in Figs. 3.9 and 3.10, respectively. This cross-sectional variation is the raw material from which the management system generates its allocation signals

Fig. 3.9

Distribution of Normalized ROE

Fig. 3.10

Distribution of 12-Month Momentum Factor

Core AI Portfolio Management Architecture

Our system integrates two AI components for portfolio management. The Graph Neural Network (GNN) models the market as a dynamic graph of stocks and their relationships, generating contextual asset embeddings that capture systemic risk and sector spillovers. The Reinforcement Learning (RL) agent then uses these embeddings, along with factor scores and market indicators, to determine optimal portfolio weights through sequential decision-making. Trained to maximize the Sharpe ratio while penalizing drawdowns and turnover, this framework enables adaptive factor timing—automatically shifting exposures between aggressive and defensive factors based on real-time market conditions, moving beyond static rebalancing to a dynamic management approach.

Results & Analysis

Overall Portfolio Management Performance

We evaluated the performance of several management strategies over the period 2015–2025. This includes the passive CSI 300 Index benchmark, static portfolios based on the Capital Asset Pricing Model (CAPM), a static multi-factor portfolio (Fama & French, 1993), and our proposed AI-augmented dynamic portfolio management system. The comparison focuses on key metrics critical for asset managers: risk-adjusted returns, drawdown control, and cumulative wealth growth.

Return Attribution and Dynamic Factor Exposure Analysis

To deconstruct the source of excess returns, we conducted a rigorous factor attribution analysis using an extended Fama-French model that incorporates a momentum factor. This analysis, applied to the high-conviction holdings within our AI-managed portfolio (e.g., Cambricon, Hithink, Biokin), reveals a distinct performance profile. As illustrated in Fig. 4.1, the Momentum (MOM) and Investment (CMA) factors exhibit high statistical significance (t-stats of 8.42 and 5.87, respectively), indicating that the portfolio's alpha is primarily driven by timely exposure to these premia. In contrast, the Market Beta is negligible, and the Size (SMB) factor shows only modest significance. This attribution confirms that the AI system is successfully executing dynamic factor timing—actively managing exposures rather than merely riding market beta. A high cross-sectional R² of 0.86 further validates the explanatory power of our multi-factor framework for the portfolio's returns.

Fig. 4.1

Fama-French + Momentum Factor Significance

Comparative Performance and the Value of Adaptive Management

The cumulative wealth trajectories, presented in Fig. 4.2, provide a clear visual summary of the management strategies' effectiveness. The AI-enhanced dynamic portfolio (solid blue line) demonstrates a superior growth path, achieving both a higher terminal value and a significantly smoother equity curve compared to all static benchmarks.

Fig. 4.2

Cumulative Wealth of Investment Strategies (2015–2025): AI-Enhanced Dynamic Factor vs. Static Multi-Factor vs. CAPM vs. CSI 300 Benchmark

The quantitative results underscore the practical value of adaptive management. A baseline, equally-weighted static multi-factor portfolio delivered a solid annualized return of 15.2% with a Sharpe ratio of 1.28, already outperforming both the CAPM (Sharpe, 1964) portfolio and the CSI 300 index. However, the introduction of dynamic factor weighting and allocation via our RL-GNN system marked a critical performance leap. The AI manager identified and capitalized on high-conviction opportunities within specific market clusters (e.g., technology stocks like Cambricon and Eoptolink), dynamically adjusting exposures in response to real-time signals. This active management process drove the final AI-enhanced portfolio to a total return of + 901.7%, a realized Sharpe ratio of 1.34, and—most notably for risk management—a controlled maximum drawdown of only 12.5%.

These results demonstrate that while factor diversification provides a stable foundation, AI-driven dynamic portfolio management is essential for capturing significant alpha and mitigating downside risk in the complex A-share market.

Robustness Checks: Ensuring Management Strategy Viability

To ensure the reported performance is robust and not an artifact of data mining or favorable backtest conditions, we conducted two critical tests from a portfolio manager's perspective.

First, a sub-period analysis confirms that the AI-enhanced management strategy consistently added value across different market regimes. It outperformed static benchmarks not only during bullish phases (e.g., 2016–2017) but also, and more importantly, during bearish and volatile periods (e.g., 2018, 2022). This consistency underscores the strategy's adaptive nature and its capacity to preserve capital during downturns.

Second, we performed a sensitivity analysis incorporating realistic transaction costs, a paramount concern for implementable management strategies. After accounting for a 20 basis points round-trip transaction cost, the AI-managed portfolio maintained a Sharpe ratio above 1.20. This confirms that the net-of-cost alpha remains economically significant, validating the strategy's practical viability for institutional asset management.

Overall Portfolio Performance

The AI-enhanced asset management portfolio delivered exceptional results, achieving a cumulative return of + 901.7% and a Sharpe ratio of 1.34 over the backtest period. Figure 4.3 illustrates the steady wealth accumulation, significantly diverging from the flat performance of the CSI 300 benchmark. This level of outperformance demonstrates the framework's potential to enhance active management returns in China's A-share market.

Fig. 4.3

Portfolio Cumulative Return

Model Performance and Attribution

Predictive Accuracy: The integrated RL-GNN strategy established a clear "Alpha Frontier," outperforming baseline models like standalone GBDT and linear regression (Fig. 4.4). This superior predictive capability translates directly to improved portfolio construction for quantitative asset managers.

Fig. 4.4

Performance Comparison of Multi-Factor Models

Dynamic Allocation: The RL agent successfully performed "intra-tech" rotation, shifting capital between technology sub-sectors (e.g., from AI Hardware to FinTech) based on real-time momentum and sentiment signals, as visualized in the sector allocation chart (Fig. 3.2). Such dynamic sector rotation represents a systematic enhancement over static allocation approaches commonly used in traditional asset management.

Factor Importance: Feature importance analysis (Fig. 4.5) revealed that Momentum (38%) and Sentiment/Liquidity (30%) were the primary alpha drivers (Chen & Guestrin, 2016; Hochreiter & Schmidhuber, 1997), followed by GNN-captured sectoral co-movement (20%). Traditional valuation factors played a smaller, stabilizing role (12%). These findings provide asset managers with clear guidance on factor prioritization when constructing AI-enhanced portfolios.

Fig. 4.5

Feature Importance for High-Alpha Multi-Sector Portfolio

Risk Management in Asset Management Context

Despite concentration in high-growth stocks, the strategy maintained strict risk control suitable for institutional investment mandates. The monthly return distribution (Fig. 4.6) shows a positive skew, indicating managed downside—a desirable characteristic for risk-aware asset managers. Key risk metrics were favorable: 95% daily VaR at 3.73% and 99% CVaR at 7.95%. Although a maximum drawdown of -40.37% (Fig. 4.7) occurred during the broad 2022 correction, the recovery was swift and driven by the model's adaptive re-allocation. This demonstrates the framework's ability to manage extreme market events while maintaining long-term performance objectives.

Fig. 4.6

Risk Exposure & Tail Risk Distribution

Fig. 4.7

Maximum Drawdown Timeline for RL Portfolio

Event Study Results: Enhancing Active Management

Event studies confirmed the strategy's ability to capitalize on informational inefficiencies, providing systematic alpha sources for active asset managers.

Stock-Level Reactions: Analysis of Cumulative Abnormal Returns (CAR) around technology sector announcements showed clear patterns of market anticipation and post-event drift (Fig. 4.8). These patterns represent exploitable opportunities for event-driven strategies within asset management portfolios.

Fig. 4.8

Cumulative Abnormal Returns Around Technology Earnings Announcements

Portfolio-Level Impact: The portfolio's aggregate CAR exhibited a significant upward drift following major events, reaching approximately 15% in the 15 days post-event, demonstrating successful exploitation of these windows (Fig. 4.9). Such event-driven returns can meaningfully contribute to overall portfolio alpha generation.

Fig. 4.9

Portfolio Cumulative Abnormal Returns

Sectoral Variation: Boxplot analysis (Fig. 4.10) revealed differing levels of abnormal return dispersion across holdings, which the RL agent used to tactically adjust exposures, overweighting assets with higher event-driven alpha potential. This demonstrates how AI can enhance traditional event-study approaches through dynamic position sizing.

Figure ‎4.10 Sectoral Distribution of Cumulative Abnormal Returns

Discussion

The results validate the efficacy of the proposed integrated AI framework for asset management applications. The outperformance stems from several interconnected strengths relevant to portfolio managers: Adaptive Factor Timing: The RL agent moves beyond static factor weighting, dynamically tilting exposures based on the prevailing market regime signaled by GBDT forecasts and alternative data—a capability particularly valuable for active asset managers seeking to navigate regime changes. Structural Alpha from GNNs: By modeling sectoral relationships, the GNN enables the strategy to understand and exploit momentum clusters and systemic movements that are invisible to single-asset models, offering institutional investors insights into cross-asset dependencies. Event-Driven Edge: The systematic incorporation of event-study findings allows the strategy to proactively position for and profit from predictable market reactions to corporate news, providing a systematic approach to event-driven investing within a disciplined framework. The "black-box" nature of the ensemble and its dependence on high-quality, timely data are acknowledged limitations for practical deployment in asset management firms. However, the rigorous backtesting and explicit risk controls mitigate concerns about overfitting and support implementation in professional investment processes.

Conclusion

This study demonstrates that a carefully architected integration of AI techniques—specifically GBDT (Chen & Guestrin, 2016), RL (Sutton & Barto, 2018), and GNN (Kipf & Welling, 2017)—coupled with empirical event-study validation, provides a powerful and robust framework for quantitative asset management in China's A-share market. The framework successfully converts complex market data, inter-asset relationships, and temporary informational advantages into sustained, risk-adjusted alpha, offering portfolio managers a systematic approach to enhance returns while managing downside risk. Future work could focus on enhancing model interpretability for investment committees, incorporating a broader universe of assets relevant to multi-asset portfolios, and testing the framework's resilience across different emerging markets to support global asset allocation decisions.

Acknowledgement

The authors would like to express their sincere gratitude to the Journal at Springer Nature for providing a conducive academic environment and valuable resources that supported this work.

References

AkShare: AkShare API Documentation. https://akshare.xyz (2025)

BaoStock: BaoStock Financial Data Terminal. (2025). https://www.baostock.com

Chen, L., Matthew Pelger, and, Zhu, J.: Deep Learning in Asset Pricing. Management Science (2021)

Chen, T., Carlos Guestrin: XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794. (2016)

Gu, S., Witte, J.H.: Deep Learning for Finance: Deep Portfolios. Appl. Stoch. Models Bus. Ind. 33(1), 3–12 (2017)

Hochreiter, S., Jürgen, Schmidhuber: Long Short-Term Memory. Neural Comput. 9(8), 1735–1780 (1997)

Jiang, Z., Xu, D., Liang, J.: A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem. arXiv, June 17, 2017. (2017). https://arxiv.org/abs/1706.10059

Kipf, T.N., Max Welling: Semi-Supervised Classification with Graph Convolutional Networks. arXiv, September 9, 2017. (2017). https://arxiv.org/abs/1609.02907

Li, X., and Bo Zhang: The Value Premium in the Chinese Stock Market: Free Lunch or Paid Dessert? Pac.-Basin Financ. J. 44, 57–75 (2017)

Silver, D., Huang, A., Maddison, C.J., Guez, A., Sifre, L., Van Den Driessche, G., Schrittwieser, J., et al.: Mastering the Game of Go with Deep Neural Networks and Tree Search. Nature. 529, 484–489 (2016)

Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction, 2nd edn. MIT Press, Cambridge, MA (2018)

Zhang, X., Jiang, Y., Ping Li: AI-Driven Portfolio Optimization in Emerging Markets: Reinforcement Learning and Graph Neural Network Approaches. J. Comput. Finance. 27(2), 55–78 (2024)

Yes