References
1. Mestav Sarica, G. & Pan, T.-C. Seismic loss dynamics in three Asian megacities using a macro-level approach based on socioeconomic exposure indicators. Communications Earth & Environment 3, 101 (2022).
2. Birkmann, J., Welle, T., Solecki, W., Lwasa, S. & Garschagen, M. Boost resilience of small and mid-sized cities. Nature 537, 605–608 (2016).
3. Persson, P., Andersen, L. V., Persson, K. & Bucinskas, P. Effect of structural design on traffic-induced building vibrations. Procedia Engineering 199, 2711–2716 (2017).
4. Zhao, S., Zhang, C., Dai, X. & Yan, Z. Review of wind-induced effects estimation through nonlinear analysis of tall buildings, high-rise structures, flexible bridges and transmission lines. Buildings 13, 2033 (2023).
5. Wang, Y., Zhao, D. & Li, H. Finite Element Model Updating Technique for Super High-Rise Building Based on Response Surface Method. Buildings 15, 126 (2025).
6. Abdelnaby, A. E. & Elnashai, A. S. Numerical modeling and analysis of RC frames subjected to multiple earthquakes. Earthquakes and Structures 9, 957–981 (2015).
7. Li, X. et al. Mechanics-informed autoencoder enables automated detection and localization of unforeseen structural damage. Nature Communications 15, 9229 (2024).
8. Cao, Z. et al. Dynamic sensitivity-based finite element model updating for nonlinear structures using time-domain responses. International Journal of Mechanical Sciences 184, 105788 (2020).
9. Yao, M. Nonlinear structural dynamic finite element analysis using Ritz vector reduced basis method. Shock and Vibration 3, 259–268 (1996).
10. Ohsaki, M. et al. High-precision finite element analysis of elastoplastic dynamic responses of super‐high‐rise steel frames. Earthquake engineering & structural dynamics 38, 635–654 (2009).
11. McKenna, F. OpenSees: a framework for earthquake engineering simulation. Computing in Science & Engineering 13, 58–66 (2011).
12. Moaveni, B., Conte, J. P. & Hemez, F. M. Uncertainty and Sensitivity Analysis of Damage Identification Results Obtained Using Finite Element Model Updating. Computer-Aided Civil and Infrastructure Engineering 24, 320–334 (2009).
13. Butt, F. & Omenzetter, P. Seismic response trends evaluation and finite element model calibration of an instrumented RC building considering soil-structure interaction and non-structural components. Engineering Structures 65, 111–123 (2014).
14. Karapiperis, K. & Kochmann, D. M. Prediction and control of fracture paths in disordered architected materials using graph neural networks. Communications Engineering 2, 32 (2023).
15. Cremen, G., Galasso, C. & McCloskey, J. A simulation-based framework for earthquake risk‐informed and people‐centered decision making on future urban planning. Earth's Future 10, e2021EF002388 (2022).
16. Li, Y. et al. The physics-based deterministic scenarios for earthquake hazards and losses of the Zhujiangkou fault in southern China. npj Natural Hazards 2, 6 (2025).
17. Ereiz, S., Duvnjak, I. & Jiménez-Alonso, J. F. Review of finite element model updating methods for structural applications. In Proceedings of the Structures, 684–723 (2022).
18. Wang, X. et al. Kriging-based surrogate data-enriching artificial neural network prediction of strength and permeability of permeable cement-stabilized base. Nature Communications 15, 4891 (2024).
19. LeCun, Y. & Bengio, Y. Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks 3361, 1995 (1995).
20. Wang, T. Y. et al. Probabilistic Seismic Response Prediction of Three-Dimensional Structures Based on Bayesian Convolutional Neural Network. Sensors 22 (2022).
21. Wu, R.-T. & Jahanshahi, M. R. Deep convolutional neural network for structural dynamic response estimation and system identification. Journal of Engineering Mechanics 145, 04018125 (2019).
22. Chung, J., Gulcehre, C., Cho, K. & Bengio, Y. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555 (2014).
23. Perez-Ramirez, C. A. et al. Recurrent neural network model with Bayesian training and mutual information for response prediction of large buildings. Engineering Structures 178, 603–615 (2019).
24. Hu, Y., Tsang, H. H., Lam, N. & Lumantarna, E. Physics-informed neural networks for enhancing structural seismic response prediction with pseudo-labelling. Archives of Civil and Mechanical Engineering 24 (2023).
25. Zhang, R. et al. Deep long short-term memory networks for nonlinear structural seismic response prediction. Computers & Structures 220, 55–68 (2019).
26. Peng, H., Yan, J. W., Yu, Y. & Luo, Y. Z. Time series estimation based on deep Learning for structural dynamic nonlinear prediction. Structures 29, 1016–1031 (2021).
27. Liu, F., Li, J. & Wang, L. PI-LSTM: Physics-informed long short-term memory network for structural response modeling. Engineering Structures 292, 116500 (2023).
28. Xu, Z., Chen, J., Shen, J. & Xiang, M. Recursive long short-term memory network for predicting nonlinear structural seismic response. Engineering Structures 250, 113406 (2022).
29. Vaswani, A. et al. Attention is all you need. Advances in neural information processing systems 30 (2017).
30. Zhang, Q. et al. Transformer-based structural seismic response prediction. Structures 61, 105929 (2024).
31. Eshkevari, S. S., Takác, M., Pakzad, S. N. & Jahani, M. DynNet: Physics-based neural architecture design for nonlinear structural response modeling and prediction. Engineering Structures 229 (2021).
32. Yu, Y., Yao, H. & Liu, Y. Structural dynamics simulation using a novel physics-guided machine learning method. Engineering Applications of Artificial Intelligence 96, 103947 (2020).
33. Zhang, R., Liu, Y. & Sun, H. Physics-guided convolutional neural network (PhyCNN) for data-driven seismic response modeling. Engineering Structures 215, 110704 (2020).
34. Zhang, R., Liu, Y. & Sun, H. Physics-informed multi-LSTM networks for metamodeling of nonlinear structures. Computer Methods in Applied Mechanics and Engineering 369, 113226 (2020).
35. Zhou, Y., Meng, S., Lou, Y. & Kong, Q. Physics-informed deep learning-based real-time structural response prediction method. Engineering 35, 140–157 (2024).
36. Chen, Z., Liu, Y. & Sun, H. Physics-informed learning of governing equations from scarce data. Nature communications 12, 6136 (2021).
37. Okazaki, T., Ito, T., Hirahara, K. & Ueda, N. Physics-informed deep learning approach for modeling crustal deformation. Nature Communications 13, 7092 (2022).
38. Mangalathu, S. & Jeon, J.-S. Classification of failure mode and prediction of shear strength for reinforced concrete beam-column joints using machine learning techniques. Engineering Structures 160, 85–94 (2018).
39. Sun, H., Burton, H. V. & Huang, H. Machine learning applications for building structural design and performance assessment: State-of-the-art review. Journal of Building Engineering 33, 101816 (2021).
40. Zhou, Y. et al. Seismic performance of a frame-supported shear wall over‐track building through shaking table test. The Structural Design of Tall and Special Buildings 33, e2098 (2024).
41. Ancheta, T. D., Darragh, R. B., Stewart, J. P., Seyhan, E., Silva, W. J., Chiou, B. S. J., Donahue, J. L.,. NGA-West2 Database. PEER Report 2013/03. Pacific Earthquake Engineering Research Center, University of California, Berkeley, 2014.
42. Zhang, L., Rao, A. & Agrawala, M. Adding conditional control to text-to-image diffusion models. In Proceedings of the the IEEE/CVF international conference on computer vision, 3836–3847 (2023).
43. Ministry of Housing and Urban-Rural Development of the People’s Republic of China. Code for Seismic Design of Buildings. GB 50011–2010. Beijing: 2016.
44. Ministry of Housing and Urban-Rural Development of the PRC. Technical Specification for Concrete Structures of Tall Buildings. JGJ 3-2010. Beijing: China Architecture & Building Press, 2010.
45. Hendrycks, D. & Gimpel, K. Gaussian error linear units (gelus). arXiv preprint arXiv:1606.08415 (2016).
46. Ainslie, J. et al. Gqa: Training generalized multi-query transformer models from multi-head checkpoints. arXiv preprint arXiv:2305.13245 (2023).
47. Su, J. et al. Roformer: Enhanced transformer with rotary position embedding. Neurocomputing 568, 127063 (2024).
48. Shazeer, N. Glu variants improve transformer. arXiv preprint arXiv:2002.05202 (2020). Methods Numerical module: Dataset acquisition and processing The development of a robust numerical module is essential for laying the foundation of the deep learning model in this study. This module focuses on acquiring and processing high-quality data that accurately represents real-world dynamic scenarios. Acknowledging the critical role of high-fidelity data in enhancing deep learning performance, we created a comprehensive dataset using non-linear time history analysis (NLTHA) simulations.
1. -External excitations Selection: To robustly generalize structural response predictions across a diverse spectrum of dynamic loading scenarios, the external excitation database includes seismic ground motions, stochastic white noise, subway-induced vibrations, and impulsive loading. A total of 26,000 dynamic records were assembled, with the majority sourced from realistic seismic ground-motion records41 spanning predominantly mid-frequency ranges (approximately 1–10 Hz). These seismic records were supplemented with other dynamic excitations strategically selected or synthesized to comprehensively cover lower (< 1 Hz) and higher (> 10 Hz) frequency domains. Amplitude scaling was consistently applied to maintain realistic intensity distributions aligned with established seismic design codes and structural engineering guidelines. All excitation records were uniformly sampled at a time interval of 0.02 seconds, ensuring high temporal resolution for subsequent nonlinear time-history analyses (NLTHA). By incorporating dynamic phenomena across multiple frequency scales—extending over three orders of magnitude—the diversified excitation dataset enhances SeisGPT’s predictive accuracy and generalizability, thereby advancing its potential as a foundational model for accurately predicting structural responses under a wide variety of external dynamic conditions
2. -Building Models: A total of 270,000 detailed macro-scale building models were developed for this study, encompassing four principal structural categories: 150,000 frame structures, 60,000 frame–shear wall structures, 60,000 shear wall structures, and 694 models derived from real-world buildings. These models span a wide spectrum of typologies—including high-rise, mid-rise, and low-rise configurations—and incorporate diverse functional programs, such as residential, office, and commercial types. A summary of all finite element models and corresponding nonlinear time-history simulation data used in this study is provided in Extended Data Table 4. To enable large-scale, diverse, and structurally valid building model generation, we developed a dedicated generative design pipeline built upon a family of customized diffusion models: ArchiFlux, StructFlux, and BeamFlux. These models were designed and trained by the authors, extending the capabilities of the base Flux architecture through targeted fine-tuning using ControlNet42, which conditions generation on architectural constraints. This framework allows for the synthesis of high-fidelity structural configurations with semantic consistency and compliance with seismic design codes. The training data consisted of semantically labeled architectural drawings from real-world buildings, in which key components—including partitions, windows, doors, beams, columns, and shear walls—were annotated using distinct color channels to facilitate visual and algorithmic parsing. These annotated plans formed a structured dataset for training our domain-specific generative models. Each model in the suite serves a specialized function: ArchiFlux synthesizes floor plans with spatial segmentation aligned to functional zoning; StructFlux generates the spatial arrangement of primary load-resisting elements such as columns and shear walls; and BeamFlux defines the distribution of secondary framing components. The outputs from these models are then post-processed and evaluated to ensure structural coherence, such as alignment between vertical and horizontal load paths and adherence to minimum clearances and layout feasibility. To ensure compliance with seismic performance criteria, a final optimization phase adjusts the geometric dimensions and reinforcement ratios of structural components. This phase incorporates drift ratio constraints and flexural strength checks to satisfy inter-story performance limits and material capacity bounds. The result is a collection of 270,000 high-fidelity macro-models, encompassing 150,000 frame structures, 60,000 frame–shear wall structures, and 60,000 shear wall structures, all of which conform to relevant structural design specifications. The integration of advanced diffusion-based generation with physics-informed post-optimization marks a significant methodological advance, enabling automated, scalable production of realistic and code-compliant structural models. Further details of the model architecture, training protocol, and optimization workflow are provided in Extended Data Fig. 4. The input variables for these building structures and their structural elements were defined according to the specifications in GB/T50011-201043 and JGJ 3-201044. The generation of each building geometry and configuration model adhered to a structured process that maintained realistic relationships between parameters while allowing for controlled variability. To ensure a robust and versatile dataset, the range of values and combinations was intentionally extended beyond typical correlations, allowing for a more comprehensive exploration of the input space. This approach enhances the model’s generalizability by training it on a diverse set of building configurations. Key geometric parameters include floor height (ranging from 2.8 to 4.5 m), slab thickness (80 to 120 mm), and shear wall thickness (200 to 400 mm). The dimensions of columns and beams were automatically designed per design codes, with calculations for axial compression and flexural strength to ensure compliance. If any generated dimensions were found to be unreasonable, they were regenerated. The reinforcement ratio for all structural elements was automatically calculated according to the applicable design standards. The material properties, such as concrete strength (ranging from C25 to C50) and reinforcement steel bar strength (ranging from 355 to 400 MPa), were considered. This detailed generation process ensures that each building configuration is unique, realistic, and internally consistent, effectively representing real-world structural designs. The design parameters and their ranges for reinforced concrete (RC) buildings are summarized in Extended Data Table 5.
3. -NLTHA Simulations: Each of the 270,694 building models was subjected to three randomly selected ground motions in both the x and y directions, resulting in a total of 2,053,880 dynamic simulations. All simulations were conducted using the OpenSees11, a widely adopted open-source platform for nonlinear structural analysis. Implicit analysis was employed for these simulations, as it computes the response at each time step under the assumption of static equilibrium, making it particularly suitable for handling complex material behavior and large deformations, both of which are essential in dynamic simulations. The final output from the simulations consists of detailed NLTHA responses, providing crucial response data for each RC structure under the specified external excitation. This response data is critical for training, validating, and testing the deep learning models.
4. -Dataset Splitting and Characteristics: To ensure robust model evaluation and generalizability, the dataset was divided into training and testing sets using a random selection process. The final split allocated 267,629, buildings for training and 3,065 buildings for testing. With a time interval of 0.02 seconds for each input excitation and the duration of the simulations, the entire dataset represents approximately 10 billion data points. This vast and diverse dataset provides a solid foundation for model training and evaluation across different structural types and scenarios.
The comprehensive nature of the dataset—encompassing various building types, external dynamic excitations, and structural responses—ensures that the resulting deep learning model will be well-equipped to handle a wide range of real-world scenarios. This robust numerical module forms the cornerstone of our study, enabling the development of a highly accurate and versatile response prediction model.
Simplified dynamic response (SDR) module
The simplified dynamic‑response (SDR) module is devised for rapid, coarse‑resolution estimation of global structural response, converting a high-fidelity FE model into a compact mechanical surrogate. The complete workflow is illustrated in Extended Data Fig. 5. Floor masses were obtained by integrating the self-weight of the slabs, tributary masses from vertical members, and prescribed live-load allowances, resulting in the lumped mass matrix (
M). This approach preserves the true vertical distribution of inertia, which governs modal participation in tall or irregular buildings. To capture lateral and torsional stiffness without using spring analogies, unit horizontal forces
were applied sequentially at each floor of the FE model, and the resulting displacement field
was assembled. Enforcing equilibrium
and imposing the symmetry constraint
, a dense, full-bandwidth stiffness matrix
was obtained by constrained least-squares minimisation, faithfully capturing shear–flexure interaction, diaphragm eccentricity, and other higher-order coupling effects that conventional tri-diagonal stick models overlook. The pair
thus defines a reduced multi-degree-of-freedom system that reproduces the fundamental natural frequencies and participation factors of the parent FE model while enabling time-history analyses several orders of magnitude faster. Agnostic to structural typology—moment frames, braced cores, or coupled shear walls—and tolerant of non-uniform mass–stiffness distributions, the SDR surrogate furnishes a mechanics-grounded backbone for downstream ML inference across heterogeneous building inventories, combining computational efficiency with accuracy.
Simplified dynamic response calculation
The SDR module produces a coarse-resolution approximation of the floor-wise dynamic response by transforming a high-fidelity finite element model into a simplified mechanical surrogate. By integrating story mass, stiffness, and structural dynamic equations, it embeds key structural parameters into the calculations, streamlining the task of fitting deep learning models to structural responses under external dynamic excitation.
The calculation method leverages a simplified numerical approach, specifically the Newmark-β method, which assumes linear acceleration changes within a time interval. This method incorporates floor displacement and acceleration, as described by the following equations:
Here,
and
are constants. The structural dynamics equation that must be satisfied is:
By substituting equations (1) and (2) into Eq. (3), the equation for
is obtained:
Once
is found, it is introduced back into equations (1) and (2) to calculate
. The damping matrix
uses Rayleigh damping, expressed as:
Parameters α and β are determined by:
where
is the damping ratio, and
and
are the first- and second-order natural frequencies.
The structural dynamics equation is solved to obtain the nonlinear response, considering the damping matrix expressed via Rayleigh damping. This iterative process, based on the structural dynamic information extraction module, allows for accurate estimation of the nonlinear structural response.
SeisGPT core deep learning model
The SeisGPT model incorporates a feature embedding module, a physics-informed structural encoder (PhySE), and a response feature decoder (RFD). Its architecture integrates state-of-the-art techniques from machine learning and structural engineering to model complex interactions within structural systems and capture long-range dependencies in response predictions. The core components of the model include the PhySE, enhanced gated fusion modules, and Transformer blocks, each playing a critical role in learning physics-informed and data-driven representations. A detailed schematic of the SeisGPT architecture is shown in Fig. 6.
Feature embedding module: This module processes multiple data sources, including excitation data, simplified structural response data, and structural matrices (stiffness and mass). It transforms these inputs into a unified latent space utilized by the model’s subsequent layers, particularly the Transformer blocks. The excitation data
and simplified structural response data
were initially passed through linear transformations to embed the temporal sequences, followed by a dropout operation to mitigate overfitting during training.
Time embeddings were then computed for each input sequence, encoding the excitation and simplified structural response data into time-specific representations that capture the temporal dynamics of the inputs. In parallel, floor embeddings were generated by applying linear transformations to the structural data, ensuring consistent encoding across varying numbers of floors and capturing the physical characteristics of the structures.
The encoded features were subsequently processed by the PhySE, which integrates temporal and structural features along with the stiffness matrix
and mass matrix
. The excitation, structural, and simplified response features were fused using the enhanced gated fusion mechanism, allowing the model to dynamically learn meaningful interactions among external excitation, simplified structural response, and building properties. To further capture temporal dependencies, sinusoidal positional encodings were precomputed and applied to the features, enabling the model to effectively represent the temporal structure inherent in the data.
Physics-informed structural encoder: The PhySE is central to incorporating structural knowledge into the learning process, leveraging both graph-based learning and physics-informed feature extraction. The encoder is designed to process the stiffness matrix
and the mass matrix
, which represent key structural properties, and produce feature embeddings that capture the structural response.
The physics-informed graph neural network (PIGNN) incorporates structural domain knowledge into the GNN architecture to enhance the modeling of responses in structural systems. The model constructs a physics-informed graph, where nodes represent structural elements (floors in a building), and edges encode the physical connections between adjacent nodes, such as beams or walls. Node features were derived from the stiffness matrix
and the mass matrix
, where
is the batch size and
is the number of floors. The graph structure is formed by extracting the diagonal elements of the stiffness matrix to represent the stiffness associated with each node.
In the physics-informed graph attention layer, input features first undergo a feature projection step, where a fully connected layer is applied to reduce dimensionality. Each input node feature
is then processed through multiple single-head attention mechanisms. For each attention head
, the feature vector
undergoes a linear transformation:
where
is the weight matrix of the attention head. Edge features are computed as follows:
The attention coefficients are calculated using the following equation:
where
is a learnable weight matrix associated with each attention head, and || denotes vector concatenation. In the subsequent feature aggregation step, each node aggregates information from its neighboring nodes, weighted by the learned attention coefficients:
where 𝜎 denotes an activation function, and Z
represents the set of neighboring nodes of node 𝑖. The outputs from all attention heads were then concatenated to form the updated feature representation for each node. This updated representation was subsequently processed through a layer normalization step and passed through a GELU
45 activation function. Finally, a residual connection adds the processed features back to the original input to complete the computation.
Response feature decoder (RFD) is an advanced transformer-based neural network. It begins with an embedding layer that maps input tokens into higher-dimensional vector representations, expanding the number of channels to
and timesteps to
. This transformation enables the model to address data heterogeneity and handle diverse feature types by representing them in a format suitable for subsequent processing. root mean square (RMS) normalization was then applied to stabilize and normalize the embedded vectors, which was particularly beneficial for time-series data.
where
represents the normalized activation value,
denotes the original activation value, and
indicates the dimension of the token vector.
A key feature of the model is the grouped query attention46 (GQA) mechanism, a variant of the standard multi-head attention. GQA segments queries into groups, with each group sharing a single set of keys and value heads, thus enhancing both performance and memory efficiency. This mechanism plays a critical role in focusing on relevant features by grouping input elements and applying self-attention to identify the most important ones within each group. This prioritization of informative features significantly boosts the model’s performance. Rotary position embedding47 (RoPE) was applied to the queries and keys within the GQA mechanism to effectively encode positional information. The output from this attention layer then passed through a feed-forward network (FFN) that employs the SwiGLU48 (Sigmoid-Weighted Linear Unit) activation function, which has been shown to outperform the traditional ReLU activation function in certain cases.
Residual connections and additional RMS normalization were applied after both the attention and feed-forward layers to facilitate gradient flow and improve model training. This block configuration was repeated N times, indicating the stacking of transformer blocks, which deepens the model and enables it to learn complex patterns. After these repetitions, the output underwent another RMS normalization, followed by a linear transformation to map it to the desired dimensions. This architecture integrated modern techniques such as grouped query attention and SwiGLU, enhancing both performance and efficiency in processing sequential data. The RFD’s advanced design allowed it to effectively manage diverse and complex inputs, making it a powerful tool for real-time structural response prediction.
Fine-Tuning with SeisGPT: While SeisGPT was designed for general structural response prediction, it could be fine-tuned to enhance accuracy for a specific building, provided prior structural response data was available. During the fine-tuning process, the model was adapted to better predict responses for the target building, ensuring improved prediction accuracy for that specific structure. This adaptation was achieved by introducing LoRA layers to all linear layers in the response feature decoder. During fine-tuning, all model parameters, except those in the LoRA layers, were frozen. Only the small parameters in the LoRA layers were updated during training. This approach allowed SeisGPT to efficiently adapt to the structural characteristics of the target building and provide more accurate responses, enhancing the model’s performance in real-world applications where building-specific data was available.
SeisGPT-R: SeisGPT-R augments the SeisGPT framework to enable accurate reconstruction of structural responses from sparse sensor measurements. The model ingests three primary input types: external excitation sequences (
), simplified response predictions (
) generated via the SDR module, and partial observational data (
) derived from sparse sensor instrumentation. To accommodate the sparse and irregular nature of
, dedicated time embedding and floor embedding layers preprocess these sensor inputs, ensuring appropriate temporal and spatial representation.
Critically, SeisGPT-R employs a learned gated fusion mechanism within its physics-informed structural encoder. Specifically, structural representations generated by the physics-informed graph neural network (PIGNN), denoted by
, are adaptively fused with sensor-derived embeddings (
). This fusion occurs at the feature level—after structural topology and dynamics have been encoded by the PIGNN—rather than at the raw input level. The floor-wise fusion is governed by an adaptive gating weight,
, calculated via a linear transformation followed by a sigmoid activation function, as follows:
The resulting fused feature,
, is computed as a weighted combination:
where
W and
b are learnable parameters. This gating mechanism dynamically balances physics-informed structural priors against real observational data based on the reliability and availability of sensor measurements at each floor. Subsequently, the fused representation
is passed through the decoder network to reconstruct detailed structural response profiles. This architecture not only ensures robust spatial continuity and physical coherence of the predicted responses but also provides adaptability in data-limited scenarios, significantly enhancing reconstruction accuracy and generalization across diverse structural typologies and external loading conditions.
Average prediction performance of SeisGPT-Base and baseline models (SeisGRU, SeisLSTM, TimesNet, N-Beats, and Informer) evaluated on 3,000 previously unseen buildings, equally distributed across frame, frame–shear wall, and shear wall structures. Metrics include Floor-wise Normalized Mean Absolute Error (FNMAE), Floor-wise Normalized Root Mean Squared Error (FNRMSE), and Pearson correlation coefficient (R), computed separately for acceleration and displacement predictions. SeisGPT-Base consistently achieves the lowest error and highest correlation across all tasks and typologies, demonstrating superior generalization and fidelity in full-building response modeling.
Evaluation of structural response prediction accuracy for six models—SeisGPT-Enhanced, SeisGRU, SeisLSTM, TimesNet, N-Beats, and Informer—on a test set of 65 real buildings. Metrics reported include Floor-wise Normalized Mean Absolute Error (FNMAE), Floor-wise Normalized Root Mean Squared Error (FNRMSE), and Pearson correlation coefficient (R), computed separately for acceleration and displacement predictions. SeisGPT-Enhanced consistently achieves the lowest prediction errors and highest correlation across both tasks, demonstrating superior alignment with finite element reference responses and improved generalization to real structural systems.
Comparison of three model configurations evaluated on 65 real buildings: SeisGPT-Base (A1), pretrained on the large-scale synthetic building dataset; SeisGPT-Enhanced (A2), obtained by fine-tuning SeisGPT-Base on real building data; and a non-pretrained model (A3), trained from scratch on the same real-world building dataset. Fine-tuning consistently improves predictive accuracy over pretraining alone, while models trained without pretraining exhibit notably higher errors, underscoring the value of large-scale synthetic building dataset for learning transferable structural representations.
This table presents the distribution of structural types and the corresponding number of FEM cases in the dataset used in this study. The statistics include the number of different building structure types as well as the number of cases analyzed for each type using finite element analysis.
This table presents the ranges of design parameters for the generated buildings, produced by the AI-based algorithm used in this study. The parameters include key structural design values and their corresponding ranges.