References
Alexandrov, V., & Hoogenboom, G. (2000). The impact of climate variability and change on crop yield in Bulgaria. Agricultural and Forest Meteorology, 104, 315–327.
Arvor, D., Bégué, A., Dubreuil, V., & Nelson, A. (2023). Global maize yield prediction using machine learning approaches: Evidence from 37 developing countries. arXiv. https://arxiv.org/abs/2312.02254
Asare, E., Aidoo, O. F., & Boateng, E. (2023). Application of random forest for maize yield prediction under varying soil and climatic conditions in Ghana. Frontiers in Sustainable Food Systems, 7, 11403005. https://doi.org/10.3389/fsufs.2023.11403005
Aschonitis, V., Mastrocicco, M., Colombani, N., Salemi, E., Kazakis, N., Voudouris, K., & Castaldelli, G. (2012). Assessment of the intrinsic vulnerability of agricultural land to water and nitrogen losses via deterministic approach and regression analysis. Water, Air, and Soil Pollution, 223, 1605–1614.
Awad, M., & Khanna, R. (2015). Efficient learning machines: Theories, concepts, and applications for engineers and system designers. Apress. https://doi.org/10.1007/978-1-4302-5990-9
Basso, B., & Liu, L. (2019). Seasonal crop yield forecast: Methods, applications, and accuracies. Advances in Agronomy, 154, 201–255. https://doi.org/10.1016/bs.agron.2018.11.002
Belgiu, M., & Drăguţ, L. (2016). Random forest in remote sensing: A review of applications and future directions. ISPRS Journal of Photogrammetry and Remote Sensing, 114, 24–31. https://doi.org/10.1016/j.isprsjprs.2016.01.011
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324
Cai, Y., Moore, K., Pellegrini, A., Elhaddad, A., Lessel, J., Townsend, C., et al. (2017). Crop yield predictions: High-resolution statistical model for intra-season forecasts applied to corn in the U.S. Gro Intelligence, Inc.
Challinor, A. J., Watson, J., Lobell, D. B., Howden, S., Smith, D., & Chhetri, N. (2014). A meta-analysis of crop yield under climate change and adaptation. Nature Climate Change, 4, 287–291.
Chen, L., Wang, J., Sun, Y., & Huang, Z. (2025). UAV-based multispectral imagery and machine learning for maize yield prediction. Computers and Electronics in Agriculture, 224, 109050. https://doi.org/10.1016/j.compag.2025.109050
Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 785–794). ACM. https://doi.org/10.1145/2939672.2939785
Chen, X., Li, J., & Wang, Y. (2025). Machine learning-based prediction of maize yield using socioeconomic and climatic data. Agricultural Systems, 209, 103721. https://doi.org/10.1016/j.agsy.2025.103721
Cheng, E., Zhang, B., Peng, D., Zhong, L., Yu, L., Liu, Y., et al. (2022). Wheat yield estimation using remote sensing data based on machine learning approaches. Frontiers in Plant Science, 13, 1090970. https://doi.org/10.3389/fpls.2022.1090970
Cheng, E., Zhang, B., Peng, D., Zhong, L., Yu, L., Liu, Y., Xiao, C., Li, C., Li, X., Chen, Y., Ye, H., Wang, H., Yu, R., Hu, J., & Yang, S. (2022). Wheat yield estimation using remote sensing data based on machine learning approaches. Frontiers in Plant Science, 13, 1090970. https://doi.org/10.3389/fpls.2022.1090970
Chicco, D., & Jurman, G. (2020). The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genomics, 21(1), 6. https://doi.org/10.1186/s12864-019-6413-7
Chlingaryan, A., Sukkarieh, S., & Whelan, B. (2018). Machine learning approaches for crop yield prediction and nitrogen status estimation in precision agriculture: A review. Computers and Electronics in Agriculture, 151, 61–69. https://doi.org/10.1016/j.compag.2018.05.012
Cover, T. M., & Hart, P. E. (1967). Nearest neighbor pattern classification. IEEE Transactions on Information Theory, 13(1), 21–27. https://doi.org/10.1109/TIT.1967.1053964
Crane-Droesch, A. (2018). Machine learning methods for crop yield prediction and climate change impact assessment in agriculture. Environmental Research Letters, 13(11), 114003. https://doi.org/10.1088/1748-9326/aae159
Dietterich, T. G. (2000). Ensemble methods in machine learning. In Multiple Classifier Systems (pp. 1–15). Springer.
Drummond, S. T., Sudduth, K. A., Joshi, A., Birrell, S. J., & Kitchen, N. R. (2003). Statistical and neural methods for site-specific yield prediction. Transactions of the ASAE, 46(1), 5–14. https://doi.org/10.13031/2013.12541
Fan, J., Chen, X., & Zhang, Y. (2023). Machine learning-based crop yield prediction: A comparative study of XGBoost, Random Forest, and deep learning models. Computers and Electronics in Agriculture, 210, 107999. https://doi.org/10.1016/j.compag.2023.107999
Fix, E., & Hodges, J. L. (1951). Discriminatory analysis: Nonparametric discrimination—Consistency properties (Technical Report No. 4). USAF School of Aviation Medicine.
Freund, Y., & Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1), 119–139. https://doi.org/10.1006/jcss.1997.1504
Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. Annals of Statistics, 29(5), 1189–1232. https://doi.org/10.1214/aos/1013203451
Fukuda, S., Spreer, W., Yasunaga, E., Yuge, K., Sardsud, V., & Müller, J. (2013). Random Forests modeling for the estimation of mango (Mangifera indica L. cv. Chok Anan) fruit yields under different irrigation regimes. Agricultural Water Management, 116, 142–150. https://doi.org/10.1016/j.agwat.2012.07.003
Gao, J., Zhang, Y., Feng, P., Liu, Y., & Li, X. (2023). Maize yield prediction with machine learning, spectral variables, and irrigation management. Computers and Electronics in Agriculture, 205, 107624. https://doi.org/10.1016/j.compag.2023.107624
González Sánchez, A., Frausto Solís, J., & Ojeda Bustamante, W. (2014). Predictive ability of machine learning methods for massive crop yield prediction. Spanish Journal of Agricultural Research, 12(2), 313–328. https://doi.org/10.5424/sjar/2014122-4439
Habib-ur-Rahman, M., Ahmad, A., Raza, A., Hasnain, M. U., Alharby, H. F., Alzahrani, Y. M., Bamagoos, A. A., Hakeem, K. R., Ahmad, S., Nasim, W., Ali, S., Mansour, F., & El Sabagh, A. (2022). Impact of climate change on agricultural production; Issues, challenges, and opportunities in Asia. Frontiers in Plant Science, 13, 925548. https://doi.org/10.3389/fpls.2022.925548
Hagan, M. T., & Menhaj, M. B. (1994). Training feedforward networks with the Marquardt algorithm. IEEE Transactions on Neural Networks, 5(6), 989–993. https://doi.org/10.1109/72.329697
Han, L., Yang, G., Feng, H., Zhou, C., & Yang, H. (2020). Hyperspectral-based prediction of wheat yield using machine learning techniques. Remote Sensing, 12(9), 1–20.
Hoerl, A. E., & Kennard, R. W. (1970). Ridge regression: Biased estimation for nonorthogonal problems. Technometrics, 12(1), 55–67.
Hoogenboom, G., White, J. W., & Messina, C. D. (2004). From genome to crop: Integration through simulation modeling. Field Crops Research, 90(1), 145–163. https://doi.org/10.1016/j.fcr.2004.07.014
Hornik, K., Stinchcombe, M., & White, H. (1989). Multilayer feedforward networks are universal approximators. Neural Networks, 2(5), 359–366. https://doi.org/10.1016/0893-6080(89)90020-8
Iizumi, T., & Ramankutty, N. (2015). How do weather and climate influence cropping area and intensity? Global Food Security, 4, 46–50.
James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning: With applications in R. Springer. https://doi.org/10.1007/978-1-4614-7138-7
Jeong, J. H., Resop, J. P., Mueller, N. D., Fleisher, D. H., Yun, K., Butler, E. E., et al. (2016). Random forests for global and regional crop yield predictions. PLoS ONE, 11(6), e0156571. https://doi.org/10.1371/journal.pone.0156571
Karimi, Y., Prasher, S., Madani, A., & Kim, S. (2008). Application of support vector machine technology for the estimation of crop biophysical parameters using aerial hyperspectral observations. Canadian Biosystems Engineering, 50(7), 13–20.
Kharal, A. S., Mahar, S. A., Mushtaque, M. I., Magsi, A., & Mahar, J. A. (2024). A model for wheat yield prediction to reduce the effect of climate change using support vector regression. VFAST Transactions on Software Engineering, 12(2), 192–212. https://doi.org/10.21015/vtse.v12i2.1855
Kharal, A. S., Mahar, S. A., Mushtaque, M. I., Magsi, A., & Mahar, J. A. (2024). A Model for Wheat Yield Prediction to Reduce the Effect of Climate Change Using Support Vector Regression. VFAST Transactions on Software Engineering, 12(2), 192–212. https://doi.org/10.21015/vtse.v12i2.1855
Kim, S., & Kim, H. (2016). A new metric of absolute percentage error for intermittent demand forecasts. International Journal of Forecasting, 32(3), 669–679.
Kingsley, J., Afu, S. M., Isong, I. A., Chapman, P. A., Kebonye, N. M., & Ayito, E. O. (2021). Estimation of soil organic carbon distribution by geostatistical and deterministic interpolation methods: A case study of the southeastern soils of Nigeria. Environmental Engineering and Management Journal, 20, 1077–1085.
Kouadio, L., Deo, R. C., Byrareddy, V., Adamowski, J. F., Mushtaq, S., & Nguyen, V. P. (2018). Artificial intelligence approach for the prediction of Robusta coffee yield using soil fertility properties. Computers and Electronics in Agriculture, 155, 324–338.
Kuhn, M., & Johnson, K. (2013). Applied predictive modeling. Springer.
LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436–444. https://doi.org/10.1038/nature14539
Li, B., Yang, W., & Li, X. (2018). Application of combined model with DGM(1,1) and linear regression in grain yield prediction. Grey Systems: Theory and Application, 8(1), 25–34. https://doi.org/10.1108/GS-07-2017-0020
Li, H., Wang, J., & Zhao, Y. (2023). Comparative performance of machine learning algorithms for crop yield prediction: Evidence from support vector regression and ensemble methods. Agricultural Systems, 207, 103613. https://doi.org/10.1016/j.agsy.2023.103613
Li, T., Zhou, Y., Li, X., Wu, J., & He, T. (2019). Forecasting daily crude oil prices using improved CEEMDAN and ridge regression-based predictors. Energies, 12(19), Article 3603. https://doi.org/10.3390/en12193603
Li, Y., Zhang, H., & Liu, Q. (2025). Application of XGBoost model and multi-source data for winter wheat yield prediction in Henan Province of China. Computers and Electronics in Agriculture, 215, 108528. https://doi.org/10.1016/j.compag.2025.108528
Lischeid, G., Webber, H., Sommer, M., Nendel, C., & Ewert, F. (2022). Machine learning in crop yield modeling: A powerful tool, but no surrogate for science. Agricultural and Forest Meteorology, 312, 108698. https://doi.org/10.1016/j.agrformet.2021.108698
Maseko, S., Van Der Laan, M., Tesfamariam, E. H., Delport, M., & Otterman, H. (2024). Evaluating machine learning models and identifying key factors influencing spatial maize yield predictions in data intensive farm management. European Journal of Agronomy, 157, 127193. https://doi.org/10.1016/j.eja.2024.127193
Masocha, M., Mutanga, O., & Sibanda, M. (2023). Integrating UAV imagery and machine learning for maize yield prediction across growth stages in South Africa. Remote Sensing Applications: Society and Environment, 29, 100982. https://doi.org/10.1016/j.rsase.2023.100982
Maulud, D., & Abdulazeez, A. M. (2020). A review on linear regression comprehensive in machine learning. Journal of Applied Science and Technology Trends, 1(2), 140–147. https://doi.org/10.38094/jastt1457
Maulud, D., & Abdulazeez, A. M. (2020). A Review on Linear Regression Comprehensive in Machine Learning. Journal of Applied Science and Technology Trends, 1(2), 140–147. https://doi.org/10.38094/jastt1457
Mehdizadeh, S., Behmanesh, J., & Khalili, K. (2017). Using MARS, SVM, GEP and empirical equations for estimation of monthly mean reference evapotranspiration. Computers and Electronics in Agriculture, 139, 103–114. https://doi.org/10.1016/j.compag.2017.05.002
Miller, T., Rahman, A., & Ghosh, S. (2023). Neural networks for crop yield prediction: A comparative analysis with machine learning models. Computers and Electronics in Agriculture, 208, 108012. https://doi.org/10.1016/j.compag.2023.108012
Mitchell, R., & Frank, E. (2017). Accelerating the XGBoost algorithm using GPU computing. PeerJ Computer Science, 3, e127. https://doi.org/10.7717/peerj-cs.127
Mohammadi, K., Shamshirband, S., Motamedi, S., Petković, D., Hashim, R., & Gocić, M. (2015). Extreme learning machine based prediction of daily dew point temperature. Computers and Electronics in Agriculture, 117, 214–225. https://doi.org/10.1016/j.compag.2015.08.008
Montesinos López, O. A., Montesinos López, A., & Crossa, J. (2022). Overfitting, Model Tuning, and Evaluation of Prediction Performance. In O. A. Montesinos López, A. Montesinos López, & J. Crossa, Multivariate Statistical Machine Learning Methods for Genomic Prediction (pp. 109–139). Springer International Publishing. https://doi.org/10.1007/978-3-030-89010-0_4
Montgomery, D. C., Peck, E. A., & Vining, G. G. (2012). Introduction to linear regression analysis (5th ed.). Wiley.
Morellos, A., Pantazi, X.-E., Moshou, D., Alexandridis, T., Whetton, R., Tziotzios, G., et al. (2016). Machine learning-based prediction of soil total nitrogen, organic carbon, and moisture content using VIS-NIR spectroscopy. Biosystems Engineering, 152, 104–116. https://doi.org/10.1016/j.biosystemseng.2016.04.018
Mutanga, O., Adam, E., & Cho, M. (2012). High density biomass estimation for wetland vegetation using WorldView-2 imagery and random forest regression algorithm. International Journal of Applied Earth Observation and Geoinformation, 18, 399–406. https://doi.org/10.1016/j.jag.2012.03.012
Nagelkerke, N. J. D. (1991). A note on a general definition of the coefficient of determination. Biometrika, 78(3), 691–692.
Naik, J., Satapathy, P., & Dash, P. (2018). Short-term wind speed and wind power prediction using hybrid empirical mode decomposition and kernel ridge regression. Applied Soft Computing, 70, 1167–1188. https://doi.org/10.1016/j.asoc.2018.06.008
Pantazi, X. E., Moshou, D., Alexandridis, T., Whetton, R. L., & Mouazen, A. M. (2016). Wheat yield prediction using machine learning and advanced sensing techniques. Computers and Electronics in Agriculture, 121, 57–65. https://doi.org/10.1016/j.compag.2015.11.018
Pham, H., & Olafsson, S. (2019a). Bagged ensembles with tunable parameters. Computational Intelligence, 35(1), 184–203. https://doi.org/10.1111/coin.12198
Pham, H., & Olafsson, S. (2019b). On Cesaro averages for weighted trees in the random forest. Journal of Classification, 1–14. https://doi.org/10.1007/s00357-019-09322-8
Romeijn, H., Faggian, R., Diogo, V., & Sposito, V. (2016). Evaluation of deterministic and complex analytical hierarchy process methods for agricultural land suitability analysis in a changing climate. ISPRS International Journal of Geo-Information, 5, 99.
Rosenberg, N. J. (1992). Adaptation of agriculture to climate change. Climatic Change, 21, 385–405.
Ruane, A. C., Major, D. C., Winston, H. Y., Alam, M., Hussain, S. G., Khan, A. S., Hassan, A., Al Hossain, B. M. T., Goldberg, R., & Horton, R. M. (2013). Multi-factor impact analysis of agricultural production in Bangladesh with climate change. Global Environmental Change, 23, 338–350.
Sajedi-Hosseini, F., Malekian, A., Choubin, B., Rahmati, O., Cipullo, S., Coulon, F., et al. (2018). A novel machine learning-based approach for the risk assessment of nitrate groundwater contamination. Science of the Total Environment, 644, 954–962. https://doi.org/10.1016/j.scitotenv.2018.07.054
Saunders, C., Gammerman, A., & Vovk, V. (1998). Ridge regression learning algorithm in dual variables. In Proceedings of the 15th International Conference on Machine Learning (pp. 515–521). Morgan Kaufmann.
Schmidhuber, J. (2015). Deep learning in neural networks: An overview. Neural Networks, 61, 85–117.
Shahhosseini, M., Hu, G., & Pham, H. (2019a). Optimizing ensemble weights and hyperparameters of machine learning models for regression problems. arXiv:1908.05287.
Shahhosseini, M., Hu, G., & Pham, H. (2019b). Optimizing ensemble weights for machine learning models: A case study for housing price prediction. In H. Yang, R. Qiu, & W. Chen (Eds.), Smart service systems, operations management, and analytics (pp. 1–14). Springer. https://doi.org/10.1007/978-3-030-30967-1_9
Shekoofa, A., Emam, Y., Shekoufa, N., Ebrahimi, M., & Ebrahimie, E. (2014). Determining the most important physiological and agronomic traits contributing to maize grain yield through machine learning algorithms: A new avenue in intelligent agriculture. PLoS ONE, 9(5), e97288. https://doi.org/10.1371/journal.pone.0097288
Smola, A. J., & Schölkopf, B. (2004). A tutorial on support vector regression. Statistics and Computing, 14(3), 199–222. https://doi.org/10.1023/B\:STCO.0000035301.49549.88
Subedi, B., Poudel, A., & Aryal, S. (2023). The impact of climate change on insect pest biology and ecology: Implications for pest management strategies, crop production, and food security. Journal of Agriculture and Food Research, 14, 100733. https://doi.org/10.1016/j.jafr.2023.100733
Tadesse, T., Demisse, G. B., Zaitchik, B., et al. (2018). Building resilience to food insecurity in data-scarce regions. Agricultural and Forest Meteorology, 262, 402–413.
Tao, F., Li, Y., Wei, Y., Zhang, C., & Zuo, Y. (2025). Data–model Fusion Methods and Applications toward Smart Manufacturing and Digital Engineering. Engineering, S2095809925000244. https://doi.org/10.1016/j.eng.2024.12.034
Thornton, P. K., Jones, P. G., Alagarswamy, G., & Andresen, J. (2009). Spatial variation of crop yield response to climate change in East Africa. Global Environmental Change, 19, 54–65.
Vincenzi, S., Zucchetta, M., Franzoi, P., Pellizzato, M., Pranovi, F., De Leo, G. A., et al. (2011). Application of a Random Forest algorithm to predict spatial distribution of the potential yield of Ruditapes philippinarum in the Venice lagoon, Italy. Ecological Modelling, 222(8), 1471–1478. https://doi.org/10.1016/j.ecolmodel.2011.02.007
Willmott, C. J., & Matsuura, K. (2005). Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Climate Research, 30, 79–82.
Wolpert, D. H. (1992). Stacked generalization. Neural Networks, 5(2), 241–259. https://doi.org/10.1016/S0893-6080(05)80023-1
Yang, Q., Li, P., & Sun, J. (2024). Deep learning approaches for crop yield prediction: Challenges and opportunities. Agricultural Systems, 212, 103754. https://doi.org/10.1016/j.agsy.2024.103754
Yang, S., Li, L., Fei, S., Yang, M., Tao, Z., Meng, Y., & Xiao, Y. (2024). Wheat yield prediction using machine learning method based on UAV remote sensing data. Drones, 8(7), 284. https://doi.org/10.3390/drones8070284
Zhang, C., & Ma, Y. (Eds.). (2012). Ensemble machine learning: Methods and applications. Springer.
Zhang, G. P., Patuwo, B. E., & Hu, M. Y. (1998). Forecasting with artificial neural networks: The state of the art. International Journal of Forecasting, 14(1), 35–62. https://doi.org/10.1016/S0169-2070(97)00044-7
Zhao, L., Wang, H., & Li, P. (2024). Enhancing crop yield prediction with XGBoost and remote sensing data: Evidence from maize and wheat systems. Agricultural Systems, 212, 103753. https://doi.org/10.1016/j.agsy.2024.103753
Zou, H., & Hastie, T. (2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B, 67(2), 301–320.