Operational Sentinel-2 Based System for Near Real-Time Irrigated Area Monitoring in the Limpopo River Basin

*Correspondence: Z.Kiala@cgiar.org

Zolo Kiala¹*, Karthikeyan Matheswaran², Chris Dickens², Mariangel Garcia Andarcia², Fulco Ludwig², Surajit Ghosh²

¹International Water Management Institute, Block G Ground Floor, Hatfield Gardens, 333 Grosvenor St, Hatfield, Pretoria, 0083, South Africa

²International Water Management Institute, 127 Sunil Mawatha, Battaramulla, Sri Lanka 10120

Abstract

Monitoring irrigated agriculture is critical in water-scarce regions like the Limpopo River Basin (LRB), where irrigation consumes significant freshwater. This study presents a scalable, semi-supervised machine-learning framework for monthly mapping of irrigated croplands and water use in the LRB from 2019 to 2024. The framework combines Sentinel-2 imagery, Random Forest classification, time-lagged precipitation–vegetation analysis, and slope masking for monthly irrigation mapping. FAO’s WaPOR evapotranspiration data was used to estimate water use. At 10 m resolution, the framework achieves 80% accuracy (κ = 0.60), capturing smallholder plots and seasonal dynamics. Dry-season irrigated area declined from ~ 211,281 ha in 2019 to ~ 184,771 ha in 2024, while water use rose from ~ 103 to ~ 134 million m³, indicating intensifying demand. Irrigation concentrates in key sub-basins, with potential for sustainable expansion if water is available. The methodology highlights the effectiveness of combining high-resolution imagery with time-lagged analysis for accurate irrigation mapping. Its open-access implementation via Google Earth Engine offers a replicable model for water-stressed transboundary basins, enhancing resource management and resilience in climatically vulnerable regions.

Keywords:

CHIRPS

Digital Twin

Google Earth Engine

irrigated agriculture

Random Forest

Sentinel-2

WaPOR

1. Introduction

Irrigated agriculture accounts for ~ 70% of global freshwater withdrawals and is a critical pillar of food and livelihood security (Ozdogan et al. 2010). The challenges of irrigation are particularly acute in water-scarce basins, where factors such as population growth, changes in dietary shifts, and climate variability are driving the increased demand. The Limpopo River Basin (LRB) is one of Southern Africa’s major transboundary river systems. It is home to 18 million predominantly rural inhabitants and experiences severe water stress and prolonged dry spells, declining baseflows, and rising temperatures (Sitoe and Qwist-Hoffman 2013; Dickens et al. 2020). Irrigation supports employment and agricultural production. Between 2000 and 2012, annual water consumption in the irrigation sector more than doubled, from 1,400 to 3,000 10⁶ m³ straining already over-allocated sub-basins of the LRB (Kapangaziwiri et al. 2021).

The LRB hosts both large-scale commercial farms and small-holder irrigation schemes. Subsistence agriculture is also common. In this setting, near real-time irrigation monitoring systems are essential. They help track where and when irrigation occurs, as well as the amount of water used (Magidi et al. 2021). Previous studies have mostly focused on historical irrigation estimates, which offer limited applicability for timely decision-making during the current growing season (Senay et al. 2016; Mahapatra et al. 2024). Furthermore, conventional approaches often require significant field inputs and ground data collected during the irrigation season, constraining their scalability and responsiveness (Thenkabail et al. 2009). Near real-time monitoring can capture the progression of irrigated areas and water use, providing critical, timely insights to basin managers, farmers, and policymakers. This enables them to track agricultural activities, identify fallow lands, and optimize water allocations on a season-by-season basis, supporting sustainable water management in the face of growing demand and climate variability. Implementing such systems in the Limpopo can empower stakeholders to respond adaptively to water scarcity, improving resource use efficiency and resilience across both commercial and smallholder farm contexts.

Remote sensing based irrigated area mapping focused on coarse to medium-resolution imagery such as AVHRR (Thenkabail et al. 2009), MODIS (Ozdogan and Gutman 2008), and Landsat (McAllister et al. 2015) to differentiate irrigated from rainfed systems using vegetation indices and seasonal phenology. These approaches proved effective for large-scale mapping but struggled in heterogeneous agricultural landscapes. In sub-Saharan Africa, where smallholder plots often measure less than one hectare, coarse spatial resolutions (> 250 m) systematically under-estimated irrigated area (Thenkabail et al. 2009). High-resolution sensors such as SPOT (Vogels et al. 2019), and more recently Sentinel-2, have addressed some of these limitations by offering finer discrimination of crop phenology and water use dynamics (Magidi et al. 2021; Zurqani et al. 2021). Despite these advances, major challenges remain: fragmented fields, intra-seasonal variability, and informal irrigation practices continue to evade detection in many African contexts. This underscores the need for operational systems that combine high spatial and temporal resolution imagery with robust classification frameworks to generate actionable, near real-time irrigation intelligence.

In the LRB, prior mapping efforts have advanced understanding but remain constrained by scale and frequency. Van Niekerk et al. (2018) estimated evapotranspiration at 250 m resolution using Landsat-8 and Earth Observation datasets, while (Mpakairi et al. 2023) applied Sentinel-2 and advanced classifiers to separate irrigated from rainfed areas at finer resolution. Cai et., al (2017) developed a methodology combining Landsat imagery, vegetation indices, and classification techniques to map irrigated areas in South Africa’s Limpopo Province, demonstrating moderate success while emphasizing challenges in distinguishing irrigated from rainfed agriculture due to spatial heterogeneity and limited ground-truth data. However, neither approaches captured monthly changes across the basin nor linked mapped extents directly to quantified water use, leaving a critical gap for water allocation planning. Smallholder schemes, often under a few hectares, remain especially under-represented, despite their significance in local livelihood security, food production and groundwater abstraction.

Cloud based remote sensing processing platforms like Google Earth Engine (GEE) now enable basin-scale, high-resolution, and temporally consistent irrigation monitoring (Gorelick et al. 2017). Platforms like FAO’s WaPOR provide analysis ready crop water use data at 100 m resolution (FAO 2019) to support near real-time analytics over larger areas. Machine learning approaches for mapping irrigated agriculture, including Random Forest and deep learning, have achieved strong performance in other African basins (Mpakairi et al. 2023; Mohammedshum et al. 2023). Hybrid frameworks integrating multi-sensor data and terrain models can further improve classification accuracy. Currently, the LRB lacks an operational, high-resolution monitoring system that can map irrigated areas monthly at sub-field scale, capture interannual trends in irrigated extent; and quantify corresponding water use to inform allocation strategies.

This study addresses this gap by developing and validating a fully automated, semi-supervised machine learning framework for monthly mapping of irrigated areas and estimation of dry-season water use in the LRB from 2019 to 2024. Implemented as part of the basin’s Digital Twin initiative (https://digitaltwins.demos-only.iwmi.org/), the system integrates Sentinel-2 imagery, Random Forest classification, time-lagged precipitation–vegetation relationships, and slope-based masking to delineate irrigation without reliance on in-season field data. Water consumption is quantified using WaPOR evapotranspiration products, enabling direct linkage between mapped extent and use.

2. Materials and Methods

2.1 Study area

The LRB, located between latitudes 22°S-26°S and longitudes 26°E-35°E, is the fourth-largest transboundary river basin in Southern Africa, draining 416,300 km² (Fig. 1). It is shared by Botswana (19%), Zimbabwe (15%), South Africa (45%), and Mozambique (21%). The LRB hosts 18 million people, predominantly in rural areas and dependent on rainfed agriculture, which is projected to grow to 20 million by 2040 (Dickens et al. 2020). In 2007, South Africa held the largest share of the Limpopo River Basin’s population with about 10.7 million people, while Botswana, though smaller in absolute terms with about 1.2 million people, had a striking concentration, with nearly 69% of its national population residing within the basin (Limpopo Basin Permanent Technical Committee 2010). Economic activities span mining, irrigated agriculture, tourism, energy, forestry, and manufacturing, with irrigated farming the second-largest employer after mining (Sitoe and Qwist-Hoffman 2013). With an average elevation of ~ 840 m, the LRB spans diverse climatic, topographic, and ecological zones. The LRB exhibits strong spatial variability, transitioning from arid conditions in the west with annual rainfall around 200 mm to semi-arid and wetter temperate climates towards the central and eastern regions receiving up to 1,500 mm annually (Limpopo Basin Permanent Technical Committee 2010; Sitoe and Qwist-Hoffman 2013). The mean annual precipitation is approximately 530 mm, with isolated sub-humid pockets in the central basin. Rainfall follows a distinct east–west and north–south gradient, increasing towards the Mozambique coastal plains, and inner parts of South Africa. The wet season extends from October to April, peaking in December through February (> 100 mm/month), while the dry season spans May to September, with minimal rainfall (< 20 mm/month) (Fig. 2). Temperature extremes are notable, with summer daytime temperatures frequently exceeding 40°C, while winter nights can fall to 0°C with profound implications for agricultural activities and associated livelihoods.

Shallow sandy soils dominate the northern parts of LRB, moderately deep sandy to sandy clay loams characterise the south, and deeper sandy soils are prevalent in western and eastern zones. Floodplains in Mozambique have loamy and clay soils, though overall soil water retention remains low. The LRB’s geology comprises significant formations such as the Limpopo Mobile Belt, Kalahari Craton, Karoo System, and Bushveld Igneous Complex (Limpopo Basin Permanent Technical Committee 2010).

Fig. 1

Map of the Limpopo River Basin showing basin boundaries, major rivers (Limpopo in blue), and key urban centers. Elevation ranges from 251 m (light yellow) to 2,320 m (dark brown), sourced from Copernicus GLO-30 DEM (2021).

Fig. 2

Long term (1981–2024) mean monthly rainfall in the LRB computed from CHIRPS data. Blue bars represent the wet season (October–April) and red bars denote the dry season (May–September).

2.2 Methodology

This study presents a fully automated, semi-supervised approach for mapping irrigated areas in the LRB. The workflow consists of five main steps:

High confidence cropland delineation.

Generation of reference data from known irrigated areas.

Model development and classification.

Post-classification filtering using time-lagged regression and terrain slope.

Validation through ground-truth data.

The methodology was applied for each dry-season month from 2019 to 2024. All data processing and analysis were performed using the Google Earth Engine (GEE) cloud computing platform. An interactive web application that implements the methodology is available at: https://huggingface.co/spaces/IWMIHQ/irrigated-agriculture-extractor. This application allows for greater customisation of the methodology and facilitates its replication in other basins with similar environmental conditions.

2.2.1 Cropland Delineation

The first step involved defining the cropland extent within which irrigation mapping was conducted. For the South African portion of the LRB, cropland boundaries provided by the Department of Agriculture, Land Reform and Rural Development (DALRRD) (released in April 2017) were used. These cropland boundaries were further refined using updated field extents from the 2022 South African National Land Cover (SANLC) dataset (Zakariyyaa 2025), which was considered more accurate than available global land cover products. For the remaining parts of the basin, covering Botswana, Mozambique, and Zimbabwe, national datasets of comparable accuracy were not available. Therefore, the European Space Agency’s (ESA) 10-meter resolution global land cover product (2021 version) was used to delineate cropland areas in these regions (Zanaga et al. 2022). The ESA 10m cropland map in these three countries was further validated at 100 random points with high resolution Google Earth basemap. The ESA 10 m cropland map was combined with cropland boundaries from SANLC dataset to produce the LRB cropland base map.

2.2.2 Generation of Reference Data

To train the classification model, reference data for irrigated and non-irrigated areas were derived using an irrigation-sensitive vegetation index.

Irrigation Index (Normalized Green Index - NGI)

Monthly median composites of Sentinel-2 (S2) imagery were generated, and two vegetation indices, the Normalized Difference Vegetation Index (NDVI) and the Green Index (GI), were calculated for each composite. The product of these indices produced the Normalized Green Index (NGI), a proxy for vegetation vigour and phenology (Pun et al. 2017):

NGI = NDVI × GI

(1)

The NGI enhances the differentiation between irrigated and non-irrigated croplands, especially during the dry season, when irrigation has the most pronounced effect on vegetation greenness.

Sampling Irrigated and Non-Irrigated Pixels

From the Normalized Green Index (NGI) layer, the following methodology was applied:

The top 10% of pixel values, representing the healthiest vegetation, were classified as irrigated croplands.

The bottom 10% were labelled as non-irrigated croplands.

These thresholds reflect the assumption that strong vegetation vigor during the dry season is primarily sustained by irrigation (Cai et al. 2017). Class-specific binary masks were created using Google Earth Engine’s percentileReducer and reduceRegion functions. The masks were then polygonized, and a random sample of 100 polygons per class was selected to form the reference training dataset. This process negates the need for prior data collection and adapts the existing approach for near real time irrigated area mapping and monitoring applications.

Figure 3 illustrates the workflow for deriving reference data from the NGI: green indicates irrigated fields, while red indicates non-irrigated fields.

Fig. 3

Workflow for generating reference data from the Normalized Green Index (NGI).

2.2.3 Model Development

Monthly Sentinel-2 Level-2A surface reflectance imagery was processed on Google Earth Engine (GEE) to produce stable inputs for irrigated area classification. To ensure data quality, a cloud removal procedure was first applied using the QA60 cloud mask band, effectively excluding cloud-contaminated pixels. From the resulting cloud-free imagery, monthly median composites were generated within the cropland mask to minimize the effects of atmospheric variability and residual cloud contamination, yielding cleaner and more reliable datasets for subsequent analyses.

From these composites, the extracted features consisted of the raw Sentinel-2 spectral bands (B2, B3, B4, B5, B6, B7, B8, B8A, B11, and B12) and a suite of derived vegetation and spectral indices (NDVI, NDBI, GI, CI, GCVI, LSWI, EVI, and BSI). These layers were then overlaid with the NGI-derived reference data, and the corresponding raster reflectance values were extracted at every reference location. This procedure ensured that each reference sample was linked with a complete set of spectral and biophysical attributes, combining raw reflectance, vegetation indices, and class labels (irrigated vs. non-irrigated). Together, these inputs formed the feature dataset used to train the classification model.

The extracted feature dataset was then used to train a Random Forest (RF) classifier (Breiman 2001). RF is an ensemble learning algorithm that constructs multiple decision trees on bootstrapped subsets of the training data and combines their predictions through majority voting. In this study, the RF was configured with 100 trees, a parameter choice that balanced computational efficiency with classification stability, and a random seed of 42 was set to ensure reproducibility. RF was chosen for its robustness against overfitting, its ability to handle high-dimensional feature spaces, and its proven effectiveness in remote sensing applications (Breiman 2001). The trained model produced both binary classification outputs (irrigated/non-irrigated) and class probabilities, enabling confidence-based post-classification analysis and supporting accurate, large-scale mapping of irrigated croplands.

The detailed workflow for classifying irrigated areas is outlined in Fig. 4.

Fig. 4

Workflow for mapping irrigated areas using Sentinel-2 bands and indices, Random Forest classification, and in-situ accuracy assessment

2.2.4 Post-Classification Refinement

To enhance the accuracy of the irrigated class and reduce false positives, particularly from perennial crops or terrain-induced anomalies, two post-classification refinement techniques were applied to the LRB irrigated maps which consist of time-lagged regression and slope masking.

Time-Lagged Regression

Within the Limpopo River Basin, certain non-irrigated perennial crops and plantation forests present challenges for irrigation classification. Perennial fruit orchards such as avocado and citrus can maintain high vegetation vigour during the dry season due to their deep rooting systems. In addition, isolated sub-humid pockets in the central basin sustain relatively high soil moisture levels, allowing orchards and some rainfed crops to remain greener for longer than in surrounding semi-arid zones. Furthermore, plantations of Acacia spp., particularly around Tzaneen in Limpopo, retain green foliage during the dry season because of their access to deeper soil water reserves. These conditions can cause orchards and Acacia plantations to be misclassified as irrigated croplands when relying solely on vegetation indices. To reduce such errors, a time-lagged regression analysis was applied to evaluate the strength of the relationship between vegetation dynamics (NDVI) and precipitation. Areas where vegetation greenness showed a strong seasonal correlation with rainfall were flagged as rainfed, whereas persistently green areas without rainfall were more confidently identified as irrigated. This adjustment was critical for refining the accuracy of irrigation mapping across the basin, particularly in regions where perennial crops and plantation forestry complicate classification.

A cross-correlation function (CCF) method (Box et al. 2015) was applied using Google Earth Engine (GEE) to evaluate the lagged response of NDVI (derived from Landsat-8) to precipitation data from CHIRPS (Funk et al. 2015), calculated over 16-day intervals. Pixels exhibiting a positive correlation coefficient (r > 0.35) were interpreted as rainfed and subsequently removed from the irrigated class. This threshold was determined empirically via sensitivity trials to balance omission and commission errors.

Slope Masking

Given the technical and economic constraints associated with irrigating steep terrain, a slope-based mask was applied to exclude areas with slopes exceeding 5 degrees. This threshold aligns with previous recommendations for irrigation suitability mapping (Ragettli et al. 2018). Slope values were derived from the Copernicus GLO-30 Digital Elevation Model, ensuring consistency with high-resolution terrain data across the basin.

2.2.5 Model validation

Model performance was evaluated using three complementary approaches. First, independent in-situ validation data collected in September 2024 served as the primary benchmark. September is a critical period in the study region because it coincides with the late dry season, when rainfall-dependent croplands typically exhibit vegetation stress or senescence, while irrigated fields remain green and productive. This contrast maximizes the separability between irrigated and non-irrigated croplands, providing a robust temporal window for validation.

Second, internal testing datasets were used to calculate standard classification metrics, including overall accuracy, confusion matrix, and producer’s and user’s accuracies. These metrics offered a quantitative assessment of model reliability and helped to identify potential sources of misclassification between irrigated and non-irrigated classes.

Finally, model outputs were compared against the locations of known irrigation schemes, encompassing both smallholder and commercial systems. This step provided an additional qualitative assessment by verifying whether the classified irrigated areas corresponded with established irrigation infrastructure, thereby strengthening confidence in the model’s applicability for operational monitoring.

2.3 Ground-Truth Data Collection

A field validation campaign was conducted from 30 September to 5 October 2024 in Limpopo Province, South Africa, to evaluate the accuracy of irrigated area maps. Due to logistical constraints, field validation focused primarily on South Africa, which may limit the generalizability of findings to other LRB countries, including Botswana, Mozambique, and Zimbabwe. Data were collected from 120 agricultural plots using the KoboToolbox mobile data collection platform. The dataset included details on irrigation presence and type (e.g., center-pivot, drip, canal, sprinkler), crop type (e.g., maize, potatoes, citrus, avocado, cabbage), and water source (e.g., groundwater, rivers, canals).

Additionally, ten semi-structured interviews with local farmers provided qualitative insights into irrigation practices. The results revealed that commercial farms predominantly used center-pivot irrigation systems, while smallholder farmers employed a variety of irrigation methods. At the time of the survey, most crops were in the growing or maturity stages, with groundwater identified as the primary irrigation water source.

Figure 5 presents representative photographs from field observations of agricultural land. Images (a) and (c) depict non-irrigated crops, with (a) showing sparse, dry trees on reddish soil and (c) displaying a fenced area with minimal vegetation. In contrast, images (b) and (d) illustrate irrigated crops, with (b) showcasing a lush green field and (d) highlighting well-developed cabbage rows with visible irrigation tubing.

Fig. 5

Field observation photographs of non-irrigated (a, c) and irrigated crops (b, d)

To supplement the in-situ validation data, an additional set of 70 reference samples was derived through visual interpretation of a high-resolution PlanetScope composite (4.77 m spatial resolution, four spectral bands) accessed via Google Earth Engine (GEE). These samples were in areas overlapping with known irrigation schemes, including both smallholder and commercial systems, thereby ensuring that the training and validation datasets reflected the full diversity of irrigation practices in the Limpopo River Basin. Of the PlanetScope-derived samples, 38 were classified as non-irrigated and 32 as irrigated, which, when combined with field observations, yielded a balanced dataset of 98 non-irrigated and 92 irrigated reference points. This integration of field-based data, scheme-level knowledge, and high-resolution imagery strengthened the representativeness of the reference dataset and reduced the likelihood of sampling bias in model training and accuracy assessment.

The locations of the surveyed irrigated fields within Limpopo Province are also shown in Fig. 6.

Fig. 6

Locations of surveyed irrigated fields

2.4 Estimation of Irrigation Water Use

Water use by irrigated agriculture was quantified using the FAO Water Productivity Open-access Portal, version 3 (WaPOR v3) dataset (FAO 2019). We employed the Level-2 monthly Actual Evapotranspiration and Interception (AETI) product, available at 100 m spatial resolution for the African continent. To integrate with the 10 m irrigated area map, the AETI data were resampled using the nearest-neighbour method, which preserved the original pixel values and thus the integrity of the WaPOR measurements (Lillesand et al. 2015).

The classified irrigated area raster was then converted into vector polygons, and mean AETI values (mm month⁻¹) were extracted for each polygon using a zonal statistics procedure. WaPOR Level-2 AETI values are provided as digital numbers (DN) with a scale factor of 0.1, corresponding to millimetres per month. Conversion to water volume was carried out in two steps:

$\:{AETI}_{mm}=DN\times\:0.1$

$\:V\left(m3\right)=\:{AETI}_{mm}\times\:{10}^{-3}\times\:A\:$

where

$\:A$

is the polygon area in square metres. This ensured that evapotranspiration depth (mm) was consistently converted to volumetric water use (m³). Finally, total irrigation water use for the basin was obtained by aggregating volumes across all polygons.

3. Results

3.1 Cropland Extent

The composite cropland mask for the LRB covered 4.37 million ha (9.5% of the basin), with South Africa accounting for 3.02 million ha (~ 70%). The remainder was in Botswana (0.59 million ha), Zimbabwe (0.53 million ha), and Mozambique (0.24 million ha) (Fig. 7). The mask, encompassing both rainfed and irrigated areas, formed the baseline for near real-time monthly irrigated area mapping.

Fig. 7

Cropland mask of the Limpopo River Basin, derived from DALRRD cropland (2017) and SANLC (2022) for South Africa, and ESA WorldCover 10 m 2021 v200 for Botswana, Mozambique, and Zimbabwe.

Cropland distribution varied substantially among sub-basins (Table 1). The Middle Olifants (659,301.6 ha) and Upper Olifants (582,542.6 ha) together contained over 28% of total cropland, followed by Crocodile (443,986.9 ha), Sand (269,569.4 ha), Shashe (247,169.2 ha), and Notwane (207,940.1 ha). In contrast, the Lower Middle Limpopo (12,529.1 ha), Matlabas (20,954.3 ha), and Shingwedzi (21,533 ha) had minimal coverage.

Table 1
Cropland extent of Limpopo sub-basins.
Sub-basin	Cropland extent (ha)
Letaba	176,696.5
Lower Olifants	75,784.1
Sand	269,569.4
Mahalapswe	94,066.9
Matlabas	20,954.3
Nzhelele	26,690.8
Crocodile	443,986.9
Marico	95,785.3
Notwane	207,940.1
Bonwapitse	52,160.5
Mokolo	107,816.4
Lephalala	76,121.9
Mogalakwena	278,890.5
Motloutse	77,381.4
Shashe	247,169.2
Mzingwani	148,396.9
Bubi	58,425
Luvuvhu	87,327.4
Mwenezi	176,034.3
Upper Olifants	582,542.6
Middle Olifants	659,301.6
Steelpoort	96,457
Shingwedzi	21,533
Lower Middle Limpopo	12,529.1
Changane	75,594.5
Lower Limpopo	137,754.9
Lotsane	70,647.6

3.2 Classification Accuracy Assessment

Validation of the September 2024 irrigated area map using 198 field reference points yielded an overall accuracy of 80% and a Cohen’s Kappa of 0.60 (Fig. 8; Table 2). For the non-irrigated class, producer accuracy was 87.5% and user accuracy 68.5%, with an F-score of 82.4%. The irrigated class achieved a producer accuracy of 75.4% and user accuracy of 90.8% (F-score: 76.8%).

The confusion matrix showed 89/98 non-irrigated and 63/92 irrigated samples correctly classified. Misclassification was more common for irrigated plots (29 labelled non-irrigated) than for non-irrigated areas (9 cases). Many errors occurred in fields where crops were at early phenological stages with high bare-soil fractions, producing spectral confusion with fallow/dryland areas.

McNemar’s test confirmed a significant asymmetry in errors (χ² = 10.89, p < 0.001), indicating a tendency to under-predict irrigated areas.

Fig. 8

Confusion matrix of September 2024 irrigated area classification

Table 2 User, producer and overall accuracy for irrigated agriculture for the algorithm models

Figure 9 shows model predictions of irrigated agriculture (highlighted in green) overlaid on true-color composites from PlanetScope imagery. Each pair of panels compares the original satellite view (left) with the corresponding classification output (right). The predictions capture a range of irrigation patterns, including irregularly shaped plots, circular center-pivot systems, and rectangular fields, demonstrating the model’s ability to identify irrigated areas across diverse field geometries and scales.

Fig. 9

Examples of irrigated areas derived from classification (right) alongside true-color PlanetScope composites (left, September 2024), provided through the Norwegian International Climate and Forest Initiative (NICFI). Panels illustrate (a) Silalabuhwa Irrigation Scheme, (b) a private commercial center-pivot irrigation farm on the Mahalapye River, (c) Zholube Irrigation Scheme, and (d) a smallholder irrigation scheme on the Mzingwane River in Matabeleland South, Zimbabwe.

3.3 Temporal Trends in Irrigated Area (2019–2024)

The monthly distribution of irrigated areas during dry season (May–September) showed clear interannual variability (Fig. 10). The year 2019 recorded the largest areal extent (~ 221,000 ha), significantly higher than subsequent years (ANOVA, F = 8.72, p < 0.001). The lowest seasonal mean occurred in 2020 (~ 171,112 ha), with July and August minima of 116,000 ha and 120,000 ha, respectively. Tukey’s HSD identified 2020 as significantly different (p < 0.05) from 2019, 2022, and 2023. From 2021 to 2023, irrigated area extents partially recovered (from 179,000 to 191,000 ha). In 2024, early-season values exceeded 210,000 ha but declined later in the season. A linear regression of the irrigated areas from 2019 to 2024 showed a downward trend with an average annual decline of ~ 10,200 ha. The month of May showed significantly higher values than those for August or September (paired t-test, p < 0.01), confirming continuation of crops from the previous rainfed season.

Fig. 10

Monthly irrigated area extents for the dry seasons from 2019 to 2024

3.4 Spatial Distribution of Irrigated Area at the Sub-Basin Level

Identified irrigated agriculture in the Middle Olifants, Crocodile, and Letaba sub-basins of LRB together accounted for the largest share of the basin’s total irrigated area (Fig. 11). An exception occurred in September 2023, when the Crocodile surpassed the Middle Olifants.

Sub-basins such as Lotsane, Changane, Lower Middle Limpopo, Bubi, Bonwapitse, and Mahalapswe consistently exhibited lower irrigated area extents from 2019. Global Moran’s I indicated significant positive spatial autocorrelation each year (I = 0.41–0.55, p < 0.01), with high-value clusters in the central/northeastern basin (e.g., Middle Olifants, Crocodile, Letaba) and low-value clusters in the west and south. These findings demonstrate that irrigated agriculture within the LRB is not randomly distributed but is spatially concentrated in specific sub-basins mainly in South Africa, likely reflecting the combined influence of hydrological availability, irrigation infrastructure, institutions and socio-economic drivers.

Fig. 11

Spatial distribution of irrigated areas across LRB sub-basins (2019–2024), highlighting concentration in Middle Olifants, Crocodile, Letaba, and Luvuvhu.

3.5 Progression of Water Consumption in Irrigated Agriculture (2019–2024)

Dry-season irrigation water consumption peaked in May and September, with a pronounced minimum in June (Fig. 12). The highest single-month value was recorded in May 2019 (189.8 10⁶ m³), while the lowest occurred in June 2020 indicating transition from rainfed to irrigated season (46.2 10⁶ m³). August consistently exhibited high and stable consumption across all years. Mean dry-season use increased from approximately 103 10⁶ m³ in 2020 to 134 10⁶ m³ in 2024. Across months, May had the highest mean consumption (147.4 10⁶ m³), whereas June recorded both the lowest mean (83.7 10⁶ m³) and the highest interannual variability (CV = 33.6%). A one-way ANOVA indicated significant differences among months (F(4,25) = 8.54, p < 0.001). Post-hoc Tukey’s HSD revealed that August consumption was significantly higher than June (mean difference = 43.3 10⁶ m³, p = 0.026). These results confirm a strong seasonal pattern, with peaks at the start and end of the dry season, stable high usage in August, and a June minimum.

Fig. 12

Progression of water consumption of irrigated agriculture within LRB between 2019 and 2024

4. Discussion

4.1 Changes in Irrigated Area (2019–2024)

The results reveal a significant decline in irrigated areas, from a mean of 211,281 ha in 2019 to 184,771 ha in 2024. This trend aligns with findings from (Cai et al. 2017) and (Van Niekerk et al. 2018), who reported irrigated areas of 262,000 ha and 218,302 ha in the Limpopo Province, respectively. These estimates were based on coarser 30 m Landsat imagery which probably resulted in an overestimation due to mixed-pixel issues in heterogeneous landscapes (He et al. 2020; Gxokwe et al. 2022). The finer 10 m resolution of Sentinel-2 used in this study better captures smallholder plots, which are critical in the LRB’s fragmented agricultural systems. The observed decline may reflect multiple drivers, including increasing water stress, as reported by (Kapangaziwiri et al. 2021), who noted a doubling of water consumption from 2000 to 2012. The sharp drop in 2020 (171,112 ha) likely resulted from COVID-19 lockdowns across the LRB’s four countries, which disrupted agricultural activities and led to farm abandonment (Paganini et al. 2020). Persistent drought and water scarcity have likely contributed to the limited recovery of irrigated areas post-2020, with water restrictions and reduced reservoir levels disproportionately affecting smallholder farmers (Makhanya 2021; Ferreira et al. 2022; Mukwevho 2023; Mathivha et al. 2024).

Spatially, irrigation is concentrated in the Middle Olifants, Crocodile, Letaba, and Luvuvhu sub-basins, which consistently accounted for the largest irrigated extents, with significant positive spatial autocorrelation (Moran’s I = 0.41–0.55, p < 0.01). This clustering reflects hydrological availability, irrigation infrastructure, and socio-economic factors, as noted by (Sitoe and Qwist-Hoffman 2013). Conversely, sub-basins like Lotsane, Changane, and Lower Middle Limpopo show minimal irrigation, suggesting untapped potential for agricultural expansion.

Intra-annual trends show higher irrigation in May and June compared to August and September, likely due to declining water quotas as the dry season progresses (Mazibuko et al. 2021). Smallholder farmers, reliant on costly irrigation systems, often shift to rainfed agriculture, increasing their vulnerability to drought (Olabanji et al. 2021). The relatively small proportion of irrigated land (0.4% of the basin) indicates room for sustainable expansion, provided water resources are managed effectively.

4.2 Water use dynamics

Water consumption peaked in May (147.4 10⁶ m³) and September, with a minimum in June (83.7 10⁶ m³, CV = 33.6%), reflecting the LRB’s strong seasonality (Fig. 2). These patterns correspond to crop phenological stages, with May aligning with the maturity of summer crops and planting of winter crops, and September marking the growing stage of new crops, both requiring high water inputs (Al-Kaisi and Broner 2009; Chadalavada et al. 2021). The low water use in June, despite relatively high irrigated area, likely coincides with the planting phase of winter crops, which have lower ET demands (Sihlobo 2022). The increase in mean dry-season water use from 103 10⁶ m³ in 2020 to 134 10⁶ m³ in 2024, despite declining irrigated area, suggests intensifying demand, possibly due to drought-driven reliance on irrigation, as noted by (Botai et al. 2020), or improvements in irrigation efficiency, such as revitalized schemes and treated effluent use (Limpopo Basin Permanent Technical Committee 2010). The 2020 minimum likely reflects reduced agricultural activity during COVID-19 restrictions (Paganini et al. 2020).

4.3 Strengths and Limitations

The framework demonstrates strong operational potential by combining automation, scalability, and high spatial resolution. Leveraging Google Earth Engine enables basin-wide monthly mapping without the need for continuous in-season field data, while post-classification refinements reduce misclassification from perennial crops and steep terrain. The integration of WaPOR evapotranspiration adds an essential link between mapped extents and actual water use, enhancing its value for allocation planning.

Validation against independent field observations yielded an overall accuracy of ~ 80% (κ ≈ 0.60), a moderate level of agreement suitable for basin-scale operational monitoring. The model exhibits a conservative bias: irrigated fields are more likely to be undercounted, especially at early phenological stages with high bare-soil fractions, while non-irrigated areas are classified more reliably. This reduces the risk of overestimating irrigation but may lead to omission of marginal or early-season activity.

Despite these strengths, some limitations remain. Field validation was concentrated in South Africa’s Limpopo Province, which may restrict generalizability across the basin’s diverse agro-ecological settings. The reliance on resampled WaPOR data (100 m to 10 m) could introduce minor spatial inaccuracies, even though nearest-neighbor methods were applied to mitigate this. Furthermore, while the semi-supervised approach reduces dependency on costly ground surveys, some level of field data will always be required to benchmark results and calibrate models in new contexts.

Future improvements could involve integrating radar or thermal datasets to better capture irrigation signals under cloudy conditions or soil-dominated canopies and applying temporal deep learning approaches that exploit seasonal dynamics. Broader validation across Botswana, Mozambique, and Zimbabwe, supplemented by high-resolution commercial imagery and irrigation scheme inventories, would further strengthen accuracy and applicability. Overall, the high-resolution, semi-supervised framework marks a clear advancement over coarse-scale mapping, balancing automation, interpretability, and operational feasibility, while acknowledging the need for continued refinement and context-specific validation.

4.4 Implications and Future Directions

The high-resolution, monthly irrigated area maps and associated water use estimates produced by this framework provide an evidence base that was previously unavailable for the Limpopo River Basin. By linking spatially explicit irrigation extents with water consumption, the system enables managers to monitor irrigation dynamics throughout the dry season, identify peaks in demand, and detect shifts in cultivation patterns over time. This near real-time information supports more equitable allocation of scarce water resources, as authorities can see not only how much area is irrigated but also how much water is consumed in different sub-basins.

The capacity to capture smallholder fields at 10 m resolution is particularly valuable. These plots are often underrepresented in coarser-scale assessments, yet they play a central role in household food security and rural livelihoods. By including both small-holder and commercial irrigation in the same monitoring system, the methodology provides a more complete picture of agricultural water use and helps planners design interventions that are socially inclusive.

At the same time, the spatial distribution maps highlight irrigation hotspots, such as the Middle Olifants and Crocodile sub-basins, where water demand is concentrated and resource competition is most acute. This allows basin managers to target monitoring, efficiency improvements, or enforcement efforts where they are most needed. Conversely, the detection of underutilized areas offers opportunities for sustainable expansion, provided hydrological assessments confirm that water resources are available (Perrin et al. 2012).

In the longer term, embedding this monitoring system within the Limpopo Digital Twin initiative will ensure that outputs are directly accessible to decision-makers, farmers, and researchers. This integration provides a platform for evidence-based agricultural and water planning, enhancing resilience in a climatically vulnerable basin.

5. Conclusions

This study developed and validated a scalable, high-resolution, semi-supervised machine learning framework for monthly mapping of irrigated croplands and their water use in the LRB from 2019 to 2024, addressing a critical need for precise irrigation monitoring in a water-stressed transboundary region. By integrating Sentinel-2 imagery, Random Forest classification, time-lagged precipitation–vegetation relationships, slope-based masking, and FAO’s WaPOR evapotranspiration data, the framework achieved an overall accuracy of 80% (Cohen’s Kappa = 0.60), effectively capturing smallholder plots and monthly dynamics at a 10 m resolution. The findings reveal a significant decline in irrigated areas, from ~ 211,281.8 ha in 2019 to ~ 184,771 ha in 2024, alongside an increase in dry-season water use from 103 10⁶ m3 in 2020 to 134 10⁶ m3 in 2024, highlighting intensifying water stress and the impact of factors such as drought and COVID-19 disruptions. Spatial concentration in the Middle Olifants, Crocodile, Letaba, and Luvuvhu sub-basins underscores the need for targeted water management, while underutilized sub-basins present opportunities for sustainable irrigation expansion. Despite limitations, such as misclassification at early crop stages, the framework provides actionable insights for equitable water allocation and agricultural planning. Future enhancements, including expanded validation and additional data sources, could further strengthen its applicability across other water-stressed basins, positioning it as a vital tool for sustainable water resource management in climatically vulnerable regions.

Funding:

This research was funded by CGIAR through the CGIAR Initiative on Digital Innovation, supported by CGIAR Trust Fund contributors.

Data Availability

The full implementation is openly available via a Google Colab–hosted GEE script: [https://shorturl.at/GYUn1](https:/shorturl.at/GYUn1) . The workflow for generating the reference data is available here: [https://code.earthengine.google.com/a97003607b4ae118e717b5faa66bb73d](https:/code.earthengine.google.com/a97003607b4ae118e717b5faa66bb73d) .

Conflicts of Interest:

The authors declare no conflict of interest.

Clinical trial number

not applicable

Ethics, Consent to Participate, and Consent to Publish declarations

not applicable

Author Contribution

Conceptualization, Z.K. and K.M.; methodology, Z.K. and K.M.; validation, Z.K., K.M. and C.D.; formal analysis, Z.K.; investigation, Z.K. and M.G.A.; resources, C.D. and F.L.; data curation, Z.K. and M.G.A.; writing—original draft preparation, Z.K.; writing—review and editing, K.M., C.D., M.G.A., F.L. and S.G.; visualization, Z.K.; supervision, C.D., F.L. and S.G.; project administration, K.M.; funding acquisition, M.G.A.

References

Al-Kaisi MM, Broner I. Crop water use and growth stages. USA: Colorado State University Extension Fort Collins, CO; 2009.

Botai CM, Botai JO, Zwane NN, et al. Hydroclimatic Extremes in the Limpopo River Basin, South Africa, under Changing Climate. Water. 2020;12:3299. https://doi.org/10.3390/w12123299.

Box GE, Jenkins GM, Reinsel GC, Ljung GM. Time series analysis: forecasting and control. Wiley; 2015.

Breiman L. Random forests. Mach Learn. 2001;45:5–32.

Cai X, Magidi J, Nhamo L, Van Koppen B. Mapping irrigated areas in the Limpopo Province, South Africa. International Water Management Institute (IWMI); 2017.

Chadalavada K, Kumari BR, Kumar TS. Sorghum mitigates climate variability and change on crop yield and quality. Planta. 2021;253:113.

Dickens C, Mukuyu P, Ndlovu B et al. (2020) E-flows for the Limpopo River Basin: from vision to management.

FAO. (2019) WaPOR Database methodology: Level 3 data – Using remote sensing in support of solutions to reduce agricultural water productivity gaps. Rome 68.

Ferreira KR, Queiroz GR, Marujo RFB, Costa RW, BUILDING EARTH OBSERVATION DATA CUBES ON AWS. (2022). Int Arch Photogramm Remote Sens Spat Inf Sci XLIII-B3-2022:597–602. https://doi.org/10.5194/isprs-archives-XLIII-B3-2022-597-2022

Funk C, Peterson P, Landsfeld M, et al. The climate hazards infrared precipitation with stations—a new environmental record for monitoring extremes. Sci Data. 2015;2:1–21.

Gorelick N, Hancher M, Dixon M, et al. Google Earth Engine: Planetary-scale geospatial analysis for everyone. Remote Sens Environ. 2017;202:18–27. https://doi.org/10.1016/j.rse.2017.06.031.

Gxokwe S, Dube T, Mazvimavi D. Leveraging Google Earth Engine platform to characterize and map small seasonal wetlands in the semi-arid environments of South Africa. Sci Total Environ. 2022;803:150139.

He Y, Yang J, Guo X. Green Vegetation Cover Dynamics in a Heterogeneous Grassland: Spectral Unmixing of Landsat Time Series from 1999 to 2014. Remote Sens. 2020;12:3826. https://doi.org/10.3390/rs12223826.

Kapangaziwiri E, Kahinda J-MM, Oosthuizen N, et al. TOWARDS THE QUANTIFICATION OF THE HISTORICAL AND. FUTURE WATER RESOURCES OF THE LIMPOPO RIVER BASIN; 2021.

Lillesand T, Kiefer RW, Chipman J. Remote sensing and image interpretation. Wiley; 2015.

Limpopo Basin Permanent Technical Committee. (2010) Joint Limpopo River Basin Study Scoping Phase Final Report.

Magidi J, Nhamo L, Mpandeli S, Mabhaudhi T. Application of the Random Forest Classifier to Map Irrigated Areas Using Google Earth Engine. Remote Sens. 2021;13:876. https://doi.org/10.3390/rs13050876.

Mahapatra S, Velpuri N, Schmitter P, et al. Enhancing the water accounting plus framework to evaluate field-level water use and availability in rice-dominant cropping system and potential for mapping GHG emissions at scale. Bihar, India: a case study for the eastern Gangetic Plains; 2024.

Makhanya NZ. (2021) Potential impacts of climate change on hydrological droughts in the Limpopo river basin.

Mathivha FI, Mabala L, Matimolane S, Mbatha N. El Niño-Induced Drought Impacts on Reservoir Water Resources in South Africa. Atmosphere. 2024;15:249. https://doi.org/10.3390/atmos15030249.

Mazibuko S, Mukwada G, Moeletsi M. Assessing the frequency of drought/flood severity in the Luvuvhu River catchment, Limpopo Province, South Africa. Water SA. 2021;47:172–84.

McAllister A, Whitfield D, Abuzar M. Mapping irrigated farmlands using vegetation and thermal thresholds derived from Landsat and ASTER data in an irrigation district of Australia. Photogramm Eng Remote Sens. 2015;81:229–38.

Mohammedshum AA, Maathuis BHP, Mannaerts CM, Teka D. Mapping Small-Scale Irrigation Areas Using Expert Decision Rules and the Random Forest Classifier in Northern Ethiopia. Remote Sens. 2023;15:5647. https://doi.org/10.3390/rs15245647.

Mpakairi KS, Dube T, Sibanda M, Mutanga O. Fine-scale characterization of irrigated and rainfed croplands at national scale using multi-source data, random forest, and deep learning algorithms. ISPRS J Photogramm Remote Sens. 2023;204:117–30. https://doi.org/10.1016/j.isprsjprs.2023.09.006.

Mukwevho SG. (2023) Modelling impacts of climate change on hydrology of Latonyanda River Quaternary Catchment, Limpopo Province, South Africa.

Olabanji MF, Davis N, Ndarana T, et al. Assessment of smallholder farmers’ perception and adaptation response to climate change in the Olifants catchment, South Africa. J Water Clim Change. 2021;12:3388–403. https://doi.org/10.2166/wcc.2021.138.

Ozdogan M, Gutman G. A new methodology to map irrigated areas using multi-temporal MODIS and ancillary data: An application example in the continental US. Remote Sens Environ. 2008;112:3520–37.

Ozdogan M, Yang Y, Allez G, Cervantes C. Remote sensing of irrigated agriculture: Opportunities and challenges. Remote Sens. 2010;2:2274–304.

Paganini N, Adinata K, Buthelezi N, et al. Growing and Eating Food during the COVID-19 Pandemic: Farmers’ Perspectives on Local Food System Resilience to Shocks in Southern Africa and Indonesia. Sustainability. 2020;12:8556. https://doi.org/10.3390/su12208556.

Perrin J, Ferrant S, Massuel S, et al. Assessing water availability in a semi-arid watershed of southern India using a semi-distributed model. J Hydrol. 2012;460:143–55.

Pun M, Mutiibwa D, Li R. Land use classification: A surface energy balance and vegetation index application to map and monitor irrigated lands. Remote Sens. 2017;9:1256.

Ragettli S, Herberz T, Siegfried T. An Unsupervised Classification Algorithm for Multi-Temporal Irrigated Area Mapping in Central Asia. Remote Sens. 2018;10:1823. https://doi.org/10.3390/rs10111823.

Senay GB, Friedrichs M, Singh RK, Velpuri NM. Evaluating Landsat 8 evapotranspiration for water use mapping in the Colorado River Basin. Remote Sens Environ. 2016;185:171–85.

Sihlobo W. (2022) What the month of April means for SA agriculture. In: What Mon. April Means SA Agric. http://www.agbiz.co.za/. Accessed 24 Nov 2024.

Sitoe S, Qwist-Hoffman P. (2013) Limpopo River Basin Monograph.

Thenkabail PS, Biradar CM, Noojipady P, et al. Global irrigated area map (GIAM), derived from remote sensing, for the end of the last millennium. Int J Remote Sens. 2009;30:3679–733.

Van Niekerk A, Jarmain C, Goudriaan R. An earth observation approach towards mapping irrigated area and quantifying water use by irrigated crops in South Africa. Water Research Commission; 2018.

Vogels MF, de Jong SM, Sterk G, Addink EA. Mapping irrigated agriculture in complex landscapes using SPOT6 imagery and object-based image analysis–A case study in the Central Rift Valley, Ethiopia–. Int J Appl Earth Obs Geoinf. 2019;75:118–29.

Zakariyyaa O. (2025) The 2022 South African National Land Cover Data and its Operational use.

Zanaga D, Van De Kerchove R, Daems D et al. (2022) ESA WorldCover 10 m 2021 v200.

Zurqani HA, Allen JS, Post CJ, et al. Mapping and quantifying agricultural irrigation in heterogeneous landscapes using Google Earth Engine. Remote Sens Appl Soc Environ. 2021;23:100590. https://doi.org/10.1016/j.rsase.2021.100590.

Yes