JiajiaWang1✉Emailjiajiaw2@illinois.edu
ZhihanTao1
BrianDeal1
1A
¹Department of Landscape ArchitectureUniversity of Illinois Urbana- ChampaignChampaignUSA Jiajia Wang1*, Zhihan Tao1, Brian Deal1
¹Department of Landscape Architecture, University of Illinois Urbana-Champaign, Champaign, USA
* Corresponding author: jiajiaw2@illinois.edu
Abstract
Cities worldwide aim to simultaneously achieve environmental protection, social equity, and economic vitality. These seemingly simple goals, however, require evaluating sustainability trade-offs across hundreds of instances, projects and places – a cognitive limitation traditionally mistaken for physical impossibility. In this paper, we present a multi-objective AI framework that analyzes hundreds of census tracts and variable configurations across three competing objectives to achieve maximum performance (in all three) simultaneously (+ 90%). The solution sets of such a large and complex array have been computationally invisible to conventional analysis. Using the city of Chicago as our test case (801 census blocks, 59 configurations) we found 22 uniquely optimal solutions that cluster along century-old green space corridors established by Burnham's original 1909 Plan. Interestingly, these solutions are found in the wealthier north and northwest parts of city with none in the highly disinvested south and west. This pattern demonstrates how historic infrastructure decisions can create path dependencies that may be difficult to overcome. Our AI framework reveals critical thresholds that appear to enable solution success, including critical open space access and population densities. The work provides an evidence-based approach for climate-responsive urban design and planning decisions. The framework supports an emerging urban cognitive ecosystem approach that uses data and AI to create intelligent, adaptive ecosystems that can learn and proactively deliver services, transforming planning from pattern recognition to revealing systemic and historic inequities.
A
Introduction
A
Cities worldwide confront a fundamental tension: how to simultaneously achieve environmental protection, social equity, and economic vitality when these objectives often conflict. Nearly 55% of the global population resides in urban areas, a proportion projected to reach 68% by 2050, intensifying pressures on municipal governments to balance competing sustainability imperatives
1. Spatial planning decisions represent the most powerful intervention point, fundamentally structuring how cities distribute environmental resources, social infrastructure, and economic opportunity
2. Yet configurations optimizing environmental quality typically exclude lower-income residents, while affordable neighborhoods systematically lack environmental amenities and economic opportunities. With a pattern replicated across diverse geographic and economic contexts, Chicago exemplifies this sustainability trilemma. The city's lakefront neighborhoods enjoy air quality 23% better than the city average, yet median home values exceed
$450,000—pricing out 68% of Chicago households
3. Conversely, the South and West sides, where 72% of residents identify as Black or Latino, experience flood risk 3.2 times higher and green space access 47% below the municipal median, despite offering more affordable housing
4. This reveals how spatial planning decisions perpetuate systematic inequalities even within a single metropolitan region.
In addition to the exclusion of the marginalized population, traditional planning approaches struggle with the complexity of spatial distribution of resources. Comprehensive plans establish qualitative goals—"promote sustainable development," "enhance livability"—without quantifying trade-offs or identifying optimal configurations5. Zoning codes evolved to separate incompatible uses rather than optimize sustainability performance6, while master plans operate on 20-year horizons poorly suited to rapidly evolving climate targets7. Cost-benefit analysis typically monetizes outcomes within a single dimension, obscuring fundamental incommensurability between environmental preservation, social justice, and economic efficiency8. These computational limitations have historically masked systemic patterns—a cognitive barrier we have mistaken for physical impossibility. Traditional planning, lacking quantitative frameworks to navigate these trade-offs, defaults to incremental adjustments that perpetuate inequalities.
Early urban planning optimization relied on linear programming for well-defined problems like facility location but proved inadequate for multi-dimensional sustainability challenges with nonlinear relationships and competing objectives9,10. Evolutionary algorithms offered a breakthrough by enabling multi-objective optimization. Models such as NSGA-II became the dominant approach for discovering Pareto-optimal solutions—configurations were improving any one objective necessarily degrades another11. NSGA-II's population-based search maintains diverse non-dominated solutions representing optimal trade-offs12. However, conventional methods require thousands of evaluations13, making it computationally infeasible to analyze all neighborhoods simultaneously, preventing city-wide pattern discovery
Bayesian optimization (BO) enables city-wide analysis through intelligent sequential sampling guided by probabilistic surrogate models14. BO constructs Gaussian Process models that predict objective values and quantify uncertainty, enabling strategic candidate selection with dramatically fewer evaluations. The qEHVI acquisition function extends BO to multi-objective problems by measuring expected improvement to the Pareto front's dominated hypervolume15. Recent algorithmic advances enable simultaneous evaluation of entire urban systems rather than pre-selected neighborhoods16. Despite these computational advantages, existing frameworks employ generic features—raw census variables and land use categories—that fail to capture domain-specific sustainability mechanisms17. Environmental models use basic metrics like "percentage tree cover" without accounting for density interactions. Social assessments rely on raw income without constructing housing cost burden indices. This feature engineering gap limits even sophisticated optimization algorithms.
Three critical gaps constrain existing frameworks: computational tractability, domain awareness, and actionable insights. Here, we integrate domain-specific feature engineering, objective-tailored predictive models, and sample-efficient multi-objective Bayesian optimization to address these limitations. We develop 10 additional composite indices grounded in urban planning theory with separate LightGBM models for each objective. Multi-objective Bayesian optimization with qEHVI discovers Pareto-optimal solutions across all 801 census tracts simultaneously. Applied to Chicago, this framework identifies concrete land use configurations and reveals preconditions enabling multi-objective success, directly supporting UN Sustainable Development Goal 1118.
This research advances urban sustainability science through four contributions. Methodologically, city-wide simultaneous analysis transforms optimization from pre-selected neighborhood evaluation to discovering systemic patterns. Empirically, novel features yield R² >0.96 across objectives, operationalizing urban planning theory within machine learning. Substantively, 59 Pareto-optimal solutions reveal fundamental trade-offs: no solution achieves > 0.50 in both environmental and economic objectives, while balanced solutions converge on 25.3% open space at 16,000–24,000 persons/km². Practically, findings identify transit accessibility and housing affordability as leverage points, with green infrastructure enabling multi-objective success, directly informing zoning reforms and capital budgets. Addressing this requires novel computational approaches capable of simultaneously evaluating environmental, social, and economic outcomes across entire urban systems.
Results
Traditional urban planning approaches evaluate sustainability trade-offs sequentially across tens of neighborhoods—a computational constraint long mistaken for physical impossibility. Our AI-driven multi-objective optimization framework transcended this limitation by simultaneously evaluating Chicago’s 801 census tracts across three competing dimensions, discovering 59 Pareto-optimal configurations that were physically achievable but computationally invisible to conventional analysis. Most critically, these solutions cluster exclusively along century-old infrastructure corridors established by Burnham's 1909 Plan, with zero in historically disinvested South and West Chicago. This geographic pattern reveals how historic investments create path dependencies that contemporary policy struggles to overcome—constraints that remain hidden without simultaneous computational evaluation across all neighborhoods.
The AI framework successfully operationalized urban planning theory within predictive models, achieving robust predictive capability across all three sustainability dimensions (Table 1). The environmental model achieved the highest accuracy (R² = 0.985), followed by economic (R² = 0.982) and social models (R² = 0.968). All models-maintained prediction errors below 2% of the normalized scale (RMSE ≤ 0.020). These high accuracy levels validate that the framework can reliably predict sustainability outcomes for configurations never observed in historical data, enabling genuine optimization rather than mere interpolation. Evaluating all 801 tracts reveals patterns invisible to sequential analysis: zero optimal solutions exist in historically disinvested South and West Chicago.
Table 1
Predictive Model Performance Across Three Sustainability Dimensions (n = 801)
Objectives | R2 | RMSE |
|---|
Environment | 0.985 | 0.013 |
Social | 0.968 | 0.020 |
Economic | 0.982 | 0.014 |
The machine learning feature importance analysis revealed non-obvious drivers that challenge conventional planning assumptions (Table 2). Environmental outcomes were dominated by ecological infrastructure, with plant diversity contributing 45.3% of predictive power—nearly five times more influential than any other environmental factor. This dominance suggests that biodiversity serves as a master indicator of ecological health in urban contexts, synthesizing multiple ecological processes including air quality, carbon sequestration, and ecosystem resilience. Social outcomes prioritized economic opportunity, with the employment index accounting for 37.3% of predictive importance, while traditional demographic factors like education (9.7%) played secondary roles. This finding challenges the conventional planning emphasis on educational attainment as a primary social intervention point, suggesting that job availability may be a more direct lever for social outcomes. Economic predictions relied heavily on housing market dynamics (39.5%), with our engineered economic accessibility feature ranking second (9.8%), validating our domain-specific feature engineering approach. The relatively low importance of direct employment rate (4.6%) compared to housing values indicates that property markets capture cumulative economic vitality better than labor statistics alone. These patterns provide empirical validation for redirecting planning interventions toward leverage points invisible in traditional single-variable analyses.
Table 2
Top 5 Features by Domain (% Importance)
Rank | Environmental | Social | Economic |
|---|
1 | Plant Diversity (45.3%) | Employment Index (37.3%) | Housing Value (39.5%) |
2 | Flood Resilience (19.9%) | College Education (9.7%) | Economic Accessibility (9.8%) |
3 | Air Quality (13.7%) | Renter Rate (2.1%) | Employment Rate (4.6%) |
4 | CO₂ per Household (6.1%) | Social Equity Index (1.7%) | Economic Vulnerability (0.9%) |
5 | Tree Canopy (2.5%) | Transit Stops (0.8%) | Renter Rate (0.8%) |
The Invisible Geography of Sustainability: Spatial Inequity
The spatial distribution reveals a pronounced northward extension from Chicago's Loop, concentrating along corridors stretching from downtown toward northern neighborhoods (Fig. 1). Independent optimization trials repeatedly converged on these same geographic locations, discovering solutions with varying environmental-social-economic trade-offs within these constrained areas. This is mathematical proof that historical infrastructure investment creates compounding advantages that optimization cannot overcome. Zero optimal solutions appear in Chicago's South and West sides despite these areas comprising over half the city's land area and population.
This geographic pattern is not coincidental following the city's established park systems, particularly the lakefront parks and the boulevards designed in Burnham's Plan of Chicago in 1909, which we can still see today (Supplementary Fig. 1). The northern concentration reflects the cumulative advantage of historical green infrastructure investments that create the preconditions for multi-objective optimization. This geographic constraint demonstrates how historic planning decisions fundamentally structure contemporary sustainability possibilities, a path dependency that remains invisible.
The Pareto Fronts: Quantifying Fundamental Trade-offs
We discovered 59 Pareto-optimal solutions spanning environmental scores 0.477–0.704, social scores 0.687–0.769, and economic scores 0.370–0.623 (Fig. 2). These represent the theoretical performance ceiling where improving one objective necessarily degrades another. The environmental-economic frontier shows the strongest negative correlation: no solution achieves > 0.50 in both simultaneously. Environmental-social objectives show strong conflict—solutions achieving environmental scores above 0.65 consistently yield social scores below 0.73, while high social performers (> 0.76) restrict environmental scores to below 0.56. Conversely, the social-economic frontier demonstrates weak positive correlation, with multiple solutions exceeding 0.75 social and 0.50 economic scores concurrently.
The three independent trials validated the optimization's robustness (Fig. 3). Trial 2 identified 28 solutions (47% of total), Trial 0 found 22 (37%), and Trial 1 contributed 9 (15%). All extreme configurations appeared within the first 30 iterations, while intermediate solutions emerged throughout the 50-iteration process, demonstrating the algorithm's efficiency in discovering the complete Pareto front structure with 110 evaluations per trial. The three trials' convergence on the same geographic locations—despite discovering solutions with distinct performance profiles—reinforces that only certain areas of the city can achieve multi-objective optimization under current conditions.
The narrow social score range (0.082) compared to environmental (0.227) and economic (0.253) range reveals social objectives are most robust to land use variations—planners have greater flexibility achieving social equity across different spatial configurations than balancing environmental and economic goals. This finding has important policy implications: social outcomes appear relatively achievable across diverse development patterns, while environmental and economic goals require precise calibration of land use configurations.
We identified four distinct archetypes—environmental leaders, social leaders, economic leaders, and balanced solutions with density spanning nearly seven-fold (4,881 − 33,445 persons/km²) (Fig. 1). Environmental leaders achieve 0.704 at just 4,881 persons/km² but sacrifice economy (0.392). Social leaders peak at 0.769 across flexible densities (15,301 − 31,796 persons/km²), suggesting well-being depends more on service provision than density. Economic leaders reach only 0.620 at 24,049 persons/km² while sacrificing environmental quality (0.490).
Critically, balanced solutions achieve 89.8% environmental (0.632), 97.7% social (0.751), and 79.5% economic (0.494) performance relative to single-objective maxima—demonstrating simultaneous optimization requires only modest compromises. The leading balanced solution (Trial 1, Solution 5) emerges at 16,197 persons/km², with top-performing balanced configurations clustering around 13,000–18,000 persons/km²—a range suggesting natural equilibrium where objectives reinforce rather than conflict.
The Green Infrastructure Threshold
The top-performing balanced solutions consistently appear in census tracts adjacent to or containing significant park space. Balanced solutions converge on 25.3% open space through emergent optimization, not design constraint (Supplementary Table 3, Fig. 4). This represents a quantitative tipping point where parks generate neighborhood-wide benefits while maintaining economic viability. Despite using only 65% as much open space as environmental optimizers (25.3% vs 38.8%), balanced configurations achieve 89.8% of maximum environmental performance. Strategic placement matters more than total quantity. Parks embedded in mixed-use, transit-accessible neighborhoods at moderate densities generate more total value than low-density preserves.
The optimization identified a density threshold: balanced solutions cluster at 13,000–18,000 persons/km², with the leading solution at 16,197 persons/km². Environmental leaders maintain densities below 4,881 persons/km², while social leaders span 15,301 − 33,445 persons/km². Notably, the highest observed density achieving Pareto optimality is 33,445 persons/km², suggesting diminishing sustainability returns beyond this threshold. The balanced land use mix—25.3% open space, 10.9% low, 8.7% medium, 55.1% high-intensity—represents careful calibration where each type contributes to overall performance. Graduated intensity levels create transition zones preserving neighborhood character while accommodating growth. These configurations provide immediately actionable templates: planners can reference specific land use mixes proven mathematically optimal for different policy priorities.
Discussion
AI as Computational Lens: Revealing Invisible Urban Patterns
This study demonstrates how AI-driven multi-objective Bayesian optimization transforms urban sustainability planning from resource-intensive scenario analysis to efficient, scalable, and cost-effective decision support. By evaluating all 801 census tracts simultaneously, we discovered patterns invisible to traditional planning: optimal solutions exist only along Burnham's 1909 corridors, with zero in South and West Chicago. This provides quantitative evidence that century-old infrastructure investments create path dependencies contemporary policy cannot easily overcome without first addressing foundational inequities10. Our computational tools reveal optimal configurations exist in only 22 of 801 census tracts, exposing rather than solving deep inequities.
A
Feature importance analysis from AI models challenges conventional planning priorities. Plant diversity (45.3%) dominates environmental predictions—nearly five times more influential than any other factor—suggesting species diversity synthesizes multiple ecological processes. Employment opportunities (37.3%) outweigh education (9.7%) for social outcomes, while housing values (39.5%) capture economic vitality better than employment rates (4.6%). These findings redirect interventions toward leverage points invisible in single-variable analyses, operationalizing urban planning theory within predictive models that achieve R² >0.96 across all objectives.
Threshold and Optimal Density: Actionable Parameters
We discovered two parameters that provide immediately actionable guidance for advancing UN Sustainable Development Goal 11 on making cities inclusive, safe, resilient and sustainable18. Balanced solutions converge on 25.3% open space through emergent optimization—a quantitative tipping point where parks generate neighborhood-wide benefits while maintaining economic viability. Despite using only 65% as much open space as environmental optimizers, balanced configurations achieve 89.8% environmental performance because strategic placement in mixed-use, transit-accessible neighborhoods amplifies benefits through higher user populations. The density threshold of 13,000–18,000 persons/km² enables viable transit and commerce while maintaining green space, with diminishing sustainability returns beyond 33,445 persons/km². While these Chicago-derived parameters require context-specific calibration, they provide evidence-based starting points for cities lacking optimization resources. The strong environmental-economic trade-off—no solution achieves > 0.50 in both simultaneously—quantifies conflicts often implicit in climate planning. Conversely, social objectives' narrow score range (0.082 vs 0.227 environmental, 0.253 economic) reveals greater flexibility: planners can achieve social equity across diverse configurations while environmental and economic goals require precise calibration. The 25.3% green space threshold provides flood resilience through permeable surfaces while the density threshold enables transit viability, directly supporting climate adaptation and mitigation.
Democratizing Urbanization for Climate Action
By relying on freely available census data, satellite imagery, and open-source machine learning libraries, this framework democratizes sophisticated multi-objective optimization previously limited to wealthy cities with extensive technical capacity14. Any urban area with basic census statistics and environmental data can apply this approach enabling comprehensive city-wide analysis. This accessibility addresses power asymmetries in the emerging urban cognitive ecosystem, where AI capabilities have concentrated in wealthy municipalities. By using only open data sources and reproducible methods, we distribute cognitive infrastructure as readily as we advocate for physical green infrastructure. This accessibility proves critical as cities generate approximately 70% of global carbon emissions19, and face intensifying climate impacts from extreme heat, flooding, and storms20, which makes urban land use optimization essential for achieving 2030 and 2050 climate targets. This computational approach addresses the "planning gap" where cities lack tools to operationalize sustainability commitments5. The framework's iterative nature enables cities to continuously update optimization as census data, climate projections, or policy priorities evolve, supporting adaptive planning that responds to climate impacts, demographic shifts, and infrastructure changes, addressing the need for dynamic, evidence-based urban planning identified in SDG 11.3 on inclusive and sustainable urbanization18. This framework also establishes foundations for integrated systems modeling. Coupling land use optimization with energy and transportation models would enable comprehensive decarbonization strategies, moving from static pattern optimization to dynamic transition pathways that co-optimize across urban systems to achieve climate targets while maintaining equity and economic vitality.
Limitations and Future Directions
Several limitations warrant acknowledgment. Optimization captures a sustainability snapshot but misses temporal dynamics including climate change impacts and development trajectories. Census tract-level analysis cannot capture within-neighborhood inequities—optimization may improve averages while exacerbating disparities for vulnerable subpopulations. The three-objective formulation, while tractable, may oversimplify urban complexity. Future work should develop dynamic optimization incorporating climate projections, integrate citizen preferences through participatory modeling, and conduct multi-city comparative analysis with finer granularity data to distinguish universal patterns from context-specific findings. Despite these limitations, this framework provides a scalable, open-source foundation enabling evidence-based urban sustainability planning previously accessible only to wealthy cities, bridging computational efficiency with participatory governance.
Methods
We developed an AI framework that integrated machine learning and multi-objective Bayesian optimization framework (Fig. 5) to identify sustainable urban land use configurations across urban planning. The approach addresses a critical limitation of traditional planning methods: conventional optimization requires thousands of evaluations, making iterative scenario analysis impractical. Our framework reduces computational stress while quantifying prediction uncertainty, enabling real-time planning support for balancing environmental quality, social equity, and economic vitality21.
Study area. Chicago, Illinois (591 square kilometers, 2.721 million population, 801 census tracts) provides an ideal testbed due to: (1) substantial spatial heterogeneity spanning dense urban core to sprawling periphery, enabling model training across diverse urban forms; (2) pressing sustainability challenges including environmental justice concerns (industrial pollution in minority communities), climate adaptation needs (flooding, heat islands), and economic inequality (median household income ranging from $ 13,438 to $ 250,001 across tracts); (3) comprehensive open data availability enabling reproducible analysis and transferability to cities in other regions.
Workflow. The AI workflow includes compiling multi-source data for city of Chicago (Stage 1), then engineering domain-specific composite indices (Stage 2). Three separate LightGBM models predict environmental, social, and economic objectives (Stage 3). Multi-objective Bayesian optimization with Gaussian Process surrogates and qEHVI acquisition function efficiently discovers Pareto-optimal solutions (Stage 4). This integration of domain-aware features, specialized models, and sample-efficient optimization. This integration enables accurate characterization of sustainability trade-offs while maintaining computational tractability for real-world planning applications.
Data and Preprocessing. We integrated 26 variables (Supplementary Table 1) from U.S. Census American Community Survey22, USGS land cover data23, Chicago City Data24, H + T Index from Center for Neighborhood Technology25, net primary productivity (NPP) from Land Use Evolution and Impact Assessment Model (LEAM) Lab calculation26, and Chicago Chives data from Healthy Regions & Policies Lab27, and OpenStreet Map28 selected based on theoretical relevance, data availability, and alignment with established urban sustainability indicator frameworks29. The 26 variables operationalize the triple-bottom-line framework across environmental quality, social equity, and economic vitality domains, with detailed theoretical justification provided in Supplementary Text 1. Preprocessing included: (1) inversion of negative indicators; (2) min-max normalization to [0,1] scale30; (3) median imputation for missing values (1.8%); (4) verification that land use proportions sum to 100% (± 1%).
Feature Engineering. We constructed 10 additional composite indices (Supplementary Table 2) including environmental features (Development Diversity Index using Shannon entropy, Green Space Per Capita, Sustainable Mobility Index), social features (Housing Diversity Index, Housing Stability Score, Social Equity Index, Transit Accessibility Leverage), and economic features (Economic Vulnerability Index, Transit-Oriented Development Potential, Gentrification Pressure). Supplementary Text 2 provides detailed formulas. This approach reflects interconnected sustainability dimensions where economic, social, and environmental factors must be considered simultaneously21,29.
Predictive Models. Three separate LightGBM gradient boosting models
31 predicted environmental (9 features), social (8 features), and economic (7 features) objectives. The models use an additive ensemble:
where F(x) is the predicted objective value,
are individual decision trees,
are tree weights learned via gradient descent, and M is the number of boosting iterations. LightGBM was selected for its efficiency through Gradient-based One-Side Sampling and Exclusive Feature Bundling, which reduce computational cost while maintaining accuracy
31. Models were trained on 70% of data with 30% holdout testing using 5-fold cross-validation implemented in scikit-learn
32, with grid search and early stopping for hyperparameter optimization
33.
Multi-Objective Optimization Model. Figure
6 depicts the workflows of the multi-objective optimization process over environmental quality, social equity, economic vitality. We maximized three competing objectives across a four-dimensional decision space: population density, open space proportion, low-intensity proportion, and medium-intensity proportion [all 0–1 scaled], with high-intensity as residual. Natural areas were preserved from baseline. Three independent Gaussian Process models with Matérn 5/2 kernels as following
34 approximate LightGBM predictions while quantifying uncertainty.
where
is the mean function and
is the covariance kernel function served as surrogates for LightGBM predictions while quantifying uncertainty.
The q-Expected Hypervolume Improvement (qEHVI) acquisition function
35 guided candidate selection by measuring expected improvement to the Pareto front's dominated hypervolume
36, balancing exploration and exploitation while enabling parallel evaluation.
where
is the batch of candidate points,
is the current Pareto front,
denotes hypervolume, and
is the GP posterior predictive distribution. This approach extends traditional EHVI through cached box decompositions that reduce computational complexity from exponential to polynomial in batch size
37, contrasting with entropy-based methods like PESMO that require complex approximations
38. Acquisition optimization used exact gradients via automatic differentiation
35.
A
Experimental Protocol. Each trial began with 10 samples: three each from environmental-, social-, and economic-optimal regions (identified through single-objective optimization), plus one random sample. The algorithm ran 50 iterations with batch size q = 2, evaluating 110 configurations per trial using sample average approximation for deterministic acquisition optimization
37,39. Three independent trials were conducted. Performance metrics included R², MSE, and MAE on holdout sets. Pareto optimality was verified through dominance checking
38, ensuring no solution could improve one objective without degrading another—a fundamental property of multi-objective optimization
11. Front quality was assessed via hypervolume indicator
36. Analyses used Python 3.9 with scikit-learn
32, LightGBM
31, and BoTorch
39.
Ethical Considerations. This study develops a decision-support framework for urban planners, not an autonomous decision-making system. Key limitations and ethical concerns include: (1) automation bias if planners defer uncritically to algorithmic recommendations without ground-truthing against community knowledge; (2) models trained on historical data may inadvertently encode discriminatory patterns despite equity metrics—the zero optimal solutions in South and West Chicago reflects historical disinvestment, not inherent unsuitability; (3) aggregate optimization at census tract level may obscure inequities within neighborhoods and displacement risks; (4) census data systematically undercounts marginalized populations (homeless, undocumented, highly mobile residents); (5) the framework's computational efficiency may accelerate decision-making beyond community capacity for meaningful participation; (6) cumulative energy footprint if deployed across thousands of cities. Future operational deployment requires mandatory stakeholder co-governance where affected communities guide objective definition and evaluates solutions; algorithmic audits by community representatives; equity impact assessments; and validation against both quantitative census data and qualitative community-generated knowledge. The algorithm identifies mathematically optimal configurations; planners and communities must decide which trade-offs align with democratic values.
Data availability
All datasets used in this study are publicly available. U.S. Census data are available through the American Community Survey (https://www.census.gov/programs-surveys/acs). Land cover data are available from the U.S. Geological Survey NLCD (https://doi.org/10.5066/P94UXNTS). The H + T Index is available at https://htaindex.cnt.org/. Chicago-specific data are available through the Chicago Data Portal (https://data.cityofchicago.org/) and Chicago Chives at https://doi.org/10.5281/zenodo.13774077. OpenStreetMap data are available at https://www.openstreetmap.org/. Net Primary Productivity (NPP) calculations are available from the corresponding author upon reasonable request. Additional NPP data for other regions are available at NASA MODIS (https://modis.gsfc.nasa.gov/data/dataprod/mod17.php). Processed data during this study are available from the corresponding author upon reasonable request.
Code availability
The Python analysis code for multi-objective Bayesian optimization and predictive modeling is not publicly available as it is undergoing documentation and generalization for broader use but is available from the corresponding author upon reasonable request.
Electronic Supplementary Material
Below is the link to the electronic supplementary material
A
Data Availability
All datasets used in this study are publicly available. U.S. Census data are available through the American Community Survey (https://www.census.gov/programs-surveys/acs). Land cover data are available from the U.S. Geological Survey NLCD (https://doi.org/10.5066/P94UXNTS). The H+T Index is available at https://htaindex.cnt.org/. Chicago-specific data are available through the Chicago Data Portal (https://data.cityofchicago.org/) and Chicago Chives at https://doi.org/10.5281/zenodo.13774077. OpenStreetMap data are available at https://www.openstreetmap.org/. Net Primary Productivity (NPP) calculations are available from the corresponding author upon reasonable request. Additional NPP data for other regions are available at NASA MODIS (https://modis.gsfc.nasa.gov/data/dataprod/mod17.php). Processed data during this study are available from the corresponding author upon reasonable request.
References
1.United Nations. (2018). World urbanization prospects: The 2018 revision. Department of Economic and Social Affairs, Population Division.
2.Berke, P. R., & Conroy, M. M. (2000). Are we planning for sustainable development? Journal of the American Planning Association, 66(1), 21–33.
3.U.S. Census Bureau. (2020). American Community Survey 5-year estimates
4.City of Chicago. (2021). Chicago climate action plan. Department of Environment.
5.Berke, P. R., & Godschalk, D. R. (2009). Searching for the good plan. Journal of Planning Literature, 23(3), 227–240.
6.Talen, E. (2012). City rules: How regulations affect urban form. Island Press.
7.Seasons, M. (2003). Monitoring and evaluation in municipal planning. Journal of the American Planning Association, 69(4), 430–440.
8.Ackerman, F., & Heinzerling, L. (2004). Priceless: On knowing the price of everything and the value of nothing. The New Press.
9.Lee, D. B. (1973). Requiem for large-scale models. Journal of the American Institute of Planners, 39(3), 163–178.
10.Batty, M. (2013). The new science of cities. MIT Press.
11.Deb, K., Pratap, A., Agarwal, S., & Meyarivan, T. (2002). A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Transactions on Evolutionary Computation, 6(2), 182–197.
12.Stewart, T. J., Janssen, R., & van Herwijnen, M. (2004). A genetic algorithm approach to multiobjective land use planning. Computers & Operations Research, 31(14), 2293–2313.
13.Coello Coello, C. A., Lamont, G. B., & Van Veldhuizen, D. A. (2007). Evolutionary algorithms for solving multi-objective problems (2nd ed.). Springer.
14.Shahriari, B., Swersky, K., Wang, Z., Adams, R. P., & de Freitas, N. (2016). Taking the human out of the loop: A review of Bayesian optimization. Proceedings of the IEEE, 104(1), 148–175.
15.Emmerich, M. T. M., Deutz, A. H., & Klinkenberg, J. W. (2011). Hypervolume-based expected improvement. Proceedings of the 2011 IEEE Congress on Evolutionary Computation, 2147–2154.
16.Garnett, R. (2023). Bayesian optimization. Cambridge University Press.
17.Masoumi, Z., van Genderen, J., & Maleki, J. (2021). Fire risk assessment in dense urban areas. ISPRS International Journal of Geo-Information, 10(8), 553.
18.United Nations. (2015). Transforming our world: The 2030 agenda for sustainable development. https://sdgs.un.org/goals/goal11
19.Intergovernmental Panel on Climate Change (IPCC). Human settlements, infrastructure, and spatial planning in Climate Change 2014: Mitigation of Climate Change. Contribution of Working Group III to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change 923–1000 (Cambridge University Press, 2015).
20.Intergovernmental Panel on Climate Change (IPCC). Climate Change 2022: Impacts, Adaptation and Vulnerability. Working Group II Contribution to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change (Cambridge University Press, 2023).
21.Yigitcanlar, T. et al. Understanding 'smart cities': Intertwining development drivers with desired outcomes in a multidimensional framework. Cities 81, 145–160 (2018).
22.U.S. Census Bureau. American Community Survey https://www.census.gov/programs-surveys/acs (2025).
23.U.S. Geological Survey. Annual NLCD Collection 1 Science Products https://doi.org/10.5066/P94UXNTS (2024).
24.City of Chicago. Chicago Data Portal https://data.cityofchicago.org/ (2025).
25.Center for Neighborhood Technology. Housing + Transportation Affordability Index. https://htaindex.cnt.org/ (2022).
26.Zeng, Y., Wang, J., Lai, S. & Deal, B. Understanding the carbon sequestration potential of urban landscapes: a state-wide assessment in Illinois. J. Digit. Landsc. Archit. 9, 193–201 (2024).
27.Lambert, S. et al. Chives: An Environmental Justice Geospatial Dashboard for Chicago (v3.0.0). Zenodo https://doi.org/10.5281/zenodo.13774077 (2024).
28.OpenStreetMap contributors. OpenStreetMap https://www.openstreetmap.org/ (2025).
29.Verma, P. & Raghubanshi, A. S. Urban sustainability indicators: Challenges and opportunities. Ecol. Indic. 93, 282–291 (2018).
30.Kontokosta, C. E. & Malik, A. The Resilience to Emergencies and Disasters Index: Applying big data to benchmark and validate neighborhood resilience capacity. Sustain. Cities Soc. 36, 272–285 (2018).
31.Ke, G. et al. LightGBM: A highly efficient gradient boosting decision tree. Adv. Neural Inf. Process. Syst. 30, 3146–3154 (2017).
32.Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
33.Bergstra, J., Yamins, D. & Cox, D. D. Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures. Proc. 30th Int. Conf. Mach. Learn. 28, 115–123 (2013).
34.Rasmussen, C. E. & Williams, C. K. I. Gaussian processes for machine learning (MIT Press, 2006).
35.Daulton, S., Balandat, M. & Bakshy, E. Differentiable expected hypervolume improvement for parallel multi-objective Bayesian optimization. Adv. Neural Inf. Process. Syst. 33, 9851–9864 (2020).
36.Zitzler, E. & Thiele, L. Multiobjective evolutionary algorithms: A comparative case study and the strength Pareto approach. IEEE Trans. Evol. Comput. 3, 257–271 (1999).
37.Daulton, S., Balandat, M. & Bakshy, E. Parallel Bayesian optimization of multiple noisy objectives with expected hypervolume improvement. Adv. Neural Inf. Process. Syst. 34, 2187–2200 (2021).
38.Hernández-Lobato, D., Hernández-Lobato, J., Shah, A. & Adams, R. Predictive entropy search for multi-objective Bayesian optimization. Proc. 33rd Int. Conf. Mach. Learn. 48, 1492–1501 (2016).
39.Balandat, M. et al. BoTorch: A framework for efficient Monte-Carlo Bayesian optimization. Adv. Neural Inf. Process. Syst. 33, 21524–21538 (2020).
A
Acknowledgement
We gratefully acknowledge the resources and support provided by the Land Use Evolution and Impact Assessment Model (LEAM) Laboratory at the University of Illinois Urbana-Champaign. We thank the U.S. Census Bureau for providing American Community Survey data, the U.S. Geological Survey for land cover data, and the City of Chicago, Center for Neighborhood Technology, Healthy Regions & Policies Lab, and OpenStreetMap contributors for open data access.
A
Author Contribution
J.W. conceptualized the study, developed the AI framework, performed analysis and coding, and drafted the manuscript. Z.T. conceptualized the study and contributed to manuscript drafting. B.D. conceptualized the study, provided supervision, and contributed to manuscript revision.