Abstract
Concentrations estimates for ambient air pollution are used widely in fields such as environmental epidemiology, health impact assessment, urban planning, environmental equity, and sustainability. This study builds on previous efforts by developing an updated high-resolution geospatial database of population-weighted annual-average concentrations for six criteria air pollutants (PM2.5, PM10, CO, NO2, SO2, O3) across the contiguous U.S. during a five-year period (2016-2020). We developed Land Use Regression (LUR) models within a partial-least-square – universal kriging framework by incorporating several land use, geospatial, and satellite – based predictor variables. The LUR models were validated using conventional and clustered cross-validation, with the former consistently showing superior performance in capturing the variability of air quality. Most models demonstrated reliable performance (e.g., mean squared error – based R2 > 0.8, standardized root mean squared error < 0.1). We used the best modeling approach to develop estimates by Census Block, which were then population-weighted averaged at Census Block Group, Census Tract, and County geographies. Our database provides valuable insights into the dynamics of air pollution, with utility for environmental risk assessment, public health, policy, and urban planning.



![Author ORCID: We display the ORCID iD icon alongside authors names on our website to acknowledge that the ORCiD has been authenticated when entered by the user. To view the users ORCiD record click the icon. [opens in a new tab]](https://www.cambridge.org/engage/assets/public/coe/logo/orcid.png)