Hostname: page-component-89b8bd64d-shngb Total loading time: 0 Render date: 2026-05-07T03:43:56.742Z Has data issue: false hasContentIssue false

Spatial analysis of tails of air pollution PDFs in Europe

Published online by Cambridge University Press:  02 January 2025

Hankun He*
Affiliation:
Centre for Complex Systems, School of Mathematical Sciences, Queen Mary University of London, London, UK
Benjamin Schäfer
Affiliation:
Institute for Automation and Applied Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
Christian Beck
Affiliation:
Centre for Complex Systems, School of Mathematical Sciences, Queen Mary University of London, London, UK
*
Corresponding author: Hankun He; Email: h.he@qmul.ac.uk

Abstract

Outdoor air pollution is estimated to cause a huge number of premature deaths worldwide. It catalyzes many diseases on a variety of time scales, and it has a detrimental effect on the environment. In light of these impacts, it is necessary to obtain a better understanding of the dynamics and statistics of measured air pollution concentrations, including temporal fluctuations of observed concentrations and spatial heterogeneities. Here, we present an extensive analysis for measured data from Europe. The observed probability density functions (PDFs) of air pollution concentrations depend very much on the spatial location and the pollutant substance. We analyze a large number of time series data from 3544 different European monitoring sites and show that the PDFs of nitric oxide ($ NO $), nitrogen dioxide ($ {NO}_2 $), and particulate matter ($ {PM}_{10} $ and $ {PM}_{2.5} $) concentrations generically exhibit heavy tails. These are asymptotically well approximated by $ q $-exponential distributions with a given entropic index $ q $ and width parameter $ \lambda $. We observe that the power-law parameter $ q $ and the width parameter $ \lambda $ vary widely for the different spatial locations. We present the results of our data analysis in the form of a map that shows which parameters $ q $ and $ \lambda $ are most relevant in a given region. A variety of interesting spatial patterns is observed that correlate to the properties of the geographical region. We also present results on typical time scales associated with the dynamical behavior.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. (a) The white dots show the positions of all European measuring stations considered. A red dot singles out an example of a station at Illmitz, Austria. For such a given example location, the time series data of the four measured concentrations are shown in panels b and d. The corresponding tails of the PDFs (as shown in c, e) are then automatically analysed with a fitting program He (2024) that extracts the parameters $ \left(q,\lambda \right) $ for the best-fitting $ q $-exponential. This is done for all 3544 sites.

Figure 1

Figure 2. Measured histogram of $ NO $ concentrations, recorded at the example site Barnsley Gawber, UK, together with the best log-normal (blue), Weibull (orange), Gamma (brown), and $ q $-exponential (purple) fits, together with their respective log-likelihood values. Also displayed are the optimum values of the $ q $ and $ \lambda $ parameters for the $ q $-exponential distribution. The $ q $-exponential fits the measured data best, as indicated by the highest log-likelihood value. Similar plots can be produced for all 3544 sites.

Figure 2

Figure 3. Best-fitting parameters of $ q $-exponentials at the various measuring stations. There is an increasing trend of $ q $ versus $ \log \lambda $ for $ NO $ (a) and $ {NO}_2 $ (b), whereas a disk-shaped pattern is observed for $ {PM}_{2.5} $ (c) and $ {PM}_{10} $ (d). The colors encode the area type where the measurements were done. There are predominant patches of a single color in the parameter space for $ NO $ and $ {NO}_2 $, where for $ {PM}_{2.5} $ and $ {PM}_{10} $ the color pattern looks more mixed. Deep green points correspond to cleaner rural areas.

Figure 3

Figure 4. Spatial distribution of best-fitting parameters $ \left(q,\lambda \right) $ characterizing the measured PDFs of $ {NO}_x $ and $ {PM}_x $ pollutants across Europe. The color codes are directly indicated in the individual figures. A large value of $ q $ indicates heavy tails in the distribution. A small value of $ \lambda $ indicates heavy average pollution. The pollutant characteristics across Europe is quite inhomogeneous.

Figure 4

Figure 5. The long time scales $ T $ describing the scale of typical changes of the temporal mean and variance of the measured time series for $ NO $ (a), $ {NO}_2 $ (b), $ {PM}_{2.5} $ (c), and $ {PM}_{10} $ (d). The color coding is explained at the bottom of each figure.

Author comment: Spatial analysis of tails of air pollution PDFs in Europe — R0/PR1

Comments

Submission: Spatial analysis of tails of air pollution PDFs in Europe

Dear Editors,

We hereby submit our manuscript titled “Spatial analysis of tails of air pollution PDFs in Europe” for publication in

Environmental Data Science.

Outdoor air pollution is estimated to cause a huge number of premature deaths worldwide, it catalyses many diseases on a variety of time scales, and it has a detrimental effect on the environment. In light of these impacts it is necessary to obtain a better understanding of the dynamics and statistics of measured air pollution concentrations, including temporal fluctuations of observed concentrations and spatial heterogeneities.

We analyse a large number of time series data from 3544 different European monitoring sites and show that the PDFs of nitric oxide (NO), nitrogen dioxide (NO2) and particulate matter (PM10 and PM2.5) concentrations generically exhibit heavy tails. These are asymptotically well approximated by q-exponential distributions with a given entropic index q and width parameter λ.

We present the results of our data analysis in the form of a map that shows which parameters q andλ are most relevant in a given region. A variety of interesting spatial patterns is observed that correlate to properties of the geographical region. We also present results on typical time scales associated with the dynamical behaviour.

Our submitted manuscript offers a substantial research contribution at the interface of statistical physics, environmental sciences, geography and data analytics. It addresses the complex dynamics of air pollution in Europe, analysing in particular the tails of the observed PDFs which describe high-pollution events. Our work enables a better understanding of the time-varying dynamics of air pollution, which is essential for policy formulation and the construction of suitable stochastic models, as well as for analysing the medical consequences of exposure to polluted air.

Kind regards,

Hankun He

(for all authors)

Review: Spatial analysis of tails of air pollution PDFs in Europe — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

>Summary: In this section please explain in your own words what problem the paper addresses and what it contributes to solving it.

The paper presented a spatial analysis of the air pollution data in Europe using q-exponentials to fit the PDFs which often decay as a power law. The analysis is aimed to better understand the probability densities of measured air pollution time series which may inform local decision-making for improving local air quality.

>Relevance and Impact: Is this paper a significant contribution to interdisciplinary climate informatics?

The paper demonstrated the value of statistical tools to extract information from rich observational time series that is meaningful to understand the temporal and spatial patterns of air pollution data which may inform local decision-making to improve air quality.

>Detailed Comments

The paper is overall well-organized and clearly presented with supporting evidence in results. Below are a few comments for the authors to improve the clarity for the methods and explain the results for readability and reproducibility.

1. What is the filtering criteria applied to the original collection of 9000+ stations to the final 3500+ stations before the analysis? There is only a brief mention of using quality for data filtering but no information on how the filtering is carried out. It will be important to include the information in the method so it can be reproduced by others.

2. Although the paper use an existing method to fit the PDFs based on q-exponentials, it will be good to briefly summarize the goodness-of-fit for the 3500+ stations after the fitting before conducting the analysis of the parameters extracted from the fitting. Without the information of the goodness-of-fit, it will be hard for readers to assess the quality of fitting.

3. The color scheme for Figure 2 is not very accessible for readers who might be vision impaired (e.g., color blindness). I would suggest authors to redo the figure to use color-blindness friendly color scheme. Additionally, the ledgen for urban background is not visible (assuming it is a white dot on a white background).

4. Regarding spatial pattern, although it is important to preserve the information of each station, it is hard to make sense of more than 3500+ data points that have heavy overlap on the map. It might be better to apply a spatial kernal density function for aggregate the information and presentedas a map from these discrete data point to better extract the spatial pattern. Additionally, it appears in Figure 3, there is different spatial coverage for different species of air pollutant. Is this because of the availability of the observations? If so, this should be noted in the paper as well.

5. Regarding the time scales, it appears that both NO2 and PM2.5 have a very large range (a week to a year). Does this appear to be an appropriate pattern based on their lifecycle? Would this be caused by local meteorological and anthopogenic factors? If so, are there any existing research to explain the wide range of temporal scales?

6. Similarly, are their any meteorological or anthopogenic factors that affects the spatial patterns of both q & lambda in Figure 3? This will be more useful information for the local policy makers to inform local decisions as presented as one of the potential for this study.

Recommendation: Spatial analysis of tails of air pollution PDFs in Europe — R0/PR3

Comments

This article was accepted into Climate Informatics 2024 Conference after the authors addressed the comments in the reviews provided. It has been accepted for publication in Environmental Data Science on the strength of the Climate Informatics Review Process.

Decision: Spatial analysis of tails of air pollution PDFs in Europe — R0/PR4

Comments

No accompanying comment.