Hostname: page-component-77f85d65b8-t6st2 Total loading time: 0 Render date: 2026-04-19T11:05:53.647Z Has data issue: false hasContentIssue false

Actively inferring methane sources with drones

Published online by Cambridge University Press:  28 January 2026

Alouette van Hove*
Affiliation:
Department of Geosciences, University of Oslo, Oslo, Norway
Kristoffer Aalstad
Affiliation:
Department of Geosciences, University of Oslo, Oslo, Norway
Norbert Pirk
Affiliation:
Department of Geosciences, University of Oslo, Oslo, Norway
*
Corresponding author: Alouette van Hove; Email: a.van.hove@geo.uio.no

Abstract

This study presents a framework that combines Bayesian inference with reinforcement learning to guide drone-based sampling for methane source estimation. Synthetic gas concentration and wind observations are generated using a calibrated model derived from real-world drone measurements, providing a more representative testbed that captures atmospheric boundary layer variability. We compare three path planning strategies—preplanned, myopic (short-sighted), and non-myopic (long-term)—and find that non-myopic policies trained via deep reinforcement learning consistently yield more precise and accurate estimates of both source location and emission rate. We further investigate centralized multi-agent collaboration and observe comparable performance to independent agents in the tested single-source scenario. Our results suggest that effective source term estimation depends on correctly identifying the plume and obtaining low-noise concentration measurements within it. Precise localization further requires sampling in close proximity to the source, including slightly upwind. In more complex environments with multiple emission sources, multi-agent systems may offer advantages by enabling individual drones to specialize in tracking distinct plumes. These findings support the development of intelligent, data-driven sampling strategies for drone-based environmental monitoring, with potential applications in climate monitoring, emission inventories, and regulatory compliance.

Information

Type
Methods Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press
Figure 0

Figure 1. Schematic overview of the developed framework, consisting of two main components: source term estimation (top box, Section 2.1) and drone path planning strategies (bottom box, Section 2.2). Source estimation uses Bayesian inverse modeling in the Box–Cox transformed observation space, where “B-C” denotes the Box–Cox transformation. Bayesian updating is sequential in the synthetic experiments, with each posterior becoming the prior for the next step. Path planning is guided by the agent (i.e., drone) policy: preplanned (deterministic), myopic (greedy one-step lookahead), or non-myopic (greedy via a trained state-value function). Dashed lines indicate components used only during reinforcement learning training, where a neural network approximates state-action values using epsilon-greedy exploration.

Figure 1

Table 1. Uniform prior bounds, training/test scenario bounds, and true values used in the illustrative run for the five uncertain parameters. These parameters define the search space for Bayesian inference and reinforcement learning-based path planning strategies

Figure 2

Figure 2. (a) The drone used in the field experiment, equipped with a methane sensor mounted underneath, and a 3D sonic anemometer on top. The pole with reflectors marks the location of the leaking borehole. Methane was released by drilling five shallow holes through the ice to access the underlying water. Credit: Gina Schulz. (b) The drone is collecting atmospheric observations near the borehole.

Figure 3

Figure 3. Methane (CH$ {}_4 $) concentrations observed during the field experiment. The drone followed a preplanned lawnmower flight path at ~2, ~4, and ~6 m above ground level. Each sub-figure corresponds to a single flight leg at constant altitude. Wind vectors are shown as quivers, with lengths scaled to $ 20\% $ of the axis length for visual clarity. All data are resampled to $ 0.1\;\mathrm{Hz} $.

Figure 4

Figure 4. (a) Logarithm of Bayesian model evidence for different settings of the Box–Cox transformation parameter $ \lambda $ and the corresponding transformed error standard deviation parameter $ {\sigma}_C^{\left(\lambda \right)} $ for methane concentration data. Higher model evidence values indicate transformation settings that yield an observation model in the nature run that more closely matches the observed data from the field campaign. (b) Example turbulent synthetic methane concentration observations $ c\left(\boldsymbol{x},t\right) $ obtained via the inverse Box–Cox transformation of $ C{\left(\boldsymbol{x}\right)}^{\left(\lambda \right)}+{\unicode{x025B}}_C^{\left(\lambda \right)} $, with $ \lambda =0 $ and $ {\unicode{x025B}}_C^{\left(\lambda \right)}\sim \mathcal{N}\left(0,{\sigma}_C^{\left(\lambda \right)}\right) $, where $ {\sigma}_C^{\left(\lambda \right)}=0.15 $. The predicted mean concentrations $ C\left(\boldsymbol{x}\right) $ are generated by the forward model, Eq. (2.3), assuming a background concentration of $ {C}_0=2.09\;\mathrm{ppm} $.

Figure 5

Figure 5. Illustrative flight paths for the lawnmower flight plan (top), myopic flight plan (middle), and non-myopic flight plan (bottom). The true parameters are $ \boldsymbol{\theta} =\left[Q=3.9\;\mathrm{kg}\hskip0.1em {\mathrm{h}}^{-1},{x}_{\mathrm{s}}=150\;\mathrm{m},{y}_{\mathrm{s}}=66\;\mathrm{m},V=2.7\hskip0.22em \mathrm{m}\hskip0.1em {\mathrm{s}}^{-1},\Phi =200{}^{\circ}\right] $. The expected mean concentration field, following Eq. (2.3), is shown as a background heatmap, with the true source location marked by a cross. The flight paths are overlaid, along with the obtained noisy observations depicted by colored nodes: the first two observations are shown as a triangle, the final as a square, and intermediate observations as circles. The rewards and cumulative rewards for each flight path are plotted. The cumulative rewards of the other flight paths are shown by a thin line for comparison. Histograms display the initial uniform prior in light shading, the final posterior in dark shading, and the true value depicted by a dashed line.

Figure 6

Table 2. Performance metrics for the lawnmower, myopic, and non-myopic flight strategies across 3D and 2D domains, and episode lengths of 108, 54, and 27 steps. Metrics include: (i) cumulative information gain or entropy reduction ($ \varDelta \mathcal{H}=\mathcal{H}({s}_0)-\mathcal{H}({s}_{\mathrm{final}}) $), (ii) Continuous Ranked Probability Score ($ \mathrm{CRPS} $) for the emission rate ($ {\mathrm{CRPS}}_Q $), (iii) $ \mathrm{CRPS} $ for the Euclidean distance from the true source location ($ {\mathrm{CRPS}}_{x_s} $), and (iv) $ \mathrm{CRPS} $ ratio of the posterior compared to the initial prior, averaged over both $ Q $ and $ {\boldsymbol{x}}_s $ ($ {\mathrm{CRPS}}_{\mathrm{norm}} $). Results are aggregated over 25,000 synthetic test scenarios and reported as median values, followed by the 5th and 95th percentiles in between square brackets. Lower $ \mathrm{CRPS} $ values indicate better performance

Figure 7

Figure 6. Annotated matrix illustrating pairwise outperformance fractions between flight strategies and durations. Each cell represents the proportion of simulation runs in which the strategy and duration indicated by the row achieve a lower $ {\mathrm{CRPS}}_{\mathrm{norm}} $ than the corresponding configuration in the column. Values are based on 25,000 synthetic testbed runs for each configuration and reflect comparative posterior accuracy and precision across configurations.

Figure 8

Figure 7. Illustrative flight paths of two drones without collaboration (top), and with collaboration (bottom) using a non-myopic policy. These examples are not representative of typical behavior in either case; they were chosen to highlight the impact of observation noise. The true parameters are $ \boldsymbol{\theta} =\left[Q=3.9\;\mathrm{kg}\hskip0.1em {\mathrm{h}}^{-1},{x}_{\mathrm{s}}=150\;\mathrm{m},{y}_{\mathrm{s}}=66\;\mathrm{m},V=2.7\;\mathrm{m}\hskip0.22em {\mathrm{s}}^{-1},\Phi =200{}^{\circ}\right] $. The expected mean concentration field, following Eq. (2.3), is shown as a background heatmap, including the true source location marked by a cross. The flight paths of the two agents, shown in black and blue, are overlaid, along with the obtained noisy observations depicted by colored nodes: the first two observations are shown as a triangle, the final as a square, and intermediate observations as circles. The cumulative rewards of each flight path are plotted. The cumulative rewards of the other flight paths are shown by a thin line for comparison. Histograms display the uniform prior in light shading, the final posterior in dark shading, and the true value depicted by a dashed line.

Figure 9

Figure F1. (a) Example turbulent synthetic wind speed observations $ v\left(\boldsymbol{x},t\right) $ obtained via the inverse Box–Cox transformation of $ {V}^{\left(\lambda \right)}+{\unicode{x025B}}_V^{\left(\lambda \right)} $, with $ \lambda =0.38 $ and $ {\unicode{x025B}}_V^{\left(\lambda \right)}\sim \mathcal{N}\left(0,{\sigma}_V^{\left(\lambda \right)}\right) $, where $ {\sigma}_V^{\left(\lambda \right)}=0.83 $. All observations are positive because the Box–Cox transformation is defined exclusively for strictly positive input values, thereby ensuring that transformed wind speed measurements remain >0. (b) Example turbulent synthetic wind direction observations $ \phi \left(\boldsymbol{x},t\right) $ obtained via the inverse Box–Cox transformation of $ {\Phi}^{\left(\lambda \right)}+{\unicode{x025B}}_{\Phi}^{\left(\lambda \right)} $, with $ \lambda =1 $ and $ {\unicode{x025B}}_{\Phi}^{\left(\lambda \right)}\sim \mathcal{N}\left(0,{\sigma}_{\Phi}^{\left(\lambda \right)}\right) $, where $ {\sigma}_{\Phi}^{\left(\lambda \right)}=98.09 $. Observations exceeding 360° (< $ 4\% $ of plotted cases) are folded over the $ 0-360{}^{\circ} $ range by a modulus operation.

Author comment: Actively inferring methane sources with drones — R0/PR1

Comments

Dear editors,

We are pleased to submit our manuscript ‘Actively inferring methane sources with drones’ for consideration in the special edition of Environmental Data Science on ‘Connecting Data-Driven and Physical Approaches: Application to Climate Modeling and Earth System Observation’.

In this study, we present a framework that integrates Bayesian inference with deep reinforcement learning (RL) to guide drone-based sampling for methane source estimation in unfamiliar environments. Using a calibrated observation model derived from real-world drone measurements, we demonstrate that non-myopic (far-sighted) navigation strategies trained via RL consistently outperform both myopic (short-sighted) and preplanned lawnmower strategies in terms of precision and accuracy of estimated source location and emission rate.

Our findings advance intelligent, data-driven sampling strategies for environmental monitoring, with potential applications in climate science, emissions reporting, and regulatory compliance.

We appreciate your consideration and look forward to your response.

Kind regards,

Alouette van Hove

Kristoffer Aalstad

Norbert Pirk

Review: Actively inferring methane sources with drones — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

Comments to the Author

Review of “Actively Inferring methane sources with drones”

This paper presents a framework that uses several techniques to improve the drone-based methane source term estimates such as source location and emission rate. I think the manuscript is very well written, and presents valuable information and novel methodologies that can improve the current scientific field. The manuscript is suitable for publication in Environmental Data Science with minor revisions. I have listed some comments below that can help the authors to improve their manuscript.

Comments:

• Page 1 Line 20-21: Could you please break this sentence down into two since it is really hard to understand it in its current form?

• I found the introduction part a bit slim. Drone-based emission quantification methodologies are not limited by Gaussian plume inversions; there are other methods such as mass balance and vertical profile-based quantifications. Although the focus of this study is not on quantification methodologies, I think at least authors need to acknowledge these methods in the introduction section because the aim is to have an improved drone-based emission quantification methodology.

• Page 5 Table 1: Could you please elaborate how the prior bounds are selected?

• Page 5 Line 50: I would strongly advise the authors to use the term drone instead of agent, as it is more intuitive and easier to understand.

• Page 8, Nature run: I did not quite understand the term nature run; is this something related to the experiment or to the simulation? If so, how is the simulation conducted? Does it use the Gaussian plume inversion equation that was defined in 2.3?

• Page 9 Synthetic experiments: Why is the resolution of 30 m selected for discretization? Why not smaller or larger resolution?

• I think the methodology section will benefit from a schematic workflow that shows the steps of each method.

• I think section 3.1 is more suited for the methods rather than results section, as authors describe how they used box-cox transformation to normalize CH4 concentration, wind speed, and wind direction.

• Figure 4: in flight paths, the authors use different symbols like triangle, circles, or square, why is that? It would help if authors can designate the starting point of the flight, as well.

• Page 18 Line 35: I am a bit puzzled how measurements from upwind and outside of the plume center would help localizing the source. Is it due to background concentration estimation? Could you please elaborate this more?

• As a future work, maybe the authors can add some discussion about the investigation of the curtain flights and mass balance approaches with similar framework.

Review: Actively inferring methane sources with drones — R0/PR3

Conflict of interest statement

I am a coauthor on several papers cited in my review. These works are cited solely because they are directly relevant to the technical issues discussed.

Comments

The authors have studied Bayesian source estimation using drones. They describe a procedure to estimate model parameters and apply it to field measurements of a turbulent concentration field. They then integrate the field measurements into in silico experiments which are used to train a strategy for collecting observations which minimizes the posterior entropy of the model parameters. Overall, the paper is interesting, well-written, and novel, and it appears to be scientifically rigorous. While the study is generally convincing, I have a few technical and interpretive concerns that should be addressed prior to acceptance.

1. The authors model the concentration by a spatially-dependent mean and a noise term, whose parameters are spatially-independent. I would expect on the contrary that the noise will depend significantly on the position relative to the source. Can the authors comment? Even if their data are insufficient to calibrate spatially varying noise, they could at least discuss its expected qualitative form (e.g., elevated variance near the plume core).

2. The authors have neglected to consider perturbations to the flow by the drone, which may indeed be significant. If these effects are implicitly absorbed into the error terms, this should be stated explicitly; otherwise, an estimate of their magnitude (e.g., from prior studies) would help justify their neglect.

3. The details of the synthetic experiment are not clear as written. Can the authors precisely elaborate on how the synthetic observations were drawn?

4. Recent studies (e.g., Heinonen et al., Phys. Rev. Fluids 10, 064614, 2025; Piro et al., J. Turbulence 26(5), 153–173, 2025; Piro et al., arXiv:2504.11291, 2025) have emphasized the importance of properly treating small-scale correlations in the concentration field. These effects, which will also be present in the velocity field, can materially alter Bayesian source estimation. The authors should clarify whether such correlations are artificially suppressed in their synthetic setup and discuss possible implications for field data.

5. The authors find that the best Box-Cox transformation for the error on the wind direction is an identity, implying a standard normal. The assumption of a normal error model seems difficult to justify given the circular domain and large reported standard deviation (>90°). A von Mises or wrapped normal distribution would appear more appropriate; could the authors comment on this modeling choice?

6. The authors note that “estimating the emission rate is generally more challenging than localizing the source.” This behavior is unsurprising given the structure of the Vergassola model, in which mean concentration depends exponentially on position and linearly on emission rate (see, e.g., Piro et al., J. Turbulence 26(5), 153–173, 2025). A brief acknowledgment of this relationship would strengthen the discussion.

7. When discussing neural network input representations, the authors might also consider citing Heinonen et al. (2023), which independently developed the same methodology for the belief state representation.

Recommendation: Actively inferring methane sources with drones — R0/PR4

Comments

The github link provided for the Data Availability Statement does nor work. Please check it out as it is a requirement for publication.

Thank you very much,

Paula L.

Decision: Actively inferring methane sources with drones — R0/PR5

Comments

No accompanying comment.

Author comment: Actively inferring methane sources with drones — R1/PR6

Comments

Dear Dr. Claire Monteleoni,

We sincerely appreciate the constructive feedback and valuable suggestions provided by the reviewers, which we believe have significantly improved the quality of our manuscript.

In the uploaded document responses_to_reviewers, we address each comment systematically, providing detailed responses and outlining the corresponding adjustments made to the manuscript. The file manuscript_diff highlights all changes implemented throughout the revision process.

During the revision, we identified and corrected an error in a unit conversion within the observation model. This correction affects only the scaling of the final emission rate estimates, and the manuscript has been updated accordingly.

Additionally, the GitHub repository is now publicly accessible.

Thank you for your time and consideration. Please let us know if any further clarifications are needed.

Kind regards,

Alouette, Kristoffer and Norbert

Review: Actively inferring methane sources with drones — R1/PR7

Conflict of interest statement

Reviewer declares none.

Comments

The authors have responded thoroughly to all my comments, and the article is ready for publication.

Review: Actively inferring methane sources with drones — R1/PR8

Conflict of interest statement

Reviewer declares none.

Comments

I have no further comments to the Authors.

Recommendation: Actively inferring methane sources with drones — R1/PR9

Comments

No accompanying comment.

Decision: Actively inferring methane sources with drones — R1/PR10

Comments

No accompanying comment.