Hostname: page-component-6766d58669-zlvph Total loading time: 0 Render date: 2026-05-24T11:22:31.867Z Has data issue: false hasContentIssue false

Discrete variational autoencoders for synthetic nighttime visible satellite imagery

Published online by Cambridge University Press:  22 August 2025

Mickell D. Als*
Affiliation:
Department of Computer Science, University of Toronto, Toronto, ON, Canada
David Tomarov
Affiliation:
Department of Computer Science, University of Toronto, Toronto, ON, Canada School of Electrical Engineering, Tel-Aviv University, Tel-Aviv, Israel
Steve Easterbrook
Affiliation:
Department of Computer Science, University of Toronto, Toronto, ON, Canada
*
Corresponding author: Mickell D. Als; Email: mickellals@cs.toronto.edu

Abstract

Visible satellite imagery (VIS) is essential for monitoring weather patterns and tracking ground surface changes associated with climate change. However, its availability is limited during nighttime. To address this limitation, we present a discrete variational autoencoder (VQVAE) method for translating infrared satellite imagery to VIS. This method departs from previous efforts that utilize a U-Net architecture. By removing the connections between corresponding layers of the encoder and decoder, the model learns a discrete and rich codebook of latent priors for the translation task. We train and test our model on mesoscale data from the Geostationary Operational Environmental Satellite (GOES) West Advanced Baseline Imager (ABI) sensor, spanning 4 years (2019 to 2022) using the Conditional Generative Adversarial Nets (CGAN) framework. This work demonstrates the practical use of a VQVAE for meteorological satellite image translation. Our approach provides a modular framework for data compression and reconstruction, with a latent representation space specifically designed for handling meteorological satellite imagery.

Information

Type
Application Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Figure 1. Diagram of the VQGAN model for image-to-image translation in the training pipeline.

Figure 1

Table 1. Final configurations of evaluated models

Figure 2

Table 2. Performance metrics comparison

Figure 3

Figure A1. Comparison of model outputs for land and ocean cover. Row 1 shows land cover, while Row 2 shows ocean cover. Columns, from left to right, represent: input, ground truth (GT), baseline model, embedding dimension $ {Z}^D=4 $, and embedding dimension $ {Z}^D=6 $.

Figure 4

Figure A2. Comparison of model outputs for nighttime. From left to right: input, ground truth (GT), baseline model, embedding dimension $ 4 $, and embedding dimension $ 6 $.

Figure 5

Table A1. Spectral and spatial characteristics of GOES ABI sensor bands. Reflective bands operate during daytime; radiance bands are thermal and operate continuously

Figure 6

Table A2. Evaluation of reconstruction quality across different codebook sizes. Highlighted values (yellow) indicate best performance for each metric

Figure 7

Figure A3. Comparison of the effect of reduced discriminator influence on image reconstruction quality. (a) Default model. (b) Model with 2 residual blocks before each downsample and upsample block. (c) Model with 4 residual blocks before each downsample and upsample block. (d) Ground truth image.

Figure 8

Figure A4. Nighttime visible images over land. (a) 3 Band nighttime IR image. (b) Generated nighttime visible image.

Figure 9

Figure A5. Comparison of the impact of embedding dimension on models trained with ABI bands 10, 11, and 14. (a) Baseline model. (b) Model with 4-dimensional embedding. (c) Model with 6-dimensional embedding. (d) Ground truth VIS.

Figure 10

Figure A6. Visual comparison of model outputs when evaluated on inputs it was not trained on—specifically, the Band 11, 13, and 14 combination. (a) Input LWIR imagery. (b) Ground truth VIS image. (c) Baseline model output. (d) Output from the trained model using $ {Z}^D= 4 $.

Author comment: Discrete variational autoencoders for synthetic nighttime visible satellite imagery — R0/PR1

Comments

To the Editors of Environmental Data Science,

Thank you for the opportunity to publish our work titled “Discrete Variational Autoencoders for Synthetic Nighttime Visible Satellite Imagery.” This study addresses the challenge of generating visible meteorological satellite imagery from longwave infrared inputs. While previous efforts have relied on models such as UNet, multilayer perceptrons, or proposed advanced generative approaches like latent diffusion models, our work introduces an alternative: the Vector-Quantized Variational Autoencoder (VQ-VAE).

The VQ-VAE architecture offers a distinct advantage by decoupling the encoder, latent space, and decoder, allowing each component to be used independently once training is complete. This work demonstrates the viability of alternative architectures for this task while also emphasizing that many limitations observed in previous models persist. We suggest that these challenges arise not solely from model architecture, but from deeper representational issues inherent in the task. As such, progress will depend on rethinking how meteorological satellite imagery is represented and processed. Finally, we conclude by proposing the integration of spatially explicit artificial intelligence techniques to better capture the spatial heterogeneity characteristics of remote sensing data.

The authors gratefully acknowledge the Data Analytics for Canadian Climate Services (DACCS) for providing access to the Marble platform, which was instrumental for data storage and exploration throughout this project. We also extend our thanks to the School of Electrical Engineering and the Lowy International School at Tel Aviv University for facilitating David Tomarov’s participation in the student exchange program at the University of Toronto, where this research was conducted. Additional thanks are owed to the Natural Sciences and Engineering Research Council (NSERC) of Canada for supporting this work through grant RGPIN-2019-07042. The authors declare no conflicts of interest.

It is the hope of the authors this work will spur further interest in GeoAI and its associated techniques.

Kind Regards,

_____________

Mickell Als

Department of Computer Science

University of Toronto

Review: Discrete variational autoencoders for synthetic nighttime visible satellite imagery — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

1. Summary: In this section please explain in your own words what problem the paper addresses and what it contributes to solving it.

The article addresses the problem of monitoring the Earth’s surface at night, since satellites can capture wavelengths that are only emitted during the day. The article therefore proposes a model that can capture the electromagnetic waves emitted at night, which could help with more continuous monitoring of the Earth’s surface.

2. Please select a score of relevance to climate informatics which promotes the interdisciplinary research between climate science, data science, and computer science.

Highly relevant

3. Relevance and Impact: Is this paper a significant contribution to interdisciplinary climate informatics?

I present my opinion on the work entitled “Discrete Variational Autoencoders for Synthetic Nighttime

Visible Satellite Imagery” claiming that the article meets all the requirements for presentation at Climate Informatics 2025 without the need for corrections because it presents the problem clearly and objectively, with a well-designed methodology and clear results without any doubts. In addition, the article mentions the creation of a computer model to work with satellite data systems in an automated way in order to study the climatic influences on land surface cover.

4. Overall recommendation of the submission.

Accept: Good paper to be accepted as it.

5. Detailed Comments

The article presents a highly relevant problem in terms of monitoring the Earth’s surface at night and in areas covered by clouds, proposing a model that can enable observations in these conditions. Although the model presented several challenges and limitations in addition to its capacity to support larger data, the model presented demonstrates a major step forward for this field of observations in which most satellites are limited to capturing radiance during the day.

I would like to congratulate the authors on their excellent work and their success in improving the system.

Review: Discrete variational autoencoders for synthetic nighttime visible satellite imagery — R0/PR3

Conflict of interest statement

Reviewer declares none.

Comments

1. Summary: In this section please explain in your own words what problem the paper addresses and what it contributes to solving it.

The paper describes an experiment that applies machine learning techniques to predict night-time visible bands from infrared bands. This modelling process is achieved through a Conditional Generative Adversarial Network (CGAN), which replaces the generator component with a Discrete Variational Autoencoder (VQVAE).

The authors claim the main contributions of their work:

1) The application of Autoencoder models to meteorological satellite data, with a focus on night-time image generation, in the visible spectrum;

2) A contribution of a pipeline suited for GOES West ABI sensor data, including synthetic green band prediction and scaling;

3) Foundational insights to advance generative modelling, in super-resolution and latent diffusion tasks for atmospheric remote sensing.

The paper follows with “related research”, “background”, “methodology”, “results and discussion”, and a “conclusion and limitations” section.

The appendix contains examples of images produced by the aforementioned generative process and consolidated metrics of reconstruction errors.

2. Please select a score of relevance to climate informatics which promotes the interdisciplinary research between climate science, data science, and computer science.

Somewhat relevant

3. Relevance and Impact: Is this paper a significant contribution to interdisciplinary climate informatics?

Addressing the author’s contributions stated in the paper (which are mainly computational):

1) Autoencoders and GANs began to be used in Atmospheric Sciences Modelling as of 2018 (e.g. 2018 AGU Fall Meeting, “Earth and Space Science Informatics” section abstracts) and the paper does not advance much the state-of-the-art, as other works (even cited in the manuscript) already use GANs for domain translation in remote sensing. The authors did not provide a comparison with the methods they mention (neither a discussion of suitable baselines for comparison), which somehow compromises the assessment of how the results (of methods they combined) compare to other published work, both quantitatively as qualitatively (including in aspects of remote sensing of the atmosphere and its physics);

2) The pipeline for GOES West ABI is an important contribution, as pipelines for such dataset are not easily found. However, when inspected the data generation component (https://github.com/Convolution/nighttime_vqgan/blob/main/create_datasets.py ; ref. ThreddsData class), it points out to an endpoint at the University of Toronto (UofT) that seems to be a THREDDS server, through a proxy (likely a UofT MARBLE service), not the NOAA THREDDS service, which makes reproducibility conditioned by access to that service.

3) In terms of the proposed insights about GANs (for super resolution and latent diffusion tasks), the authors did some remarks in terms of lessons learned during the parametrization of their training (e.g. codebook size effects ), but the discussion was quite brief and limited to perceptual analysis and aggregated error metrics.

4. Overall recommendation of the submission.

Major Revision: Clearly below the acceptance threshold and require notable changes.

5. Detailed Comments

The manuscript needs some changes, mostly in terms of clarifying how their chosen experimental design, based on a combination of established methods, contributes to their research objectives (mainly the advancement of domain adaptation among bands in remote sensing data). I hope that the authors find these useful, as I believe this will highlight their contributions and enhance the readership experience.

In that regard, the authors point out a criticism of current techniques being limited, given a hypothesis that they introduce spatial biases (line 13), and suggesting that their method advances this aspect. However, no spatial analysis other than visual inspection of images is provided. I suppose the authors want to pursuit such contribution. In that case, it is recommended that they formalize it by using metrics designed for such purpose (e.g. Spatial Autocorrelation metrics such as Moran’s I, Geary’s c), and provide statistics for the entirety of images generated. In addition to the manuscript mentioned metrics, I would also recommend other metrics more sensitive to distribution shift (e.g. K-L divergence), and apply these along the tensor dimensions (lat, long, and for each band). Comparing their model, to other prestablished models (the ones mentioned in their related research) using the same data, and general experimental design, is highly recommended, so the reader can better position their findings.

In the same line, it is recommended to use a protocol to assess reconstruction biases physically. The SCuBA protocol (Chapter 5 of Kurihana, 2024—https://doi.org/10.6082/uchicago.11340) addresses such situations with the aid of climate model runs for classification tasks, and it can be adapted for continuous cases.

To address the issues in reproducibility, I strongly encourage the authors to follow FAIR principles (https://www.go-fair.org/fair-principles/) for their treatment of their data sources, as they are available in their original data repository (NOAA - https://www.ncei.noaa.gov/access/metadata/landing-page/bin/iso?id=gov.noaa.ncdc:C01502 ) in a format that allows the usage of their already programmed THREDDS calls. The GitHub codebase needs a better description, not only in code comments, but in terms of the clarity of the steps necessary to replicate results, ideally not bound by elements of the MARBLE service. Parameterizations of the experiment should be of clear inspection (one possibility is to use a YAML/JSON message only for such purpose). The codebase also needs to be synced or replicated to a versioned permanent archival system (e.g. Zenodo), as well as the data. This last one, as NSERC funded the research, it is highly recommended that these data outputs be stored at the Borealis repository ( https://onesearch.library.utoronto.ca/researchdata/about-u-t-dataverse ). You can find a more structured protocol for reproducible research at The Turing Way (https://book.the-turing-way.org) and geosciences at the Environmental Data Science Book (https://edsbook.org/welcome.html).

Other comments are about the manuscript organization and formatting. I suggest the authors consolidate some sections in which the information is close (e.g., the “Background” section has content that is quite close to the “Related Research” and “Methodology” sections—some of the items in the “Results” section are also part of “Methodology” in my view, as they are parameterizations used in the learning process).

The “Data Processing” section does not include any information about the spatiotemporal time range of the ABI-L2-MCMIPM product retrieved, and such information is also not found in the reproduction instructions within the provided GitHub repository. Along with the ML modeling parametrization, these are crucial elements to reproduce the experiment and better understand the limits of the author’s statements and conclusions.

Regarding referencing styling, I would strongly encourage the authors to do a careful review. It seems they have used LaTeX to produce their manuscript, which has generated sentences that begin with a citation (e.g., line 40) with bibtex calls that are prone to create errors once the manuscript is rendered. As Cambridge University Press uses the American Mathematical Society formatting, I recommend reading their user guide for the amsrefs package (https://www.ams.org/arc/tex/amsrefs/amsrdoc.pdf), to avoid such pitfalls.

6. Reviewer’s confidence

The reviewer has multi-year research expertise in the relevant field and is strongly confident for the evaluation

Recommendation: Discrete variational autoencoders for synthetic nighttime visible satellite imagery — R0/PR4

Comments

This article was accepted into the Climate Informatics 2025 Conference after the authors addressed the comments in the reviews provided. It has been accepted for publication in Environmental Data Science on the strength of the Climate Informatics Review Process.

Decision: Discrete variational autoencoders for synthetic nighttime visible satellite imagery — R0/PR5

Comments

No accompanying comment.