Hostname: page-component-89b8bd64d-mmrw7 Total loading time: 0 Render date: 2026-05-12T03:12:33.981Z Has data issue: false hasContentIssue false

Opening a conversation on responsible environmental data science in the age of large language models

Published online by Cambridge University Press:  09 May 2024

Ruth Y. Oliver*
Affiliation:
Bren School of Environmental Science and Management, University of California Santa Barbara, Santa Barbara, CA, USA
Melissa Chapman
Affiliation:
National Center for Ecological Analysis and Synthesis, University of California Santa Barbara, Santa Barbara, CA, USA
Nathan Emery
Affiliation:
Center for Innovative Teaching, Research, and Learning, University of California Santa Barbara, Santa Barbara, CA, USA
Lauren Gillespie
Affiliation:
Department of Computer Science, Stanford University, Palo Alto, CA, USA
Natasha Gownaris
Affiliation:
Department of Environmental Studies, Gettysburg College, Gettysburg, PA, USA
Sophia Leiker
Affiliation:
Bren School of Environmental Science and Management, University of California Santa Barbara, Santa Barbara, CA, USA
Anna C. Nisi
Affiliation:
Department of Biology, Center for Ecosystem Sentinels, University of Washington, Seattle, WA, USA
David Ayers
Affiliation:
Wildlife, Fish and Conservation Biology Department, University of California Davis, Davis, CA, USA
Ian Breckheimer
Affiliation:
Rocky Mountain Biological Laboratory, Crested Butte, CO, USA
Hannah Blondin
Affiliation:
Cooperative Institute for Marine and Atmospheric Studies (CIMAS), University of Miami, Miami, FL, USA
Ava Hoffman
Affiliation:
Data Science Lab, Fred Hutchinson Cancer Center, Seattle, WA, USA
Camille M.L.S. Pagniello
Affiliation:
Hawai’i Institute of Marine Biology, University of Hawai’i at Mānoa, Kaneohe, HI, USA
Naupaka Zimmerman
Affiliation:
Department of Biology, University of San Francisco, San Francisco, CA, USA
*
Corresponding author: Ruth Y. Oliver; Email: rutholiver@bren.ucsb.edu

Abstract

The general public and scientific community alike are abuzz over the release of ChatGPT and GPT-4. Among many concerns being raised about the emergence and widespread use of tools based on large language models (LLMs) is the potential for them to propagate biases and inequities. We hope to open a conversation within the environmental data science community to encourage the circumspect and responsible use of LLMs. Here, we pose a series of questions aimed at fostering discussion and initiating a larger dialogue. To improve literacy on these tools, we provide background information on the LLMs that underpin tools like ChatGPT. We identify key areas in research and teaching in environmental data science where these tools may be applied, and discuss limitations to their use and points of concern. We also discuss ethical considerations surrounding the use of LLMs to ensure that as environmental data scientists, researchers, and instructors, we can make well-considered and informed choices about engagement with these tools. Our goal is to spark forward-looking discussion and research on how as a community we can responsibly integrate generative AI technologies into our work.

Information

Type
Position Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press
Figure 0

Figure 1. Collaborating with ChatGPT on an environmental data science project. In this hypothetical example, we explore how ChatGPT’s (3.5) core functionalities may be used for environmental data science. We have annotated to highlight notable components of responses. Responses have been truncated for brevity. A full transcript can be found in the Supplementary material.

Figure 1

Table 1. General overview of the key functionalities, applications, and potential benefits and pitfalls of ChatGPT

Supplementary material: File

Oliver et al. supplementary material

Oliver et al. supplementary material
Download Oliver et al. supplementary material(File)
File 19.8 KB