elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

Climateset: A large-scale climate model dataset for machine learning

Kaltenborn, Julia and Lange, Charlotte Emilie Elektra and Ramesh, Venkatesh and Brouillard, Philippe and Gurwicz, Yaniv and Nagda, Chandni and Runge, Jakob and Nowack, Peer and Rolnick, David (2023) Climateset: A large-scale climate model dataset for machine learning. In: 37th Conference on Neural Information Processing Systems, NeurIPS 2023, pp. 21757-21792. Advances in Neural Information Processing Systems. Thirty-seventh Conference on Neural Information Processing Systems, 2023-12-10, New Orleans, USA. ISSN 1049-5258.

Full text not available from this repository.

Official URL: https://proceedings.neurips.cc/paper_files/paper/2023/file/44a6769fe6c695f8dfb347c649f7c9f0-Paper-Datasets_and_Benchmarks.pdf

Abstract

Climate models have been key for assessing the impact of climate change and simulating future climate scenarios. The machine learning (ML) community has taken an increased interest in supporting climate scientists’ efforts on various tasks such as climate model emulation, downscaling, and prediction tasks. Many of those tasks have been addressed on datasets created with single climate models. However, both the climate science and ML communities have suggested that to address those tasks at scale, we need large, consistent, and ML-ready climate model datasets. Here, we introduce ClimateSet, a dataset containing the inputs and outputs of 36 climate models from the Input4MIPs and CMIP6 archives. In addition, we provide a modular dataset pipeline for retrieving and preprocessing additional climate models and scenarios. We showcase the potential of our dataset by using it as a benchmark for ML-based climate model emulation. We gain new insights about the performance and generalization capabilities of the different ML models by analyzing their performance across different climate models. Furthermore, the dataset can be used to train an ML emulator on several climate models instead of just one. Such a “super-emulator” can quickly project new climate change scenarios, complementing existing scenarios already provided to policymakers. We believe ClimateSet will create the basis needed for the ML community to tackle climate-related tasks at scale

Item URL in elib:https://elib.dlr.de/210927/
Document Type:Conference or Workshop Item (Poster)
Title:Climateset: A large-scale climate model dataset for machine learning
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Kaltenborn, JuliaUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Lange, Charlotte Emilie ElektraUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Ramesh, VenkateshUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Brouillard, PhilippeUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Gurwicz, YanivUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Nagda, ChandniUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Runge, JakobJakob.Runge (at) dlr.deUNSPECIFIEDUNSPECIFIED
Nowack, PeerUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Rolnick, DavidUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Date:December 2023
Journal or Publication Title:37th Conference on Neural Information Processing Systems, NeurIPS 2023
Refereed publication:Yes
Open Access:No
Gold Open Access:No
In SCOPUS:Yes
In ISI Web of Science:No
Page Range:pp. 21757-21792
Editors:
EditorsEmailEditor's ORCID iDORCID Put Code
Globerson, AmirUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Hardt, MoritzUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Levine, SergeyUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Saenko, KateUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Publisher:Advances in Neural Information Processing Systems
ISSN:1049-5258
Status:Published
Keywords:Benchmarking, Machine Learning, Climate Models
Event Title:Thirty-seventh Conference on Neural Information Processing Systems
Event Location:New Orleans, USA
Event Type:international Conference
Event Date:10 December 2023
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:other
DLR - Research area:Raumfahrt
DLR - Program:R - no assignment
DLR - Research theme (Project):R - no assignment, D - no assignment
Location: Jena
Institutes and Institutions:Institute of Data Science > Data Analysis and Intelligence
Deposited By: Hochsprung, Tom
Deposited On:06 Jan 2025 11:34
Last Modified:06 Jan 2025 11:34

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.