elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Contact | Deutsch
Fontsize: [-] Text [+]

Automatic machine-learning based water body detection algorithm as part of the hydrological conditioning of the TanDEM-X DEM

Keller, Carolin and Warmedinger, Leena Julia and Huber, Martin and Gorzawski, Larissa and Anand, Mira and Lehner, Bernhard and Thieme, Michele and Wessel, Birgit and Roth, Achim (2023) Automatic machine-learning based water body detection algorithm as part of the hydrological conditioning of the TanDEM-X DEM. TerraSAR-X / TanDEM-X Science Team Meeting 2023, 2023-10-18 - 2023-10-20, Oberpfaffenhofen, Wessling, Deutschland.

[img] PDF - Only accessible within DLR
5MB

Abstract

The TanDEM-X mission provides a high-resolution global digital elevation model (DEM) which works as a base for data analyses where the Earths relief is a key source of information. The unedited version of the TanDEM-X DEM contains artefacts such as rough surfaces of open water due to decorrelation effects, which impair the derivation of streams and hydrological assessments. To counteract distortions inherent to the interferometric synthetic aperture radar acquisition and processing and to support the conditioning of the elevation data in order to improve hydrological analyses, a global water body mask (WBM) is generated. The presented technique is a major part of the pre-conditioning of the global TanDEM-X DEM for producing a second and refined version of the HydroSHEDS database. Its core is an automatic machine-learning (ML) based water body detection using a Gradient Boosted Decision Tree (GBDT) algorithm as underlying classifier with Bayesian hyperparameter optimization. The features considered by the algorithm are mainly based on the TanDEM-X datasets assisted by a global Normalized Difference Vegetation Index (NDVI) derived from Sentinel-2 data. The ML model is sustained by hyperparameter tuning in order to learn the spectral signatures and their varieties for water bodies and land surfaces, enabling accurate detection and classification. As sufficient training data is required and to avoid manual hyperparameter tuning, the automation generates training data by using a maximum WBM created from reference data which allows for automatic random sampling throughout the feature space to train the classifier. For each TanDEM-X tile, a classified WBM is generated by the automatic ML algorithm as well as an alternative WBM based on the WBM quality layer of the Copernicus (COP) DEM modified using the TanDEM-X dataset. Then, either the superior water body mask or a combination of both is selected. First results show that the classified WBM is able to improve the alternative COP WBM in approximately 45% of tiles processed so far, underlining the quality of the used ML algorithm. However, regional analysis uncovers large differences. While the COP WBM achieves better results in arid areas, the classified WBM is superior in northern regions. Frozen water imposes great challenges to water detection algorithms. The ML based approach shows large improvements in northern regions compared to the COP WBM as it has the flexibility of including the minimum observed amplitude preferably capturing open water. Arid regions often show too little water resulting in an insufficient number of training pixels to perform a classification.

Item URL in elib:https://elib.dlr.de/199211/
Document Type:Conference or Workshop Item (Speech)
Title:Automatic machine-learning based water body detection algorithm as part of the hydrological conditioning of the TanDEM-X DEM
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Keller, CarolinUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Warmedinger, Leena JuliaUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Huber, MartinUNSPECIFIEDhttps://orcid.org/0000-0003-2665-2149UNSPECIFIED
Gorzawski, LarissaUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Anand, MiraUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Lehner, BernhardUNSPECIFIEDhttps://orcid.org/0000-0003-3712-2581UNSPECIFIED
Thieme, MicheleUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Wessel, BirgitUNSPECIFIEDhttps://orcid.org/0000-0002-8673-2485146765882
Roth, AchimUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Date:19 October 2023
Refereed publication:No
Open Access:No
Gold Open Access:No
In SCOPUS:No
In ISI Web of Science:No
Status:Published
Keywords:HydroSHEDS Machine-Learning water classification
Event Title:TerraSAR-X / TanDEM-X Science Team Meeting 2023
Event Location:Oberpfaffenhofen, Wessling, Deutschland
Event Type:international Conference
Event Start Date:18 October 2023
Event End Date:20 October 2023
Organizer:Deutsches Zentrum für Luft- und Raumfahrt (DLR)
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Earth Observation
DLR - Research area:Raumfahrt
DLR - Program:R EO - Earth Observation
DLR - Research theme (Project):R - TerraSAR/TanDEM
Location: Oberpfaffenhofen
Institutes and Institutions:German Remote Sensing Data Center
Deposited By: Keller, Carolin
Deposited On:16 Nov 2023 10:11
Last Modified:24 Apr 2024 20:59

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.