elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Contact | Deutsch
Fontsize: [-] Text [+]

Cross-Task Transfer for Geotagged Audiovisual Aerial Scene Recognition

Hu, Di and Li, Xuhong and Mou, LiChao and Jin, Pu and Chen, Dong and Zhu, Xiao Xiang and Dou, Dejing (2020) Cross-Task Transfer for Geotagged Audiovisual Aerial Scene Recognition. In: 16th European Conference on Computer Vision, ECCV 2020, 12369, pp. 68-84. Springer. ECCV 2020, 2020-08-23 - 2020-08-28, Glasgow, UK. doi: 10.1007/978-3-030-58586-0_5. ISBN 978-303058541-9. ISSN 0302-9743.

[img] PDF
2MB

Official URL: https://link.springer.com/chapter/10.1007/978-3-030-58586-0_5

Abstract

Aerial scene recognition is a fundamental task in remote sensing and has recently received increased interest. While the visual information from overhead images with powerful models and efficient algorithms yields considerable performance on scene recognition, it still suffers from the variation of ground objects, lighting conditions etc. Inspired by the multi-channel perception theory in cognition science, in this paper, for improving the performance on the aerial scene recognition, we explore a novel audiovisual aerial scene recognition task using both images and sounds as input. Based on an observation that some specific sound events are more likely to be heard at a given geographic location, we propose to exploit the knowledge from the sound events to improve the performance on the aerial scene recognition. For this purpose, we have constructed a new dataset named AuDio Visual Aerial sceNe reCognition datasEt (ADVANCE). With the help of this dataset, we evaluate three proposed approaches for transferring the sound event knowledge to the aerial scene recognition task in a multimodal learning framework, and show the benefit of exploiting the audio information for the aerial scene recognition. The source code is publicly available for reproducibility purposes.

Item URL in elib:https://elib.dlr.de/139789/
Document Type:Conference or Workshop Item (Speech)
Title:Cross-Task Transfer for Geotagged Audiovisual Aerial Scene Recognition
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Hu, DiUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Li, XuhongUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Mou, LiChaoUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Jin, PuUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Chen, DongUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Zhu, Xiao XiangUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Dou, DejingUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Date:August 2020
Journal or Publication Title:16th European Conference on Computer Vision, ECCV 2020
Refereed publication:Yes
Open Access:Yes
Gold Open Access:No
In SCOPUS:Yes
In ISI Web of Science:No
Volume:12369
DOI:10.1007/978-3-030-58586-0_5
Page Range:pp. 68-84
Publisher:Springer
Series Name:Lecture Notes in Computer Science
ISSN:0302-9743
ISBN:978-303058541-9
Status:Published
Keywords:Cross-task transfer, aerial scene classification, geotagged sound, multimodal learning, remote sensing
Event Title:ECCV 2020
Event Location:Glasgow, UK
Event Type:international Conference
Event Start Date:23 August 2020
Event End Date:28 August 2020
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Earth Observation
DLR - Research area:Raumfahrt
DLR - Program:R EO - Earth Observation
DLR - Research theme (Project):R - Remote Sensing and Geo Research
Location: Oberpfaffenhofen
Institutes and Institutions:Remote Sensing Technology Institute > EO Data Science
Deposited By: Bratasanu, Ion-Dragos
Deposited On:18 Dec 2020 13:13
Last Modified:24 Apr 2024 20:40

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.