elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Contact | Deutsch
Fontsize: [-] Text [+]

FuTH-Net: Fusing Temporal Relations and Holistic Features for Aerial Video Classification

Jin, Pu and Mou, LiChao and Hua, Yuansheng and Xia, Gui-Song and Zhu, Xiao Xiang (2022) FuTH-Net: Fusing Temporal Relations and Holistic Features for Aerial Video Classification. IEEE Transactions on Geoscience and Remote Sensing, 60, p. 5618913. IEEE - Institute of Electrical and Electronics Engineers. doi: 10.1109/TGRS.2022.3150917. ISSN 0196-2892.

[img] PDF - Published version
6MB

Official URL: https://ieeexplore.ieee.org/document/9709809

Abstract

Unmanned aerial vehicles (UAVs) are now widely applied to data acquisition due to its low cost and fast mobility. With the increasing volume of aerial videos, the demand for automatically parsing these videos is surging. To achieve this, current research mainly focuses on extracting a holistic feature with convolutions along both spatial and temporal dimensions. However, these methods are limited by small temporal receptive fields and cannot adequately capture long-term temporal dependencies that are important for describing complicated dynamics. In this article, we propose a novel deep neural network, termed Fusing Temporal relations and Holistic features for aerial video classification (FuTH-Net), to model not only holistic features but also temporal relations for aerial video classification. Furthermore, the holistic features are refined by the multiscale temporal relations in a novel fusion module for yielding more discriminative video representations. More specially, FuTH-Net employs a two-pathway architecture: 1) a holistic representation pathway to learn a general feature of both frame appearances and short-term temporal variations and 2) a temporal relation pathway to capture multiscale temporal relations across arbitrary frames, providing long-term temporal dependencies. Afterward, a novel fusion module is proposed to spatiotemporally integrate the two features learned from the two pathways. Our model is evaluated on two aerial video classification datasets, ERA and Drone-Action, and achieves the state-of-the-art results. This demonstrates its effectiveness and good generalization capacity across different recognition tasks (event classification and human action recognition). To facilitate further research, we release the code at https://gitlab.lrz.de/ai4eo/reasoning/futh-net .

Item URL in elib:https://elib.dlr.de/192765/
Document Type:Article
Title:FuTH-Net: Fusing Temporal Relations and Holistic Features for Aerial Video Classification
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Jin, PuUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Mou, LiChaoUNSPECIFIEDhttps://orcid.org/0000-0001-8407-6413UNSPECIFIED
Hua, YuanshengUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Xia, Gui-SongUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Zhu, Xiao XiangUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Date:February 2022
Journal or Publication Title:IEEE Transactions on Geoscience and Remote Sensing
Refereed publication:Yes
Open Access:Yes
Gold Open Access:No
In SCOPUS:Yes
In ISI Web of Science:Yes
Volume:60
DOI:10.1109/TGRS.2022.3150917
Page Range:p. 5618913
Publisher:IEEE - Institute of Electrical and Electronics Engineers
ISSN:0196-2892
Status:Published
Keywords:Aerial video classification, convolutional neural networks (CNNs), holistic features, temporal relations, two-pathway, unmanned aerial vehicle (UAV)
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Earth Observation
DLR - Research area:Raumfahrt
DLR - Program:R EO - Earth Observation
DLR - Research theme (Project):R - Artificial Intelligence
Location: Oberpfaffenhofen
Institutes and Institutions:Remote Sensing Technology Institute > EO Data Science
Deposited By: Haschberger, Dr.-Ing. Peter
Deposited On:22 Dec 2022 09:06
Last Modified:30 Jan 2024 11:10

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.