A Novel Deep Learning Framework Based on Transfer Learning and Joint Time-Frequency Analysis

Huang, Zhongling und Dumitru, Corneliu Octavian und Pan, Zongxu und Le, Bin und Datcu, Mihai (2019) A Novel Deep Learning Framework Based on Transfer Learning and Joint Time-Frequency Analysis. TerraSAR-X Science Team Meeting 2019, 2019-10-21 - 2019-10-24, Oberpfaffenhofen, Germany.

PDF
101kB

Offizielle URL: https://tandemx-science.dlr.de/cgi-bin/wcm.pl?page=Tdm-Science-Team-Meeting

Kurzfassung

We propose a novel SAR-specific deep learning framework Deep SAR-Net (DSN) for complex-valued SAR images based on transfer learning and joint time-frequency analysis. Conventional methods for deep convolutional neural networks usually take the amplitude information of single-polarization SAR images as input to learn hierarchical spatial features automatically, which may have difficulties in discriminating objects with similar texture but with discriminative scattering patterns. As a result, we analyzed complex-valued SAR images to learn both spatial texture information and the backscattering patterns of objects on the ground.

Firstly, we experimented on a large-scale SAR land cover dataset collected from TerraSAR-X images, with a hierarchical three-level annotation of 150 categories and comprising more than 100,000 image patches. With three main challenges of highly imbalanced classes, geographic diversity, and label noise, in automatically interpreting the dataset, a deep transfer learning method based on a similarly annotated optical land cover dataset (NWPU-RESISC45) was used to learn a deep Residual convolutional neural network, optimizing a combined top-2 smooth loss function with cost-sensitive parameters. Rather than applying the ImageNet pre-trained model of ResNet-18 to SAR images directly, the optical remote sensing land cover dataset narrows the gap between SAR and natural images which results in a significant improvement in feature transferability, and the proposed combined loss function is successful in accelerating the training process, and is reducing the model bias to noisy labels. The trained deep Residual CNN model shows a good generalization for other SAR image processing tasks, including MSTAR target recognition, land cover, and land use localization.

Based on this pre-trained model, we transferred the first two residual blocks to extract the mid-level representative spatial features from the intensity images of single-look complex (SLC) SAR data, which have a similar resolution and pixel spacing along range and azimuth directions to avoid large distortions. Then, a joint time-frequency analysis was applied to SLC data to obtain a 4-D representation with information in all sub-bands, where the radar spectrograms reveal the backscattering diversity versus range and azimuth frequencies of objects on the ground. A stacked convolutional auto-encoder was designed to learn the latent features from the radar spectrograms in the frequency domain, related to physical target properties. Later, the frequency features were spatially aligned corresponding to the spatial information in the 4-D representation to be fused with the transferred spatial features. A post-learning sub-net consisting of two bottleneck residual blocks was designed to make the final decisions.

This is the first time to exploit the full use of single-polarization SLC SAR data in deep learning. Compared with conventional CNNs which are based on intensity information only, the proposed DSN shows a superior performance in SAR image land cover and land use classification, especially for man-made objects. In some cases, the shapes and textures are similar to intensity images which confuse CNNs to make a right decision, but the spectrogram amplitudes present prominently different characteristics, helping DSNs to reach a better understanding of the objects on the ground. On the other hand, for natural surfaces, the radar spectrograms present similar backscattering patterns without a specific mechanism for distinguishing the features in the frequency domain, so that they cannot provide enough extra information on natural surfaces to support the interpretation of SAR images. The experiments are conducted on Sentinel-1 Stripmap SAR images and we believe the proposed DSN can be also applied to TerraSAR-X SLC data.

elib-URL des Eintrags:

https://elib.dlr.de/130268/

Dokumentart:

Konferenzbeitrag (Vortrag)

Titel:

A Novel Deep Learning Framework Based on Transfer Learning and Joint Time-Frequency Analysis

Autoren:

Autoren	Institution oder E-Mail-Adresse	Autoren-ORCID-iD	ORCID Put Code
Huang, Zhongling	huangzhongling15 (at) mails.ucas.ac.cn	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT
Dumitru, Corneliu Octavian	Corneliu.Dumitru (at) dlr.de	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT
Pan, Zongxu	Institute of Geology and Geophysics, CAS	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT
Le, Bin	Chinese Academy of Science	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT
Datcu, Mihai	Mihai.Datcu (at) dlr.de	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT

Datum:

Oktober 2019

Referierte Publikation:

Nein

Open Access:

Gold Open Access:

Nein

In SCOPUS:

Nein

In ISI Web of Science:

Nein

Status:

veröffentlicht

Stichwörter:

Deep Learning, Transfer Learning, Joint Time-Frequency Analysis

Veranstaltungstitel:

TerraSAR-X Science Team Meeting 2019

Veranstaltungsort:

Oberpfaffenhofen, Germany

Veranstaltungsart:

internationale Konferenz

Veranstaltungsbeginn:

21 Oktober 2019

Veranstaltungsende:

24 Oktober 2019

HGF - Forschungsbereich:

Luftfahrt, Raumfahrt und Verkehr

HGF - Programm:

Raumfahrt

HGF - Programmthema:

Erdbeobachtung

DLR - Schwerpunkt:

Raumfahrt

DLR - Forschungsgebiet:

R EO - Erdbeobachtung

DLR - Teilgebiet (Projekt, Vorhaben):

R - Vorhaben hochauflösende Fernerkundungsverfahren (alt)

Standort:

Oberpfaffenhofen

Institute & Einrichtungen:

Institut für Methodik der Fernerkundung > EO Data Science

Hinterlegt von:

Karmakar, Chandrabali

Hinterlegt am:

21 Nov 2019 14:12

Letzte Änderung:

24 Apr 2024 20:33

Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags