DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Contact | Deutsch
Fontsize: [-] Text [+]

Multi-core-CPU and GPU-accelerated radiative transfer models based on the discrete ordinate method

Efremenko, Dmitry and Loyola, Diego and Doicu, Adrian and Spurr, Robert (2014) Multi-core-CPU and GPU-accelerated radiative transfer models based on the discrete ordinate method. Computer Physics Communications, 185 (12), pp. 3079-3089. Elsevier. doi: 10.1016/j.cpc.2014.07.018. ISSN 0010-4655.

Full text not available from this repository.

Official URL: http://www.sciencedirect.com/science/article/pii/S0010465514002616


The operational processing of remote sensing data usually requires high-performance radiative transfer model (RTM) simulations. To date, multi-core CPUs and also Graphical Processing Units (GPUs) have been used for highly intensive parallel computations. In this paper, we have compared multi-core and GPU implementations of an RTM based on the discrete ordinate solution method. To implement GPUs, the original CPU code has been redesigned using the C-oriented Compute Unified Device Architecture (CUDA) developed by NVIDIA. GPU memory management is a crucial issue regarding the performance. To cope with limitations of GPU registers, we have adapted an RTM based on the matrix operator technique together with the interaction principle for multilayer atmospheric systems. The speed-up of such an implementation depends on the number of discrete ordinates used in the RTM. To reduce the CPU/GPU communication overhead, we have exploited the asynchronous data transfer between host and device. To obtain optimal performance, we have also used overlapping of CPU and GPU computations by distributing the workload between them. With GPUs, we have achieved a 20x–40x speed-up for the multi-stream RTM, and 50x speed-up for the two-stream RTM with respect to the original single-threaded CPU codes. Based on these performance tests, an optimal workload distribution scheme between GPU and CPU is proposed. Additionally, CPU/GPU benchmark tests regarding basic matrix operations are given. Finally, we discuss the performance obtained with the multi-core-CPU and GPU implementations of the RTM.

Item URL in elib:https://elib.dlr.de/92914/
Document Type:Article
Title:Multi-core-CPU and GPU-accelerated radiative transfer models based on the discrete ordinate method
AuthorsInstitution or Email of AuthorsAuthor's ORCID iD
Efremenko, Dmitrydmitry.efremenko (at) dlr.deUNSPECIFIED
Loyola, Diegodiego.loyola (at) dlr.deUNSPECIFIED
Doicu, Adrianadrian.doicu (at) dlr.deUNSPECIFIED
Spurr, RobertRT SolutionsUNSPECIFIED
Date:December 2014
Journal or Publication Title:Computer Physics Communications
Refereed publication:Yes
Open Access:No
Gold Open Access:No
In ISI Web of Science:Yes
DOI :10.1016/j.cpc.2014.07.018
Page Range:pp. 3079-3089
EditorsEmailEditor's ORCID iD
Fritzsche, StephanFriedrich Schiller University of Jena, Jena, GermanyUNSPECIFIED
Keywords:Radiative transfer models; Discrete ordinate method; CUDA
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Earth Observation
DLR - Research area:Raumfahrt
DLR - Program:R EO - Earth Observation
DLR - Research theme (Project):R - Vorhaben Informationstechnische Systeme für die Fernerkundung (old)
Location: Oberpfaffenhofen
Institutes and Institutions:Remote Sensing Technology Institute > Atmospheric Processors
Deposited By: Efremenko, Dr Dmitry
Deposited On:01 Dec 2014 15:11
Last Modified:06 Sep 2019 15:17

Repository Staff Only: item control page

Help & Contact
electronic library is running on EPrints 3.3.12
Copyright © 2008-2017 German Aerospace Center (DLR). All rights reserved.