Efremenko, Dmitry und Loyola, Diego und Doicu, Adrian und Spurr, Robert (2014) Multi-core-CPU and GPU-accelerated radiative transfer models based on the discrete ordinate method. Computer Physics Communications, 185 (12), Seiten 3079-3089. Elsevier. doi: 10.1016/j.cpc.2014.07.018. ISSN 0010-4655.
Dieses Archiv kann nicht den Volltext zur Verfügung stellen.
Offizielle URL: http://www.sciencedirect.com/science/article/pii/S0010465514002616
Kurzfassung
The operational processing of remote sensing data usually requires high-performance radiative transfer model (RTM) simulations. To date, multi-core CPUs and also Graphical Processing Units (GPUs) have been used for highly intensive parallel computations. In this paper, we have compared multi-core and GPU implementations of an RTM based on the discrete ordinate solution method. To implement GPUs, the original CPU code has been redesigned using the C-oriented Compute Unified Device Architecture (CUDA) developed by NVIDIA. GPU memory management is a crucial issue regarding the performance. To cope with limitations of GPU registers, we have adapted an RTM based on the matrix operator technique together with the interaction principle for multilayer atmospheric systems. The speed-up of such an implementation depends on the number of discrete ordinates used in the RTM. To reduce the CPU/GPU communication overhead, we have exploited the asynchronous data transfer between host and device. To obtain optimal performance, we have also used overlapping of CPU and GPU computations by distributing the workload between them. With GPUs, we have achieved a 20x–40x speed-up for the multi-stream RTM, and 50x speed-up for the two-stream RTM with respect to the original single-threaded CPU codes. Based on these performance tests, an optimal workload distribution scheme between GPU and CPU is proposed. Additionally, CPU/GPU benchmark tests regarding basic matrix operations are given. Finally, we discuss the performance obtained with the multi-core-CPU and GPU implementations of the RTM.
elib-URL des Eintrags: | https://elib.dlr.de/92914/ | ||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Dokumentart: | Zeitschriftenbeitrag | ||||||||||||||||||||
Titel: | Multi-core-CPU and GPU-accelerated radiative transfer models based on the discrete ordinate method | ||||||||||||||||||||
Autoren: |
| ||||||||||||||||||||
Datum: | Dezember 2014 | ||||||||||||||||||||
Erschienen in: | Computer Physics Communications | ||||||||||||||||||||
Referierte Publikation: | Ja | ||||||||||||||||||||
Open Access: | Nein | ||||||||||||||||||||
Gold Open Access: | Nein | ||||||||||||||||||||
In SCOPUS: | Ja | ||||||||||||||||||||
In ISI Web of Science: | Ja | ||||||||||||||||||||
Band: | 185 | ||||||||||||||||||||
DOI: | 10.1016/j.cpc.2014.07.018 | ||||||||||||||||||||
Seitenbereich: | Seiten 3079-3089 | ||||||||||||||||||||
Herausgeber: |
| ||||||||||||||||||||
Verlag: | Elsevier | ||||||||||||||||||||
ISSN: | 0010-4655 | ||||||||||||||||||||
Status: | veröffentlicht | ||||||||||||||||||||
Stichwörter: | Radiative transfer models; Discrete ordinate method; CUDA | ||||||||||||||||||||
HGF - Forschungsbereich: | Luftfahrt, Raumfahrt und Verkehr | ||||||||||||||||||||
HGF - Programm: | Raumfahrt | ||||||||||||||||||||
HGF - Programmthema: | Erdbeobachtung | ||||||||||||||||||||
DLR - Schwerpunkt: | Raumfahrt | ||||||||||||||||||||
DLR - Forschungsgebiet: | R EO - Erdbeobachtung | ||||||||||||||||||||
DLR - Teilgebiet (Projekt, Vorhaben): | R - Vorhaben Informationstechnische Systeme für die Fernerkundung (alt) | ||||||||||||||||||||
Standort: | Oberpfaffenhofen | ||||||||||||||||||||
Institute & Einrichtungen: | Institut für Methodik der Fernerkundung > Atmosphärenprozessoren | ||||||||||||||||||||
Hinterlegt von: | Efremenko, Dr Dmitry | ||||||||||||||||||||
Hinterlegt am: | 01 Dez 2014 15:11 | ||||||||||||||||||||
Letzte Änderung: | 29 Nov 2023 13:22 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags