elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Contact | Deutsch
Fontsize: [-] Text [+]

Implementierung exemplarischer, paralleler, numerischer Verfahren aus dem CFD-Bereich auf Grafikkarten unter Verwendung von OpenCL

Kautz, Frank (2011) Implementierung exemplarischer, paralleler, numerischer Verfahren aus dem CFD-Bereich auf Grafikkarten unter Verwendung von OpenCL. Master's, Hochschule Darmstadt.

[img]
Preview
PDF
3MB

Abstract

This master thesis examines whether the linear module of the flow solver TRACE (Turbo machinery Research Aerodynamic Computational Environment) can be accelerated by the use of GPUs. The target platforms of the current implementation are CPU Clusters and employs MPI (Message Passing Interface) for parallelization. Different data formats are examined in this thesis. The basics of the General-Purpose Computation on Graphics Processing Unit programming (GPGPU) with the use of Open Computing Language (OpenCL) and Compute Unified Device Architecture CUDA are described. The first step is to analyze the flow solver TRACE. Which algorithm is used? Which parts of the algorithm are candidates to be executed by GPU? Which data format is used? TRACE uses the Blocked Compressed Sparse Row format (BCSR) with a block size of 5x5 values. It is examined if this format is suitable for processing by the GPU or if another format works better on this architecture. For testing different formats several data converters have been implemented. TRACE uses the Generalized Minimum Residual algorithm (GMRES). GMRES contains a sparse matrix-vector multiplication (SpMV); this will be ported onto the GPU. The SpMV is implemented with the use of different data formats. By trying different optimization approaches it is tested if a better performance is possible. The various implementations with OpenCL and CUDA are compared on a test system with the MPI version of TRACE. The performance of the implemented SpMV kernels is measured and compared. It is found that block variants of the ELL format give the best results.

Item URL in elib:https://elib.dlr.de/79339/
Document Type:Thesis (Master's)
Title:Implementierung exemplarischer, paralleler, numerischer Verfahren aus dem CFD-Bereich auf Grafikkarten unter Verwendung von OpenCL
Authors:
AuthorsInstitution or Email of AuthorsAuthors ORCID iD
Kautz, FrankSC-VKUNSPECIFIED
Date:13 April 2011
Refereed publication:No
Open Access:Yes
Gold Open Access:No
In SCOPUS:No
In ISI Web of Science:No
Number of Pages:87
Status:Published
Keywords:GPGPU; parallelism; sparse matrix-vector multiplication; compressed storage formats; OpenCL; CUDA; CFD
Institution:Hochschule Darmstadt
Department:Fachbereich Informatik
HGF - Research field:Aeronautics, Space and Transport (old)
HGF - Program:Space (old)
HGF - Program Themes:W SY - Technik für Raumfahrtsysteme
DLR - Research area:Space
DLR - Program:W SY - Technik für Raumfahrtsysteme
DLR - Research theme (Project):W - Vorhaben SISTEC (old)
Location: Köln-Porz
Institutes and Institutions:Institut of Simulation and Software Technology > Distributed Systems and Component Software
Deposited By: Basermann, Dr.-Ing. Achim
Deposited On:13 Dec 2012 16:07
Last Modified:31 Jul 2019 19:38

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
electronic library is running on EPrints 3.3.12
Copyright © 2008-2017 German Aerospace Center (DLR). All rights reserved.