Kautz, Frank (2011) Implementierung exemplarischer, paralleler, numerischer Verfahren aus dem CFD-Bereich auf Grafikkarten unter Verwendung von OpenCL. Masterarbeit, Hochschule Darmstadt.
|
PDF
3MB |
Kurzfassung
This master thesis examines whether the linear module of the flow solver TRACE (Turbo machinery Research Aerodynamic Computational Environment) can be accelerated by the use of GPUs. The target platforms of the current implementation are CPU Clusters and employs MPI (Message Passing Interface) for parallelization. Different data formats are examined in this thesis. The basics of the General-Purpose Computation on Graphics Processing Unit programming (GPGPU) with the use of Open Computing Language (OpenCL) and Compute Unified Device Architecture CUDA are described. The first step is to analyze the flow solver TRACE. Which algorithm is used? Which parts of the algorithm are candidates to be executed by GPU? Which data format is used? TRACE uses the Blocked Compressed Sparse Row format (BCSR) with a block size of 5x5 values. It is examined if this format is suitable for processing by the GPU or if another format works better on this architecture. For testing different formats several data converters have been implemented. TRACE uses the Generalized Minimum Residual algorithm (GMRES). GMRES contains a sparse matrix-vector multiplication (SpMV); this will be ported onto the GPU. The SpMV is implemented with the use of different data formats. By trying different optimization approaches it is tested if a better performance is possible. The various implementations with OpenCL and CUDA are compared on a test system with the MPI version of TRACE. The performance of the implemented SpMV kernels is measured and compared. It is found that block variants of the ELL format give the best results.
elib-URL des Eintrags: | https://elib.dlr.de/79339/ | ||||||||
---|---|---|---|---|---|---|---|---|---|
Dokumentart: | Hochschulschrift (Masterarbeit) | ||||||||
Titel: | Implementierung exemplarischer, paralleler, numerischer Verfahren aus dem CFD-Bereich auf Grafikkarten unter Verwendung von OpenCL | ||||||||
Autoren: |
| ||||||||
Datum: | 13 April 2011 | ||||||||
Referierte Publikation: | Nein | ||||||||
Open Access: | Ja | ||||||||
Seitenanzahl: | 87 | ||||||||
Status: | veröffentlicht | ||||||||
Stichwörter: | GPGPU; parallelism; sparse matrix-vector multiplication; compressed storage formats; OpenCL; CUDA; CFD | ||||||||
Institution: | Hochschule Darmstadt | ||||||||
Abteilung: | Fachbereich Informatik | ||||||||
HGF - Forschungsbereich: | Verkehr und Weltraum (alt) | ||||||||
HGF - Programm: | Weltraum (alt) | ||||||||
HGF - Programmthema: | W SY - Technik für Raumfahrtsysteme | ||||||||
DLR - Schwerpunkt: | Weltraum | ||||||||
DLR - Forschungsgebiet: | W SY - Technik für Raumfahrtsysteme | ||||||||
DLR - Teilgebiet (Projekt, Vorhaben): | W - Vorhaben SISTEC (alt) | ||||||||
Standort: | Köln-Porz | ||||||||
Institute & Einrichtungen: | Institut für Simulations- und Softwaretechnik > Verteilte Systeme und Komponentensoftware | ||||||||
Hinterlegt von: | Basermann, Dr.-Ing. Achim | ||||||||
Hinterlegt am: | 13 Dez 2012 16:07 | ||||||||
Letzte Änderung: | 31 Jul 2019 19:38 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags