Röhrig-Zöllner, Melven und Thies, Jonas und Basermann, Achim (2026) Performance of linear algebra building blocks for low-rank tensor algorithms on current multi-core CPUs and GPUs. SIAM Conference on Parallel Processing for Scientific Computing, 2026-03-02 - 2026-03-06, Berlin, Deutschland.
|
PDF
2MB |
Kurzfassung
In this talk we discuss the node-level performance of basic operations needed in low-rank tensor algorithms. As starting point, we consider two specific problems: (1) compressing large dense data in the tensor-train (TT) format and (2) solving linear systems with an operator and right-hand side vector given in TT format. For both problems, we analyze and optimize suitable algorithms focussing on the required underlying operations such as tensor contractions and matrix decompositions. In particular, we obtain a significant speedup for orthogonalization and truncation steps by using a high-performance implementation of a "Q-less" tall-skinny QR decomposition. On multi-core CPUs, our implementation achieves a speedup of ~50x over reference implementations for the TT compression, and up to ~5x for solving linear systems. For GPUs, we give an overview of the performance of the most important operations and shortly discuss the implementation of a tall-skinny QR decomposition.
| elib-URL des Eintrags: | https://elib.dlr.de/223282/ | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dokumentart: | Konferenzbeitrag (Vortrag) | ||||||||||||||||
| Titel: | Performance of linear algebra building blocks for low-rank tensor algorithms on current multi-core CPUs and GPUs | ||||||||||||||||
| Autoren: |
| ||||||||||||||||
| Datum: | 5 März 2026 | ||||||||||||||||
| Referierte Publikation: | Nein | ||||||||||||||||
| Open Access: | Ja | ||||||||||||||||
| Gold Open Access: | Nein | ||||||||||||||||
| In SCOPUS: | Nein | ||||||||||||||||
| In ISI Web of Science: | Nein | ||||||||||||||||
| Status: | veröffentlicht | ||||||||||||||||
| Stichwörter: | linear algebra, high-performance computing, tensor operations, tensor-trains | ||||||||||||||||
| Veranstaltungstitel: | SIAM Conference on Parallel Processing for Scientific Computing | ||||||||||||||||
| Veranstaltungsort: | Berlin, Deutschland | ||||||||||||||||
| Veranstaltungsart: | internationale Konferenz | ||||||||||||||||
| Veranstaltungsbeginn: | 2 März 2026 | ||||||||||||||||
| Veranstaltungsende: | 6 März 2026 | ||||||||||||||||
| Veranstalter : | Society for Industrial and Applied Mathematics (SIAM) | ||||||||||||||||
| HGF - Forschungsbereich: | keine Zuordnung | ||||||||||||||||
| HGF - Programm: | keine Zuordnung | ||||||||||||||||
| HGF - Programmthema: | keine Zuordnung | ||||||||||||||||
| DLR - Schwerpunkt: | Quantencomputing-Initiative | ||||||||||||||||
| DLR - Forschungsgebiet: | QC AW - Anwendungen | ||||||||||||||||
| DLR - Teilgebiet (Projekt, Vorhaben): | QC - QuTeNet | ||||||||||||||||
| Standort: | Köln-Porz | ||||||||||||||||
| Institute & Einrichtungen: | Institut für Softwaretechnologie Institut für Softwaretechnologie > High-Performance Computing | ||||||||||||||||
| Hinterlegt von: | Röhrig-Zöllner, Melven | ||||||||||||||||
| Hinterlegt am: | 08 Apr 2026 10:55 | ||||||||||||||||
| Letzte Änderung: | 08 Apr 2026 10:55 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags