elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

Performance of linear algebra building blocks for low-rank tensor algorithms on current multi-core CPUs and GPUs

Röhrig-Zöllner, Melven and Thies, Jonas and Basermann, Achim (2026) Performance of linear algebra building blocks for low-rank tensor algorithms on current multi-core CPUs and GPUs. SIAM Conference on Parallel Processing for Scientific Computing, 2026-03-02 - 2026-03-06, Berlin, Deutschland.

[img] PDF
2MB

Abstract

In this talk we discuss the node-level performance of basic operations needed in low-rank tensor algorithms. As starting point, we consider two specific problems: (1) compressing large dense data in the tensor-train (TT) format and (2) solving linear systems with an operator and right-hand side vector given in TT format. For both problems, we analyze and optimize suitable algorithms focussing on the required underlying operations such as tensor contractions and matrix decompositions. In particular, we obtain a significant speedup for orthogonalization and truncation steps by using a high-performance implementation of a "Q-less" tall-skinny QR decomposition. On multi-core CPUs, our implementation achieves a speedup of ~50x over reference implementations for the TT compression, and up to ~5x for solving linear systems. For GPUs, we give an overview of the performance of the most important operations and shortly discuss the implementation of a tall-skinny QR decomposition.

Item URL in elib:https://elib.dlr.de/223282/
Document Type:Conference or Workshop Item (Speech)
Title:Performance of linear algebra building blocks for low-rank tensor algorithms on current multi-core CPUs and GPUs
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Röhrig-Zöllner, MelvenMelven.Roehrig-Zoellner (at) dlr.dehttps://orcid.org/0000-0001-9851-5886UNSPECIFIED
Thies, JonasJ.Thies (at) tudelft.nlhttps://orcid.org/0000-0001-9231-9999UNSPECIFIED
Basermann, AchimAchim.Basermann (at) dlr.dehttps://orcid.org/0000-0003-3637-3231UNSPECIFIED
Date:5 March 2026
Refereed publication:No
Open Access:Yes
Gold Open Access:No
In SCOPUS:No
In ISI Web of Science:No
Status:Published
Keywords:linear algebra, high-performance computing, tensor operations, tensor-trains
Event Title:SIAM Conference on Parallel Processing for Scientific Computing
Event Location:Berlin, Deutschland
Event Type:international Conference
Event Start Date:2 March 2026
Event End Date:6 March 2026
Organizer:Society for Industrial and Applied Mathematics (SIAM)
HGF - Research field:other
HGF - Program:other
HGF - Program Themes:other
DLR - Research area:Quantum Computing Initiative
DLR - Program:QC AW - Applications
DLR - Research theme (Project):QC - QuTeNet
Location: Köln-Porz
Institutes and Institutions:Institute of Software Technology
Institute of Software Technology > High-Performance Computing
Deposited By: Röhrig-Zöllner, Melven
Deposited On:08 Apr 2026 10:55
Last Modified:08 Apr 2026 10:55

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.