elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

Designing a Framework for DataOps: Improving Data Quality and Pipeline Efficiency in Data Science

Haertel, Christian and Sagavakar, Kunal Sanjay and Staegemann, Daniel and Pohl, Matthias and Volk, Matthias and Turowski, Klaus (2026) Designing a Framework for DataOps: Improving Data Quality and Pipeline Efficiency in Data Science. Procedia Computer Science. Elsevier. ISSN 1877-0509. (In Press)

Full text not available from this repository.

Abstract

Data Science (DS), with the application of methods from Data Analytics, can assist organizations in deriving value from huge amounts of data to improve performance. To be able to provide actionable insights from this resource, a significant effort in DS is attributed to the data collection, cleaning, and transformation to construct a target high-quality dataset for analytics. However, the traditional, predominantly manual approach to data preparation is considered inefficient and error-prone, failing to meet the requirements of Big Data environments and the adaptability to react to changing conditions. DataOps aims to combine best practices and technologies from DevOps to automate stages of the data lifecycle to increase the quality and reliability of data pipelines. Hence, in this paper, a framework is proposed to guide DataOps implementation, using the Design Science Research methodology. The applicability of the artifact is demonstrated in a case study in urban mobility analytics.

Item URL in elib:https://elib.dlr.de/221861/
Document Type:Article
Title:Designing a Framework for DataOps: Improving Data Quality and Pipeline Efficiency in Data Science
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Haertel, ChristianUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Sagavakar, Kunal SanjayUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Staegemann, DanielUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Pohl, MatthiasUNSPECIFIEDhttps://orcid.org/0000-0002-6241-7675UNSPECIFIED
Volk, MatthiasUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Turowski, KlausUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Date:2026
Journal or Publication Title:Procedia Computer Science
Refereed publication:Yes
Open Access:No
Gold Open Access:No
In SCOPUS:Yes
In ISI Web of Science:No
Publisher:Elsevier
ISSN:1877-0509
Status:In Press
Keywords:Data Science; Data Analytics; DataOps; Design Science Research; Cloud Computing.
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:other
DLR - Research area:Raumfahrt
DLR - Program:R - no assignment
DLR - Research theme (Project):R - no assignment
Location: Jena
Institutes and Institutions:Institute of Data Science > Data Management and Enrichment
Deposited By: Pohl, Matthias
Deposited On:09 Jan 2026 08:13
Last Modified:09 Jan 2026 08:13

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.