Haertel, Christian and Sagavakar, Kunal Sanjay and Staegemann, Daniel and Pohl, Matthias and Volk, Matthias and Turowski, Klaus (2026) Designing a Framework for DataOps: Improving Data Quality and Pipeline Efficiency in Data Science. Procedia Computer Science. Elsevier. ISSN 1877-0509. (In Press)
Full text not available from this repository.
Abstract
Data Science (DS), with the application of methods from Data Analytics, can assist organizations in deriving value from huge amounts of data to improve performance. To be able to provide actionable insights from this resource, a significant effort in DS is attributed to the data collection, cleaning, and transformation to construct a target high-quality dataset for analytics. However, the traditional, predominantly manual approach to data preparation is considered inefficient and error-prone, failing to meet the requirements of Big Data environments and the adaptability to react to changing conditions. DataOps aims to combine best practices and technologies from DevOps to automate stages of the data lifecycle to increase the quality and reliability of data pipelines. Hence, in this paper, a framework is proposed to guide DataOps implementation, using the Design Science Research methodology. The applicability of the artifact is demonstrated in a case study in urban mobility analytics.
| Item URL in elib: | https://elib.dlr.de/221861/ | ||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Document Type: | Article | ||||||||||||||||||||||||||||
| Title: | Designing a Framework for DataOps: Improving Data Quality and Pipeline Efficiency in Data Science | ||||||||||||||||||||||||||||
| Authors: |
| ||||||||||||||||||||||||||||
| Date: | 2026 | ||||||||||||||||||||||||||||
| Journal or Publication Title: | Procedia Computer Science | ||||||||||||||||||||||||||||
| Refereed publication: | Yes | ||||||||||||||||||||||||||||
| Open Access: | No | ||||||||||||||||||||||||||||
| Gold Open Access: | No | ||||||||||||||||||||||||||||
| In SCOPUS: | Yes | ||||||||||||||||||||||||||||
| In ISI Web of Science: | No | ||||||||||||||||||||||||||||
| Publisher: | Elsevier | ||||||||||||||||||||||||||||
| ISSN: | 1877-0509 | ||||||||||||||||||||||||||||
| Status: | In Press | ||||||||||||||||||||||||||||
| Keywords: | Data Science; Data Analytics; DataOps; Design Science Research; Cloud Computing. | ||||||||||||||||||||||||||||
| HGF - Research field: | Aeronautics, Space and Transport | ||||||||||||||||||||||||||||
| HGF - Program: | Space | ||||||||||||||||||||||||||||
| HGF - Program Themes: | other | ||||||||||||||||||||||||||||
| DLR - Research area: | Raumfahrt | ||||||||||||||||||||||||||||
| DLR - Program: | R - no assignment | ||||||||||||||||||||||||||||
| DLR - Research theme (Project): | R - no assignment | ||||||||||||||||||||||||||||
| Location: | Jena | ||||||||||||||||||||||||||||
| Institutes and Institutions: | Institute of Data Science > Data Management and Enrichment | ||||||||||||||||||||||||||||
| Deposited By: | Pohl, Matthias | ||||||||||||||||||||||||||||
| Deposited On: | 09 Jan 2026 08:13 | ||||||||||||||||||||||||||||
| Last Modified: | 09 Jan 2026 08:13 |
Repository Staff Only: item control page