Memon, Shahbaz and Jadebeck, Johann F. and Osthege, Michael and Wendler, Anna Clara and Kerkmann, David and Zunker, Henrik and Wiechert, Wolfgang and Nöh, Katharina and Göbbert, Jens Henrik and Hagemeier, Björn and Riedel, Morris and Kühn, Martin Joachim (2024) Automated Processing of Pipelines Managing Now- and Forecasting of Infectious Diseases. In: 2024 47th ICT and Electronics Convention, MIPRO 2024 - Proceedings. IEEE. 47th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2024, 2024-05-20, Croatia. doi: 10.1109/MIPRO60963.2024.10569336. ISBN 979-835038249-5. ISSN 2623-8764.
![]() |
PDF
248kB |
Official URL: https://ieeexplore.ieee.org/document/10569336
Abstract
When faced with the challenge of now- and forecasting infectious diseases, multiple data sources and state-of-the-art models have to be considered. Automatic aggregation, processing, and publishing to relevant data sinks is paramount to achieving consistent, reproducible, and timely results given daily-reported data. To facilitate scientific collaboration and reproducibility of workflows, open and extensible architectures for compute pipelines are required. In this research, we devise an architecture realizing the seamless management and processing of reproducible pipelines. Our case-study is a daily pipeline for nowcasting the state of SARS-CoV-2 in Germany based on public data and state-of-the-art models implemented in the simulation software MEmilio. The results of our pipeline are pushed to ESID (Epidemiological Scenarios for Infectious Diseases), a user interface to epidemiological simulations. To realize the given pipeline, a workflow management system is required to ensure pipeline processing and secure access to multiple heterogeneous data storages. For this purpose, we based our work on an open-source workflow management system - Apache Airflow, which provides the orchestration, coordination and management of complex connected tasks. S3 is utilized as an intermediate data storage service for sharing data between workflow steps and persisting experiment output. We provide a comprehensive view on our work on automated, end-to-end and reproducible pipelines, with detailed commentary on use case, and its realization.
Item URL in elib: | https://elib.dlr.de/205435/ | ||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Document Type: | Conference or Workshop Item (Speech) | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Title: | Automated Processing of Pipelines Managing Now- and Forecasting of Infectious Diseases | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Authors: |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||
Date: | 2024 | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Journal or Publication Title: | 2024 47th ICT and Electronics Convention, MIPRO 2024 - Proceedings | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Refereed publication: | Yes | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Open Access: | Yes | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Gold Open Access: | No | ||||||||||||||||||||||||||||||||||||||||||||||||||||
In SCOPUS: | Yes | ||||||||||||||||||||||||||||||||||||||||||||||||||||
In ISI Web of Science: | No | ||||||||||||||||||||||||||||||||||||||||||||||||||||
DOI: | 10.1109/MIPRO60963.2024.10569336 | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Publisher: | IEEE | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Series Name: | 47th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2024 | ||||||||||||||||||||||||||||||||||||||||||||||||||||
ISSN: | 2623-8764 | ||||||||||||||||||||||||||||||||||||||||||||||||||||
ISBN: | 979-835038249-5 | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Status: | Published | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Keywords: | nowcasting, forecasting, automatization, pipeline, workflow management, end-to-end, processing | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Event Title: | 47th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2024 | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Event Location: | Croatia | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Event Type: | international Conference | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Event Date: | 20 May 2024 | ||||||||||||||||||||||||||||||||||||||||||||||||||||
HGF - Research field: | Aeronautics, Space and Transport | ||||||||||||||||||||||||||||||||||||||||||||||||||||
HGF - Program: | Space | ||||||||||||||||||||||||||||||||||||||||||||||||||||
HGF - Program Themes: | Space System Technology | ||||||||||||||||||||||||||||||||||||||||||||||||||||
DLR - Research area: | Raumfahrt | ||||||||||||||||||||||||||||||||||||||||||||||||||||
DLR - Program: | R SY - Space System Technology | ||||||||||||||||||||||||||||||||||||||||||||||||||||
DLR - Research theme (Project): | R - Tasks SISTEC | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Location: | Köln-Porz | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Institutes and Institutions: | Institute of Software Technology Institute of Software Technology > High-Performance Computing | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Deposited By: | Kühn, Dr. Martin Joachim | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Deposited On: | 02 Aug 2024 10:50 | ||||||||||||||||||||||||||||||||||||||||||||||||||||
Last Modified: | 02 Aug 2024 10:50 |
Repository Staff Only: item control page