elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Contact | Deutsch
Fontsize: [-] Text [+]

Automated Processing of Pipelines Managing Now- and Forecasting of Infectious Diseases

Memon, Shahbaz and Jadebeck, Johann F. and Osthege, Michael and Wendler, Anna Clara and Kerkmann, David and Zunker, Henrik and Wiechert, Wolfgang and Nöh, Katharina and Göbbert, Jens Henrik and Hagemeier, Björn and Riedel, Morris and Kühn, Martin Joachim (2024) Automated Processing of Pipelines Managing Now- and Forecasting of Infectious Diseases. In: 2024 47th ICT and Electronics Convention, MIPRO 2024 - Proceedings. IEEE. 47th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2024, 2024-05-20, Croatia. doi: 10.1109/MIPRO60963.2024.10569336. ISBN 979-835038249-5. ISSN 2623-8764.

[img] PDF
248kB

Official URL: https://ieeexplore.ieee.org/document/10569336

Abstract

When faced with the challenge of now- and forecasting infectious diseases, multiple data sources and state-of-the-art models have to be considered. Automatic aggregation, processing, and publishing to relevant data sinks is paramount to achieving consistent, reproducible, and timely results given daily-reported data. To facilitate scientific collaboration and reproducibility of workflows, open and extensible architectures for compute pipelines are required. In this research, we devise an architecture realizing the seamless management and processing of reproducible pipelines. Our case-study is a daily pipeline for nowcasting the state of SARS-CoV-2 in Germany based on public data and state-of-the-art models implemented in the simulation software MEmilio. The results of our pipeline are pushed to ESID (Epidemiological Scenarios for Infectious Diseases), a user interface to epidemiological simulations. To realize the given pipeline, a workflow management system is required to ensure pipeline processing and secure access to multiple heterogeneous data storages. For this purpose, we based our work on an open-source workflow management system - Apache Airflow, which provides the orchestration, coordination and management of complex connected tasks. S3 is utilized as an intermediate data storage service for sharing data between workflow steps and persisting experiment output. We provide a comprehensive view on our work on automated, end-to-end and reproducible pipelines, with detailed commentary on use case, and its realization.

Item URL in elib:https://elib.dlr.de/205435/
Document Type:Conference or Workshop Item (Speech)
Title:Automated Processing of Pipelines Managing Now- and Forecasting of Infectious Diseases
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Memon, ShahbazJülich Supercomputing Centre, Forschungszentrum Jülich GmbHUNSPECIFIEDUNSPECIFIED
Jadebeck, Johann F.Institute for Bio- and Geosciences, IBG-1: Biotechnology, Forschungszentrum Jülich GmbHUNSPECIFIEDUNSPECIFIED
Osthege, MichaelInstitute for Bio- and Geosciences, IBG-1: Biotechnology, Forschungszentrum Jülich GmbHUNSPECIFIEDUNSPECIFIED
Wendler, Anna ClaraUNSPECIFIEDhttps://orcid.org/0000-0002-1816-8907UNSPECIFIED
Kerkmann, DavidUNSPECIFIEDUNSPECIFIEDUNSPECIFIED
Zunker, HenrikUNSPECIFIEDhttps://orcid.org/0000-0002-9825-365X164795614
Wiechert, WolfgangInstitute for Bio- and Geosciences, IBG-1: Biotechnology, Forschungszentrum Jülich GmbHUNSPECIFIEDUNSPECIFIED
Nöh, KatharinaInstitute for Bio- and Geosciences, IBG-1: Biotechnology, Forschungszentrum Jülich GmbHUNSPECIFIEDUNSPECIFIED
Göbbert, Jens HenrikJülich Supercomputing Centre, Forschungszentrum Jülich GmbHUNSPECIFIEDUNSPECIFIED
Hagemeier, BjörnJülich Supercomputing Centre, Forschungszentrum Jülich GmbHUNSPECIFIEDUNSPECIFIED
Riedel, MorrisJülich Supercomputing Centre, Forschungszentrum Jülich GmbHUNSPECIFIEDUNSPECIFIED
Kühn, Martin JoachimUNSPECIFIEDhttps://orcid.org/0000-0002-0906-6984UNSPECIFIED
Date:2024
Journal or Publication Title:2024 47th ICT and Electronics Convention, MIPRO 2024 - Proceedings
Refereed publication:Yes
Open Access:Yes
Gold Open Access:No
In SCOPUS:Yes
In ISI Web of Science:No
DOI:10.1109/MIPRO60963.2024.10569336
Publisher:IEEE
Series Name:47th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2024
ISSN:2623-8764
ISBN:979-835038249-5
Status:Published
Keywords:nowcasting, forecasting, automatization, pipeline, workflow management, end-to-end, processing
Event Title:47th International Convention on Information and Communication Technology, Electronics and Microelectronics, MIPRO 2024
Event Location:Croatia
Event Type:international Conference
Event Date:20 May 2024
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Space System Technology
DLR - Research area:Raumfahrt
DLR - Program:R SY - Space System Technology
DLR - Research theme (Project):R - Tasks SISTEC
Location: Köln-Porz
Institutes and Institutions:Institute of Software Technology
Institute of Software Technology > High-Performance Computing
Deposited By: Kühn, Dr. Martin Joachim
Deposited On:02 Aug 2024 10:50
Last Modified:02 Aug 2024 10:50

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.