Osterthun, Arne und Pohl, Matthias (2025) FoxBench - Benchmark Array File Formats. [sonstige Veröffentlichung]
Dieses Archiv kann nicht den Volltext zur Verfügung stellen.
Kurzfassung
For effective data exchange and transfer, choosing the proper file format is crucial. Different domains have specific standards for file formats. While CSV files are commonly used, they lack reusability. Data files are well-suited for computing clusters as they enable efficient data processing and storage across multiple interconnected systems. Data analytics pipelines can be time-consuming due to handling large volumes of data. Timely data access is crucial for efficient processing and analysis. Earth system science (ESS) data commonly manifests as dense or sparse n-dimensional data. Dense n-dimensional data is conventionally stored in arrays, while sparse n-dimensional data is typically housed in data frames. In the realm of ESS, an array of file formats is leveraged for the storage of dense n-dimensional data, including NetCDF4, TileDB, and Zarr. The paper at hand aims to evaluate data file formats for retrieving multidimensional data, specifically focusing on tools within the ESS domain. The insights from this exploration will be applicable to other data analytics projects.
| elib-URL des Eintrags: | https://elib.dlr.de/219250/ | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dokumentart: | sonstige Veröffentlichung | ||||||||||||
| Titel: | FoxBench - Benchmark Array File Formats | ||||||||||||
| Autoren: |
| ||||||||||||
| Datum: | 16 Januar 2025 | ||||||||||||
| Referierte Publikation: | Nein | ||||||||||||
| Open Access: | Nein | ||||||||||||
| DOI: | 10.5281/zenodo.14644865 | ||||||||||||
| Status: | veröffentlicht | ||||||||||||
| Stichwörter: | Benchmark, Data Access, Storage, Cost-based Valuation, File Formats, Big Data | ||||||||||||
| HGF - Forschungsbereich: | Luftfahrt, Raumfahrt und Verkehr | ||||||||||||
| HGF - Programm: | Raumfahrt | ||||||||||||
| HGF - Programmthema: | keine Zuordnung | ||||||||||||
| DLR - Schwerpunkt: | Raumfahrt | ||||||||||||
| DLR - Forschungsgebiet: | R - keine Zuordnung | ||||||||||||
| DLR - Teilgebiet (Projekt, Vorhaben): | R - keine Zuordnung | ||||||||||||
| Standort: | Jena | ||||||||||||
| Institute & Einrichtungen: | Institut für Datenwissenschaften > Datenmanagement und -aufbereitung | ||||||||||||
| Hinterlegt von: | Osterthun, Arne | ||||||||||||
| Hinterlegt am: | 19 Nov 2025 08:17 | ||||||||||||
| Letzte Änderung: | 19 Nov 2025 08:17 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags