Schreiber, Andreas (2017) Traceability and Reproducibility of Big Data Analytics Workflows using Provenance. 2nd European GeoInformation Symposium and Exposition, 2017-06-20 - 2017-06-22, Berlin, Deutschland.
PDF
4MB |
Offizielle URL: https://www.afcea.org/event/?q=GEO17
Kurzfassung
The provenance of data provides detailed information about the origin of that data. That includes information about ownership and both actions and modifications performed on the data. With provenance information, data will be traceable and reproducible. In data science, results that are not reproducible by peer scientists are valueless and of no significance. In engineering, users can be more confident in the quality of products that ware developed based on simulations and data analytics workflows. To specify and store provenance information, W3C has standardized the provenance model PROV. Using PROV and associated implementations, users can record provenance of data analytics processes. The provenance information are directed acyclic graphs that can be analyzed to get insight into the data analytics processes. The talk describes the architecture of provenance management and how to apply provenance recording and provenance analytics to data science and big data analytics workflows.
elib-URL des Eintrags: | https://elib.dlr.de/113545/ | ||||||||
---|---|---|---|---|---|---|---|---|---|
Dokumentart: | Konferenzbeitrag (Vortrag) | ||||||||
Titel: | Traceability and Reproducibility of Big Data Analytics Workflows using Provenance | ||||||||
Autoren: |
| ||||||||
Datum: | 21 Juni 2017 | ||||||||
Referierte Publikation: | Nein | ||||||||
Open Access: | Ja | ||||||||
Gold Open Access: | Nein | ||||||||
In SCOPUS: | Nein | ||||||||
In ISI Web of Science: | Nein | ||||||||
Status: | veröffentlicht | ||||||||
Stichwörter: | big data, data science, reproducibility, traceability, provenance | ||||||||
Veranstaltungstitel: | 2nd European GeoInformation Symposium and Exposition | ||||||||
Veranstaltungsort: | Berlin, Deutschland | ||||||||
Veranstaltungsart: | internationale Konferenz | ||||||||
Veranstaltungsbeginn: | 20 Juni 2017 | ||||||||
Veranstaltungsende: | 22 Juni 2017 | ||||||||
Veranstalter : | AFCEA International | ||||||||
HGF - Forschungsbereich: | Luftfahrt, Raumfahrt und Verkehr | ||||||||
HGF - Programm: | Raumfahrt | ||||||||
HGF - Programmthema: | Technik für Raumfahrtsysteme | ||||||||
DLR - Schwerpunkt: | Raumfahrt | ||||||||
DLR - Forschungsgebiet: | R SY - Technik für Raumfahrtsysteme | ||||||||
DLR - Teilgebiet (Projekt, Vorhaben): | R - Vorhaben SISTEC (alt) | ||||||||
Standort: | Köln-Porz | ||||||||
Institute & Einrichtungen: | Institut für Simulations- und Softwaretechnik Institut für Simulations- und Softwaretechnik > Verteilte Systeme und Komponentensoftware | ||||||||
Hinterlegt von: | Schreiber, Andreas | ||||||||
Hinterlegt am: | 07 Nov 2017 13:55 | ||||||||
Letzte Änderung: | 24 Apr 2024 20:18 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags