elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Impressum | Datenschutz | Kontakt | English
Schriftgröße: [-] Text [+]

Sharing data pipelines: Why sharing data may not be enough, and what to do about it.

Käthner, David (2020) Sharing data pipelines: Why sharing data may not be enough, and what to do about it. Sharing psychological research data: Best practices and new developments (CSPD2020), 2020-12-07 - 2020-12-08, Online.

[img] PDF
740kB

Kurzfassung

New research challenges and low-cost technological solutions drive the motivation to record behavior in multivariate ways using high temporal resolution. Making such data accessible and usable is more complicated than it may seem. Methods like ECG, EEG, and eye tracking can produce very large amounts of data in a short time. Further, context data to explain the observed behavior must be recorded as well. E.g., in a field study using an instrumented research vehicle, the position of the vehicle and the distance to the vehicle in front could act as context data. To make this multitude of data analyzable, data must be cleaned and fused in data pipelines. Cleaning happens in multiple stages, and requires decisions which have direct effects on patterns in the data. Time series data are often up- or down sampled, potentially altering characteristics of signals of interest. Sharing the data pipeline alongside an uncleaned version of the data therefore should be the default when publishing research results. Data science has developed a number of solutions to store and document data and data pipelines, whose benefits and costs will be discussed in this talk. These approaches can be structured in three interdependent dimensions: data storage, data processing, and competencies required by developers and users of data pipelines. Data from empirical studies can be very challenging to store, process, and document. Solutions to these issues do exist, but they require a training which is yet to be implemented in the typical Psychology curriculum.

elib-URL des Eintrags:https://elib.dlr.de/140472/
Dokumentart:Konferenzbeitrag (Vortrag)
Titel:Sharing data pipelines: Why sharing data may not be enough, and what to do about it.
Autoren:
AutorenInstitution oder E-Mail-AdresseAutoren-ORCID-iDORCID Put Code
Käthner, DavidDavid.Kaethner (at) dlr.dehttps://orcid.org/0000-0003-4168-2266NICHT SPEZIFIZIERT
Datum:7 Dezember 2020
Referierte Publikation:Nein
Open Access:Ja
Gold Open Access:Nein
In SCOPUS:Nein
In ISI Web of Science:Nein
Status:veröffentlicht
Stichwörter:data pipeline; data; reproducible research;
Veranstaltungstitel:Sharing psychological research data: Best practices and new developments (CSPD2020)
Veranstaltungsort:Online
Veranstaltungsart:internationale Konferenz
Veranstaltungsbeginn:7 Dezember 2020
Veranstaltungsende:8 Dezember 2020
Veranstalter :Leibniz Institute for Psychology Information (ZPID)
HGF - Forschungsbereich:Luftfahrt, Raumfahrt und Verkehr
HGF - Programm:Verkehr
HGF - Programmthema:Verkehrssystem
DLR - Schwerpunkt:Verkehr
DLR - Forschungsgebiet:V VS - Verkehrssystem
DLR - Teilgebiet (Projekt, Vorhaben):V - Energie und Verkehr (alt)
Standort: Braunschweig
Institute & Einrichtungen:Institut für Verkehrssystemtechnik
Hinterlegt von: Käthner, David
Hinterlegt am:15 Jan 2021 10:52
Letzte Änderung:24 Apr 2024 20:41

Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags

Blättern
Suchen
Hilfe & Kontakt
Informationen
electronic library verwendet EPrints 3.3.12
Gestaltung Webseite und Datenbank: Copyright © Deutsches Zentrum für Luft- und Raumfahrt (DLR). Alle Rechte vorbehalten.