Lukats, Daniel und Zielinski, Oliver und Hahn, Axel und Stahl, Frederic (2024) A benchmark and survey of fully unsupervised concept drift detectors on real-world data streams. International Journal of Data Science and Analytics. Springer. doi: 10.1007/s41060-024-00620-y. ISSN 2364-415X.
PDF
- Verlagsversion (veröffentlichte Fassung)
2MB |
Offizielle URL: https://link.springer.com/article/10.1007/s41060-024-00620-y#citeas
Kurzfassung
oncept drift detection techniques can be used to discover substantial changes of the patterns encoded in data streams in real-time. If left unaddressed, these changes can render deployed machine learning models unreliable because their training data no longer matches the patterns present in the data stream. Most algorithms proposed in the literature depend on the immediate availability of ground truth class labels. This is unrealistic for many applications due to the associated cost of labeling. Therefore, this study reviews the availability of fully unsupervised concept drift detectors, which can operate entirely without labeled data. Ten algorithms are analyzed in terms of architectural choices, core ideas and assumptions about data because they fulfilled several inclusion criteria designed to ensure faithful and reliable implementations. Seven of these algorithms are evaluated with common concept drift detection metrics on eleven real-world data streams; the remaining three performed too slow or depended on chance. Based on the results of these experiments, three concept drift detectors—Discriminative Drift Detector, Image-Based Drift Detector and Semi-Parametric Log-Likelihood—can be recommended depending on the desired target metric. This study further reveals issues with the evaluation metrics Mean Time Ratio and lift-per-drift. Finally, it highlights open research challenges.
elib-URL des Eintrags: | https://elib.dlr.de/206088/ | ||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Dokumentart: | Zeitschriftenbeitrag | ||||||||||||||||||||
Titel: | A benchmark and survey of fully unsupervised concept drift detectors on real-world data streams | ||||||||||||||||||||
Autoren: |
| ||||||||||||||||||||
Datum: | 27 August 2024 | ||||||||||||||||||||
Erschienen in: | International Journal of Data Science and Analytics | ||||||||||||||||||||
Referierte Publikation: | Ja | ||||||||||||||||||||
Open Access: | Ja | ||||||||||||||||||||
Gold Open Access: | Nein | ||||||||||||||||||||
In SCOPUS: | Ja | ||||||||||||||||||||
In ISI Web of Science: | Ja | ||||||||||||||||||||
DOI: | 10.1007/s41060-024-00620-y | ||||||||||||||||||||
Verlag: | Springer | ||||||||||||||||||||
ISSN: | 2364-415X | ||||||||||||||||||||
Status: | veröffentlicht | ||||||||||||||||||||
Stichwörter: | Unsupervised concept drift detection Non-stationary data analysis Metrics for concept drift evaluation Benchmark data streams | ||||||||||||||||||||
HGF - Forschungsbereich: | keine Zuordnung | ||||||||||||||||||||
HGF - Programm: | keine Zuordnung | ||||||||||||||||||||
HGF - Programmthema: | keine Zuordnung | ||||||||||||||||||||
DLR - Schwerpunkt: | keine Zuordnung | ||||||||||||||||||||
DLR - Forschungsgebiet: | keine Zuordnung | ||||||||||||||||||||
DLR - Teilgebiet (Projekt, Vorhaben): | keine Zuordnung | ||||||||||||||||||||
Standort: | Oldenburg | ||||||||||||||||||||
Institute & Einrichtungen: | Institut für Systems Engineering für zukünftige Mobilität | ||||||||||||||||||||
Hinterlegt von: | Jupe-Weinauer, Julia | ||||||||||||||||||||
Hinterlegt am: | 04 Sep 2024 11:12 | ||||||||||||||||||||
Letzte Änderung: | 09 Sep 2024 09:52 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags