Lehmler, Stephan Johann und Saif-ur-Rehman, Muhammad und Glasmachers, Tobias und Iossifidis, Ioannis (2026) Distributional Properties of ReLU-Activations in Artificial Neural Networks that Learn by Memorization. In: 11th International Conference on Machine Learning, Optimization, and Data Science, LOD 2025, 16467, Seiten 410-423. Springer. LOD2025, 2025-09-21 - 2025-09-24, Castiglione della Pescaia, Italy. doi: 10.1007/978-3-032-21477-5_27. ISBN 978-303221476-8. ISSN 0302-9743.
|
PDF
- Nur DLR-intern zugänglich
4MB |
Offizielle URL: https://dx.doi.org/10.1007/978-3-032-21477-5_27
Kurzfassung
We investigate the distributional properties of layers in Artificial Neural Network (ANN) that can be used to distinguish between networks learning by generalization and memorizing networks. Starting from the notion of memorization being essentially definable as learning ‘rare’ features of the input data, we propose the activation probability of Rectified Linear Units (ReLU)-neurons as an important indicator of memorization. Building on this idea, we show how hypotheses about distributional properties over whole networks structures can be derived from the activation probability of singular neurons in memorizing ANNs. We such extend previous work on using Poisson process models of activations in ANN by considering the correlation between neurons. Using this approach, we further simulate the effect of memorizing neurons on distributional properties of weight matrices and activation magnitudes and find a connection between L1/L2-norm regularization of weight matrices. We empirically evaluate the distributions of activation rate, correlation structure and weight matrices in memorizing and generalizing ANNs on a simple MNIST-classification task. Our initial findings show how the activation frequency and intra-layer correlation structure can be used to distinguish generalizing from memorizing networks and for inferring distributional properties on affected parts of the networks. This work presents a building block to later derive online metrics for memorization in ANNs.
| elib-URL des Eintrags: | https://elib.dlr.de/225152/ | ||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dokumentart: | Konferenzbeitrag (Vortrag) | ||||||||||||||||||||||||||||||||
| Titel: | Distributional Properties of ReLU-Activations in Artificial Neural Networks that Learn by Memorization | ||||||||||||||||||||||||||||||||
| Autoren: |
| ||||||||||||||||||||||||||||||||
| Datum: | 1 Mai 2026 | ||||||||||||||||||||||||||||||||
| Erschienen in: | 11th International Conference on Machine Learning, Optimization, and Data Science, LOD 2025 | ||||||||||||||||||||||||||||||||
| Referierte Publikation: | Ja | ||||||||||||||||||||||||||||||||
| Open Access: | Nein | ||||||||||||||||||||||||||||||||
| Gold Open Access: | Nein | ||||||||||||||||||||||||||||||||
| In SCOPUS: | Ja | ||||||||||||||||||||||||||||||||
| In ISI Web of Science: | Nein | ||||||||||||||||||||||||||||||||
| Band: | 16467 | ||||||||||||||||||||||||||||||||
| DOI: | 10.1007/978-3-032-21477-5_27 | ||||||||||||||||||||||||||||||||
| Seitenbereich: | Seiten 410-423 | ||||||||||||||||||||||||||||||||
| Herausgeber: |
| ||||||||||||||||||||||||||||||||
| Verlag: | Springer | ||||||||||||||||||||||||||||||||
| Name der Reihe: | Lecture Notes in Computer Science | ||||||||||||||||||||||||||||||||
| ISSN: | 0302-9743 | ||||||||||||||||||||||||||||||||
| ISBN: | 978-303221476-8 | ||||||||||||||||||||||||||||||||
| Status: | veröffentlicht | ||||||||||||||||||||||||||||||||
| Stichwörter: | Artificial Neural Networks, Memorization, Statistical Modeling | ||||||||||||||||||||||||||||||||
| Veranstaltungstitel: | LOD2025 | ||||||||||||||||||||||||||||||||
| Veranstaltungsort: | Castiglione della Pescaia, Italy | ||||||||||||||||||||||||||||||||
| Veranstaltungsart: | internationale Konferenz | ||||||||||||||||||||||||||||||||
| Veranstaltungsbeginn: | 21 September 2025 | ||||||||||||||||||||||||||||||||
| Veranstaltungsende: | 24 September 2025 | ||||||||||||||||||||||||||||||||
| HGF - Forschungsbereich: | keine Zuordnung | ||||||||||||||||||||||||||||||||
| HGF - Programm: | keine Zuordnung | ||||||||||||||||||||||||||||||||
| HGF - Programmthema: | keine Zuordnung | ||||||||||||||||||||||||||||||||
| DLR - Schwerpunkt: | Digitalisierung | ||||||||||||||||||||||||||||||||
| DLR - Forschungsgebiet: | D KIZ - Künstliche Intelligenz | ||||||||||||||||||||||||||||||||
| DLR - Teilgebiet (Projekt, Vorhaben): | D - Kurzstudien [KIZ] | ||||||||||||||||||||||||||||||||
| Standort: | Ulm | ||||||||||||||||||||||||||||||||
| Institute & Einrichtungen: | Institut für KI-Sicherheit | ||||||||||||||||||||||||||||||||
| Hinterlegt von: | Lehmler, Stephan | ||||||||||||||||||||||||||||||||
| Hinterlegt am: | 18 Jun 2026 10:43 | ||||||||||||||||||||||||||||||||
| Letzte Änderung: | 22 Jun 2026 10:18 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags