Distributional Properties of ReLU-Activations in Artificial Neural Networks that Learn by Memorization

Lehmler, Stephan Johann und Saif-ur-Rehman, Muhammad und Glasmachers, Tobias und Iossifidis, Ioannis (2026) Distributional Properties of ReLU-Activations in Artificial Neural Networks that Learn by Memorization. In: 11th International Conference on Machine Learning, Optimization, and Data Science, LOD 2025, 16467, Seiten 410-423. Springer. LOD2025, 2025-09-21 - 2025-09-24, Castiglione della Pescaia, Italy. doi: 10.1007/978-3-032-21477-5_27. ISBN 978-303221476-8. ISSN 0302-9743.

PDF - Nur DLR-intern zugänglich
4MB

Offizielle URL: https://dx.doi.org/10.1007/978-3-032-21477-5_27

Kurzfassung

We investigate the distributional properties of layers in Artificial Neural Network (ANN) that can be used to distinguish between networks learning by generalization and memorizing networks. Starting from the notion of memorization being essentially definable as learning ‘rare’ features of the input data, we propose the activation probability of Rectified Linear Units (ReLU)-neurons as an important indicator of memorization. Building on this idea, we show how hypotheses about distributional properties over whole networks structures can be derived from the activation probability of singular neurons in memorizing ANNs. We such extend previous work on using Poisson process models of activations in ANN by considering the correlation between neurons. Using this approach, we further simulate the effect of memorizing neurons on distributional properties of weight matrices and activation magnitudes and find a connection between L1/L2-norm regularization of weight matrices. We empirically evaluate the distributions of activation rate, correlation structure and weight matrices in memorizing and generalizing ANNs on a simple MNIST-classification task. Our initial findings show how the activation frequency and intra-layer correlation structure can be used to distinguish generalizing from memorizing networks and for inferring distributional properties on affected parts of the networks. This work presents a building block to later derive online metrics for memorization in ANNs.

elib-URL des Eintrags:

https://elib.dlr.de/225152/

Dokumentart:

Konferenzbeitrag (Vortrag)

Titel:

Distributional Properties of ReLU-Activations in Artificial Neural Networks that Learn by Memorization

Autoren:

Autoren	Institution oder E-Mail-Adresse	Autoren-ORCID-iD	ORCID Put Code
Lehmler, Stephan Johann	stephan.lehmler (at) dlr.de	https://orcid.org/0000-0002-6373-4074	218076637
Saif-ur-Rehman, Muhammad	NICHT SPEZIFIZIERT	https://orcid.org/0000-0003-1774-7330	NICHT SPEZIFIZIERT
Glasmachers, Tobias	NICHT SPEZIFIZIERT	https://orcid.org/0000-0003-1886-1696	NICHT SPEZIFIZIERT
Iossifidis, Ioannis	NICHT SPEZIFIZIERT	https://orcid.org/0000-0002-9876-4396	NICHT SPEZIFIZIERT

Datum:

1 Mai 2026

Erschienen in:

11th International Conference on Machine Learning, Optimization, and Data Science, LOD 2025

Referierte Publikation:

Open Access:

Nein

Gold Open Access:

Nein

In SCOPUS:

In ISI Web of Science:

Nein

Band:

16467

DOI:

10.1007/978-3-032-21477-5_27

Seitenbereich:

Seiten 410-423

Herausgeber:

Herausgeber	Institution und/oder E-Mail-Adresse der Herausgeber	Herausgeber-ORCID-iD	ORCID Put Code
Nicosia, Guiseppe	University of Catania	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT
Ojha, Varun	Newcastle University	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT
Giesselbach, Sven	Fraunhofer Institute - IAIS	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT
Padalos, Panos	University of Florida	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT
Umeton, Renato	Dana-Farber Cancer Institute	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT
La Malfa, Emanuele	University of Oxford	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT
La Malfa, Gabriele	King’s College London	NICHT SPEZIFIZIERT	NICHT SPEZIFIZIERT

Verlag:

Springer

Name der Reihe:

Lecture Notes in Computer Science

ISSN:

0302-9743

ISBN:

978-303221476-8

Status:

veröffentlicht

Stichwörter:

Artificial Neural Networks, Memorization, Statistical Modeling

Veranstaltungstitel:

LOD2025

Veranstaltungsort:

Castiglione della Pescaia, Italy

Veranstaltungsart:

internationale Konferenz

Veranstaltungsbeginn:

21 September 2025

Veranstaltungsende:

24 September 2025

HGF - Forschungsbereich:

keine Zuordnung

HGF - Programm:

keine Zuordnung

HGF - Programmthema:

keine Zuordnung

DLR - Schwerpunkt:

Digitalisierung

DLR - Forschungsgebiet:

D KIZ - Künstliche Intelligenz

DLR - Teilgebiet (Projekt, Vorhaben):

D - Kurzstudien [KIZ]

Standort:

Ulm

Institute & Einrichtungen:

Institut für KI-Sicherheit

Hinterlegt von:

Lehmler, Stephan

Hinterlegt am:

18 Jun 2026 10:43

Letzte Änderung:

22 Jun 2026 10:18

Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags