Interpretability of deep-learning methods applied to large-scale structure surveys

Aymerich, Gaspard; Kacprzak, Tomasz; Refregier, A.; Thomsen, Arne

Interpretability of deep-learning methods applied to large-scale structure surveys

dc.contributor.author	Aymerich, Gaspard
dc.contributor.author	Kacprzak, Tomasz
dc.contributor.author	Refregier, A.
dc.contributor.author	Thomsen, Arne
dc.date.accessioned	2026-05-08T12:52:46Z
dc.date.issued	2026
dc.description.abstract	Deep learning and convolutional neural networks in particular are powerful and promising tools for cosmological analysis of large-scale structure surveys. They already provide similar performance levels to classical analysis methods using fixed summary statistics and show potential to break key degeneracies through better probe combinations. They will also likely improve rapidly in the coming years as progress is made in terms of physical modelling through both software and hardware improvement. One key issue remains: unlike classical analysis, a convolutional neural network’s inference process is hidden from the user as the network optimises millions of parameters with no interpretable physical meaning. This prevents a clear understanding of the potential limitations and biases of the analysis, making it hard to rely on as a main analysis method. In this work, we explored the behaviour of such a convolutional neural network through a novel method. Instead of trying to analyse a network a posteriori, i.e. after training has been completed, we studied the impact on the constraining power of training the network and predicting parameters with degraded data, where we removed part of the information. This allowed us to gain an understanding of which parts and features of tomographic, weak gravitational lensing maps are most important in the network’s inference process. For Stage-III-like noise levels, we find that the network’s inference process relies on a mix of both Gaussian and non-Gaussian information, and it seems to put an emphasis on structures whose scales are at the limit between linear and non-linear regimes. When studying a noiseless survey, we find that the relative importance of small scales increases, indicating that they hold relevant cosmological information that is inaccessible when including realistic levels of shape noise.
dc.identifier.doi	10.1051/0004-6361/202553963
dc.identifier.issn	1432-0746
dc.identifier.issn	0004-6361
dc.identifier.uri	https://irf.fhnw.ch/handle/11654/56708
dc.identifier.uri	https://doi.org/10.26041/fhnw-16228
dc.language.iso	en
dc.publisher	EDP Sciences
dc.relation.ispartof	Astronomy & Astrophysics
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject.ddc	005 - Computer Programmierung, Programme und Daten
dc.title	Interpretability of deep-learning methods applied to large-scale structure surveys
dc.type	01A - Beitrag in wissenschaftlicher Zeitschrift
dc.volume	709
dspace.entity.type	Publication
fhnw.InventedHere	Yes
fhnw.ReviewType	peer-reviewed
fhnw.affiliation.hochschule	Hochschule für Informatik FHNW	de_CH
fhnw.affiliation.institut	Institut für Data Science	de_CH
fhnw.oastatus.aurora	Version: Published * Embargo: None * Licence: CC BY *** URL: https://v2.sherpa.ac.uk/id/publication/11142
fhnw.openAccessCategory	Gold
fhnw.pagination	A78
fhnw.publicationState	Published
fhnw.targetcollection	b508cce9-5084-49ae-a565-d8e5c348c3ab
relation.isAuthorOfPublication	04d4b858-38f9-4cc8-ad93-05a7d40c7476
relation.isAuthorOfPublication.latestForDiscovery	04d4b858-38f9-4cc8-ad93-05a7d40c7476

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1

Name:: aa53963-25.pdf
Größe:: 29.55 MB
Format:: Adobe Portable Document Format

Herunterladen

Lizenzbündel

Gerade angezeigt 1 - 1 von 1

Name:: license.txt
Größe:: 2.66 KB
Format:: Item-specific license agreed upon to submission
Beschreibung:

Herunterladen

Sammlung

Institut für Data Science