Interpretability of deep-learning methods applied to large-scale structure surveys

dc.contributor.authorAymerich, Gaspard
dc.contributor.authorKacprzak, Tomasz
dc.contributor.authorRefregier, A.
dc.contributor.authorThomsen, Arne
dc.date.accessioned2026-05-08T12:52:46Z
dc.date.issued2026
dc.description.abstractDeep learning and convolutional neural networks in particular are powerful and promising tools for cosmological analysis of large-scale structure surveys. They already provide similar performance levels to classical analysis methods using fixed summary statistics and show potential to break key degeneracies through better probe combinations. They will also likely improve rapidly in the coming years as progress is made in terms of physical modelling through both software and hardware improvement. One key issue remains: unlike classical analysis, a convolutional neural network’s inference process is hidden from the user as the network optimises millions of parameters with no interpretable physical meaning. This prevents a clear understanding of the potential limitations and biases of the analysis, making it hard to rely on as a main analysis method. In this work, we explored the behaviour of such a convolutional neural network through a novel method. Instead of trying to analyse a network a posteriori, i.e. after training has been completed, we studied the impact on the constraining power of training the network and predicting parameters with degraded data, where we removed part of the information. This allowed us to gain an understanding of which parts and features of tomographic, weak gravitational lensing maps are most important in the network’s inference process. For Stage-III-like noise levels, we find that the network’s inference process relies on a mix of both Gaussian and non-Gaussian information, and it seems to put an emphasis on structures whose scales are at the limit between linear and non-linear regimes. When studying a noiseless survey, we find that the relative importance of small scales increases, indicating that they hold relevant cosmological information that is inaccessible when including realistic levels of shape noise.
dc.identifier.doi10.1051/0004-6361/202553963
dc.identifier.issn1432-0746
dc.identifier.issn0004-6361
dc.identifier.urihttps://irf.fhnw.ch/handle/11654/56708
dc.identifier.urihttps://doi.org/10.26041/fhnw-16228
dc.language.isoen
dc.publisherEDP Sciences
dc.relation.ispartofAstronomy & Astrophysics
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subject.ddc005 - Computer Programmierung, Programme und Daten
dc.titleInterpretability of deep-learning methods applied to large-scale structure surveys
dc.type01A - Beitrag in wissenschaftlicher Zeitschrift
dc.volume709
dspace.entity.typePublication
fhnw.InventedHereYes
fhnw.ReviewTypepeer-reviewed
fhnw.affiliation.hochschuleHochschule für Informatik FHNWde_CH
fhnw.affiliation.institutInstitut für Data Sciencede_CH
fhnw.oastatus.auroraVersion: Published *** Embargo: None *** Licence: CC BY *** URL: https://v2.sherpa.ac.uk/id/publication/11142
fhnw.openAccessCategoryGold
fhnw.paginationA78
fhnw.publicationStatePublished
fhnw.targetcollectionb508cce9-5084-49ae-a565-d8e5c348c3ab
relation.isAuthorOfPublication04d4b858-38f9-4cc8-ad93-05a7d40c7476
relation.isAuthorOfPublication.latestForDiscovery04d4b858-38f9-4cc8-ad93-05a7d40c7476
Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
aa53963-25.pdf
Größe:
29.55 MB
Format:
Adobe Portable Document Format

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
license.txt
Größe:
2.66 KB
Format:
Item-specific license agreed upon to submission
Beschreibung: