Interpretability of deep-learning methods applied to large-scale structure surveys
| dc.contributor.author | Aymerich, Gaspard | |
| dc.contributor.author | Kacprzak, Tomasz | |
| dc.contributor.author | Refregier, A. | |
| dc.contributor.author | Thomsen, Arne | |
| dc.date.accessioned | 2026-05-08T12:52:46Z | |
| dc.date.issued | 2026 | |
| dc.description.abstract | Deep learning and convolutional neural networks in particular are powerful and promising tools for cosmological analysis of large-scale structure surveys. They already provide similar performance levels to classical analysis methods using fixed summary statistics and show potential to break key degeneracies through better probe combinations. They will also likely improve rapidly in the coming years as progress is made in terms of physical modelling through both software and hardware improvement. One key issue remains: unlike classical analysis, a convolutional neural network’s inference process is hidden from the user as the network optimises millions of parameters with no interpretable physical meaning. This prevents a clear understanding of the potential limitations and biases of the analysis, making it hard to rely on as a main analysis method. In this work, we explored the behaviour of such a convolutional neural network through a novel method. Instead of trying to analyse a network a posteriori, i.e. after training has been completed, we studied the impact on the constraining power of training the network and predicting parameters with degraded data, where we removed part of the information. This allowed us to gain an understanding of which parts and features of tomographic, weak gravitational lensing maps are most important in the network’s inference process. For Stage-III-like noise levels, we find that the network’s inference process relies on a mix of both Gaussian and non-Gaussian information, and it seems to put an emphasis on structures whose scales are at the limit between linear and non-linear regimes. When studying a noiseless survey, we find that the relative importance of small scales increases, indicating that they hold relevant cosmological information that is inaccessible when including realistic levels of shape noise. | |
| dc.identifier.doi | 10.1051/0004-6361/202553963 | |
| dc.identifier.issn | 1432-0746 | |
| dc.identifier.issn | 0004-6361 | |
| dc.identifier.uri | https://irf.fhnw.ch/handle/11654/56708 | |
| dc.identifier.uri | https://doi.org/10.26041/fhnw-16228 | |
| dc.language.iso | en | |
| dc.publisher | EDP Sciences | |
| dc.relation.ispartof | Astronomy & Astrophysics | |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
| dc.subject.ddc | 005 - Computer Programmierung, Programme und Daten | |
| dc.title | Interpretability of deep-learning methods applied to large-scale structure surveys | |
| dc.type | 01A - Beitrag in wissenschaftlicher Zeitschrift | |
| dc.volume | 709 | |
| dspace.entity.type | Publication | |
| fhnw.InventedHere | Yes | |
| fhnw.ReviewType | peer-reviewed | |
| fhnw.affiliation.hochschule | Hochschule für Informatik FHNW | de_CH |
| fhnw.affiliation.institut | Institut für Data Science | de_CH |
| fhnw.oastatus.aurora | Version: Published *** Embargo: None *** Licence: CC BY *** URL: https://v2.sherpa.ac.uk/id/publication/11142 | |
| fhnw.openAccessCategory | Gold | |
| fhnw.pagination | A78 | |
| fhnw.publicationState | Published | |
| fhnw.targetcollection | b508cce9-5084-49ae-a565-d8e5c348c3ab | |
| relation.isAuthorOfPublication | 04d4b858-38f9-4cc8-ad93-05a7d40c7476 | |
| relation.isAuthorOfPublication.latestForDiscovery | 04d4b858-38f9-4cc8-ad93-05a7d40c7476 |
Dateien
Originalbündel
1 - 1 von 1
Lizenzbündel
1 - 1 von 1
Lade...
- Name:
- license.txt
- Größe:
- 2.66 KB
- Format:
- Item-specific license agreed upon to submission
- Beschreibung: