Solving the 2-level atom non-LTE problem using soft actor-critic reinforcement learning

Panos, Brandon; Milić, Ivan

Solving the 2-level atom non-LTE problem using soft actor-critic reinforcement learning

dc.contributor.author	Panos, Brandon
dc.contributor.author	Milić, Ivan
dc.date.accessioned	2026-02-10T13:04:52Z
dc.date.issued	2026
dc.description.abstract	We present a novel reinforcement learning (RL) approach for solving the classical 2-level atom non-LTE radiative transfer problem by framing it as a control task in which an RL agent learns a depth-dependent source function S(tau) that self-consistently satisfies the equation of statistical equilibrium (SE). The agent’s policy is optimized entirely via reward-based interactions with a radiative transfer engine, without explicit knowledge of the ground truth. This method bypasses the need for constructing approximate lambda operators (Lambda^) common in accelerated iterative schemes. Additionally, it requires no extensive precomputed labelled data sets to extract a supervisory signal, and avoids backpropagating gradients through the complex RT solver itself. Finally, we show through experiment that a simple feedforward neural network trained greedily cannot solve for SE, possibly due to the moving target nature of the problem. Our Lambda^-Free method offers potential advantages for complex scenarios (e.g. atmospheres with enhanced velocity fields, multidimensional geometries, or complex microphysics) where Lambda^* construction or solver differentiability is challenging. Additionally, the agent can be incentivized to find more efficient policies by manipulating the discount factor, leading to a reprioritization of immediate rewards. If demonstrated to generalize past its training data, this RL framework could serve as an alternative or accelerated formalism to achieve SE. To the best of our knowledge, this study represents the first application of reinforcement learning in solar physics that directly solves for a fundamental physical constraint.
dc.identifier.doi	10.1093/rasti/rzag005
dc.identifier.issn	2752-8200
dc.identifier.uri	https://irf.fhnw.ch/handle/11654/55600
dc.identifier.uri	https://doi.org/10.26041/fhnw-15415
dc.language.iso	en
dc.publisher	Oxford University Press
dc.relation.ispartof	RAS Techniques and Instruments
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject.ddc	004 - Computer Wissenschaften, Internet
dc.subject.ddc	520 - Astronomie, Kartografie
dc.title	Solving the 2-level atom non-LTE problem using soft actor-critic reinforcement learning
dc.type	01A - Beitrag in wissenschaftlicher Zeitschrift
dc.volume	5
dspace.entity.type	Publication
fhnw.InventedHere	Yes
fhnw.ReviewType	peer-reviewed
fhnw.affiliation.hochschule	Hochschule für Informatik FHNW	de_CH
fhnw.affiliation.institut	Institut für Data Science	de_CH
fhnw.oastatus.aurora	Version: Published * Embargo: None * Licence: CC BY *** URL: https://v2.sherpa.ac.uk/id/publication/41233
fhnw.openAccessCategory	Gold
fhnw.publicationState	Published
fhnw.targetcollection	b508cce9-5084-49ae-a565-d8e5c348c3ab
relation.isAuthorOfPublication	5cc45827-ef02-4fac-b0a2-7f3e223994d9
relation.isAuthorOfPublication.latestForDiscovery	5cc45827-ef02-4fac-b0a2-7f3e223994d9

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1

Name:: rzag005.pdf
Größe:: 1.62 MB
Format:: Adobe Portable Document Format

Herunterladen

Lizenzbündel

Gerade angezeigt 1 - 1 von 1

Name:: license.txt
Größe:: 2.66 KB
Format:: Item-specific license agreed upon to submission
Beschreibung:

Herunterladen

Sammlung

Institut für Data Science