From Ticks to Flows: Dynamics of Neural Reinforcement Learning in Continuous Environments
Published in ICLR, 2026
We present a novel theoretical framework for deep RL in continuous environments by modeling the problem as a continuous-time stochastic process, deriving equations describing how the state distribution evolves over gradient steps in the infinite width limit.
Recommended citation: Saket Tiwari, Tejas Kotwal, & George Konidaris. (2026). "From Ticks to Flows: Dynamics of Neural Reinforcement Learning in Continuous Environments." arXiv:2606.04275 https://arxiv.org/abs/2606.04275
