Yogi Optimizer Link

Enter the .

In DRL, the data distribution changes constantly as the agent learns. Yogi’s resistance to sudden variance spikes helps maintain stable policy updates, often outperforming Adam on tasks like Atari games and robotic control. yogi optimizer