REINFORCE
Seeing the World Blindfolded: The observational dropout technique, explained
In reinforcement learning, if researchers want an agent to have an internal representation of its environment, they’ll build and train a world model that it can refer to. New research shows that world models can emerge from standard training, rather than needing to be built separately.