Csaba Szepesvari

TalkRL: The Reinforcement Learning Podcast - Un pódcast de Robin Ranjit Singh Chauhan

prueba podimo gratis durante 60 días!

Miles de audiolibros y podcasts exclusivos, haz clic aquí para probar

Csaba Szepesvari of DeepMind shares his views on Bandits, Adversaries, PUCT in AlphaGo / AlphaZero / MuZero, AGI and RL, what is timeless, and more!