
Arxiv paper - Welcome to the Era of Experience
04/21/25 • 7 min
In this episode, we discuss Welcome to the Era of Experience by David Silver, Richard S. Sutton. The paper discusses the forthcoming era of artificial intelligence marked by agents with superhuman capabilities. These agents will primarily learn through experience. The note highlights the essential features that will characterize this new phase in AI development.
In this episode, we discuss Welcome to the Era of Experience by David Silver, Richard S. Sutton. The paper discusses the forthcoming era of artificial intelligence marked by agents with superhuman capabilities. These agents will primarily learn through experience. The note highlights the essential features that will characterize this new phase in AI development.
Previous Episode

Arxiv paper - MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation
In this episode, we discuss MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation by Sihyun Yu, Meera Hahn, Dan Kondratyuk, Jinwoo Shin, Agrim Gupta, José Lezama, Irfan Essa, David Ross, Jonathan Huang. The paper introduces MALT Diffusion, a new diffusion model designed for generating long videos by dividing them into short segments and using recurrent attention to maintain a memory latent vector for long-term context. It presents training techniques to ensure consistent quality over extended frames and demonstrates superior performance on long video benchmarks, significantly improving FVD scores. Additionally, MALT shows strong results in text-to-video generation, capable of producing longer videos than existing methods.
Next Episode

Arxiv paper - LLM Post-Training: A Deep Dive into Reasoning Large Language Models
In this episode, we discuss LLM Post-Training: A Deep Dive into Reasoning Large Language Models by Komal Kumar, Tajamul Ashraf, Omkar Thawakar, Rao Muhammad Anwer, Hisham Cholakkal, Mubarak Shah, Ming-Hsuan Yang, Phillip H. S. Torr, Fahad Shahbaz Khan, Salman Khan. The paper surveys post-training techniques for Large Language Models (LLMs) that enhance performance beyond initial pretraining, focusing on fine-tuning, reinforcement learning, and test-time scaling. It addresses challenges like catastrophic forgetting and reward hacking while exploring model alignment and scalable adaptation. The survey also provides a public repository to track ongoing advancements in post-training methods.
If you like this episode you’ll love
Episode Comments
Generate a badge
Get a badge for your website that links back to this episode
<a href="https://goodpods.com/podcasts/ai-breakdown-410966/arxiv-paper-welcome-to-the-era-of-experience-89793299"> <img src="https://storage.googleapis.com/goodpods-images-bucket/badges/generic-badge-1.svg" alt="listen to arxiv paper - welcome to the era of experience on goodpods" style="width: 225px" /> </a>
Copy