Today we’re joined by Kamyar Azizzadenesheli, a staff researcher at Nvidia, to continue our AI Trends 2024 series. In our conversation, Kamyar updates us on the latest developments in reinforcement learning (RL), and how the RL community is taking advantage of the abstract reasoning abilities of large language models (LLMs). Kamyar shares his insights on how LLMs are pushing RL performance forward in a variety of applications, such as ALOHA, a robot that can learn to fold clothes, and Voyager, an RL agent that uses GPT-4 to outperform prior systems at playing Minecraft. We also explore the progress being made in assessing and addressing the risks of RL-based decision-making in domains such as finance, healthcare, and agriculture. Finally, we discuss the future of deep reinforcement learning, Kamyar’s top predictions for the field, and how greater compute capabilities will be critical in achieving general intelligence.
The complete show notes for this episode can be found at twimlai.com/go/670.
Brain-Inspired Hardware and Algorithm Co-Design with Melika Payvand - #585
Equivariant Priors for Compressed Sensing with Arash Behboodi - #584
Managing Data Labeling Ops for Success with Audrey Smith - #583
Engineering an ML-Powered Developer-First Search Engine with Richard Socher - #582
On The Path Towards Robot Vision with Aljosa Osep - #581
More Language, Less Labeling with Kate Saenko - #580
Optical Flow Estimation, Panoptic Segmentation, and Vision Transformers with Fatih Porikli - #579
Data Governance for Data Science with Adam Wood - #578
Feature Platforms for Data-Centric AI with Mike Del Balso - #577
The Fallacy of "Ground Truth" with Shayan Mohanty - #576
Principle-centric AI with Adrien Gaidon - #575
Data Debt in Machine Learning with D. Sculley - #574
AI for Enterprise Decisioning at Scale with Rob Walker - #573
Data Rights, Quantification and Governance for Ethical AI with Margaret Mitchell - #572
Studying Machine Intelligence with Been Kim - #571
Advances in Neural Compression with Auke Wiggers - #570
Mixture-of-Experts and Trends in Large-Scale Language Modeling with Irwan Bello - #569
Daring to DAIR: Distributed AI Research with Timnit Gebru - #568
Hierarchical and Continual RL with Doina Precup - #567
Open-Source Drug Discovery with DeepChem with Bharath Ramsundar - #566
Create your
podcast in
minutes
It is Free
20/20
The Dropout
Ten Percent Happier with Dan Harris
World News Tonight with David Muir
NEJM This Week