Podcasting
Advertisers
Enterprise
Pricing
Resources
Discover Discover

Log in
Sign up free

AI Breakdown

Arxiv paper - LLM Post-Training: A Deep Dive into Reasoning Large Language Models

2025-04-22

In this episode, we discuss LLM Post-Training: A Deep Dive into Reasoning Large Language Models by Komal Kumar, Tajamul Ashraf, Omkar Thawakar, Rao Muhammad Anwer, Hisham Cholakkal, Mubarak Shah, Ming-Hsuan Yang, Phillip H. S. Torr, Fahad Shahbaz Khan, Salman Khan. The paper surveys post-training techniques for Large Language Models (LLMs) that enhance performance beyond initial pretraining, focusing on fine-tuning, reinforcement learning, and test-time scaling. It addresses challenges like...

In this episode, we discuss LLM Post-Training: A Deep Dive into Reasoning Large Language Models by Komal Kumar, Tajamul Ashraf, Omkar Thawakar, Rao Muhammad Anwer, Hisham Cholakkal, Mubarak Shah, Ming-Hsuan Yang, Phillip H. S. Torr, Fahad Shahbaz Khan, Salman Khan. The paper surveys post-training techniques for Large Language Models (LLMs) that enhance performance beyond initial pretraining, focusing on fine-tuning, reinforcement learning, and test-time scaling. It addresses challenges like catastrophic forgetting and reward hacking while exploring model alignment and scalable adaptation. The survey also provides a public repository to track ongoing advancements in post-training methods.

View more

Comments (3)

More Episodes

You may also like

Self-Mastery Become Your Best

Heal, Survive & Thrive!

One Quote, One Story

The Mel Robbins Podcast

‌BPLUS بی‌پلاس پادکست فارسی خلاصه کتاب

Halacha Headlines

Easy German: Learn German with native speakers | Deutsch lernen mit Muttersprachlern

رادیو راه با مجتبی شکوری

All Ears English Podcast

Clutterbug - Real-Life Hacks and Tips to Declutter, Organize and Clean your Home Fast

Get this podcast on your phone, Free

Create Your Podcast In Minutes

Full-featured podcast site
Unlimited storage and bandwidth
Comprehensive podcast stats
Distribute to Apple Podcasts, Spotify, and more
Make money with your podcast

It is Free

Podcast Services
MONETIZATION & MORE
KNOWLEDGE BASE
Support
Podbean

Privacy Policy
Cookie Policy
Terms of Use
Consent Preferences
Copyright © 2015-2025 Podbean.com