Podcasting
Advertisers
Enterprise
Pricing
Resources
Discover Discover

Log in
Sign up free

AI Breakdown

arxiv preprint - Retentive Network: A Successor to Transformer for Large Language Models

2023-07-25

In this episode we discuss Retentive Network: A Successor to Transformer for Large Language Models by Yutao Sun, Li Dong, Shaohan Huang, Shuming Ma, Yuqing Xia, Jilong Xue, Jianyong Wang, Furu Wei. The paper introduces RETNET as a successor to the Transformer architecture for language models. RETNET utilizes a retention mechanism that supports parallel, recurrent, and chunkwise recurrent computation paradigms for efficient training and inference. Experimental results show that RETNET...

In this episode we discuss Retentive Network: A Successor to Transformer for Large Language Models by Yutao Sun, Li Dong, Shaohan Huang, Shuming Ma, Yuqing Xia, Jilong Xue, Jianyong Wang, Furu Wei. The paper introduces RETNET as a successor to the Transformer architecture for language models. RETNET utilizes a retention mechanism that supports parallel, recurrent, and chunkwise recurrent computation paradigms for efficient training and inference. Experimental results show that RETNET achieves favorable scaling, parallel training, low-cost deployment, and efficient inference, making it a promising candidate for large language models.

View more

Comments (3)

More Episodes

You may also like

One Quote, One Story

Disney Family Stories & Gossip

The Saad Truth with Dr. Saad

The Mel Robbins Podcast

ŒIL pour YEUX, DENT pour MÂCHOIRE 😎

The Jordan B. Peterson Podcast

The Jordan Harbinger Show

All Ears English Podcast

The Caregiver’s Journey

Get this podcast on your phone, Free

Create Your Podcast In Minutes

Full-featured podcast site
Unlimited storage and bandwidth
Comprehensive podcast stats
Distribute to Apple Podcasts, Spotify, and more
Make money with your podcast

It is Free

Podcast Services
MONETIZATION & MORE
KNOWLEDGE BASE
Support
Podbean

Privacy Policy
Cookie Policy
Terms of Use
Consent Preferences
Copyright © 2015-2025 Podbean.com