Podcasting
Advertisers
Enterprise
Pricing
Resources
Discover Discover

Log in
Sign up free

AI Breakdown

ArXiv Preprint - S-LoRA: Serving Thousands of Concurrent LoRA Adapters

2023-11-21

In this episode we discuss S-LoRA: Serving Thousands of Concurrent LoRA Adapters by Ying Sheng, Shiyi Cao, Dacheng Li, Coleman Hooper, Nicholas Lee, Shuo Yang, Christopher Chou, Banghua Zhu, Lianmin Zheng, Kurt Keutzer, Joseph E. Gonzalez, Ion Stoica. The paper introduces S-LoRA, a system for efficiently serving a large number of Low-Rank Adaptation (LoRA) language model adapters by storing them in memory and using optimized memory management and computation strategies. S-LoRA utilizes...

In this episode we discuss S-LoRA: Serving Thousands of Concurrent LoRA Adapters by Ying Sheng, Shiyi Cao, Dacheng Li, Coleman Hooper, Nicholas Lee, Shuo Yang, Christopher Chou, Banghua Zhu, Lianmin Zheng, Kurt Keutzer, Joseph E. Gonzalez, Ion Stoica. The paper introduces S-LoRA, a system for efficiently serving a large number of Low-Rank Adaptation (LoRA) language model adapters by storing them in memory and using optimized memory management and computation strategies. S-LoRA utilizes Unified Paging for managing memory and custom CUDA kernels for improved tensor parallelism, resulting in up to 4 times higher throughput and serving capacity for thousands of adapters on a single or multiple GPUs compared to current state-of-the-art libraries. The system allows for scalable and customized fine-tuning services, and the authors have made their code publicly available.

View more

Comments (3)

More Episodes

You may also like

One Quote, One Story

Disney Family Stories & Gossip

The Saad Truth with Dr. Saad

The Mel Robbins Podcast

The Jordan B. Peterson Podcast

ŒIL pour YEUX, DENT pour MÂCHOIRE 😎

All Ears English Podcast

Halacha Headlines

‌BPLUS بی‌پلاس پادکست فارسی خلاصه کتاب

IMO with Michelle Obama and Craig Robinson

Get this podcast on your phone, Free

Create Your Podcast In Minutes

Full-featured podcast site
Unlimited storage and bandwidth
Comprehensive podcast stats
Distribute to Apple Podcasts, Spotify, and more
Make money with your podcast

It is Free

Podcast Services
MONETIZATION & MORE
KNOWLEDGE BASE
Support
Podbean

Privacy Policy
Cookie Policy
Terms of Use
Consent Preferences
Copyright © 2015-2025 Podbean.com