Podbean logo
  • Discover
  • Podcast Features

    Your all-in-one podcasting solution.

    Podcast Studio

    Easy-to-use audio recorder app.

  • Livestream

    High-performing audio live, without limits.

  • Podcast App

    The best podcast player & podcast app.

  • Ads Marketplace

    Join Ads Marketplace to earn money
    through sponsorship on your podcast.

    PodAds

    Manage your ads with dynamic ad insertion capability.

  • Patron & Paid Content

    The seamless way for fans to support you directly
    from your podcast.

  •  
  • All Arts Business Comedy Education
  • Fiction Government Health & Fitness History Kids & Family
  • Leisure Music News Religion & Spirituality Science
  • Society & Culture Sports Technology True Crime TV & Film
  • Live
  • How to Start a Podcast
  • How to Start a Live Podcast
  • How to Monetize a podcast
  • How to Promote Your Podcast
  • How to Use Group Recording
  • Log in
  • Start your podcast for free
  • Podcasting
    • Podcast Features
    • Live Stream
    • PodAds
    • Podcast App
    • Podcast Studio
  • Monetization
    • Premium
    • Patron
    • Ads Marketplace
  • Enterprise
  • Pricing
  • Discover
  • Log in
    Sign up free
Papers Read on AI

Papers Read on AI

News:Tech News

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

2023-03-27
Download
Large language models (LLMs) show excellent performance but are compute- and memory-intensive. Quantization can reduce memory and accelerate inference. However, for LLMs beyond 100 billion parameters, existing methods cannot maintain accuracy or do not run efficiently on hardware. We propose SmoothQuant, a training-free, accuracy-preserving, and general-purpose post-training quantization (PTQ) solution to enable 8-bit weight, 8-bit activation (W8A8) quantization for LLMs. Based on the fact that weights are easy to quantize while activations are not, SmoothQuant smooths the activation outliers by offline migrating the quantization difficulty from activations to weights with a mathematically equivalent transformation. 2022: Guangxuan Xiao, Ji Lin, Mickael Seznec, Julien Demouth, Song Han https://arxiv.org/pdf/2211.10438v4.pdf
view more

More Episodes

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
2023-06-09 42
Orca: Progressive Learning from Complex Explanation Traces of GPT-4
2023-06-08 72
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
2023-06-06 58
Let’s Verify Step by Step
2023-06-05 75
Large Language Models as Tool Makers
2023-06-04 82
Gorilla: Large Language Model Connected with Massive APIs
2023-06-02 90
CodeT5+: Open Code Large Language Models for Code Understanding and Generation
2023-05-31 102
VanillaNet: the Power of Minimalism in Deep Learning
2023-05-29 102
QLoRA: Efficient Finetuning of Quantized LLMs
2023-05-26 166
SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities
2023-05-25 111
LLM-Pruner: On the Structural Pruning of Large Language Models
2023-05-24 109
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
2023-05-23 115
Training language models to follow instructions with human feedback
2023-05-19 134
Language Models Trained on Media Diets Can Predict Public Opinion
2023-05-18 87
LoRA: Low-Rank Adaptation of Large Language Models
2023-05-17 129
Pretraining Without Attention
2023-05-15 111
ImageBind: One Embedding Space To Bind Them All
2023-05-12 120
ZipIt! Merging Models from Different Tasks without Training
2023-05-10 124
Chain of Thought Prompting Elicits Reasoning in Large Language Models
2023-05-09 127
CodeGen2: Lessons for Training LLMs on Programming and Natural Languages
2023-05-08 117
  • ←
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • →
012345678910111213141516171819

Get this podcast on your
phone, FREE

Download Podbean app on App Store Download Podbean app on Google Play

Create your
podcast in
minutes

  • Full-featured podcast site
  • Unlimited storage and bandwidth
  • Comprehensive podcast stats
  • Integrate with iTunes and Google
    store
  • Make money with your podcast
Get started

It is Free

  • Podcast Services

    • Podcast Features
    • Pricing
    • Enterprise Solution
    • Private Podcast
    • The Podcast App
    • Live Stream
    • Audio Recorder
    • Remote Recording
  •  
    • Create a Podcast
    • Video Podcast
    • Start Podcasting
    • Start Radio Talk Show
    • Education Podcast
    • Church Podcast
    • Nonprofit Podcast
    • Get Sermons Online
    • Free Audiobooks
  • MONETIZATION & MORE

    • Podcast Advertising
    • Dynamic Ads Insertion
    • Patron Program
    • Affiliate Program
    • Switch to Podbean
    • Submit Your Podcast
    • Podbean Plugins
    • Developers
    • Badges
  • KNOWLEDGE BASE

    • How to Start a Podcast
    • How to Start a Live Podcast
    • How to Monetize a podcast
    • How to Promote Your Podcast
    • How to Use Group Recording
  • Support

    • Support Center
    • Podbean Blog
    • What’s New
    • Free Webinars
    • Podcast Events
    • Podbean Academy
    • Podcasting Smarter
    • Resources
  • Podbean

    • About Us
    • Careers
    • Press and Media
    • Green Initiative
    • Contact Us
  • Privacy Policy
  • Cookie Policy
  • Terms of Use
  • Consent Preferences
  • Copyright © 2006-2023 Podbean.com