Podbean logo
  • Discover
  • Podcast Features

    Your all-in-one podcasting solution.

    Podcast Studio

    Easy-to-use audio recorder app.

  • Livestream

    High-performing audio live, without limits.

  • Podcast App

    The best podcast player & podcast app.

  • Ads Marketplace

    Join Ads Marketplace to earn money
    through sponsorship on your podcast.

    PodAds

    Manage your ads with dynamic ad insertion capability.

  • Patron & Paid Content

    The seamless way for fans to support you directly
    from your podcast.

  •  
  • All Arts Business Comedy Education
  • Fiction Government Health & Fitness History Kids & Family
  • Leisure Music News Religion & Spirituality Science
  • Society & Culture Sports Technology True Crime TV & Film
  • Live
  • How to Start a Podcast
  • How to Start a Live Podcast
  • How to Monetize a podcast
  • How to Promote Your Podcast
  • How to Use Group Recording
  • Log in
  • Start your podcast for free
  • Podcasting
    • Podcast Features
    • Live Stream
    • PodAds
    • Podcast App
    • Podcast Studio
  • Monetization
    • Premium
    • Patron
    • Ads Marketplace
  • Enterprise
  • Pricing
  • Discover
  • Log in
    Sign up free
Papers Read on AI

Papers Read on AI

News:Tech News

SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities

SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities

2023-05-25
Download
Multi-modal large language models are regarded as a crucial step towards Artificial General Intelligence (AGI) and have garnered significant interest with the emergence of ChatGPT. However, current speech-language models typically adopt the cascade paradigm, preventing inter-modal knowledge transfer. In this paper, we propose SpeechGPT, a large language model with intrinsic cross-modal conversational abilities, capable of perceiving and generating multi-model content. With discrete speech representations, we first construct SpeechInstruct, a large-scale cross-modal speech instruction dataset. Additionally, we employ a three-stage training strategy that includes modality-adaptation pre-training, cross-modal instruction fine-tuning, and chain-of-modality instruction fine-tuning. The experimental results demonstrate that SpeechGPT has an impressive capacity to follow multi-modal human instructions and highlight the potential of handling multiple modalities with one model. Demos are shown in https://0nutation.github.io/SpeechGPT.github.io/. 2023: Dong Zhang, Shimin Li, Xin Zhang, Jun Zhan, P. Wang, Yaqian Zhou, Xipeng Qiu https://arxiv.org/pdf/2305.11000v2.pdf
view more

More Episodes

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
2023-06-09 63
Orca: Progressive Learning from Complex Explanation Traces of GPT-4
2023-06-08 79
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
2023-06-06 62
Let’s Verify Step by Step
2023-06-05 79
Large Language Models as Tool Makers
2023-06-04 87
Gorilla: Large Language Model Connected with Massive APIs
2023-06-02 93
CodeT5+: Open Code Large Language Models for Code Understanding and Generation
2023-05-31 104
VanillaNet: the Power of Minimalism in Deep Learning
2023-05-29 104
QLoRA: Efficient Finetuning of Quantized LLMs
2023-05-26 169
LLM-Pruner: On the Structural Pruning of Large Language Models
2023-05-24 111
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
2023-05-23 116
Training language models to follow instructions with human feedback
2023-05-19 135
Language Models Trained on Media Diets Can Predict Public Opinion
2023-05-18 88
LoRA: Low-Rank Adaptation of Large Language Models
2023-05-17 130
Pretraining Without Attention
2023-05-15 113
ImageBind: One Embedding Space To Bind Them All
2023-05-12 121
ZipIt! Merging Models from Different Tasks without Training
2023-05-10 125
Chain of Thought Prompting Elicits Reasoning in Large Language Models
2023-05-09 129
CodeGen2: Lessons for Training LLMs on Programming and Natural Languages
2023-05-08 119
  • ←
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • →
01234567810111213141516171819

Get this podcast on your
phone, FREE

Download Podbean app on App Store Download Podbean app on Google Play

Create your
podcast in
minutes

  • Full-featured podcast site
  • Unlimited storage and bandwidth
  • Comprehensive podcast stats
  • Integrate with iTunes and Google
    store
  • Make money with your podcast
Get started

It is Free

  • Podcast Services

    • Podcast Features
    • Pricing
    • Enterprise Solution
    • Private Podcast
    • The Podcast App
    • Live Stream
    • Audio Recorder
    • Remote Recording
  •  
    • Create a Podcast
    • Video Podcast
    • Start Podcasting
    • Start Radio Talk Show
    • Education Podcast
    • Church Podcast
    • Nonprofit Podcast
    • Get Sermons Online
    • Free Audiobooks
  • MONETIZATION & MORE

    • Podcast Advertising
    • Dynamic Ads Insertion
    • Patron Program
    • Affiliate Program
    • Switch to Podbean
    • Submit Your Podcast
    • Podbean Plugins
    • Developers
    • Badges
  • KNOWLEDGE BASE

    • How to Start a Podcast
    • How to Start a Live Podcast
    • How to Monetize a podcast
    • How to Promote Your Podcast
    • How to Use Group Recording
  • Support

    • Support Center
    • Podbean Blog
    • What’s New
    • Free Webinars
    • Podcast Events
    • Podbean Academy
    • Podcasting Smarter
    • Resources
  • Podbean

    • About Us
    • Careers
    • Press and Media
    • Green Initiative
    • Contact Us
  • Privacy Policy
  • Cookie Policy
  • Terms of Use
  • Consent Preferences
  • Copyright © 2006-2023 Podbean.com