Podbean logo
  • Discover
  • Podcast Features

    Your all-in-one podcasting solution.

    Podcast Studio

    Easy-to-use audio recorder app.

  • Livestream

    High-performing audio live, without limits.

  • Podcast App

    The best podcast player & podcast app.

  • Ads Marketplace

    Join Ads Marketplace to earn money
    through sponsorship on your podcast.

    PodAds

    Manage your ads with dynamic ad insertion capability.

  • Patron & Paid Content

    The seamless way for fans to support you directly
    from your podcast.

  • Apple Podcasts Subscriptions Integration

    Effortlessly publish and manage exclusive episodes for your
    Apple Podcasts subscribers directly from Podbean.

  • All Arts Business Comedy Education
  • Fiction Government Health & Fitness History Kids & Family
  • Leisure Music News Religion & Spirituality Science
  • Society & Culture Sports Technology True Crime TV & Film
  • Live
  • How to Start a Podcast
  • How to Start a Live Podcast
  • How to Monetize a podcast
  • How to Promote Your Podcast
  • How to Use Group Recording
  • Log in
  • Start your podcast for free
  • Podcasting
    • Podcast Features
    • Live Stream
    • PodAds
    • Podcast App
    • Podcast Studio
  • Monetization
    • Premium
    • Patron
    • Apple Podcasts Subscriptions Integration
    • Ads Marketplace
  • Enterprise
  • Pricing
  • Discover
  • Log in
    Sign up free
Papers Read on AI

Papers Read on AI

News:Tech News

Representation Learning for the Automatic Indexing of Sound Effects Libraries

Representation Learning for the Automatic Indexing of Sound Effects Libraries

2022-09-02
Download
Labeling and maintaining a commercial sound effects library is a time-consuming task exacerbated by databases that continually grow in size and undergo taxonomy up-dates. Moreover, sound search and taxonomy creation are complicated by non-uniform metadata, an unrelenting problem even with the introduction of a new industry standard, the Universal Category System. To address these problems and overcome dataset-dependent limitations that inhibit the successful training of deep learning models, we pursue representation learning to train generalized embeddings that can be used for a wide variety of sound effects libraries and are a taxonomy-agnostic representation of sound. We show that a task-specific but dataset-independent representation can successfully address data issues such as class imbalance, inconsistent class labels, and insufficient dataset size, outperforming established representations such as OpenL3. Detailed experimental results show the impact of metric learning approaches and different cross-dataset training methods on representational effectiveness. 2022: Alison B. Ma, Alexander Lerch https://arxiv.org/pdf/2208.09096v1.pdf
view more

More Episodes

GPT Can Solve Mathematical Problems Without a Calculator
2023-09-22 72
Tracking Anything with Decoupled Video Segmentation
2023-09-21 60
ModuleFormer: Modularity Emerges from Mixture-of-Experts
2023-09-20 63
Agents: An Open-source Framework for Autonomous Language Agents
2023-09-18 81
Cognitive Architectures for Language Agents
2023-09-15 103
PyGraft: Configurable Generation of Schemas and Knowledge Graphs at Your Fingertips
2023-09-14 102
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors in Agents
2023-09-13 108
Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
2023-09-11 116
LLaSM: Large Language and Speech Model
2023-09-07 111
Nougat: Neural Optical Understanding for Academic Documents
2023-09-06 106
Communicative Agents for Software Development
2023-09-04 122
Prompt2Model: Generating Deployable Models from Natural Language Instructions
2023-09-02 124
Code Llama: Open Foundation Models for Code
2023-09-01 130
A Survey on Large Language Model based Autonomous Agents
2023-08-31 107
SoTaNa: The Open-Source Software Development Assistant
2023-08-31 80
Efficient Guided Generation for Large Language Models
2023-08-27 103
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
2023-08-25 102
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
2023-08-25 133
Platypus: Quick, Cheap, and Powerful Refinement of LLMs
2023-08-24 107
FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization
2023-08-23 83
  • ←
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • →
012345678910111213141516171819

Get this podcast on your
phone, FREE

Download Podbean app on App Store Download Podbean app on Google Play

Create your
podcast in
minutes

  • Full-featured podcast site
  • Unlimited storage and bandwidth
  • Comprehensive podcast stats
  • Distribute to Apple Podcasts, Spotify, and more
  • Make money with your podcast
Get started

It is Free

  • Podcast Services

    • Podcast Features
    • Pricing
    • Enterprise Solution
    • Private Podcast
    • The Podcast App
    • Live Stream
    • Audio Recorder
    • Remote Recording
  •  
    • Create a Podcast
    • Video Podcast
    • Start Podcasting
    • Start Radio Talk Show
    • Education Podcast
    • Church Podcast
    • Nonprofit Podcast
    • Get Sermons Online
    • Free Audiobooks
  • MONETIZATION & MORE

    • Podcast Advertising
    • Dynamic Ads Insertion
    • Patron Program
    • Apple Podcasts Subscriptions
    • Switch to Podbean
    • Submit Your Podcast
    • Podbean Plugins
    • Developers
  • KNOWLEDGE BASE

    • How to Start a Podcast
    • How to Start a Live Podcast
    • How to Monetize a podcast
    • How to Promote Your Podcast
    • How to Use Group Recording
  • Support

    • Support Center
    • What’s New
    • Free Webinars
    • Podcast Events
    • Podbean Academy
    • Podcasting Smarter
    • Badges
    • Resources
  • Podbean

    • About Us
    • Podbean Blog
    • Careers
    • Press and Media
    • Green Initiative
    • Affiliate Program
    • Contact Us
  • Privacy Policy
  • Cookie Policy
  • Terms of Use
  • Consent Preferences
  • Copyright © 2015-2023 Podbean.com