Podbean logo
  • Discover
  • Podcast Features

    Your all-in-one podcasting solution.

    Podcast App

    The best podcast player & podcast app.

  • Livestream

    High-performing audio live, without limits.

    Podcast Studio

    Easy-to-use audio recorder app.

  • PodAds

    Dynamic Ad Insertion for podcasts.

  • Premium

    Convert listeners into buyers anywhere, anytime
    with the convenience of Podbean Premium.

    Patron

    The seamless way for fans to support you directly
    from your podcast.

  • Ads Marketplace

    Join Ads Marketplace to earn money
    through sponsorship on your podcast.

  •  
  • All Arts Business Comedy Education
  • Fiction Government Health & Fitness History Kids & Family
  • Leisure Music News Religion & Spirituality Science
  • Society & Culture Sports Technology True Crime TV & Film
  • Live
  • Log in
  • Start your podcast for free
  • Podcasting
    • Podcast Features
    • Live Stream
    • PodAds
    • Podcast App
    • Podcast Studio
  • Monetization
    • Premium
    • Patron
    • Ads Marketplace
  • Enterprise
  • Pricing
  • Discover
  • Log in
    Sign up free
Papers Read on AI

Papers Read on AI

News:Tech News

OCR-free Document Understanding Transformer

OCR-free Document Understanding Transformer

2022-08-03
Download
Understanding document images ( e.g. , invoices) is a core but challenging task since it requires complex functions such as reading text and a holistic understanding of the document . Current Visual Document Understanding (VDU) methods outsource the task of reading text to off-the-shelf Optical Character Recognition (OCR) engines and focus on the understanding task with the OCR outputs. Although such OCR-based approaches have shown promising performance, they suffer from 1) high computational costs for using OCR; 2) inflexibility of OCR models on languages or types of documents; 3) OCR error propagation to the subsequent process. To address these issues, in this paper, we introduce a novel OCR-free VDU model named Donut , which stands for Do cume n t u nderstanding t ransformer. As the first step in OCR-free VDU research, we propose a simple architecture ( i.e. , Transformer) with a pre-training objective ( i.e., cross-entropy loss). Donut is conceptually simple yet effective. 2021: Geewook Kim, Teakgyu Hong, Moonbin Yim, Jeongyeon Nam, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park https://arxiv.org/pdf/2111.15664v2.pdf
view more

More Episodes

Collaborative Neural Rendering using Anime Character Sheets
2022-08-16 1
Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning
2022-08-15 15
Can Wikipedia Help Offline Reinforcement Learning?
2022-08-13 13
Masked Autoencoders that Listen
2022-08-12 16
Reconstructing 3D Human Pose by Watching Humans in the Mirror
2022-08-11 6
A Conversational Paradigm for Program Synthesis
2022-08-10 3
Masked Siamese Networks for Label-Efficient Learning
2022-08-05 2
Multiface: A Dataset for Neural Face Rendering
2022-08-04 1
OpenXAI: Towards a Transparent Evaluation of Model Explanations
2022-08-02 1
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
2022-08-01 2
Matryoshka Representations for Adaptive Deployment
2022-07-18 35
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
2022-07-15 35
More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity
2022-07-14 32
The Web Is Your Oyster - Knowledge-Intensive NLP against a Very Large Web Corpus
2022-07-14 11
GhostNet: More Features From Cheap Operations
2022-07-13 19
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection
2022-07-13 20
Transformer in Transformer
2022-07-04 40
Vision GNN: An Image is Worth Graph of Nodes
2022-07-02 31
TorchGeo: deep learning with geospatial data
2022-06-30 29
  • ←
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • →
01234567910111213141516171819

Get this podcast on your
phone, FREE

Download Podbean app on App Store Download Podbean app on Google Play

Create your
podcast in
minutes

  • Full-featured podcast site
  • Unlimited storage and bandwidth
  • Comprehensive podcast stats
  • Integrate with iTunes and Google
    store
  • Make money with your podcast
Get started

It is Free

  • Podcast Services

    • Podcast Features
    • Pricing
    • Enterprise Solution
    • Private Podcast
    • The Podcast App
    • Live Stream
    • Audio Recorder
    • Remote Recording
  •  
    • Create a Podcast
    • Video Podcast
    • Start Podcasting
    • Start Radio Talk Show
    • Education Podcast
    • Switch to Podbean
    • Submit Your Podcast
    • Podbean Plugins
  •  
    • Church Podcast
    • Nonprofit Podcast
    • Get Sermons Online
    • Free Audiobooks
    • How to Start a Podcast
    • How to Start a Live Podcast
    • How to Monetize a podcast
    • How to Promote Your Podcast
    • How to Use Group Recording
  • MONETIZATION

    • Premium Podcast
    • Podcast Advertising
    • Patron Program
  • Support

    • Contact Us
    • Support Center
    • Developers
    • Resources
    • Free Webinars
    • Podcast Events
    • Podbean Academy
    • Podcasting Smarter
    • Podbean in the Media
  • Podbean

    • About Us
    • Careers
    • Affiliate Program
    • Badges
    • Terms of Use
    • Privacy Policy
    • Podbean Blog

Copyright © 2006-2022 Podbean.com