Podbean logo
  • Discover
  • Podcast Features

    Your all-in-one podcasting solution.

    Podcast Studio

    Easy-to-use audio recorder app.

  • Livestream

    High-performing audio live, without limits.

  • Podcast App

    The best podcast player & podcast app.

  • Ads Marketplace

    Join Ads Marketplace to earn money
    through sponsorship on your podcast.

    PodAds

    Manage your ads with dynamic ad insertion capability.

  • Patron & Paid Content

    The seamless way for fans to support you directly
    from your podcast.

  • Apple Podcasts Subscriptions Integration

    Effortlessly publish and manage exclusive episodes for your
    Apple Podcasts subscribers directly from Podbean.

  • All Arts Business Comedy Education
  • Fiction Government Health & Fitness History Kids & Family
  • Leisure Music News Religion & Spirituality Science
  • Society & Culture Sports Technology True Crime TV & Film
  • Live
  • How to Start a Podcast
  • How to Start a Live Podcast
  • How to Monetize a podcast
  • How to Promote Your Podcast
  • How to Use Group Recording
  • Log in
  • Start your podcast for free
  • Podcasting
    • Podcast Features
    • Live Stream
    • PodAds
    • Podcast App
    • Podcast Studio
  • Monetization
    • Premium
    • Patron
    • Apple Podcasts Subscriptions Integration
    • Ads Marketplace
  • Enterprise
  • Pricing
  • Discover
  • Log in
    Sign up free
Papers Read on AI

Papers Read on AI

News:Tech News

Nougat: Neural Optical Understanding for Academic Documents

Nougat: Neural Optical Understanding for Academic Documents

2023-09-06
Download
Scientific knowledge is predominantly stored in books and scientific journals, often in the form of PDFs. However, the PDF format leads to a loss of semantic information, particularly for mathematical expressions. We propose Nougat (Neural Optical Understanding for Academic Documents), a Visual Transformer model that performs an Optical Character Recognition (OCR) task for processing scientific documents into a markup language, and demonstrate the effectiveness of our model on a new dataset of scientific documents. The proposed approach offers a promising solution to enhance the accessibility of scientific knowledge in the digital age, by bridging the gap between human-readable documents and machine-readable text. We release the models and code to accelerate future work on scientific text recognition.

2023: Lukas Blecher, Guillem Cucurull, Thomas Scialom, Robert Stojnic



https://arxiv.org/pdf/2308.13418v1.pdf
view more

More Episodes

GPT Can Solve Mathematical Problems Without a Calculator
2023-09-22 81
Tracking Anything with Decoupled Video Segmentation
2023-09-21 63
ModuleFormer: Modularity Emerges from Mixture-of-Experts
2023-09-20 66
Agents: An Open-source Framework for Autonomous Language Agents
2023-09-18 81
Cognitive Architectures for Language Agents
2023-09-15 104
PyGraft: Configurable Generation of Schemas and Knowledge Graphs at Your Fingertips
2023-09-14 102
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors in Agents
2023-09-13 108
Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP
2023-09-11 116
LLaSM: Large Language and Speech Model
2023-09-07 112
Communicative Agents for Software Development
2023-09-04 122
Prompt2Model: Generating Deployable Models from Natural Language Instructions
2023-09-02 124
Code Llama: Open Foundation Models for Code
2023-09-01 130
A Survey on Large Language Model based Autonomous Agents
2023-08-31 107
SoTaNa: The Open-Source Software Development Assistant
2023-08-31 80
Efficient Guided Generation for Large Language Models
2023-08-27 104
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
2023-08-25 102
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
2023-08-25 133
Platypus: Quick, Cheap, and Powerful Refinement of LLMs
2023-08-24 107
FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization
2023-08-23 83
  • ←
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • →
01234567810111213141516171819

Get this podcast on your
phone, FREE

Download Podbean app on App Store Download Podbean app on Google Play

Create your
podcast in
minutes

  • Full-featured podcast site
  • Unlimited storage and bandwidth
  • Comprehensive podcast stats
  • Distribute to Apple Podcasts, Spotify, and more
  • Make money with your podcast
Get started

It is Free

  • Podcast Services

    • Podcast Features
    • Pricing
    • Enterprise Solution
    • Private Podcast
    • The Podcast App
    • Live Stream
    • Audio Recorder
    • Remote Recording
  •  
    • Create a Podcast
    • Video Podcast
    • Start Podcasting
    • Start Radio Talk Show
    • Education Podcast
    • Church Podcast
    • Nonprofit Podcast
    • Get Sermons Online
    • Free Audiobooks
  • MONETIZATION & MORE

    • Podcast Advertising
    • Dynamic Ads Insertion
    • Patron Program
    • Apple Podcasts Subscriptions
    • Switch to Podbean
    • Submit Your Podcast
    • Podbean Plugins
    • Developers
  • KNOWLEDGE BASE

    • How to Start a Podcast
    • How to Start a Live Podcast
    • How to Monetize a podcast
    • How to Promote Your Podcast
    • How to Use Group Recording
  • Support

    • Support Center
    • What’s New
    • Free Webinars
    • Podcast Events
    • Podbean Academy
    • Podcasting Smarter
    • Badges
    • Resources
  • Podbean

    • About Us
    • Podbean Blog
    • Careers
    • Press and Media
    • Green Initiative
    • Affiliate Program
    • Contact Us
  • Privacy Policy
  • Cookie Policy
  • Terms of Use
  • Consent Preferences
  • Copyright © 2015-2023 Podbean.com