Close Menu
Ztoog
    What's Hot
    The Future

    Harvest vs Toggl: 2023 detailed comparison

    Crypto

    Ethereum Defies Expectations With Lower Volatility Than Bitcoin

    Science

    The World’s Broken Food System Costs $12.7 Trillion a Year

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      OPPO launches A5 Pro 5G: Premium features at a budget price

      How I Turn Unstructured PDFs into Revenue-Ready Spreadsheets

      Is it the best tool for 2025?

      The clocks that helped define time from London’s Royal Observatory

      Summer Movies Are Here, and So Are the New Popcorn Buckets

    • Technology

      What It Is and Why It Matters—Part 1 – O’Reilly

      Ensure Hard Work Is Recognized With These 3 Steps

      Cicada map 2025: Where will Brood XIV cicadas emerge this spring?

      Is Duolingo the face of an AI jobs crisis?

      The US DOD transfers its AI-based Open Price Exploration for National Security program to nonprofit Critical Minerals Forum to boost Western supply deals (Ernest Scheyder/Reuters)

    • Gadgets

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

      Apple plans to split iPhone 18 launch into two phases in 2026

      Upgrade your desk to Starfleet status with this $95 USB-C hub

      37 Best Graduation Gift Ideas (2025): For College Grads

      Backblaze responds to claims of “sham accounting,” customer backups at risk

    • Mobile

      Samsung Galaxy S25 Edge promo materials leak

      What are people doing with those free T-Mobile lines? Way more than you’d expect

      Samsung doesn’t want budget Galaxy phones to use exclusive AI features

      COROS’s charging adapter is a neat solution to the smartwatch charging cable problem

      Fortnite said to return to the US iOS App Store next week following court verdict

    • Science

      Nothing is stronger than quantum connections – and now we know why

      Failed Soviet probe will soon crash to Earth – and we don’t know where

      Trump administration cuts off all future federal funding to Harvard

      Does kissing spread gluten? New research offers a clue.

      Why Balcony Solar Panels Haven’t Taken Off in the US

    • AI

      Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

      How to build a better AI benchmark

      Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

      This data set helps researchers spot harmful stereotypes in LLMs

      Making AI models more trustworthy for high-stakes settings | Ztoog

    • Crypto

      Ethereum Breaks Key Resistance In One Massive Move – Higher High Confirms Momentum

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

      Bitcoin Holds Above $95K Despite Weak Blockchain Activity — Analytics Firm Explains Why

      eToro eyes US IPO launch as early as next week amid easing concerns over Trump’s tariffs

      Cardano ‘Looks Dope,’ Analyst Predicts Big Move Soon

    Ztoog
    Home » Meta AI Releases Nougat: A Visual Transformer Model that Performs OCR for Processing Scientific Documents into a Markup Language
    AI

    Meta AI Releases Nougat: A Visual Transformer Model that Performs OCR for Processing Scientific Documents into a Markup Language

    Facebook Twitter Pinterest WhatsApp
    Meta AI Releases Nougat: A Visual Transformer Model that Performs OCR for Processing Scientific Documents into a Markup Language
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    With the rising developments within the area of Artificial Intelligence, its sub-fields, together with Natural Language Processing, Natural Language Generation, Computer Vision, and many others., have quickly gained a lot of recognition as a result of their intensive use instances. Optical Character Recognition (OCR) is a well-established and closely investigated space of pc imaginative and prescient. It has a variety of makes use of, akin to doc digitization, handwriting recognition, and scene textual content identification. The recognition of mathematical expressions is one space of OCR that has obtained a lot of curiosity in educational research.

    The Portable Document Format (PDF) is among the most generally used codecs for scientific data, which is commonly preserved in books or revealed in scholarly journals. The second most used knowledge format on the web, accounting for 2.4% of the data, PDFs are steadily used for doc supply. Despite their widespread use, extracting info from PDF information may be tough, notably when coping with extremely specialised supplies like scientific analysis articles. In explicit, when these papers are transformed to PDF format, the semantic info of mathematical expressions is steadily misplaced.

    To handle the challenges, a group of researchers from Meta AI has launched a answer known as Nougat, which stands for “Neural Optical Understanding for Academic Documents.” In order to do Optical Character Recognition (OCR) on scientific texts, Nougat is a Visual Transformer mannequin. Its aim is to remodel these information into a markup language so that they could be extra simply accessed and machine-readable.

    To present the efficacy of the methodology, the group has additionally produced a contemporary dataset of educational papers. This technique presents a viable reply for enhancing scientific data accessibility within the digital age. It fills the hole between written supplies that are easy for folks to learn and textual content that computer systems can course of and analyze. Researchers, educators, and anybody concerned with scientific literature can entry and take care of scientific papers extra successfully utilizing Nougat. Nougat is principally a transformer-based mannequin designed to transform pictures of doc pages, notably these from PDFs, into formatted markup textual content.

    The group has summarized their key contributions as follows –

    1. Publication of a Pre-trained Model: The group has created a pre-trained mannequin that can remodel PDFs into a easy markup language. This pre-trained mannequin is made public on GitHub, the place the analysis neighborhood and anybody can entry it, together with the associated code.
    1. Pipeline for Dataset Creation: A technique for constructing datasets that pair PDF paperwork with their related supply code is described within the research. This dataset improvement technique is essential for testing and refining the Nougat mannequin and could also be helpful for future doc evaluation analysis and purposes.
    1. Dependency on the Page’s Image Only: One of Nougat’s standout options is its capability to function solely on the Page’s Image. This makes it a versatile instrument for extracting content material from a number of sources, even when the unique paperwork will not be obtainable in digital textual content codecs. It can course of scanned papers and books.

    Check out the Paper and Github. All Credit For This Research Goes To the Researchers on This Project. Also, don’t neglect to affix our 29k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, the place we share the most recent AI analysis information, cool AI tasks, and extra.

    If you want our work, you’ll love our publication..


    Tanya Malhotra is a remaining 12 months undergrad from the University of Petroleum & Energy Studies, Dehradun, pursuing BTech in Computer Science Engineering with a specialization in Artificial Intelligence and Machine Learning.
    She is a Data Science fanatic with good analytical and demanding pondering, together with an ardent curiosity in buying new expertise, main teams, and managing work in an organized method.


    🚀 CodiumAI allows busy builders to generate significant checks (Sponsored)

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    AI

    Making AI models more trustworthy for high-stakes settings | Ztoog

    AI

    The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    AI

    Novel method detects microbial contamination in cell cultures | Ztoog

    AI

    Seeing AI as a collaborator, not a creator

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Gadgets

    Google Domains is yet another useful service to get the ax in favor of “focus”“

    Enlarge / These Corporate Memphis people are going to have to look elsewhere quickly to…

    The Future

    How to cancel your Hubstaff subscription (+ an alternative)

    559 You can cancel your Hubstaff subscription via: The web site.Email.Customer assist quantity. In this…

    Mobile

    vivo X90s memory options leaked by third-party retailer

    The vivo X90s is arriving on Monday, and we all know the cellphone will seem…

    Science

    Antimatter definitely doesn’t fall up, physicists confirm

    Tracking the trail of antimatter is a difficult enterprisesakkmesterke/iStockphoto/Getty Images If you drop a chunk…

    The Future

    D-Link Aqulia Pro AI Mesh system – Novel look, great value

    D-Link produces some actually stable and great value {hardware} persistently. Everything from easy Wi-Fi entry…

    Our Picks
    Science

    A Groundbreaking Human Brain Cell Atlas Just Dropped

    Science

    China’s New Heavy Lift Rocket Looks a Whole Lot Like SpaceX’s Starship

    Mobile

    OnePlus Buds 3 review – GSMArena.com news

    Categories
    • AI (1,483)
    • Crypto (1,745)
    • Gadgets (1,796)
    • Mobile (1,839)
    • Science (1,854)
    • Technology (1,790)
    • The Future (1,636)
    Most Popular
    Gadgets

    What Is 5G Home Internet? Here’s Everything You Need to Know (2024)

    Science

    Jupiter’s stormy surface replicated in lab

    Technology

    Ant Financial transfers Paytm stake worth $628 to Vijay Shekhar Sharma

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.