Close Menu
Ztoog
    What's Hot
    Science

    Roo-ver: Australia’s first moon rover has name chosen in public vote

    Technology

    Fearing the Wrong Thing – O’Reilly

    Mobile

    Samsung is reportedly ‘testing’ a Galaxy Z Flip 6 with a drastically upgraded camera

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Today’s NYT Mini Crossword Answers for June 7

      ScanWatch Nova Brilliant – 30-day battery meets luxury design

      How to Get Bot Lobbies in Fortnite? (2025 Guide)

      Can work-life balance tracking improve well-being?

      Any wall can be turned into a camera to see around corners

    • Technology

      Human-Centered AI, Spatial Intelligence, and the Future of Practice – O’Reilly

      Celebrating Engineering Pioneers at IEEE VIC Summit

      What does a millennial midlife crisis look like?

      Elon Musk tries to stick to spaceships

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

    • Gadgets

      Nintendo Switch 2’s faster chip can dramatically improve original Switch games

      Nothing Phone 3 Officially Set To Launch On July 1st

      Watch Apple’s WWDC 2025 keynote right here

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

    • Mobile

      Follow these warnings from the FBI and New York Police so you don’t get scammed

      Samsung Galaxy S25 vs Google Pixel 9 deals

      YouTube is testing a leaderboard to show off top live stream fans

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

    • Science

      Could we build space-time computers that run on gravity?

      Why it’s taking a century to pin down the speed of the universe

      Some parts of Trump’s proposed budget for NASA are literally draconian

      June skygazing: A strawberry moon, the summer solstice… and Asteroid Day!

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

    • AI

      Manus has kick-started an AI agent boom in China

      Teaching AI models what they don’t know | Ztoog

      Fueling seamless AI at scale

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

    • Crypto

      Metaplanet’s Bitcoin Bet Just Got Bigger—Here’s What Changed

      JPMorgan Chase set to accept Bitcoin, crypto ETFs as loan collateral

      Bitcoin Maxi Isn’t Buying Hype Around New Crypto Holding Firms

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

    Ztoog
    Home » Meet Surya: A Multilingual Text Line Detection AI Model for Documents
    AI

    Meet Surya: A Multilingual Text Line Detection AI Model for Documents

    Facebook Twitter Pinterest WhatsApp
    Meet Surya: A Multilingual Text Line Detection AI Model for Documents
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp





    In a current tweet from the founding father of Dataquest.io, Vik Paruchuri lately publicized the launch of a multilingual doc OCR toolkit, Surya. The framework can effectively detect line-level bboxes and column breaks in paperwork, scanned photos, or displays. The current textual content detection fashions like Tesseract work on the phrase or character stage, whereas this open-source AI works on the line stage. The greatest problem in constructing a text-line detection mannequin is the unavailability of 100% right datasets with line-level annotations. 

    Surya is an encoder-decoder mannequin utilizing a picture of the doc as enter and produces a picture with containers drawn across the line containers on the unique enter picture. The preliminary layers of the decoder comprise SegFormer, a transformer for semantic segmentation, whereas the 2nd convolutional layer with batch-normalization layers makes the tip of the decoder community. Before utilizing the picture or PDF, the pages are cut up into segments to the utmost dimension of the picture and bear varied pre-processing. 

    For mannequin analysis for the accuracy of bboxes, researchers used precision and recall on the protection space as a substitute of the normal IoU metric (Intersection over union). The precision calculates how properly predicted bboxes cowl floor reality bboxes and recall calculates how properly floor reality bboxes cowl predicted bboxes. Surya is in contrast with Tesseract, experiments instructed that the precision of Surya is way larger than that of Tesseract, and Tesseract’s recall is barely greater than that of Surya however general Surya outperforms Tesseract. Another benefit of Surya over the Tesseract mannequin is that it could actually work each on CPU and GPU and is way quicker than Tesseract.

    Surya, named after the Hindu God of the Sun, has efficiently labored on a number of languages and is anticipated to work on nearly all languages. The limitation of this mannequin isn’t more likely to work on photographs or different photos as it’s specialised on paperwork. Experiments additionally present it doesn’t work properly with photos that seem like advertisements. In spite of this limitation, the mannequin continues to be of nice use and might be additional expanded to textual content detection, desk, and chart detection.


    Pragati Jhunjhunwala is a consulting intern at MarktechPost. She is presently pursuing her B.Tech from the Indian Institute of Technology(IIT), Kharagpur. She is a tech fanatic and has a eager curiosity within the scope of software program and knowledge science purposes. She is at all times studying in regards to the developments in numerous discipline of AI and ML.


    🐝 Join the Fastest Growing AI Research Newsletter Read by Researchers from Google + NVIDIA + Meta + Stanford + MIT + Microsoft and lots of others…






    Previous articleAnthropic AI Experiment Reveals Trained LLMs Harbor Malicious Intent, Defying Safety Measures


    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Manus has kick-started an AI agent boom in China

    AI

    Teaching AI models what they don’t know | Ztoog

    AI

    Fueling seamless AI at scale

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Gadgets

    LG Transforming into a Smart Life Solution Company: 2024 Checkpoint

    For the previous 12 months or so, LG Electronics has been engaged on remodeling itself…

    Gadgets

    Formula 1 chief appalled to find team using Excel to manage 20,000 car parts

    Enlarge / A pit cease in the course of the Bahrain Formula One Grand Prix…

    Crypto

    How Urvashi Barooah broke into venture after everyone told her she couldn’t

    When Urvashi Barooah utilized to MBA packages in 2015, she targeted her purposes round her…

    The Future

    ActivTrak vs Teramind: A detailed 2023 comparison

    Want to know the distinction between ActivTrak vs Teramind? If you’re on the lookout for…

    Gadgets

    Starlink Launches Direct-to-Cell Satellites For T-Mobile And Global Carriers

    SpaceX has launched the primary six Starlink satellites geared up with “Direct to Cell” capabilities,…

    Our Picks
    Crypto

    Analyst Sets Hefty Exit Price

    Crypto

    Coinbase Argues ‘Abuse of Process;’ Seeks to Dismiss SEC Case

    Gadgets

    Belkin Gets On Board With Qi2 Wireless Charging And More

    Categories
    • AI (1,496)
    • Crypto (1,756)
    • Gadgets (1,808)
    • Mobile (1,854)
    • Science (1,870)
    • Technology (1,806)
    • The Future (1,652)
    Most Popular
    AI

    Researchers from NYU and Meta AI Studies Improving Social Conversational Agents by Learning from Natural Dialogue between Users and a Deployed Model, without Extra Annotations

    Gadgets

    Humane urges customers to stop using charging case, citing battery fire concerns

    The Future

    Ilya Sutskever isn’t done working on AI safety

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.