Close Menu
Ztoog
    What's Hot
    Crypto

    Bitcoin and Ethereum decline on the week, Worldcoin to launch a new Orb and Terraform Labs files for bankruptcy

    AI

    UCSC and TU Munich Researchers Propose RECAST: A New Deep Learning-Based Model to Forecast Aftershocks

    Crypto

    No All-Time High For Bitcoin In 2023, Former BitMEX Head Arthur Hayes Predicts

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Are entangled qubits following a quantum Moore’s law?

      Disneyland’s 70th Anniversary Brings Cartoony Chaos to This Summer’s Celebration

      Story of military airfield in Afghanistan that Biden left in 2021

      Tencent hires WizardLM team, a Microsoft AI group with an odd history

      Today’s NYT Connections Hints, Answers for May 12, #701

    • Technology

      Crypto elite increasingly worried about their personal safety

      Deep dive on the evolution of Microsoft's relationship with OpenAI, from its $1B investment in 2019 through Copilot rollouts and ChatGPT's launch to present day (Bloomberg)

      New leak reveals iPhone Fold won’t look like the Galaxy Z Fold 6 at all

      Apple will use AI and user data in iOS 19 to extend iPhone battery life

      Today’s NYT Wordle Hints, Answer and Help for May 12, #1423

    • Gadgets

      The market’s down, but this OpenAI for the stock market can help you trade up

      We Hand-Picked the 24 Best Deals From the 2025 REI Anniversary Sale

      “Google wanted that”: Nextcloud decries Android permissions as “gatekeeping”

      Google Tests Automatic Password-to-Passkey Conversion On Android

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

    • Mobile

      The Forerunner 570 & 970 have made Garmin’s tiered strategy clearer than ever

      The iPhone Fold is now being tested with an under-display camera

      T-Mobile takes over one of golf’s biggest events, unleashes unique experiences

      Fitbit’s AI experiments just leveled up with 3 new health tracking features

      Motorola’s Moto Watch needs to start living up to the brand name

    • Science

      Risk of a star destroying the solar system is higher than expected

      Do these Buddhist gods hint at the purpose of China’s super-secret satellites?

      From Espresso to Eco-Brick: How Coffee Waste Fuels 3D-Printed Design

      Ancient three-eyed ‘sea moth’ used its butt to breathe

      Intelligence on Earth Evolved Independently at Least Twice

    • AI

      With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

      Google DeepMind’s new AI agent cracks real-world problems better than humans can

      Study shows vision-language models can’t handle queries with negation words | Ztoog

      How a new type of AI is helping police skirt facial recognition bans

      Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

    • Crypto

      Is Bitcoin Bull Run Back? Daily RSI Shows Only Mild Bullish Momentum

      Robinhood grows its footprint in Canada by acquiring WonderFi

      HashKey Group Announces Launch of HashKey Global MENA with VASP License in UAE

      Ethereum Breaks Key Resistance In One Massive Move – Higher High Confirms Momentum

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

    Ztoog
    Home » Meet Surya: A Multilingual Text Line Detection AI Model for Documents
    AI

    Meet Surya: A Multilingual Text Line Detection AI Model for Documents

    Facebook Twitter Pinterest WhatsApp
    Meet Surya: A Multilingual Text Line Detection AI Model for Documents
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp





    In a current tweet from the founding father of Dataquest.io, Vik Paruchuri lately publicized the launch of a multilingual doc OCR toolkit, Surya. The framework can effectively detect line-level bboxes and column breaks in paperwork, scanned photos, or displays. The current textual content detection fashions like Tesseract work on the phrase or character stage, whereas this open-source AI works on the line stage. The greatest problem in constructing a text-line detection mannequin is the unavailability of 100% right datasets with line-level annotations. 

    Surya is an encoder-decoder mannequin utilizing a picture of the doc as enter and produces a picture with containers drawn across the line containers on the unique enter picture. The preliminary layers of the decoder comprise SegFormer, a transformer for semantic segmentation, whereas the 2nd convolutional layer with batch-normalization layers makes the tip of the decoder community. Before utilizing the picture or PDF, the pages are cut up into segments to the utmost dimension of the picture and bear varied pre-processing. 

    For mannequin analysis for the accuracy of bboxes, researchers used precision and recall on the protection space as a substitute of the normal IoU metric (Intersection over union). The precision calculates how properly predicted bboxes cowl floor reality bboxes and recall calculates how properly floor reality bboxes cowl predicted bboxes. Surya is in contrast with Tesseract, experiments instructed that the precision of Surya is way larger than that of Tesseract, and Tesseract’s recall is barely greater than that of Surya however general Surya outperforms Tesseract. Another benefit of Surya over the Tesseract mannequin is that it could actually work each on CPU and GPU and is way quicker than Tesseract.

    Surya, named after the Hindu God of the Sun, has efficiently labored on a number of languages and is anticipated to work on nearly all languages. The limitation of this mannequin isn’t more likely to work on photographs or different photos as it’s specialised on paperwork. Experiments additionally present it doesn’t work properly with photos that seem like advertisements. In spite of this limitation, the mannequin continues to be of nice use and might be additional expanded to textual content detection, desk, and chart detection.


    Pragati Jhunjhunwala is a consulting intern at MarktechPost. She is presently pursuing her B.Tech from the Indian Institute of Technology(IIT), Kharagpur. She is a tech fanatic and has a eager curiosity within the scope of software program and knowledge science purposes. She is at all times studying in regards to the developments in numerous discipline of AI and ML.


    🐝 Join the Fastest Growing AI Research Newsletter Read by Researchers from Google + NVIDIA + Meta + Stanford + MIT + Microsoft and lots of others…






    Previous articleAnthropic AI Experiment Reveals Trained LLMs Harbor Malicious Intent, Defying Safety Measures


    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    AI

    Study shows vision-language models can’t handle queries with negation words | Ztoog

    AI

    How a new type of AI is helping police skirt facial recognition bans

    AI

    Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Crypto

    Can Avalanche Find Traction After 15% Plunge?

    AVAX, native token of the Avalanche community, made a convincing entrance into the cryptocurrency markets…

    Technology

    Today’s Wordle Hints and Answer: Help for April 28, #1044

    Today’s Wordle reply should not be too onerous, and it has a generally recognized which…

    Mobile

    Don’t wait for Black Friday: Best Buy just slashed $130 off one of my favorite Chromebooks

    Black Friday would possibly nonetheless be just a few weeks away, however you do not…

    Mobile

    The incredible Galaxy Tab S8 Ultra plunges in price at Best Buy; snag one with a whopping $400 discount while you can

    Attention, consideration! An incredible, jaw-dropping deal is heading your means, and you ought to undoubtedly…

    AI

    Deci AI Introduces DeciLM-7B: A Super Fast and Super Accurate 7 Billion-Parameter Large Language Model (LLM)

    In the ever-evolving area of technological developments, language fashions have develop into indispensable. These techniques,…

    Our Picks
    Mobile

    Honor 90 to make its global debut in July

    The Future

    World Backup Day Deals: 40 Early Deals on SSDs, Flash Drives, SD Cards and More

    Technology

    Enter Netflix House: unforgettable interactive adventures

    Categories
    • AI (1,487)
    • Crypto (1,748)
    • Gadgets (1,800)
    • Mobile (1,844)
    • Science (1,859)
    • Technology (1,795)
    • The Future (1,641)
    Most Popular
    Technology

    The performance brute, reborn with more power and grunt- Technology News, Firstpost

    Gadgets

    Kokopelli Chasm-Lite Stand-Up Paddleboard Review: Inflatable Summer Fun

    Crypto

    Here’s What Happened To Bitcoin The Last Time It Appeared

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.