Close Menu
Ztoog
    What's Hot
    Technology

    Today’s NYT Mini Crossword Answers for July 21

    Mobile

    Galaxy Z Flip 5’s Cover Screen is much larger but might not offer as many features as hoped

    Technology

    Radar Trends to Watch: March 2023 – O’Reilly

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Are entangled qubits following a quantum Moore’s law?

      Disneyland’s 70th Anniversary Brings Cartoony Chaos to This Summer’s Celebration

      Story of military airfield in Afghanistan that Biden left in 2021

      Tencent hires WizardLM team, a Microsoft AI group with an odd history

      Today’s NYT Connections Hints, Answers for May 12, #701

    • Technology

      Crypto elite increasingly worried about their personal safety

      Deep dive on the evolution of Microsoft's relationship with OpenAI, from its $1B investment in 2019 through Copilot rollouts and ChatGPT's launch to present day (Bloomberg)

      New leak reveals iPhone Fold won’t look like the Galaxy Z Fold 6 at all

      Apple will use AI and user data in iOS 19 to extend iPhone battery life

      Today’s NYT Wordle Hints, Answer and Help for May 12, #1423

    • Gadgets

      We Hand-Picked the 24 Best Deals From the 2025 REI Anniversary Sale

      “Google wanted that”: Nextcloud decries Android permissions as “gatekeeping”

      Google Tests Automatic Password-to-Passkey Conversion On Android

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

      Apple plans to split iPhone 18 launch into two phases in 2026

    • Mobile

      The Forerunner 570 & 970 have made Garmin’s tiered strategy clearer than ever

      The iPhone Fold is now being tested with an under-display camera

      T-Mobile takes over one of golf’s biggest events, unleashes unique experiences

      Fitbit’s AI experiments just leveled up with 3 new health tracking features

      Motorola’s Moto Watch needs to start living up to the brand name

    • Science

      Do these Buddhist gods hint at the purpose of China’s super-secret satellites?

      From Espresso to Eco-Brick: How Coffee Waste Fuels 3D-Printed Design

      Ancient three-eyed ‘sea moth’ used its butt to breathe

      Intelligence on Earth Evolved Independently at Least Twice

      Nothing is stronger than quantum connections – and now we know why

    • AI

      With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

      Google DeepMind’s new AI agent cracks real-world problems better than humans can

      Study shows vision-language models can’t handle queries with negation words | Ztoog

      How a new type of AI is helping police skirt facial recognition bans

      Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

    • Crypto

      Is Bitcoin Bull Run Back? Daily RSI Shows Only Mild Bullish Momentum

      Robinhood grows its footprint in Canada by acquiring WonderFi

      HashKey Group Announces Launch of HashKey Global MENA with VASP License in UAE

      Ethereum Breaks Key Resistance In One Massive Move – Higher High Confirms Momentum

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

    Ztoog
    Home » Meet Surya: A Multilingual Text Line Detection AI Model for Documents
    AI

    Meet Surya: A Multilingual Text Line Detection AI Model for Documents

    Facebook Twitter Pinterest WhatsApp
    Meet Surya: A Multilingual Text Line Detection AI Model for Documents
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp





    In a current tweet from the founding father of Dataquest.io, Vik Paruchuri lately publicized the launch of a multilingual doc OCR toolkit, Surya. The framework can effectively detect line-level bboxes and column breaks in paperwork, scanned photos, or displays. The current textual content detection fashions like Tesseract work on the phrase or character stage, whereas this open-source AI works on the line stage. The greatest problem in constructing a text-line detection mannequin is the unavailability of 100% right datasets with line-level annotations. 

    Surya is an encoder-decoder mannequin utilizing a picture of the doc as enter and produces a picture with containers drawn across the line containers on the unique enter picture. The preliminary layers of the decoder comprise SegFormer, a transformer for semantic segmentation, whereas the 2nd convolutional layer with batch-normalization layers makes the tip of the decoder community. Before utilizing the picture or PDF, the pages are cut up into segments to the utmost dimension of the picture and bear varied pre-processing. 

    For mannequin analysis for the accuracy of bboxes, researchers used precision and recall on the protection space as a substitute of the normal IoU metric (Intersection over union). The precision calculates how properly predicted bboxes cowl floor reality bboxes and recall calculates how properly floor reality bboxes cowl predicted bboxes. Surya is in contrast with Tesseract, experiments instructed that the precision of Surya is way larger than that of Tesseract, and Tesseract’s recall is barely greater than that of Surya however general Surya outperforms Tesseract. Another benefit of Surya over the Tesseract mannequin is that it could actually work each on CPU and GPU and is way quicker than Tesseract.

    Surya, named after the Hindu God of the Sun, has efficiently labored on a number of languages and is anticipated to work on nearly all languages. The limitation of this mannequin isn’t more likely to work on photographs or different photos as it’s specialised on paperwork. Experiments additionally present it doesn’t work properly with photos that seem like advertisements. In spite of this limitation, the mannequin continues to be of nice use and might be additional expanded to textual content detection, desk, and chart detection.


    Pragati Jhunjhunwala is a consulting intern at MarktechPost. She is presently pursuing her B.Tech from the Indian Institute of Technology(IIT), Kharagpur. She is a tech fanatic and has a eager curiosity within the scope of software program and knowledge science purposes. She is at all times studying in regards to the developments in numerous discipline of AI and ML.


    🐝 Join the Fastest Growing AI Research Newsletter Read by Researchers from Google + NVIDIA + Meta + Stanford + MIT + Microsoft and lots of others…






    Previous articleAnthropic AI Experiment Reveals Trained LLMs Harbor Malicious Intent, Defying Safety Measures


    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    AI

    Study shows vision-language models can’t handle queries with negation words | Ztoog

    AI

    How a new type of AI is helping police skirt facial recognition bans

    AI

    Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Crypto

    Analyst Predicts Surge To Near $4,000 Levels By Early 2024

    Renowned crypto analyst Michael van de Poppe has lately shared his insights on Ethereum potential…

    Science

    You Need a Heat Pump. Soon You’ll Have More American-Made Options

    Solar panels and wind generators get all the eye, however an underappreciated system helps slash…

    Crypto

    Dogecoin And Bitcoin Become Latest Additions To Robinhood Wallet

    After a protracted stretch of solely providing Polygon (MATIC) in its Web3 pockets, Robinhood shocks…

    Technology

    Packers vs. Broncos Livestream: How to Watch NFL Week 7 Online Today

    The Green Bay Packers enter Week 7 at 2-3 and recent off their bye week. The…

    The Future

    Despite glimmers of profit, most African neobanks remain in the red

    It was solely simply over a yr in the past that McKinsey described Africa’s monetary expertise…

    Our Picks
    Gadgets

    10 Best Grills (2023): Charcoal, Gas, Pellet, Hybrid, and Grilling Accessories

    Gadgets

    Snap Recalls Its Pixy Flying Selfie Camera Because of Overheating Batteries

    Science

    Diet sodas are not actually good for your diet, WHO guidance suggests

    Categories
    • AI (1,487)
    • Crypto (1,748)
    • Gadgets (1,799)
    • Mobile (1,844)
    • Science (1,858)
    • Technology (1,795)
    • The Future (1,641)
    Most Popular
    Technology

    Best TV Deals: Up to $1,000 in Discounts on LG, Samsung, Fire TV and More

    Science

    A Novel Type of Neural Network Comes to the Aid of Big Physics

    Mobile

    Bang and Olufsen Beosound Emerge review: Brilliant sound meets stylish design

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.