Close Menu
Ztoog
    What's Hot
    AI

    MusicMagus: Harnessing Diffusion Models for Zero-Shot Text-to-Music Editing

    Crypto

    Andalusia Labs raises $48M Series A to improve digital asset risk infrastructure

    The Future

    Sweater that mimics polar bear fur may keep you warm in extreme cold

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

      LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

    • Technology

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

      5 Skills Kids (and Adults) Need in an AI World – O’Reilly

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

      Senate advances GENIUS Act after cloture vote passes

    Ztoog
    Home » Meta AI Unveils SeamlessM4T: A Foundational Multilingual and Multitask Model that Seamlessly Translates and Transcribes Across Speech and Text
    AI

    Meta AI Unveils SeamlessM4T: A Foundational Multilingual and Multitask Model that Seamlessly Translates and Transcribes Across Speech and Text

    Facebook Twitter Pinterest WhatsApp
    Meta AI Unveils SeamlessM4T: A Foundational Multilingual and Multitask Model that Seamlessly Translates and Transcribes Across Speech and Text
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    In a world the place interactions are more and more international, being multilingual can bridge gaps, foster understanding, and open doorways to numerous alternatives. Learning a number of languages can present insights into language construction and linguistics, deepening one’s understanding of the mechanics of communication and thought. This may be particularly helpful in at this time’s globalized world, the place cross-cultural interactions are frequent. Don’t you suppose this bridge must be crammed even between the people and the AI?

    Researchers from MetaAI and UC Berkley suggest a foundational multilingual and multitask mannequin that seamlessly interprets and transcribes throughout speech and textual content. They name it “SeamlessM4T”. The M4T within the identify stands for Massively Multilingual and Multimodal Machine Translation. It is an AI mannequin with speech-to-text, speech-to-speech, text-to-speech, text-to-text translation, and computerized speech recognition for as much as 100 languages. 

    Who isn’t conversant in Babel Fish ( a web based translator )? What is the issue with it? Babel Fish is a speech-to-speech translation system. Various present methods of such sort are inclined to concentrate on high-resource languages akin to English, Spanish, and French, leaving many low-resource languages behind. Their providers are largely translations from English to different languages and not vice-versa. These methods depend on cascade methods composed of a number of subsystems, so their efficiency doesn’t match their cascade counterparts.

    To resolve these limitations, researchers used over 1 million hours of open speech audio knowledge to study self-supervised speech. They created a multimodal corpus of mechanically aligned speech translations of greater than 470,000 hours! To consider the mannequin’s robustness in opposition to the background noises and speaker, they created open robustness benchmarks and discovered an enchancment of 38% and 49%, respectively.

    Researchers say that they maintained systematic evaluations for his or her system all through their workflow to make sure protected and sturdy efficiency. They used parallel knowledge mining different to utilizing closed knowledge. This methodology includes encoding sentences from varied languages right into a fixed-size embedding area and discovering parallel cases based mostly on a similarity metric.

    Creating a unified massive mannequin that can deal with the complete suite of duties concerned in textual content and speech translation lays the vital groundwork for the subsequent technology of on-device and on-demand multimodal translation. They say that when language applied sciences are developed primarily with this idealogy in thoughts, the wants of half of the world’s inhabitants are resolved, and their future work includes bridging this hole between those that converse excessive and low-resource languages to steer the world in a route that has by no means been extra interconnected. 

    Researchers say that their mannequin SeamlessM4T efficiency could must be extra constant with regards to translating slang or correct nouns throughout excessive and low-resource languages. Their future work would resolve this limitation to have a extra pleasant and reasonable dialog based mostly on one’s mom tongue and slang. 


    Check out the Paper, Project, and Reference Article. All Credit For This Research Goes To the Researchers on This Project. Also, don’t overlook to hitch our 29k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, the place we share the newest AI analysis information, cool AI initiatives, and extra.

    ➡️ Hostinger AI Website Builder: User-Friendly Drag-and-Drop Editor. Try Now (Sponsored)


    Arshad is an intern at MarktechPost. He is at the moment pursuing his Int. MSc Physics from the Indian Institute of Technology Kharagpur. Understanding issues to the elemental stage results in new discoveries which result in development in know-how. He is keen about understanding the character essentially with the assistance of instruments like mathematical fashions, ML fashions and AI.


    🚀 CodiumAI permits busy builders to generate significant assessments (Sponsored)

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Technology

    Generative AI in the Enterprise – O’Reilly

    Generative AI has been the greatest know-how story of 2023. Almost everyone’s performed with ChatGPT,…

    AI

    Generative AI: Differentiating disruptors from the disrupted

    The overarching message from this analysis is that plans amongst company leaders to disrupt competitors…

    The Future

    Dog Essentials List: 13 Necessities for New Dog Owners

    Everyone desires an cute pet however not everybody understands simply how a lot preparation is…

    Technology

    Prison Architect 2 delayed again, this time to September

    A second delay has been confirmed for Prison Architect 2, lower than a month earlier…

    Technology

    Nest revival? Google’s smart speakers may be poised for a long-due refresh

    Lily Katz / Android AuthorityTL;DR Some code discovered within the newest Google Home app appears…

    Our Picks
    Gadgets

    Samsung’s Bot Fit Wearable Assistive Robot Set For CES 2024 Launch

    The Future

    Doctor Who’s New Streaming Home Has Been a Huge Success

    The Future

    Discord file links will expire after a day to fight malware

    Categories
    • AI (1,493)
    • Crypto (1,753)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,866)
    • Technology (1,802)
    • The Future (1,648)
    Most Popular
    AI

    Synthetic imagery sets new bar in AI training efficiency | Ztoog

    Crypto

    NFT startup Rario founders to leave a year after $120M funding

    Science

    Explore a digitized collection of doomed Everest climber’s letters home

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.