Close Menu
Ztoog
    What's Hot
    Crypto

    Bitcoin Evangelist Does Not See Bitcoin Hitting $30K

    AI

    Reimagining cloud strategy for AI-first enterprises

    Science

    Europe’s oldest megastructure may be at the bottom of the sea

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      What is Project Management? 5 Best Tools that You Can Try

      Operational excellence strategy and continuous improvement

      Hannah Fry: AI isn’t as powerful as we think

      FanDuel goes all in on responsible gaming push with new Play with a Plan campaign

      Gettyimages.com Is the Best Website on the Internet Right Now

    • Technology

      Iran war: How could it end?

      Democratic senators question CFTC staffing cuts in Chicago enforcement office

      Google’s Cloud AI lead on the three frontiers of model capability

      AMD agrees to backstop a $300M loan from Goldman Sachs for Crusoe to buy AMD AI chips, the first known case of AMD chips used as debt collateral (The Information)

      Productivity apps failed me when I needed them most

    • Gadgets

      macOS Tahoe 26.3.1 update will “upgrade” your M5’s CPU to new “super” cores

      Lenovo Shows Off a ThinkBook Modular AI PC Concept With Swappable Ports and Detachable Displays at MWC 2026

      POCO M8 Review: The Ultimate Budget Smartphone With Some Cons

      The Mission: Impossible of SSDs has arrived with a fingerprint lock

      6 Best Phones With Headphone Jacks (2026), Tested and Reviewed

    • Mobile

      Android’s March update is all about finding people, apps, and your missing bags

      Watch Xiaomi’s global launch event live here

      Our poll shows what buyers actually care about in new smartphones (Hint: it’s not AI)

      Is Strava down for you? You’re not alone

      The Motorola Razr FIFA World Cup 2026 Edition was literally just unveiled, and Verizon is already giving them away

    • Science

      Big Tech Signs White House Data Center Pledge With Good Optics and Little Substance

      Inside the best dark matter detector ever built

      NASA’s Artemis moon exploration programme is getting a major makeover

      Scientists crack the case of “screeching” Scotch tape

      Blue-faced, puffy-lipped monkey scores a rare conservation win

    • AI

      Online harassment is entering its AI era

      Meet NullClaw: The 678 KB Zig AI Agent Framework Running on 1 MB RAM and Booting in Two Milliseconds

      New method could increase LLM training efficiency | Ztoog

      The human work behind humanoid robots is being hidden

      NVIDIA Releases DreamDojo: An Open-Source Robot World Model Trained on 44,711 Hours of Real-World Human Video Data

    • Crypto

      Google paid startup Form Energy $1B for its massive 100-hour battery

      Ethereum Breakout Alert: Corrective Channel Flip Sparks Impulsive Wave

      Show Your ID Or No Deal

      Jane Street sued for alleged front-running trades that accelerated Terraform Labs meltdown

      Bitcoin Trades Below ETF Cost-Basis As MVRV Signals Mounting Pressure

    Ztoog
    Home » English learners can now practice speaking on Search – Google Research Blog
    AI

    English learners can now practice speaking on Search – Google Research Blog

    Facebook Twitter Pinterest WhatsApp
    English learners can now practice speaking on Search – Google Research Blog
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Posted by Christian Plagemann, Director, and Katya Cox, Product Manager, Google Research

    Learning a language can open up new alternatives in an individual’s life. It can assist individuals join with these from totally different cultures, journey the world, and advance their profession. English alone is estimated to have 1.5 billion learners worldwide. Yet proficiency in a brand new language is troublesome to realize, and lots of learners cite a scarcity of alternative to practice speaking actively and receiving actionable suggestions as a barrier to studying.

    We are excited to announce a brand new function of Google Search that helps individuals practice speaking and enhance their language expertise. Within the following few days, Android customers in Argentina, Colombia, India (Hindi), Indonesia, Mexico, and Venezuela can get much more language assist from Google by means of interactive speaking practice in English — increasing to extra nations and languages sooner or later. Google Search is already a beneficial device for language learners, offering translations, definitions, and different assets to enhance vocabulary. Now, learners translating to or from English on their Android telephones will discover a new English speaking practice expertise with personalised suggestions.

    A brand new function of Google Search permits learners
    to practice speaking phrases in context.

    Learners are introduced with real-life prompts after which kind their very own spoken solutions utilizing a offered vocabulary phrase. They interact in practice classes of 3-5 minutes, getting personalised suggestions and the choice to enroll in day by day reminders to maintain practising. With solely a smartphone and a few high quality time, learners can practice at their very own tempo, anytime, wherever.

    Activities with personalised suggestions, to complement current studying instruments

    Designed for use alongside different studying providers and assets, like private tutoring, cellular apps, and courses, the brand new speaking practice function on Google Search is one other device to help learners on their journey.

    We have partnered with linguists, academics, and ESL/EFL pedagogical specialists to create a speaking practice expertise that’s efficient and motivating. Learners practice vocabulary in genuine contexts, and materials is repeated over dynamic intervals to extend retention — approaches which are recognized to be efficient in serving to learners grow to be assured audio system. As one companion of ours shared:

    “Speaking in a given context is a talent that language learners usually lack the chance to practice. Therefore this device could be very helpful to enrich courses and different assets.” – Judit Kormos, Professor, Lancaster University

    We are additionally excited to be working with a number of language studying companions to floor content material they’re serving to create and to attach them with learners around the globe. We look ahead to increasing this program additional and dealing with any companion.

    Personalized real-time suggestions

    Every learner is totally different, so delivering personalised suggestions in actual time is a key a part of efficient practice. Responses are analyzed to supply useful, real-time options and corrections.

    The system provides semantic suggestions, indicating whether or not their response was related to the query and could also be understood by a dialog companion. Grammar suggestions offers insights into attainable grammatical enhancements, and a set of instance solutions at various ranges of language complexity give concrete options for other ways to reply on this context.

    The suggestions consists of three components: Semantic evaluation, grammar correction, and instance solutions.

    Contextual translation

    Among the a number of new applied sciences we developed, contextual translation offers the flexibility to translate particular person phrases and phrases in context. During practice classes, learners can faucet on any phrase they don’t perceive to see the interpretation of that phrase contemplating its context.

    Example of contextual translation function.

    This is a troublesome technical job, since particular person phrases in isolation usually have a number of different meanings, and a number of phrases can kind clusters of that means that should be translated in unison. Our novel strategy interprets your complete sentence, then estimates how the phrases within the authentic and the translated textual content relate to one another. This is usually often called the phrase alignment downside.

    Example of a translated sentence pair and its phrase alignment. A deep studying alignment mannequin connects the totally different phrases that create the that means to recommend a translation.

    The key expertise piece that allows this performance is a novel deep studying mannequin developed in collaboration with the Google Translate workforce, known as Deep Aligner. The fundamental concept is to take a multilingual language mannequin skilled on tons of of languages, then fine-tune a novel alignment mannequin on a set of phrase alignment examples (see the determine above for an instance) offered by human specialists, for a number of language pairs. From this, the only mannequin can then precisely align any language pair, reaching state-of-the-art alignment error charge (AER, a metric to measure the standard of phrase alignments, the place decrease is healthier). This single new mannequin has led to dramatic enhancements in alignment high quality throughout all examined language pairs, decreasing common AER from 25% to five% in comparison with alignment approaches primarily based on Hidden Markov fashions (HMMs).

    Alignment error charges (decrease is healthier) between English (EN) and different languages.

    This mannequin can also be included into Google’s translation APIs, tremendously enhancing, for instance, the formatting of translated PDFs and web sites in Chrome, the interpretation of YouTube captions, and enhancing Google Cloud’s translation API.

    Grammar suggestions

    To allow grammar suggestions for accented spoken language, our analysis groups tailored grammar correction fashions for written textual content (see the weblog and paper) to work on computerized speech recognition (ASR) transcriptions, particularly for the case of accented speech. The key step was fine-tuning the written textual content mannequin on a corpus of human and ASR transcripts of accented speech, with expert-provided grammar corrections. Furthermore, impressed by earlier work, the groups developed a novel edit-based output illustration that leverages the excessive overlap between the inputs and outputs that’s notably well-suited for brief enter sentences widespread in language studying settings.

    The edit illustration can be defined utilizing an instance:

    • Input: I1 am2 so3 unhealthy4 cooking5
    • Correction: I1 am2 so3 unhealthy4 at5 cooking6
    • Edits: (‘at’, 4, PREPOSITION, 4)

    In the above, “at” is the phrase that’s inserted at place 4 and “PREPOSITION” denotes that is an error involving prepositions. We used the error tag to pick tag-dependent acceptance thresholds that improved the mannequin additional. The mannequin elevated the recall of grammar issues from 4.6% to 35%.

    Some instance output from our mannequin and a mannequin skilled on written corpora:

        Example 1     Example 2
    User enter (transcribed speech)

    I reside of my career. I would like a environment friendly card and dependable.
    Text-based grammar mannequin

    I reside by my career. I would like an environment friendly card and a dependable.
    New speech-optimized mannequin

    I reside off my career. I would like an environment friendly and dependable card.

    Semantic evaluation

    A major aim of dialog is to speak one’s intent clearly. Thus, we designed a function that visually communicates to the learner whether or not their response was related to the context and can be understood by a companion. This is a troublesome technical downside, since early language learners’ spoken responses can be syntactically unconventional. We needed to rigorously steadiness this expertise to focus on the readability of intent relatively than correctness of syntax.

    Our system makes use of a mixture of two approaches:

    1. Sensibility classification: Large language fashions like LaMDA or PaLM are designed to present pure responses in a dialog, so it’s no shock that they do effectively on the reverse: judging whether or not a given response is contextually smart.
    2. Similarity to good responses: We used an encoder structure to match the learner’s enter to a set of recognized good responses in a semantic embedding area. This comparability offers one other helpful sign on semantic relevance, additional enhancing the standard of suggestions and options we offer.
    The system offers suggestions about whether or not the response was related to the immediate, and can be understood by a communication companion.

    ML-assisted content material improvement

    Our obtainable practice actions current a mixture of human-expert created content material, and content material that was created with AI help and human evaluation. This consists of speaking prompts, focus phrases, in addition to units of instance solutions that showcase significant and contextual responses.

    An inventory of instance solutions is offered when the learner receives suggestions and after they faucet the assistance button.

    Since learners have totally different ranges of potential, the language complexity of the content material needs to be adjusted appropriately. Prior work on language complexity estimation focuses on textual content of paragraph size or longer, which differs considerably from the kind of responses that our system processes. Thus, we developed novel fashions that can estimate the complexity of a single sentence, phrase, and even particular person phrases. This is difficult as a result of even a phrase composed of straightforward phrases can be exhausting for a language learner (e.g., “Let’s lower to the chase”). Our finest mannequin relies on BERT and achieves complexity predictions closest to human professional consensus. The mannequin was pre-trained utilizing a big set of LLM-labeled examples, after which fine-tuned utilizing a human professional–labeled dataset.

    Mean squared error of assorted approaches’ efficiency estimating content material issue on a various corpus of ~450 conversational passages (textual content / transcriptions). Top row: Human raters labeled the objects on a scale from 0.0 to five.0, roughly aligned to the CEFR scale (from A1 to C2). Bottom 4 rows: Different fashions carried out the identical job, and we present the distinction to the human professional consensus.

    Using this mannequin, we can consider the problem of textual content objects, provide a various vary of options, and most significantly problem learners appropriately for his or her potential ranges. For instance, utilizing our mannequin to label examples, we can fine-tune our system to generate speaking prompts at varied language complexity ranges.

    Vocabulary focus phrases, to be elicited by the questions
        guitar     apple     lion
    Simple     What do you prefer to play?     Do you want fruit?     Do you want massive cats?
    Intermediate     Do you play any musical devices?     What is your favourite fruit?     What is your favourite animal?
    Complex     What stringed instrument do you get pleasure from taking part in?     Which sort of fruit do you get pleasure from consuming for its crunchy texture and candy taste?     Do you get pleasure from watching massive, highly effective predators?

    Furthermore, content material issue estimation is used to step by step enhance the duty issue over time, adapting to the learner’s progress.

    Conclusion

    With these newest updates, which can roll out over the following few days, Google Search has grow to be much more useful. If you might be an Android person in India (Hindi), Indonesia, Argentina, Colombia, Mexico, or Venezuela, give it a strive by translating to or from English with Google.

    We look ahead to increasing to extra nations and languages sooner or later, and to begin providing companion practice content material quickly.

    Acknowledgements

    Many individuals have been concerned within the improvement of this mission. Among many others, we thank our exterior advisers within the language studying area: Jeffrey Davitz, Judit Kormos, Deborah Healey, Anita Bowles, Susan Gaer, Andrea Revesz, Bradley Opatz, and Anne Mcquade.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Online harassment is entering its AI era

    AI

    Meet NullClaw: The 678 KB Zig AI Agent Framework Running on 1 MB RAM and Booting in Two Milliseconds

    AI

    New method could increase LLM training efficiency | Ztoog

    AI

    The human work behind humanoid robots is being hidden

    AI

    NVIDIA Releases DreamDojo: An Open-Source Robot World Model Trained on 44,711 Hours of Real-World Human Video Data

    AI

    Personalization features can make LLMs more agreeable | Ztoog

    AI

    AI is already making online crimes easier. It could get much worse.

    AI

    NVIDIA Researchers Introduce KVTC Transform Coding Pipeline to Compress Key-Value Caches by 20x for Efficient LLM Serving

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    AI

    Mistral AI Unveils Breakthrough in Language Models with MoE 8x7B Release

    A Paris-based startup, Mistral AI, has launched a language mannequin, the MoE 8x7B. Mistral LLM…

    Mobile

    Android KitKat reaches end as Google Play services discontinues support

    TL;DR Google has introduced that Google Play services will discontinue support for Android KitKat. The…

    The Future

    Google strikes a deal with Reddit to boost AI possibilities, reportedly for $60M/year

    The partnership between the multinational know-how firm and social information aggregation is about to strengthen…

    The Future

    Bridging the Gap Between Money and Technology

    The trajectory of the total globe should be in step with the achievement of SDG…

    Science

    This Jurassic-era ‘sea murderer’ was among the first of its kind

    A life-sized reconstruction of the head and jaws of the oldest megapredatory pliosaur known as…

    Our Picks
    Science

    Passing star could fling Earth out past Pluto into the Oort cloud

    The Future

    Why Shredder’s Not in Teenage Mutant Ninja Turtles Mutant Mayhem

    The Future

    How Corporate Video Production Agencies Are Revolutionising Traditional Marketing

    Categories
    • AI (1,560)
    • Crypto (1,826)
    • Gadgets (1,870)
    • Mobile (1,910)
    • Science (1,939)
    • Technology (1,862)
    • The Future (1,716)
    Most Popular
    Crypto

    It’s a bird, it’s a plane, no, it’s a crypto whale

    Technology

    The Seven Tech Stocks Driving the S&P 500 to New Highs

    Gadgets

    30 years later, FreeDOS is still keeping the dream of the command prompt alive

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2026 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.