Close Menu
Ztoog
    What's Hot
    The Future

    120+ best Amazon Prime Day tech deals: earbuds, laptops, and more

    AI

    Q&A: A blueprint for sustainable innovation | Ztoog

    Mobile

    Samsung could launch a ring case with the Galaxy Z Fold 5 and Flip 5

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

      LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

    • Technology

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

      5 Skills Kids (and Adults) Need in an AI World – O’Reilly

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

      Senate advances GENIUS Act after cloture vote passes

    Ztoog
    Home » Google Research at I/O 2023 – Ztoog
    AI

    Google Research at I/O 2023 – Ztoog

    Facebook Twitter Pinterest WhatsApp
    Google Research at I/O 2023 – Ztoog
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Posted by James Manyika, SVP Google Research and Technology & Society, and Jeff Dean, Chief Scientist, Google DeepMind and Google Research

    Wednesday, May tenth was an thrilling day for the Google Research group as we watched the outcomes of months and years of our foundational and utilized work get introduced on the Google I/O stage. With the fast tempo of bulletins on stage, it may be tough to convey the substantial effort and distinctive improvements that underlie the applied sciences we offered. So as we speak, we’re excited to disclose extra in regards to the analysis efforts behind a few of the many compelling bulletins at this 12 months’s I/O.

    PaLM 2

    Our next-generation giant language mannequin (LLM), PaLM 2, is constructed on advances in compute-optimal scaling, scaled instruction-fine tuning and improved dataset combination. By fine-tuning and instruction-tuning the mannequin for various functions, we now have been in a position to combine state-of-the-art capabilities into over 25 Google merchandise and options, the place it’s already serving to to tell, help and delight customers. For instance:

    • Bard is an early experiment that allows you to collaborate with generative AI and helps to spice up productiveness, speed up concepts and gas curiosity. It builds on advances in deep studying effectivity and leverages reinforcement studying from human suggestions to supply extra related responses and improve the mannequin’s capacity to observe directions. Bard is now obtainable in 180 international locations, the place customers can work together with it in English, Japanese and Korean, and due to the multilingual capabilities afforded by PaLM 2, help for 40 languages is coming quickly.
    • With Search Generative Experience we’re taking extra of the work out of looking, so that you’ll be capable of perceive a subject sooner, uncover new viewpoints and insights, and get issues carried out extra simply. As a part of this experiment, you’ll see an AI-powered snapshot of key info to contemplate, with hyperlinks to dig deeper.
    • MakerSuite is an easy-to-use prototyping atmosphere for the PaLM API, powered by PaLM 2. In reality, inside consumer engagement with early prototypes of MakerSuite accelerated the event of our PaLM 2 mannequin itself. MakerSuite grew out of analysis targeted on prompting instruments, or instruments explicitly designed for customizing and controlling LLMs. This line of analysis consists of PromptMaker (precursor to MakerSuite), and AI Chains and PromptChainer (one of many first analysis efforts demonstrating the utility of LLM chaining).
    • Project Tailwind additionally made use of early analysis prototypes of MakerSuite to develop options to assist writers and researchers discover concepts and enhance their prose; its AI-first pocket book prototype used PaLM 2 to permit customers to ask questions of the mannequin grounded in paperwork they outline.
    • Med-PaLM 2 is our state-of-the-art medical LLM, constructed on PaLM 2. Med-PaLM 2 achieved 86.5% efficiency on U.S. Medical Licensing Exam–type questions, illustrating its thrilling potential for well being. We’re now exploring multimodal capabilities to synthesize inputs like X-rays.
    • Codey is a model of PaLM 2 fine-tuned on supply code to perform as a developer assistant. It helps a broad vary of Code AI options, together with code completions, code rationalization, bug fixing, supply code migration, error explanations, and extra. Codey is accessible by our trusted tester program by way of IDEs (Colab, Android Studio, Duet AI for Cloud, Firebase) and by way of a 3P-facing API.

    Perhaps much more thrilling for builders, we now have opened up the PaLM APIs & MakerSuite to supply the group alternatives to innovate utilizing this groundbreaking know-how.

    PaLM 2 has superior coding capabilities that allow it to seek out code errors and make strategies in plenty of completely different languages.

    Imagen

    Our Imagen household of picture era and enhancing fashions builds on advances in giant Transformer-based language fashions and diffusion fashions. This household of fashions is being integrated into a number of Google merchandise, together with:

    • Image era in Google Slides and Android’s Generative AI wallpaper are powered by our text-to-image era options.
    • Google Cloud’s Vertex AI permits picture era, picture enhancing, picture upscaling and fine-tuning to assist enterprise prospects meet their enterprise wants.
    • I/O Flip, a digital tackle a traditional card sport, options Google developer mascots on playing cards that have been fully AI generated. This sport showcased a fine-tuning approach known as DreamBooth for adapting pre-trained picture era fashions. Using only a handful of pictures as inputs for fine-tuning, it permits customers to generate customized pictures in minutes. With DreamBooth, customers can synthesize a topic in numerous scenes, poses, views, and lighting circumstances that don’t seem within the reference pictures.
      I/O Flip presents customized card decks designed utilizing DreamBooth.

    Phenaki

    Phenaki, Google’s Transformer-based text-to-video era mannequin was featured within the I/O pre-show. Phenaki is a mannequin that may synthesize practical movies from textual immediate sequences by leveraging two essential elements: an encoder-decoder mannequin that compresses movies to discrete embeddings and a transformer mannequin that interprets textual content embeddings to video tokens.

    ARCore and the Scene Semantic API

    Among the brand new options of ARCore introduced by the AR group at I/O, the Scene Semantic API can acknowledge pixel-wise semantics in an outside scene. This helps customers create customized AR experiences primarily based on the options within the surrounding space. This API is empowered by the out of doors semantic segmentation mannequin, leveraging our current works across the DeepLab structure and an selfish out of doors scene understanding dataset. The newest ARCore launch additionally consists of an improved monocular depth mannequin that gives greater accuracy in out of doors scenes.

    Scene Semantics API makes use of DeepLab-based semantic segmentation mannequin to supply correct pixel-wise labels in a scene outdoor.

    Chirp

    Chirp is Google’s household of state-of-the-art Universal Speech Models educated on 12 million hours of speech to allow automated speech recognition (ASR) for 100+ languages. The fashions can carry out ASR on under-resourced languages, akin to Amharic, Cebuano, and Assamese, along with broadly spoken languages like English and Mandarin. Chirp is ready to cowl such all kinds of languages by leveraging self-supervised studying on unlabeled multilingual dataset with fine-tuning on a smaller set of labeled knowledge. Chirp is now obtainable within the Google Cloud Speech-to-Text API, permitting customers to carry out inference on the mannequin by a easy interface. You can get began with Chirp right here.

    MusicLM

    At I/O, we launched MusicLM, a text-to-music mannequin that generates 20 seconds of music from a textual content immediate. You can strive it your self on AI Test Kitchen, or see it featured through the I/O preshow, the place digital musician and composer Dan Deacon used MusicLM in his efficiency.

    MusicLM, which consists of fashions powered by AudioLM and MuLAN, could make music (from textual content, buzzing, pictures or video) and musical accompaniments to singing. AudioLM generates prime quality audio with long-term consistency. It maps audio to a sequence of discrete tokens and casts audio era as a language modeling process. To synthesize longer outputs effectively, it used a novel method we’ve developed known as SoundStorm.

    Universal Translator dubbing

    Our dubbing efforts leverage dozens of ML applied sciences to translate the complete expressive vary of video content material, making movies accessible to audiences internationally. These applied sciences have been used to dub movies throughout a wide range of merchandise and content material varieties, together with instructional content material, promoting campaigns, and creator content material, with extra to come back. We use deep studying know-how to attain voice preservation and lip matching and allow high-quality video translation. We’ve constructed this product to incorporate human assessment for high quality, security checks to assist stop misuse, and we make it accessible solely to licensed companions.

    AI for world societal good

    We are making use of our AI applied sciences to unravel a few of the largest world challenges, like mitigating local weather change, adapting to a warming planet and bettering human well being and wellbeing. For instance:

    • Traffic engineers use our Green Light suggestions to cut back stop-and-go visitors at intersections and enhance the stream of visitors in cities from Bangalore to Rio de Janeiro and Hamburg. Green Light fashions every intersection, analyzing visitors patterns to develop suggestions that make visitors lights extra environment friendly — for instance, by higher synchronizing timing between adjoining lights, or adjusting the “green time” for a given road and route.
    • We’ve additionally expanded world protection on the Flood Hub to 80 international locations, as a part of our efforts to foretell riverine floods and alert people who find themselves about to be impacted earlier than catastrophe strikes. Our flood forecasting efforts depend on hydrological fashions knowledgeable by satellite tv for pc observations, climate forecasts and in-situ measurements.

    Technologies for inclusive and honest ML functions

    With our continued funding in AI applied sciences, we’re emphasizing accountable AI improvement with the objective of constructing our fashions and instruments helpful and impactful whereas additionally guaranteeing equity, security and alignment with our AI Principles. Some of those efforts have been highlighted at I/O, together with:

    • The launch of the Monk Skin Tone Examples (MST-E) Dataset to assist practitioners acquire a deeper understanding of the MST scale and practice human annotators for extra constant, inclusive, and significant pores and skin tone annotations. You can learn extra about this and different developments on our web site. This is an development on the open supply launch of the Monk Skin Tone (MST) Scale we launched final 12 months to allow builders to construct merchandise which can be extra inclusive and that higher signify their numerous customers.
    • A brand new Kaggle competitors (open till August tenth) by which the ML group is tasked with making a mannequin that may shortly and precisely establish American Sign Language (ASL) fingerspelling — the place every letter of a phrase is spelled out in ASL quickly utilizing a single hand, reasonably than utilizing the precise indicators for complete phrases — and translate it into written textual content. Learn extra in regards to the fingerspelling Kaggle competitors, which encompasses a tune from Sean Forbes, a deaf musician and rapper. We additionally showcased at I/O the successful algorithm from the prior 12 months’s competitors powers PopSign, an ASL studying app for folks of deaf or onerous of listening to kids created by Georgia Tech and Rochester Institute of Technology (RIT).

    Building the way forward for AI collectively

    It’s inspiring to be a part of a group of so many gifted people who’re main the way in which in creating state-of-the-art applied sciences, accountable AI approaches and thrilling consumer experiences. We are within the midst of a interval of unbelievable and transformative change for AI. Stay tuned for extra updates in regards to the methods by which the Google Research group is boldly exploring the frontiers of those applied sciences and utilizing them responsibly to profit folks’s lives all over the world. We hope you are as excited as we’re about the way forward for AI applied sciences and we invite you to have interaction with our groups by the references, websites and instruments that we’ve highlighted right here.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    The Future

    Can Your Typing Style Be a Privacy Risk?

      Jason Montoya / How-To Geek Your distinctive typing fashion, consisting of rhythm, pace, and…

    Gadgets

    LG tiiun: Empowering Cooking Enthusiasts and Indoor Gardeners with Fresh Herbs

    According to LG, amidst the worldwide pandemic, individuals turned to gardening as a therapeutic pastime…

    Crypto

    What is Solana?

    What is Solana? Since the explosion of cryptocurrencies, each traders and cryptocurrency lovers have carved…

    Crypto

    FTX crypto fraud victims to get their money back — plus interest

    Bankruptcy attorneys representing prospects impacted by the dramatic crash of cryptocurrency alternate FTX 17 months in…

    The Future

    CES has almost fully succeeded in chasing sex-tech off its show floors

    CES has lengthy been a launchpad for innovation and cutting-edge know-how. However, at this yr’s…

    Our Picks
    Gadgets

    Harnessing The Sun: Car Companies Embrace Solar Power for Money-Saving EVs

    Science

    ULA chief says Vulcan rocket will slip to 2024 after ground system issues

    Crypto

    Historical Data Shows What To Expect

    Categories
    • AI (1,493)
    • Crypto (1,753)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,866)
    • Technology (1,802)
    • The Future (1,648)
    Most Popular
    Crypto

    Bitcoin Price Ready To Go ‘Supersonic’, Analyst Says

    Gadgets

    LG Gram SuperSlim: The Thinnest Laptop With An OLED Screen

    Technology

    Harvey Weinstein’s overturned conviction, explained

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.