Close Menu
Ztoog
    What's Hot
    The Future

    The Biggest Toys From 2022 That You’ll Want in 2023

    Crypto

    XRP Journey to $0.55: Is a Breakthrough Imminent?

    Mobile

    I asked Garmin how to fix my running form, and the answer will surprise beginners

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      How I Turn Unstructured PDFs into Revenue-Ready Spreadsheets

      Is it the best tool for 2025?

      The clocks that helped define time from London’s Royal Observatory

      Summer Movies Are Here, and So Are the New Popcorn Buckets

      India-Pak conflict: Pak appoints ISI chief, appointment comes in backdrop of the Pahalgam attack

    • Technology

      Ensure Hard Work Is Recognized With These 3 Steps

      Cicada map 2025: Where will Brood XIV cicadas emerge this spring?

      Is Duolingo the face of an AI jobs crisis?

      The US DOD transfers its AI-based Open Price Exploration for National Security program to nonprofit Critical Minerals Forum to boost Western supply deals (Ernest Scheyder/Reuters)

      The more Google kills Fitbit, the more I want a Fitbit Sense 3

    • Gadgets

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

      Apple plans to split iPhone 18 launch into two phases in 2026

      Upgrade your desk to Starfleet status with this $95 USB-C hub

      37 Best Graduation Gift Ideas (2025): For College Grads

      Backblaze responds to claims of “sham accounting,” customer backups at risk

    • Mobile

      Samsung Galaxy S25 Edge promo materials leak

      What are people doing with those free T-Mobile lines? Way more than you’d expect

      Samsung doesn’t want budget Galaxy phones to use exclusive AI features

      COROS’s charging adapter is a neat solution to the smartwatch charging cable problem

      Fortnite said to return to the US iOS App Store next week following court verdict

    • Science

      Failed Soviet probe will soon crash to Earth – and we don’t know where

      Trump administration cuts off all future federal funding to Harvard

      Does kissing spread gluten? New research offers a clue.

      Why Balcony Solar Panels Haven’t Taken Off in the US

      ‘Dark photon’ theory of light aims to tear up a century of physics

    • AI

      How to build a better AI benchmark

      Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

      This data set helps researchers spot harmful stereotypes in LLMs

      Making AI models more trustworthy for high-stakes settings | Ztoog

      The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    • Crypto

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

      Bitcoin Holds Above $95K Despite Weak Blockchain Activity — Analytics Firm Explains Why

      eToro eyes US IPO launch as early as next week amid easing concerns over Trump’s tariffs

      Cardano ‘Looks Dope,’ Analyst Predicts Big Move Soon

      Speak at Ztoog Disrupt 2025: Applications now open

    Ztoog
    Home » What is it and how does it work?
    Mobile

    What is it and how does it work?

    Facebook Twitter Pinterest WhatsApp
    What is it and how does it work?
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Calvin Wankhede / Android Authority

    When Google introduced PaLM 2 and Gemini language fashions in mid-2023, the search large emphasised that its AI was multimodal. This meant it might generate textual content, photos, audio, and even video. Traditionally, language fashions like ChatGPT’s GPT-4 have solely excelled at reproducing textual content. Google’s newest VideoPoet mannequin challenges that notion, nonetheless, as it can convert text-based prompts into AI-generated movies.

    With VideoPoet, Google has develop into the primary tech large to announce an AI able to producing movies. And not like prior makes an attempt, Google says it may also generate scenes with numerous movement relatively than simply refined actions. So what’s the magic behind VideoPoet and what can it do? Here’s all the pieces you should know.

    What is Google VideoPoet?

    google videopoet block diagram

    Google VideoPoet is an experimental massive language mannequin that may generate movies from a text-based immediate. You can describe a fictional scene, even one as ridiculous as “A robot cat eating spaghetti,” and have a video prepared to observe inside seconds. If you’ve ever used an AI picture generator like Midjourney or DALL-E 3, you already know what to anticipate from VideoPoet.

    Like AI picture turbines, VideoPoet may also carry out edits in current video content material. For instance, you possibly can crop out a portion of the video body and ask the AI to fill within the hole with one thing out of your creativeness as a substitute.

    Google has invested in startups like Runway engaged on AI video era, however VideoPoet comes courtesy of the corporate’s inside efforts. The VideoPoet technical paper enlists as many as 31 researchers from Google Research.

    How does Google VideoPoet work?

    google how does videopoet work

    In the aforementioned paper, Google’s researchers defined that VideoPoet differs from standard text-to-image and text-to-video turbines. Unlike Midjourney, for instance, VideoPoet does not use a diffusion mannequin to generate photos from random noise. That strategy works properly for particular person photos however falls flat for movies the place the mannequin must account for movement and consistency over time.

    At its core, Google’s VideoPoet is a big language mannequin. This implies that it’s primarily based on the identical know-how powering ChatGPT and Google Bard that may predict how phrases match collectively to type sentences. VideoPoet takes that idea a step additional as it’s additionally able to predicting video and audio chunks, and not simply textual content.

    VideoPoet is a big language mannequin that generates movies as a substitute of textual content.

    VideoPoet required a specialised pre-training course of which concerned translating photos, video frames, and audio clips into a standard language, known as tokens. Put merely, the mannequin realized how to interpret completely different modalities from the coaching information. Google says that it used one billion image-text pairs and 270 million public video samples to coach VideoPoet. Ultimately, VideoPoet has develop into able to predicting video tokens identical to a conventional LLM mannequin would predict textual content tokens.

    VideoPoet has a strong basis because of its coaching that permits it to carry out duties past text-to-video era as properly. For instance, it can apply types to current movies, carry out edits like including background results, change the look of an current video with filters, and change the movement of a shifting object in an current video. Google demonstrated the latter with a raccoon dancing in numerous types.

    VideoPoet vs. rival AI video turbines: What’s the distinction?

    Meta logo on smartphone stock photo (5)

    Edgar Cervantes / Android Authority

    Google’s VideoPoet differs from most of its rivals that depend on diffusion fashions to show textual content into movies. However, it’s not precisely the primary – a smaller variety of Google Brain researchers offered Phenaki final 12 months. Likewise, Meta’s Make-A-Video challenge made waves within the AI neighborhood for producing numerous movies with out coaching on video-text pairs beforehand. However, neither fashions have been publicly launched.

    So provided that we don’t have entry to any video-generating fashions, we are able to solely depend on the data Google has supplied about VideoPoet. With that in thoughts, the paper’s authors assert that “In many cases, even the current leading models either generate small motion or, when producing larger motions, exhibit noticeable artifacts.” VideoPoet, alternatively, can deal with extra movement.

    VideoPoet can generate longer movies and deal with movement extra gracefully than the competitors.

    Google additionally says that VideoPoet can generate longer movies than the competitors. While it’s restricted to an preliminary burst of two-second movies, it can keep context throughout eight to 10 seconds of video. That might not sound like a lot however it’s spectacular given how a lot a scene might change in that point interval. Having mentioned that, Google’s instance movies solely embrace a number of dozen frames, removed from the 24 or 30 frames per second benchmark used for skilled video or filmmaking.

    Google VideoPoet availability: Is it free?

    google videopoet samples

    While Google has revealed dozens of instance movies to reveal the strengths of VideoPoet, it stopped wanting saying a public rollout. In different phrases, we don’t know after we’ll have the ability to use VideoPoet, if in any respect.

    Google hasn’t introduced a product or launch date for VideoPoet but.

    As for pricing, we might must take the trace from AI picture turbines like Midjourney which might be solely accessible through a subscription. Indeed, AI-generated photos and movies are computationally costly so opening up entry to everybody will not be possible, even for Google. We’ll have to attend for a disruptive launch like OpenAI’s ChatGPT to pressure the search large’s hand. Until then, we’ll merely have to attend and watch from the sidelines.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    Mobile

    Samsung Galaxy S25 Edge promo materials leak

    Mobile

    What are people doing with those free T-Mobile lines? Way more than you’d expect

    Mobile

    Samsung doesn’t want budget Galaxy phones to use exclusive AI features

    Mobile

    COROS’s charging adapter is a neat solution to the smartwatch charging cable problem

    Mobile

    Fortnite said to return to the US iOS App Store next week following court verdict

    Mobile

    Chinese tech icon is about to raise the stakes in a battle with US chipmaker over AI processors

    Mobile

    Need high performance on a budget? These are the phones you should buy

    Mobile

    Google officially killed Driving Mode after stripping most of its features in 2024

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Mobile

    Samsung caves to Galaxy S24 users’ gripes over pesky One UI 6.1 change

    Samsung’s deal with AI helps its new Galaxy S24 telephones smash gross sales data and…

    Mobile

    Apple Vision Pro pre-orders are live, Apple releases a 10-minute “guided tour” video for it

    Today Apple has began taking pre-orders for its first “spatial pc”, the Vision Pro headset.…

    Science

    JWST and Hubble take stunning image of the ‘Christmas tree’ cluster

    Researchers at Arizona State University have utilised the Hubble Space Telescope and the James Webb…

    Mobile

    Realme Narzo 70 Pro’s announcement set for March 19

    Realme Narzo 70 Pro will arrive on March 19, the corporate confirmed at this time.…

    Science

    Questions I dread: How did the universe begin, and what is space-time?

    WENBIN PHOTO/Getty Images THERE are two questions that I considerably dread when speaking science to…

    Our Picks
    Crypto

    Ethereum Breaches $2,200, Investors Expect $3,000 This Week

    Science

    ‘In 24 Hours, You’ll Have Your Pills’: American Women Are Traveling to Mexico for Abortions

    Mobile

    Does the power lie inside? –

    Categories
    • AI (1,482)
    • Crypto (1,744)
    • Gadgets (1,796)
    • Mobile (1,839)
    • Science (1,853)
    • Technology (1,789)
    • The Future (1,635)
    Most Popular
    Crypto

    A new web3 network is being built right now that wants to end Big Tech’s control of your data

    Mobile

    X (Twitter) is putting a $1/year paywall to keep the bots and spammers at bay

    Gadgets

    20 Best Tech Books to Gift (2023): Biographies, Startup Histories, Exposés

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.