Close Menu
Ztoog
    What's Hot
    Crypto

    Grayscale: ‘Next Bitcoin Halving Is Different’

    Crypto

    eToro eyes US IPO launch as early as next week amid easing concerns over Trump’s tariffs

    Science

    The Gruesome Story of How Neuralink’s Monkeys Actually Died

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

      LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

    • Technology

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

      5 Skills Kids (and Adults) Need in an AI World – O’Reilly

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

      Senate advances GENIUS Act after cloture vote passes

    Ztoog
    Home » What is it and how does it work?
    Mobile

    What is it and how does it work?

    Facebook Twitter Pinterest WhatsApp
    What is it and how does it work?
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Calvin Wankhede / Android Authority

    When Google introduced PaLM 2 and Gemini language fashions in mid-2023, the search large emphasised that its AI was multimodal. This meant it might generate textual content, photos, audio, and even video. Traditionally, language fashions like ChatGPT’s GPT-4 have solely excelled at reproducing textual content. Google’s newest VideoPoet mannequin challenges that notion, nonetheless, as it can convert text-based prompts into AI-generated movies.

    With VideoPoet, Google has develop into the primary tech large to announce an AI able to producing movies. And not like prior makes an attempt, Google says it may also generate scenes with numerous movement relatively than simply refined actions. So what’s the magic behind VideoPoet and what can it do? Here’s all the pieces you should know.

    What is Google VideoPoet?

    google videopoet block diagram

    Google VideoPoet is an experimental massive language mannequin that may generate movies from a text-based immediate. You can describe a fictional scene, even one as ridiculous as “A robot cat eating spaghetti,” and have a video prepared to observe inside seconds. If you’ve ever used an AI picture generator like Midjourney or DALL-E 3, you already know what to anticipate from VideoPoet.

    Like AI picture turbines, VideoPoet may also carry out edits in current video content material. For instance, you possibly can crop out a portion of the video body and ask the AI to fill within the hole with one thing out of your creativeness as a substitute.

    Google has invested in startups like Runway engaged on AI video era, however VideoPoet comes courtesy of the corporate’s inside efforts. The VideoPoet technical paper enlists as many as 31 researchers from Google Research.

    How does Google VideoPoet work?

    google how does videopoet work

    In the aforementioned paper, Google’s researchers defined that VideoPoet differs from standard text-to-image and text-to-video turbines. Unlike Midjourney, for instance, VideoPoet does not use a diffusion mannequin to generate photos from random noise. That strategy works properly for particular person photos however falls flat for movies the place the mannequin must account for movement and consistency over time.

    At its core, Google’s VideoPoet is a big language mannequin. This implies that it’s primarily based on the identical know-how powering ChatGPT and Google Bard that may predict how phrases match collectively to type sentences. VideoPoet takes that idea a step additional as it’s additionally able to predicting video and audio chunks, and not simply textual content.

    VideoPoet is a big language mannequin that generates movies as a substitute of textual content.

    VideoPoet required a specialised pre-training course of which concerned translating photos, video frames, and audio clips into a standard language, known as tokens. Put merely, the mannequin realized how to interpret completely different modalities from the coaching information. Google says that it used one billion image-text pairs and 270 million public video samples to coach VideoPoet. Ultimately, VideoPoet has develop into able to predicting video tokens identical to a conventional LLM mannequin would predict textual content tokens.

    VideoPoet has a strong basis because of its coaching that permits it to carry out duties past text-to-video era as properly. For instance, it can apply types to current movies, carry out edits like including background results, change the look of an current video with filters, and change the movement of a shifting object in an current video. Google demonstrated the latter with a raccoon dancing in numerous types.

    VideoPoet vs. rival AI video turbines: What’s the distinction?

    Meta logo on smartphone stock photo (5)

    Edgar Cervantes / Android Authority

    Google’s VideoPoet differs from most of its rivals that depend on diffusion fashions to show textual content into movies. However, it’s not precisely the primary – a smaller variety of Google Brain researchers offered Phenaki final 12 months. Likewise, Meta’s Make-A-Video challenge made waves within the AI neighborhood for producing numerous movies with out coaching on video-text pairs beforehand. However, neither fashions have been publicly launched.

    So provided that we don’t have entry to any video-generating fashions, we are able to solely depend on the data Google has supplied about VideoPoet. With that in thoughts, the paper’s authors assert that “In many cases, even the current leading models either generate small motion or, when producing larger motions, exhibit noticeable artifacts.” VideoPoet, alternatively, can deal with extra movement.

    VideoPoet can generate longer movies and deal with movement extra gracefully than the competitors.

    Google additionally says that VideoPoet can generate longer movies than the competitors. While it’s restricted to an preliminary burst of two-second movies, it can keep context throughout eight to 10 seconds of video. That might not sound like a lot however it’s spectacular given how a lot a scene might change in that point interval. Having mentioned that, Google’s instance movies solely embrace a number of dozen frames, removed from the 24 or 30 frames per second benchmark used for skilled video or filmmaking.

    Google VideoPoet availability: Is it free?

    google videopoet samples

    While Google has revealed dozens of instance movies to reveal the strengths of VideoPoet, it stopped wanting saying a public rollout. In different phrases, we don’t know after we’ll have the ability to use VideoPoet, if in any respect.

    Google hasn’t introduced a product or launch date for VideoPoet but.

    As for pricing, we might must take the trace from AI picture turbines like Midjourney which might be solely accessible through a subscription. Indeed, AI-generated photos and movies are computationally costly so opening up entry to everybody will not be possible, even for Google. We’ll have to attend for a disruptive launch like OpenAI’s ChatGPT to pressure the search large’s hand. Until then, we’ll merely have to attend and watch from the sidelines.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    Mobile

    Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

    Mobile

    Microsoft is done being subtle – this new tool screams “upgrade now”

    Mobile

    Wallpaper Wednesday: Android wallpapers 2025-05-28

    Mobile

    Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

    Mobile

    vivo T4 Ultra specs leak

    Mobile

    Forget screens: more details emerge on the mysterious Jony Ive + OpenAI device

    Mobile

    Android 16 QPR1 lets you check what fingerprints you’ve enrolled on your Pixel phone

    Mobile

    The Forerunner 570 & 970 have made Garmin’s tiered strategy clearer than ever

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Crypto

    November Grand Finale Predicted by Historical Numbers

    The digital gold rush is again on. Bitcoin (BTC), the world’s main cryptocurrency, shattered its…

    Crypto

    Trump Crypto Project Grabs 722 ETH

    They say journalists by no means actually clock out. But for Christian, that is not…

    Mobile

    The Pixel 8 has already received the teardown treatment

    What it’s essential to knowA Pixel 8 teardown video offers customers take a look at…

    Science

    How to see inside growing teeth and bones

    Your physique accommodates the stuff of rocks: the calcium-based minerals in bones and teeth. In…

    Gadgets

    The best 3D modeling software in 2023

    We might earn income from the merchandise out there on this web page and take…

    Our Picks
    AI

    Researchers from Tsinghua University and Microsoft AI Unveil a Breakthrough in Language Model Training: The Path to Optimal Learning Efficiency

    AI

    What’s next for robotaxis in 2024

    Gadgets

    The best Mac client for Gmail users is now a 1.0 release with nifty new features

    Categories
    • AI (1,493)
    • Crypto (1,753)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,866)
    • Technology (1,802)
    • The Future (1,648)
    Most Popular
    Technology

    Best Apple Watch Deals: SE 2 for $219, Up to $380 of Trade-In Credit and More

    Science

    Lunar eclipse 2023: October blood moon captured in stunning images around the world

    Mobile

    Vivo introduces a new tablet powered by Dimensity 9300 chipset, TWS 4 earbuds

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.