Close Menu
Ztoog
    What's Hot
    AI

    Beyond automatic differentiation – Ztoog

    Gadgets

    Get 6-8 Microsoft apps on your PC or Mac forever with this $69.99 lifetime license

    Crypto

    Bitcoin To Reach $1 Million In Days To Weeks, Crypto Analyst

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      What is Project Management? 5 Best Tools that You Can Try

      Operational excellence strategy and continuous improvement

      Hannah Fry: AI isn’t as powerful as we think

      FanDuel goes all in on responsible gaming push with new Play with a Plan campaign

      Gettyimages.com Is the Best Website on the Internet Right Now

    • Technology

      Iran war: How could it end?

      Democratic senators question CFTC staffing cuts in Chicago enforcement office

      Google’s Cloud AI lead on the three frontiers of model capability

      AMD agrees to backstop a $300M loan from Goldman Sachs for Crusoe to buy AMD AI chips, the first known case of AMD chips used as debt collateral (The Information)

      Productivity apps failed me when I needed them most

    • Gadgets

      macOS Tahoe 26.3.1 update will “upgrade” your M5’s CPU to new “super” cores

      Lenovo Shows Off a ThinkBook Modular AI PC Concept With Swappable Ports and Detachable Displays at MWC 2026

      POCO M8 Review: The Ultimate Budget Smartphone With Some Cons

      The Mission: Impossible of SSDs has arrived with a fingerprint lock

      6 Best Phones With Headphone Jacks (2026), Tested and Reviewed

    • Mobile

      Android’s March update is all about finding people, apps, and your missing bags

      Watch Xiaomi’s global launch event live here

      Our poll shows what buyers actually care about in new smartphones (Hint: it’s not AI)

      Is Strava down for you? You’re not alone

      The Motorola Razr FIFA World Cup 2026 Edition was literally just unveiled, and Verizon is already giving them away

    • Science

      Big Tech Signs White House Data Center Pledge With Good Optics and Little Substance

      Inside the best dark matter detector ever built

      NASA’s Artemis moon exploration programme is getting a major makeover

      Scientists crack the case of “screeching” Scotch tape

      Blue-faced, puffy-lipped monkey scores a rare conservation win

    • AI

      Online harassment is entering its AI era

      Meet NullClaw: The 678 KB Zig AI Agent Framework Running on 1 MB RAM and Booting in Two Milliseconds

      New method could increase LLM training efficiency | Ztoog

      The human work behind humanoid robots is being hidden

      NVIDIA Releases DreamDojo: An Open-Source Robot World Model Trained on 44,711 Hours of Real-World Human Video Data

    • Crypto

      Google paid startup Form Energy $1B for its massive 100-hour battery

      Ethereum Breakout Alert: Corrective Channel Flip Sparks Impulsive Wave

      Show Your ID Or No Deal

      Jane Street sued for alleged front-running trades that accelerated Terraform Labs meltdown

      Bitcoin Trades Below ETF Cost-Basis As MVRV Signals Mounting Pressure

    Ztoog
    Home » What is it and how does it work?
    Mobile

    What is it and how does it work?

    Facebook Twitter Pinterest WhatsApp
    What is it and how does it work?
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Calvin Wankhede / Android Authority

    When Google introduced PaLM 2 and Gemini language fashions in mid-2023, the search large emphasised that its AI was multimodal. This meant it might generate textual content, photos, audio, and even video. Traditionally, language fashions like ChatGPT’s GPT-4 have solely excelled at reproducing textual content. Google’s newest VideoPoet mannequin challenges that notion, nonetheless, as it can convert text-based prompts into AI-generated movies.

    With VideoPoet, Google has develop into the primary tech large to announce an AI able to producing movies. And not like prior makes an attempt, Google says it may also generate scenes with numerous movement relatively than simply refined actions. So what’s the magic behind VideoPoet and what can it do? Here’s all the pieces you should know.

    What is Google VideoPoet?

    google videopoet block diagram

    Google VideoPoet is an experimental massive language mannequin that may generate movies from a text-based immediate. You can describe a fictional scene, even one as ridiculous as “A robot cat eating spaghetti,” and have a video prepared to observe inside seconds. If you’ve ever used an AI picture generator like Midjourney or DALL-E 3, you already know what to anticipate from VideoPoet.

    Like AI picture turbines, VideoPoet may also carry out edits in current video content material. For instance, you possibly can crop out a portion of the video body and ask the AI to fill within the hole with one thing out of your creativeness as a substitute.

    Google has invested in startups like Runway engaged on AI video era, however VideoPoet comes courtesy of the corporate’s inside efforts. The VideoPoet technical paper enlists as many as 31 researchers from Google Research.

    How does Google VideoPoet work?

    google how does videopoet work

    In the aforementioned paper, Google’s researchers defined that VideoPoet differs from standard text-to-image and text-to-video turbines. Unlike Midjourney, for instance, VideoPoet does not use a diffusion mannequin to generate photos from random noise. That strategy works properly for particular person photos however falls flat for movies the place the mannequin must account for movement and consistency over time.

    At its core, Google’s VideoPoet is a big language mannequin. This implies that it’s primarily based on the identical know-how powering ChatGPT and Google Bard that may predict how phrases match collectively to type sentences. VideoPoet takes that idea a step additional as it’s additionally able to predicting video and audio chunks, and not simply textual content.

    VideoPoet is a big language mannequin that generates movies as a substitute of textual content.

    VideoPoet required a specialised pre-training course of which concerned translating photos, video frames, and audio clips into a standard language, known as tokens. Put merely, the mannequin realized how to interpret completely different modalities from the coaching information. Google says that it used one billion image-text pairs and 270 million public video samples to coach VideoPoet. Ultimately, VideoPoet has develop into able to predicting video tokens identical to a conventional LLM mannequin would predict textual content tokens.

    VideoPoet has a strong basis because of its coaching that permits it to carry out duties past text-to-video era as properly. For instance, it can apply types to current movies, carry out edits like including background results, change the look of an current video with filters, and change the movement of a shifting object in an current video. Google demonstrated the latter with a raccoon dancing in numerous types.

    VideoPoet vs. rival AI video turbines: What’s the distinction?

    Meta logo on smartphone stock photo (5)

    Edgar Cervantes / Android Authority

    Google’s VideoPoet differs from most of its rivals that depend on diffusion fashions to show textual content into movies. However, it’s not precisely the primary – a smaller variety of Google Brain researchers offered Phenaki final 12 months. Likewise, Meta’s Make-A-Video challenge made waves within the AI neighborhood for producing numerous movies with out coaching on video-text pairs beforehand. However, neither fashions have been publicly launched.

    So provided that we don’t have entry to any video-generating fashions, we are able to solely depend on the data Google has supplied about VideoPoet. With that in thoughts, the paper’s authors assert that “In many cases, even the current leading models either generate small motion or, when producing larger motions, exhibit noticeable artifacts.” VideoPoet, alternatively, can deal with extra movement.

    VideoPoet can generate longer movies and deal with movement extra gracefully than the competitors.

    Google additionally says that VideoPoet can generate longer movies than the competitors. While it’s restricted to an preliminary burst of two-second movies, it can keep context throughout eight to 10 seconds of video. That might not sound like a lot however it’s spectacular given how a lot a scene might change in that point interval. Having mentioned that, Google’s instance movies solely embrace a number of dozen frames, removed from the 24 or 30 frames per second benchmark used for skilled video or filmmaking.

    Google VideoPoet availability: Is it free?

    google videopoet samples

    While Google has revealed dozens of instance movies to reveal the strengths of VideoPoet, it stopped wanting saying a public rollout. In different phrases, we don’t know after we’ll have the ability to use VideoPoet, if in any respect.

    Google hasn’t introduced a product or launch date for VideoPoet but.

    As for pricing, we might must take the trace from AI picture turbines like Midjourney which might be solely accessible through a subscription. Indeed, AI-generated photos and movies are computationally costly so opening up entry to everybody will not be possible, even for Google. We’ll have to attend for a disruptive launch like OpenAI’s ChatGPT to pressure the search large’s hand. Until then, we’ll merely have to attend and watch from the sidelines.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    Mobile

    Android’s March update is all about finding people, apps, and your missing bags

    Mobile

    Watch Xiaomi’s global launch event live here

    Mobile

    Our poll shows what buyers actually care about in new smartphones (Hint: it’s not AI)

    Mobile

    Is Strava down for you? You’re not alone

    Mobile

    The Motorola Razr FIFA World Cup 2026 Edition was literally just unveiled, and Verizon is already giving them away

    Mobile

    Xiaomi Tag’s price surfaces – GSMArena.com news

    Mobile

    Galaxy S25 FE becomes even more affordable flagship killer with Amazon’s latest deal

    Mobile

    Gemini Labs arrives, giving a clear home for experimental features

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    The Future

    Gupshup raises $60M in equity and debt, leaves unicorn status hanging

    Gupshup, a enterprise messaging startup that started its journey in India over twenty years in…

    Science

    Is an enormous shield the worst way to protect Earth from asteroids?

    Protecting Earth from any large asteroid that may come our way is sophisticated. If you…

    AI

    Machine-learning system based on light could yield more powerful, efficient large language models | Ztoog

    ChatGPT has made headlines all over the world with its skill to write down essays,…

    Crypto

    The Macroenvironment’s Role In Bitcoin Rally

    The latest surge in Bitcoin costs, defying earlier expectations, has intrigued each cryptocurrency fans and…

    Technology

    The prospect of applying generative AI to contract lifecycle management software has prompted a flurry of deals, as experts expect further market consolidation (Nick Huber/Financial Times)

    Nick Huber / Financial Times: The prospect of applying generative AI to contract lifecycle management…

    Our Picks
    Technology

    Video Friday: Training ARTEMIS – IEEE Spectrum

    Science

    Spider Legs Hold the Key to the Ultimate Anti-Adhesive

    Mobile

    This Galaxy Watch clearance deal could be your LAST CHANCE at this classic model — score $287 off!

    Categories
    • AI (1,560)
    • Crypto (1,826)
    • Gadgets (1,870)
    • Mobile (1,910)
    • Science (1,939)
    • Technology (1,862)
    • The Future (1,716)
    Most Popular
    Crypto

    Bitcoin Bears Risk Losing $7.2 Billion If BTC Price Reaches This Level

    Technology

    Beat the Heat With This $55 Dreo Tower Fan ($45 Off)

    Crypto

    Expert Reveals 4 Reasons To Be Bullish On Q4

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2026 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.