Close Menu
Ztoog
    What's Hot
    Mobile

    Moondrop teases its first Hi-Fi smartphone with new image (Update)

    Science

    Leonardo da Vinci used toxic pigments when he painted the Mona Lisa

    Technology

    Why Bitcoin is surging to a record high

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      How to Get Bot Lobbies in Fortnite? (2025 Guide)

      Can work-life balance tracking improve well-being?

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

    • Technology

      What does a millennial midlife crisis look like?

      Elon Musk tries to stick to spaceships

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

    • Gadgets

      Watch Apple’s WWDC 2025 keynote right here

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

    • Mobile

      YouTube is testing a leaderboard to show off top live stream fans

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

    • Science

      Some parts of Trump’s proposed budget for NASA are literally draconian

      June skygazing: A strawberry moon, the summer solstice… and Asteroid Day!

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

    • AI

      Fueling seamless AI at scale

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    • Crypto

      Bitcoin Maxi Isn’t Buying Hype Around New Crypto Holding Firms

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

    Ztoog
    Home » Meet OmniControl: An Artificial Intelligence Approach for Incorporating Flexible Spatial Control Signals into a Text-Conditioned Human Motion Generation Model Based on the Diffusion Process
    AI

    Meet OmniControl: An Artificial Intelligence Approach for Incorporating Flexible Spatial Control Signals into a Text-Conditioned Human Motion Generation Model Based on the Diffusion Process

    Facebook Twitter Pinterest WhatsApp
    Meet OmniControl: An Artificial Intelligence Approach for Incorporating Flexible Spatial Control Signals into a Text-Conditioned Human Motion Generation Model Based on the Diffusion Process
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Researchers deal with the challenge of mixing spatial management indicators over each joint at any given time into text-conditioned human movement manufacturing. Modern diffusion-based methods could produce assorted and lifelike human movement, however they discover it troublesome to include variable spatial management indicators, that are important for many purposes. For occasion, a mannequin should regulate the hand place to contact the cup at a specific place and time and perceive “pick up” semantics to synthesize the motion for choosing up a cup. Similarly, when transferring by way of a room with low ceilings, a mannequin should fastidiously regulate the peak of the head for a sure period of time to keep away from accidents. 

    Since they’re troublesome to clarify in the textual immediate, these management indicators are sometimes delivered as world positions of joints of curiosity in keyframes. However, earlier inpainting-based approaches can not incorporate versatile management indicators as a consequence of their chosen relative human posture representations. The limits are largely brought on by the relative places of the joints and the pelvis with respect to 1 one other and the prior body. The world pelvic place provided in the management sign should thus be translated to a relative location regarding the earlier body to be enter to the keyframe. Similar to how different joints’ positions have to be enter, the world place of the pelvis should even be transformed. 

    However, the pelvis’ relative places between the diffusion era course of have to be extra current or corrected in each situations. To combine any spatial management sign on joints aside from the pelvis, one should first need assistance managing sparse limitations on the pelvis. Others current a two-stage mannequin, however it nonetheless has hassle regulating different joints as a consequence of the restricted management indicators over the pelvis. In this research, researchers from Northeastern University and Google Research recommend OmniControl, a brand-new diffusion-based human era mannequin that will embrace versatile spatial management indicators over any joint at any given second. Building on OmniControl, realism guiding is added to manage the creation of human actions. 

    Figure 1: Given a written immediate and adaptable spatial management indicators, OmniControl can produce convincing human gestures. Later frames in the collection are indicated by darker colors. The enter management indicators are proven by the inexperienced line or factors.

    For the mannequin to work properly, they use the identical relative human posture representations for enter and output. However, they recommend, in distinction to present approaches, changing the produced movement to world coordinates for direct comparability with the enter management indicators in the spatial steerage module, the place the gradients of the error are employed to enhance the movement. It resolves the shortcomings of the earlier inpainting-based strategies by eradicating the uncertainty relating to the relative places of the pelvis. Additionally, in comparison with earlier approaches, it permits dynamic iterative refining of the produced movement, enhancing management precision. 

    Although efficiently implementing area limits, spatial steerage alone incessantly leads to drifting points and irregular human actions. They current the realism steerage, which outputs the residuals w.r.t. the options in every consideration layer of the movement diffusion mannequin, to unravel these issues by drawing inspiration from the managed image manufacturing. These residuals can explicitly and densely alter whole-body movement. To produce practical, coherent, and constant actions with spatial restrictions, each the spatial and the realism steerage are essential, and they’re complementary in balancing management precision and movement realism. 

    Studies utilizing HumanML3D and KIT-ML reveal that OmniControl performs considerably higher than the most superior text-based movement era methods for pelvic management by way of each movement realism and management accuracy. However, incorporating the spatial limitations over any joint at any second is the place OmniControl excels. Additionally, as illustrated in Fig. 1, they might prepare a single mannequin to regulate quite a few joints collectively fairly than individually (for instance, each the left and proper wrists). 

    These options of OmniControl make it potential for a number of downstream purposes, equivalent to tying produced a human movement to the surrounding surroundings and objects, as seen in Fig. 1’s final column. Their transient contributions are: (1) As far as they’re conscious, OmniControl is the first technique able to combining spatial management indicators over any joint at any second. (2) To efficiently steadiness the management precision and movement realism in the produced movement, they recommend a distinctive management module that makes use of spatial and realism steerage. (3) Tests reveal that OmniControl can management further joints utilizing a single mannequin in text-based movement creation, setting a new customary for controlling the pelvis and opening up varied purposes in human movement manufacturing.


    Check out the Paper and Project. All Credit For This Research Goes To the Researchers on This Project. Also, don’t neglect to affix our 31k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, the place we share the newest AI analysis information, cool AI tasks, and extra.

    If you want our work, you’ll love our publication..

    We are additionally on WhatsApp. Join our AI Channel on Whatsapp..


    Aneesh Tickoo is a consulting intern at MarktechPost. He is presently pursuing his undergraduate diploma in Data Science and Artificial Intelligence from the Indian Institute of Technology(IIT), Bhilai. He spends most of his time working on tasks aimed toward harnessing the energy of machine studying. His analysis curiosity is picture processing and is enthusiastic about constructing options round it. He loves to attach with folks and collaborate on fascinating tasks.


    ▶️ Now Watch AI Research Updates On Our Youtube Channel [Watch Now]

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Fueling seamless AI at scale

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Crypto

    Paradigm Raises $850 Million for Early-Stage Crypto Venture Fund

    Paradigm, identified for its early investments in initiatives like crypto change Uniswap and Ethereum scaling…

    Crypto

    Solana Drops Below 100-Day MA On 4-Hour Chart, SOL Price In Danger?

    Having failed to interrupt its earlier excessive for the yr, the value of Solana has…

    Crypto

    Fractal Suggests Major Breakout In Q4

    Este artículo también está disponible en español. Recent Ethereum worth motion noticed ETH reaching one…

    Technology

    Bronny James’s heart reportedly stopped during practice. What could have happened?

    On Monday, Bronny James — Los Angeles Lakers star LeBron James’s 18-year-old son and a…

    Gadgets

    Dealmaster: Apple watches, TV mega-deals, headphone sales, and more

    Enlarge / The Apple Watch Ultra utilizing the Backtrack breadcrumb characteristic throughout the compass.Corey Gaskin…

    Our Picks
    Gadgets

    Android 15 Developer Preview 1 is out for the Pixel 6 and up

    Crypto

    SBF testifying, bitcoin rises amid spot ETF speculation and Walmart’s web3 accelerator surfaces

    The Future

    Prevent Fire and smoke hazards in workplace.

    Categories
    • AI (1,494)
    • Crypto (1,754)
    • Gadgets (1,806)
    • Mobile (1,852)
    • Science (1,868)
    • Technology (1,804)
    • The Future (1,650)
    Most Popular
    AI

    Can Your Chatbot Become Sherlock Holmes? This Paper Explores the Detective Skills of Large Language Models in Information Extraction

    The Future

    The GoPro Hero 12 Black has arrived

    Mobile

    Best Android tips and tricks in 2023

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.