Close Menu
Ztoog
    What's Hot
    Mobile

    DoorDash sued for charging iPhone users more than Android users because they earn more

    Crypto

    Binance Immense XRP Holdings Exposed In POR Report

    Science

    2024 is set to be the year of the moon, but let’s proceed with care

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Can work-life balance tracking improve well-being?

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

    • Technology

      Elon Musk tries to stick to spaceships

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      June skygazing: A strawberry moon, the summer solstice… and Asteroid Day!

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      Bitcoin Maxi Isn’t Buying Hype Around New Crypto Holding Firms

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

    Ztoog
    Home » This AI Research Introduces a Novel Two-Stage Pose Distillation for Whole-Body Pose Estimation
    AI

    This AI Research Introduces a Novel Two-Stage Pose Distillation for Whole-Body Pose Estimation

    Facebook Twitter Pinterest WhatsApp
    This AI Research Introduces a Novel Two-Stage Pose Distillation for Whole-Body Pose Estimation
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Numerous human-centric notion, comprehension, and creation duties rely on whole-body pose estimation, together with 3D whole-body mesh restoration, human-object interplay, and posture-conditioned human picture and movement manufacturing. Furthermore, utilizing user-friendly algorithms like OpenPose and MediaPipe, recording human postures for digital content material growth and VR/AR has considerably elevated in reputation. Although these instruments are handy, their efficiency nonetheless wants to enhance, which limits their potential. Therefore, extra developments in human pose evaluation applied sciences are important to realizing the promise of user-driven content material manufacturing. 

    Comparatively talking, whole-body pose estimation presents extra difficulties than human pose estimation with body-only key factors detection as a result of following elements:

    1. The hierarchical constructions of the human physique for fine-grained key factors localization.
    2. The small resolutions of the hand and face.
    3. The complicated physique components match a number of individuals in a picture, particularly for occlusion and troublesome hand poses.
    4. Data limitation, significantly for the whole-body photos’ numerous hand pose and head pose.

    Additionally, a mannequin should be compressed into a skinny community earlier than deployment. Distillation, trimming, and quantization make up the basic compression strategies. 

    Knowledge distillation (KD) can increase a compact mannequin’s effectiveness with out including pointless bills to the inference course of. This technique, which has broad use in varied duties like categorization, detection, and segmentation, permits college students to select up information from a extra skilled instructor. A set of real-time pose estimators with good efficiency and effectivity are produced as a consequence of the investigation of KD for whole-body pose estimation on this work. Researchers from Tsinghua Shenzhen International Graduate School and International Digital Economy Academy particularly recommend a revolutionary two-stage pose distillation structure referred to as DWPose, which, as demonstrated in Fig. 1, supplies cutting-edge efficiency. They use the latest pose estimator, RTMPose, skilled on COCO-WholeBody, as their elementary mannequin. 

    Figure 1 reveals a comparability between their mannequin and comparable fashions for COCO-WholeBody’s whole-body posture estimation.

    They natively use the instructor’s (e.g., RTMPose-x) intermediate layer and ultimate logits within the first stage distillation to direct the coed mannequin (e.g., RTMPose-l). Keypoints could also be distinguished in earlier posture coaching by their visibility, and solely seen key factors are used for monitoring. Instead, they make use of the instructor’s total outputs which embody each seen and invisible key factors—as ultimate logits, which can convey correct and thorough values to assist within the studying course of for the scholars. They additionally use a weight-decay strategy to extend effectiveness, which progressively lowers the gadget’s weight all through the coaching session. The second stage, distillation, suggests a head-aware self-KD to extend the capability of the pinnacle since a higher head would determine a extra correct localization. 

    They construct two similar fashions, selecting one as the coed to be up to date and the opposite as the trainer. Only the pinnacle of the coed is up to date by the logit-based distillation, leaving the remainder of the physique frozen. Notably, this plug-and-play technique works with dense prediction heads and allows the coed to get higher outcomes with 20% much less coaching time, whether or not skilled from the beginning with distillation or with out. The quantity and number of information addressing completely different sizes of human physique components will influence the mannequin’s efficiency. Due to the datasets ‘ want for complete annotated key factors, present estimators should assist precisely localize the fine-grained finger and facial landmarks. 

    Therefore, they incorporate an additional UBody dataset comprising quite a few face and hand key factors photographed in varied real-life settings to look at the information impact. Thus, the next could also be stated about their contributions: 

    • To overcome the whole-body information limitation, they discover extra complete coaching information, particularly on numerous and expressive hand gestures and facial expressions, making it relevant to real-life functions. 

    • They introduce a two-stage pose information distillation technique, pursuing environment friendly and exact whole-body pose estimation. 

    • Their steered distillation and information strategies might significantly improve RTMPose-l from 64.8% to 66.5% AP, even exceeding RTMPose-x teacher with 65.3% AP, utilizing the latest RTMPose as their base mannequin. Additionally, they verify DWPose’s robust efficacy and effectivity in producing work.


    Check out the Paper and GitHub. All Credit For This Research Goes To the Researchers on This Project. Also, don’t overlook to hitch our 27k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, the place we share the newest AI analysis information, cool AI tasks, and extra.


    Aneesh Tickoo is a consulting intern at MarktechPost. He is at present pursuing his undergraduate diploma in Data Science and Artificial Intelligence from the Indian Institute of Technology(IIT), Bhilai. He spends most of his time engaged on tasks aimed toward harnessing the ability of machine studying. His analysis curiosity is picture processing and is enthusiastic about constructing options round it. He loves to attach with individuals and collaborate on attention-grabbing tasks.


    🔥 Use SQL to foretell the long run (Sponsored)

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    AI

    Effector: A Python-based Machine Learning Library Dedicated to Regional Feature Effects

    Global function results strategies, comparable to Partial Dependence Plots (PDP) and SHAP Dependence Plots, have…

    Crypto

    Bitcoin Fever: 99% Of Addresses In Profit As BTC Touches $64,000

    The latest surge within the value of Bitcoin, propelling it to a three-year excessive, has…

    Crypto

    Faction launches $285M early-stage crypto fund

    As the crypto market continues to slog by way of a fundraising winter, Faction Ventures,…

    Mobile

    It’s finally easier to buy a Nothing Phone (2)

    What you want to knowThe Nothing Phone (2) is now obtainable on Amazon within the…

    AI

    Meet Modular Diffusion: A Python Library for Designing and Training Diffusion Models with PyTorch

    We are all the time looking out for cool AI initiatives for marktechpost and this…

    Our Picks
    Technology

    Here’s a first look at Gemini in Google Messages (Update: Google clarifies features)

    AI

    MIT Researchers Use Deep Learning to Get a Better Picture of the Atmospheric Layer Closest to Earth’s Surface: Improving Weather and Drought Prediction

    AI

    AI meets climate: MIT Energy and Climate Hack 2023 | Ztoog

    Categories
    • AI (1,493)
    • Crypto (1,754)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,867)
    • Technology (1,803)
    • The Future (1,649)
    Most Popular
    Crypto

    Bitcoin Price Imminent Crash To $23,000, These Are The Catalysts

    The Future

    Urlebird Review: Safety, Features, and Alternatives

    Science

    Neil Turok interview: The physicist proposing a mirror-image universe

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.