Close Menu
Ztoog
    What's Hot
    AI

    Separating Fact from Logic: Test of Time ToT Benchmark Isolates Reasoning Skills in LLMs for Improved Temporal Understanding

    Gadgets

    Report: Apple changes film strategy, will rarely do wide theatrical releases

    Mobile

    Check out the new promo videos for Apple Pay

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      What is Project Management? 5 Best Tools that You Can Try

      Operational excellence strategy and continuous improvement

      Hannah Fry: AI isn’t as powerful as we think

      FanDuel goes all in on responsible gaming push with new Play with a Plan campaign

      Gettyimages.com Is the Best Website on the Internet Right Now

    • Technology

      Iran war: How could it end?

      Democratic senators question CFTC staffing cuts in Chicago enforcement office

      Google’s Cloud AI lead on the three frontiers of model capability

      AMD agrees to backstop a $300M loan from Goldman Sachs for Crusoe to buy AMD AI chips, the first known case of AMD chips used as debt collateral (The Information)

      Productivity apps failed me when I needed them most

    • Gadgets

      macOS Tahoe 26.3.1 update will “upgrade” your M5’s CPU to new “super” cores

      Lenovo Shows Off a ThinkBook Modular AI PC Concept With Swappable Ports and Detachable Displays at MWC 2026

      POCO M8 Review: The Ultimate Budget Smartphone With Some Cons

      The Mission: Impossible of SSDs has arrived with a fingerprint lock

      6 Best Phones With Headphone Jacks (2026), Tested and Reviewed

    • Mobile

      Android’s March update is all about finding people, apps, and your missing bags

      Watch Xiaomi’s global launch event live here

      Our poll shows what buyers actually care about in new smartphones (Hint: it’s not AI)

      Is Strava down for you? You’re not alone

      The Motorola Razr FIFA World Cup 2026 Edition was literally just unveiled, and Verizon is already giving them away

    • Science

      Big Tech Signs White House Data Center Pledge With Good Optics and Little Substance

      Inside the best dark matter detector ever built

      NASA’s Artemis moon exploration programme is getting a major makeover

      Scientists crack the case of “screeching” Scotch tape

      Blue-faced, puffy-lipped monkey scores a rare conservation win

    • AI

      Online harassment is entering its AI era

      Meet NullClaw: The 678 KB Zig AI Agent Framework Running on 1 MB RAM and Booting in Two Milliseconds

      New method could increase LLM training efficiency | Ztoog

      The human work behind humanoid robots is being hidden

      NVIDIA Releases DreamDojo: An Open-Source Robot World Model Trained on 44,711 Hours of Real-World Human Video Data

    • Crypto

      Google paid startup Form Energy $1B for its massive 100-hour battery

      Ethereum Breakout Alert: Corrective Channel Flip Sparks Impulsive Wave

      Show Your ID Or No Deal

      Jane Street sued for alleged front-running trades that accelerated Terraform Labs meltdown

      Bitcoin Trades Below ETF Cost-Basis As MVRV Signals Mounting Pressure

    Ztoog
    Home » This AI Research Introduces a Novel Two-Stage Pose Distillation for Whole-Body Pose Estimation
    AI

    This AI Research Introduces a Novel Two-Stage Pose Distillation for Whole-Body Pose Estimation

    Facebook Twitter Pinterest WhatsApp
    This AI Research Introduces a Novel Two-Stage Pose Distillation for Whole-Body Pose Estimation
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Numerous human-centric notion, comprehension, and creation duties rely on whole-body pose estimation, together with 3D whole-body mesh restoration, human-object interplay, and posture-conditioned human picture and movement manufacturing. Furthermore, utilizing user-friendly algorithms like OpenPose and MediaPipe, recording human postures for digital content material growth and VR/AR has considerably elevated in reputation. Although these instruments are handy, their efficiency nonetheless wants to enhance, which limits their potential. Therefore, extra developments in human pose evaluation applied sciences are important to realizing the promise of user-driven content material manufacturing. 

    Comparatively talking, whole-body pose estimation presents extra difficulties than human pose estimation with body-only key factors detection as a result of following elements:

    1. The hierarchical constructions of the human physique for fine-grained key factors localization.
    2. The small resolutions of the hand and face.
    3. The complicated physique components match a number of individuals in a picture, particularly for occlusion and troublesome hand poses.
    4. Data limitation, significantly for the whole-body photos’ numerous hand pose and head pose.

    Additionally, a mannequin should be compressed into a skinny community earlier than deployment. Distillation, trimming, and quantization make up the basic compression strategies. 

    Knowledge distillation (KD) can increase a compact mannequin’s effectiveness with out including pointless bills to the inference course of. This technique, which has broad use in varied duties like categorization, detection, and segmentation, permits college students to select up information from a extra skilled instructor. A set of real-time pose estimators with good efficiency and effectivity are produced as a consequence of the investigation of KD for whole-body pose estimation on this work. Researchers from Tsinghua Shenzhen International Graduate School and International Digital Economy Academy particularly recommend a revolutionary two-stage pose distillation structure referred to as DWPose, which, as demonstrated in Fig. 1, supplies cutting-edge efficiency. They use the latest pose estimator, RTMPose, skilled on COCO-WholeBody, as their elementary mannequin. 

    Figure 1 reveals a comparability between their mannequin and comparable fashions for COCO-WholeBody’s whole-body posture estimation.

    They natively use the instructor’s (e.g., RTMPose-x) intermediate layer and ultimate logits within the first stage distillation to direct the coed mannequin (e.g., RTMPose-l). Keypoints could also be distinguished in earlier posture coaching by their visibility, and solely seen key factors are used for monitoring. Instead, they make use of the instructor’s total outputs which embody each seen and invisible key factors—as ultimate logits, which can convey correct and thorough values to assist within the studying course of for the scholars. They additionally use a weight-decay strategy to extend effectiveness, which progressively lowers the gadget’s weight all through the coaching session. The second stage, distillation, suggests a head-aware self-KD to extend the capability of the pinnacle since a higher head would determine a extra correct localization. 

    They construct two similar fashions, selecting one as the coed to be up to date and the opposite as the trainer. Only the pinnacle of the coed is up to date by the logit-based distillation, leaving the remainder of the physique frozen. Notably, this plug-and-play technique works with dense prediction heads and allows the coed to get higher outcomes with 20% much less coaching time, whether or not skilled from the beginning with distillation or with out. The quantity and number of information addressing completely different sizes of human physique components will influence the mannequin’s efficiency. Due to the datasets ‘ want for complete annotated key factors, present estimators should assist precisely localize the fine-grained finger and facial landmarks. 

    Therefore, they incorporate an additional UBody dataset comprising quite a few face and hand key factors photographed in varied real-life settings to look at the information impact. Thus, the next could also be stated about their contributions: 

    • To overcome the whole-body information limitation, they discover extra complete coaching information, particularly on numerous and expressive hand gestures and facial expressions, making it relevant to real-life functions. 

    • They introduce a two-stage pose information distillation technique, pursuing environment friendly and exact whole-body pose estimation. 

    • Their steered distillation and information strategies might significantly improve RTMPose-l from 64.8% to 66.5% AP, even exceeding RTMPose-x teacher with 65.3% AP, utilizing the latest RTMPose as their base mannequin. Additionally, they verify DWPose’s robust efficacy and effectivity in producing work.


    Check out the Paper and GitHub. All Credit For This Research Goes To the Researchers on This Project. Also, don’t overlook to hitch our 27k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, the place we share the newest AI analysis information, cool AI tasks, and extra.


    Aneesh Tickoo is a consulting intern at MarktechPost. He is at present pursuing his undergraduate diploma in Data Science and Artificial Intelligence from the Indian Institute of Technology(IIT), Bhilai. He spends most of his time engaged on tasks aimed toward harnessing the ability of machine studying. His analysis curiosity is picture processing and is enthusiastic about constructing options round it. He loves to attach with individuals and collaborate on attention-grabbing tasks.


    🔥 Use SQL to foretell the long run (Sponsored)

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Online harassment is entering its AI era

    AI

    Meet NullClaw: The 678 KB Zig AI Agent Framework Running on 1 MB RAM and Booting in Two Milliseconds

    AI

    New method could increase LLM training efficiency | Ztoog

    AI

    The human work behind humanoid robots is being hidden

    AI

    NVIDIA Releases DreamDojo: An Open-Source Robot World Model Trained on 44,711 Hours of Real-World Human Video Data

    AI

    Personalization features can make LLMs more agreeable | Ztoog

    AI

    AI is already making online crimes easier. It could get much worse.

    AI

    NVIDIA Researchers Introduce KVTC Transform Coding Pipeline to Compress Key-Value Caches by 20x for Efficient LLM Serving

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Crypto

    Will This Breakthrough Lead To A New Market Phase?

    Ethereum (ETH), the second-largest cryptocurrency by market capitalization, has exhibited a promising technical growth, igniting…

    AI

    The Representative Capacity of Transformer Language Models LMs with n-gram Language Models LMs: Capturing the Parallelizable Nature of n-gram LMs

    Neural language fashions (LMs) have grow to be fashionable attributable to their intensive theoretical work…

    Crypto

    Bitcoin Enthusiast Javier Milei Secures Win In Argentina’s Presidential Contest

    Javier Milei, who’s well-known for his ardent advocacy of Bitcoin, grew to become victorious within…

    Science

    3D muscle reconstruction shows 3.2 million-year-old “Lucy” walked upright

    3D reconstruction of decrease limb muscular tissues of Australopithecus afarensis fossil AL 288-1, aka “Lucy.”…

    Mobile

    CMF Buds by Nothing in for review

    If you are looking out for succesful wi-fi earbuds on a budget, you most likely…

    Our Picks
    Science

    Moon rocks reveal hidden lunar history

    Crypto

    Libra’s co-creator had geopolitical motivations to build the digital currency

    Crypto

    Ethereum (ETH) Price Drops Due Whale Selling, Key Levels

    Categories
    • AI (1,560)
    • Crypto (1,826)
    • Gadgets (1,870)
    • Mobile (1,910)
    • Science (1,939)
    • Technology (1,862)
    • The Future (1,716)
    Most Popular
    Technology

    Beginner’s Guide to Online Gambling: What to Know Before You Play

    Gadgets

    A giant list of Labor Day deals to peruse while you mourn the end of summer

    The Future

    Risk algorithm used widely in US courts is harsher than human judges

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2026 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.