Close Menu
Ztoog
    What's Hot
    AI

    Understanding the nuances of human-like intelligence | Ztoog

    Gadgets

    Toyota And Idemitsu Pioneer Solid-State Batteries For 745-Mile Range EVs

    The Future

    Haunting ‘Demon Faces’ Show What It’s Like to Have Rare Distorted Face Syndrome

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      What is Project Management? 5 Best Tools that You Can Try

      Operational excellence strategy and continuous improvement

      Hannah Fry: AI isn’t as powerful as we think

      FanDuel goes all in on responsible gaming push with new Play with a Plan campaign

      Gettyimages.com Is the Best Website on the Internet Right Now

    • Technology

      Iran war: How could it end?

      Democratic senators question CFTC staffing cuts in Chicago enforcement office

      Google’s Cloud AI lead on the three frontiers of model capability

      AMD agrees to backstop a $300M loan from Goldman Sachs for Crusoe to buy AMD AI chips, the first known case of AMD chips used as debt collateral (The Information)

      Productivity apps failed me when I needed them most

    • Gadgets

      macOS Tahoe 26.3.1 update will “upgrade” your M5’s CPU to new “super” cores

      Lenovo Shows Off a ThinkBook Modular AI PC Concept With Swappable Ports and Detachable Displays at MWC 2026

      POCO M8 Review: The Ultimate Budget Smartphone With Some Cons

      The Mission: Impossible of SSDs has arrived with a fingerprint lock

      6 Best Phones With Headphone Jacks (2026), Tested and Reviewed

    • Mobile

      Android’s March update is all about finding people, apps, and your missing bags

      Watch Xiaomi’s global launch event live here

      Our poll shows what buyers actually care about in new smartphones (Hint: it’s not AI)

      Is Strava down for you? You’re not alone

      The Motorola Razr FIFA World Cup 2026 Edition was literally just unveiled, and Verizon is already giving them away

    • Science

      Big Tech Signs White House Data Center Pledge With Good Optics and Little Substance

      Inside the best dark matter detector ever built

      NASA’s Artemis moon exploration programme is getting a major makeover

      Scientists crack the case of “screeching” Scotch tape

      Blue-faced, puffy-lipped monkey scores a rare conservation win

    • AI

      Online harassment is entering its AI era

      Meet NullClaw: The 678 KB Zig AI Agent Framework Running on 1 MB RAM and Booting in Two Milliseconds

      New method could increase LLM training efficiency | Ztoog

      The human work behind humanoid robots is being hidden

      NVIDIA Releases DreamDojo: An Open-Source Robot World Model Trained on 44,711 Hours of Real-World Human Video Data

    • Crypto

      SEC Vs. Justin Sun Case Ends In $10M Settlement

      Google paid startup Form Energy $1B for its massive 100-hour battery

      Ethereum Breakout Alert: Corrective Channel Flip Sparks Impulsive Wave

      Show Your ID Or No Deal

      Jane Street sued for alleged front-running trades that accelerated Terraform Labs meltdown

    Ztoog
    Home » CMU Researchers Developed a Simple Distance Learning AI Method to Transfer Visual Priors to Robotics Tasks: Improving Policy Learning by 20% Over Baselines
    AI

    CMU Researchers Developed a Simple Distance Learning AI Method to Transfer Visual Priors to Robotics Tasks: Improving Policy Learning by 20% Over Baselines

    Facebook Twitter Pinterest WhatsApp
    CMU Researchers Developed a Simple Distance Learning AI Method to Transfer Visual Priors to Robotics Tasks: Improving Policy Learning by 20% Over Baselines
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    A big barrier to progress in robotic studying is the dearth of adequate, large-scale knowledge units. Data units in robotics have points with being (a) arduous to scale, (b) collected in sterile, non-realistic environment (reminiscent of a robotics lab), and (c) too homogeneous (reminiscent of toy gadgets with preset backgrounds and lighting). Vision knowledge units, however, embody a huge number of duties, objects, and environments. Therefore, trendy strategies have investigated the feasibility of bringing priors developed to be used with large imaginative and prescient datasets into robotics functions.

    Pre-trained representations encoding image observations as state vectors are utilized in earlier work that makes use of imaginative and prescient knowledge units. This graphical illustration is then merely despatched into a controller skilled utilizing knowledge collected from robots. Since the latent house of pre-trained networks already incorporates semantic, task-level data, the workforce counsel that they’ll do extra than simply signify states.

    New work by a analysis workforce from Carnegie Mellon University CMU exhibits that neural image representations may be greater than merely state representations since they can be utilized to infer robotic actions with using a easy metric created throughout the embedding house. The researchers use this understanding to be taught a distance perform and a dynamics perform with little or no low-cost human knowledge. These modules specify a robotic planner that has been examined on 4 typical manipulation jobs.

    This is achieved by splitting a pre-trained illustration into two distinct modules: (a) a one-step dynamics module, which predicts the robotic’s subsequent state primarily based on its present state/motion, and (b) a “functional distance module,” which determines how shut the robotic is to attaining its purpose within the present state. Using a contrastive studying goal, the space perform is realized with solely a small quantity of information from human demonstrations. 

    Despite its obvious ease of use, the proposed system has been proven to outperform each conventional imitation studying and offline RL approaches to robotic studying. When in contrast to a customary BC baseline, this method performs considerably higher when coping with multi-modal motion distributions. The outcomes of the ablation investigation present that higher representations lead to higher management efficiency and that dynamical grounding is important for the system to be efficient in the actual world.

    Since the pre-trained illustration itself does the arduous lifting (due to its construction), and fully avoids the problem of multi-modal, sequential motion prediction, the findings present that this methodology outperforms coverage studying (via Behavior Cloning). Additionally, the earned distance perform is steady and simple to practice, making it extremely scalable and generalizable.

    The workforce hopes that their work will spark new analysis within the fields of robotics and illustration studying. Following this,  future analysis ought to refine visible representations for robotics even additional by higher portraying the granular interactions between the gripper/hand and the issues being dealt with. This has the potential to improve efficiency on actions like knob turning, the place the pre-trained R3M encoder has bother detecting delicate adjustments in grip place in regards to the knob. They hope that research would use their method additionally to be taught fully within the absence of motion labels. Finally, regardless of the area hole, it might be fantastic if the data gathered with their cheap stick may very well be employed with a stronger, extra reliable (industrial) gripper.


    Check out the Paper, GitHub, and Project. All Credit For This Research Goes To the Researchers on This Project. Also, don’t neglect to be part of our 28k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, the place we share the newest AI analysis information, cool AI initiatives, and extra.


    Dhanshree Shenwai is a Computer Science Engineer and has a good expertise in FinTech corporations overlaying Financial, Cards & Payments and Banking area with eager curiosity in functions of AI. She is passionate about exploring new applied sciences and developments in in the present day’s evolving world making everybody’s life simple.


    🔥 Use SQL to predict the long run (Sponsored)

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Online harassment is entering its AI era

    AI

    Meet NullClaw: The 678 KB Zig AI Agent Framework Running on 1 MB RAM and Booting in Two Milliseconds

    AI

    New method could increase LLM training efficiency | Ztoog

    AI

    The human work behind humanoid robots is being hidden

    AI

    NVIDIA Releases DreamDojo: An Open-Source Robot World Model Trained on 44,711 Hours of Real-World Human Video Data

    AI

    Personalization features can make LLMs more agreeable | Ztoog

    AI

    AI is already making online crimes easier. It could get much worse.

    AI

    NVIDIA Researchers Introduce KVTC Transform Coding Pipeline to Compress Key-Value Caches by 20x for Efficient LLM Serving

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    AI

    Bill Gates isn’t too scared about AI

    The billionaire enterprise magnate and philanthropist made his case in a submit on his private…

    Crypto

    Ex-Meta employees’ Aptos tests Hong Kong’s crypto appetite

    Ever since Hong Kong legalized cryptocurrency buying and selling final June, blockchain tasks from the…

    Gadgets

    Clean up holiday messes with almost 30% off the Bissell Little Green carpet and upholstery cleaner at Amazon

    We could earn income from the merchandise out there on this web page and take…

    Technology

    Canonical wants better Snap support outside Ubuntu, based on latest hires

    Canonical/Ubuntu Snaps, the self-contained utility packages that Ubuntu has lengthy seen as an easier app…

    Gadgets

    Samsung Unveils New OLED Gaming Monitors At CES 2024

    Samsung has simply launched its newest additions to the Odyssey gaming monitor lineup with the…

    Our Picks
    Crypto

    44.2% Of Ethereum Holders Now In Loss, Is This The Bottom?

    Crypto

    Will Bitcoin Price Crash To $10,000? Bloomberg Expert Reveals When

    AI

    Meet LLM Surgeon: A New Machine Learning Framework for Unstructured, Semi-Structured, and Structured Pruning of Large Language Models (LLMs)

    Categories
    • AI (1,560)
    • Crypto (1,827)
    • Gadgets (1,870)
    • Mobile (1,910)
    • Science (1,939)
    • Technology (1,862)
    • The Future (1,716)
    Most Popular
    AI

    Unlocking the Secrets of Human-Machine Interaction: This AI Research from Spain Introduces a Comprehensive Dataset for Advancing Adaptive Interface Design

    The Future

    Using Augmented Reality to Level up your E-Commerce Business

    Crypto

    SBF’s trial is coming to a close — here’s what you missed

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2026 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.