Close Menu
Ztoog
    What's Hot
    Gadgets

    Google Store Black Friday 2023: Pixel 8 Pro At $799, $400 Off Pixel Fold, And More

    Crypto

    Inside The Bitcoin Surge Of A Tiny Himalayan Kingdom

    Gadgets

    I’m a New Homeowner, and Here’s How to BYO Smart Home

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Disneyland’s 70th Anniversary Brings Cartoony Chaos to This Summer’s Celebration

      Story of military airfield in Afghanistan that Biden left in 2021

      Tencent hires WizardLM team, a Microsoft AI group with an odd history

      Today’s NYT Connections Hints, Answers for May 12, #701

      OPPO launches A5 Pro 5G: Premium features at a budget price

    • Technology

      Deep dive on the evolution of Microsoft's relationship with OpenAI, from its $1B investment in 2019 through Copilot rollouts and ChatGPT's launch to present day (Bloomberg)

      New leak reveals iPhone Fold won’t look like the Galaxy Z Fold 6 at all

      Apple will use AI and user data in iOS 19 to extend iPhone battery life

      Today’s NYT Wordle Hints, Answer and Help for May 12, #1423

      What It Is and Why It Matters—Part 1 – O’Reilly

    • Gadgets

      We Hand-Picked the 24 Best Deals From the 2025 REI Anniversary Sale

      “Google wanted that”: Nextcloud decries Android permissions as “gatekeeping”

      Google Tests Automatic Password-to-Passkey Conversion On Android

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

      Apple plans to split iPhone 18 launch into two phases in 2026

    • Mobile

      The iPhone Fold is now being tested with an under-display camera

      T-Mobile takes over one of golf’s biggest events, unleashes unique experiences

      Fitbit’s AI experiments just leveled up with 3 new health tracking features

      Motorola’s Moto Watch needs to start living up to the brand name

      Samsung Galaxy S25 Edge promo materials leak

    • Science

      Do these Buddhist gods hint at the purpose of China’s super-secret satellites?

      From Espresso to Eco-Brick: How Coffee Waste Fuels 3D-Printed Design

      Ancient three-eyed ‘sea moth’ used its butt to breathe

      Intelligence on Earth Evolved Independently at Least Twice

      Nothing is stronger than quantum connections – and now we know why

    • AI

      Google DeepMind’s new AI agent cracks real-world problems better than humans can

      Study shows vision-language models can’t handle queries with negation words | Ztoog

      How a new type of AI is helping police skirt facial recognition bans

      Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

      How to build a better AI benchmark

    • Crypto

      Is Bitcoin Bull Run Back? Daily RSI Shows Only Mild Bullish Momentum

      Robinhood grows its footprint in Canada by acquiring WonderFi

      HashKey Group Announces Launch of HashKey Global MENA with VASP License in UAE

      Ethereum Breaks Key Resistance In One Massive Move – Higher High Confirms Momentum

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

    Ztoog
    Home » A technique for more effective multipurpose robots | Ztoog
    AI

    A technique for more effective multipurpose robots | Ztoog

    Facebook Twitter Pinterest WhatsApp
    A technique for more effective multipurpose robots | Ztoog
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Let’s say you need to prepare a robotic so it understands easy methods to use instruments and might then shortly study to make repairs round your home with a hammer, wrench, and screwdriver. To do this, you would wish an infinite quantity of information demonstrating software use.

    Existing robotic datasets differ broadly in modality — some embrace coloration pictures whereas others are composed of tactile imprints, for occasion. Data may be collected in several domains, like simulation or human demos. And every dataset could seize a singular activity and setting.

    It is tough to effectively incorporate knowledge from so many sources in a single machine-learning mannequin, so many strategies use only one sort of information to coach a robotic. But robots skilled this manner, with a comparatively small quantity of task-specific knowledge, are sometimes unable to carry out new duties in unfamiliar environments.

    In an effort to coach higher multipurpose robots, MIT researchers developed a technique to mix a number of sources of information throughout domains, modalities, and duties utilizing a sort of generative AI often known as diffusion fashions.

    They prepare a separate diffusion mannequin to study a technique, or coverage, for finishing one activity utilizing one particular dataset. Then they mix the insurance policies realized by the diffusion fashions right into a basic coverage that allows a robotic to carry out a number of duties in numerous settings.

    In simulations and real-world experiments, this coaching strategy enabled a robotic to carry out a number of tool-use duties and adapt to new duties it didn’t see throughout coaching. The technique, often known as Policy Composition (PoCo), led to a 20 % enchancment in activity efficiency when in comparison with baseline methods.

    “Addressing heterogeneity in robotic datasets is like a chicken-egg problem. If we want to use a lot of data to train general robot policies, then we first need deployable robots to get all this data. I think that leveraging all the heterogeneous data available, similar to what researchers have done with ChatGPT, is an important step for the robotics field,” says Lirui Wang, {an electrical} engineering and pc science (EECS) graduate scholar and lead creator of a paper on PoCo.     

    Wang’s coauthors embrace Jialiang Zhao, a mechanical engineering graduate scholar; Yilun Du, an EECS graduate scholar; Edward Adelson, the John and Dorothy Wilson Professor of Vision Science within the Department of Brain and Cognitive Sciences and a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL); and senior creator Russ Tedrake, the Toyota Professor of EECS, Aeronautics and Astronautics, and Mechanical Engineering, and a member of CSAIL. The analysis will probably be introduced on the Robotics: Science and Systems Conference.

    Combining disparate datasets

    A robotic coverage is a machine-learning mannequin that takes inputs and makes use of them to carry out an motion. One method to consider a coverage is as a technique. In the case of a robotic arm, that technique could be a trajectory, or a sequence of poses that transfer the arm so it picks up a hammer and makes use of it to pound a nail.

    Datasets used to study robotic insurance policies are sometimes small and centered on one explicit activity and setting, like packing objects into containers in a warehouse.

    “Every single robotic warehouse is generating terabytes of data, but it only belongs to that specific robot installation working on those packages. It is not ideal if you want to use all of these data to train a general machine,” Wang says.

    The MIT researchers developed a technique that may take a sequence of smaller datasets, like these gathered from many robotic warehouses, study separate insurance policies from each, and mix the insurance policies in a method that allows a robotic to generalize to many duties.

    They symbolize every coverage utilizing a sort of generative AI mannequin often known as a diffusion mannequin. Diffusion fashions, typically used for picture era, study to create new knowledge samples that resemble samples in a coaching dataset by iteratively refining their output.

    But slightly than educating a diffusion mannequin to generate pictures, the researchers educate it to generate a trajectory for a robotic. They do that by including noise to the trajectories in a coaching dataset. The diffusion mannequin regularly removes the noise and refines its output right into a trajectory.

    This technique, often known as Diffusion Policy, was beforehand launched by researchers at MIT, Columbia University, and the Toyota Research Institute. PoCo builds off this Diffusion Policy work. 

    The staff trains every diffusion mannequin with a unique sort of dataset, equivalent to one with human video demonstrations and one other gleaned from teleoperation of a robotic arm.

    Then the researchers carry out a weighted mixture of the person insurance policies realized by all of the diffusion fashions, iteratively refining the output so the mixed coverage satisfies the targets of every particular person coverage.

    Greater than the sum of its elements

    “One of the benefits of this approach is that we can combine policies to get the best of both worlds. For instance, a policy trained on real-world data might be able to achieve more dexterity, while a policy trained on simulation might be able to achieve more generalization,” Wang says.

    With coverage composition, researchers are in a position to mix datasets from a number of sources to allow them to educate a robotic to successfully use a variety of instruments, like a hammer, screwdriver, or this spatula.

    Image: Courtesy of the researchers

    Because the insurance policies are skilled individually, one might combine and match diffusion insurance policies to attain higher outcomes for a sure activity. A consumer might additionally add knowledge in a brand new modality or area by coaching an extra Diffusion Policy with that dataset, slightly than beginning your entire course of from scratch.

    Animation of robot arm using toy hammer as objects are being placed randomly next around it.
    The coverage composition technique the researchers developed can be utilized to successfully educate a robotic to make use of instruments even when objects are positioned round it to attempt to distract it from its activity, as seen right here.

    Image: Courtesy of the researchers

    The researchers examined PoCo in simulation and on actual robotic arms that carried out a wide range of instruments duties, equivalent to utilizing a hammer to pound a nail and flipping an object with a spatula. PoCo led to a 20 % enchancment in activity efficiency in comparison with baseline strategies.

    “The striking thing was that when we finished tuning and visualized it, we can clearly see that the composed trajectory looks much better than either one of them individually,” Wang says.

    In the long run, the researchers need to apply this technique to long-horizon duties the place a robotic would decide up one software, use it, then swap to a different software. They additionally need to incorporate bigger robotics datasets to enhance efficiency.

    “We will need all three kinds of data to succeed for robotics: internet data, simulation data, and real robot data. How to combine them effectively will be the million-dollar question. PoCo is a solid step on the right track,” says Jim Fan, senior analysis scientist at NVIDIA and chief of the AI Agents Initiative, who was not concerned with this work.

    This analysis is funded, partly, by Amazon, the Singapore Defense Science and Technology Agency, the U.S. National Science Foundation, and the Toyota Research Institute.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    AI

    Study shows vision-language models can’t handle queries with negation words | Ztoog

    AI

    How a new type of AI is helping police skirt facial recognition bans

    AI

    Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    AI

    Making AI models more trustworthy for high-stakes settings | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Science

    Planned moon landings could pelt orbiting spacecraft with dusty debris

    Artist’s depiction of the Blue Origin’s Blue Moon lander, which NASA has chosen for its…

    AI

    Now you can chat with ChatGPT using your voice

    In final week’s demo, Raul Puri, a scientist who works on GPT-4, gave me a…

    Crypto

    Bitcoin Retests $95,000, Is A New Year Rebound Coming?

    Este artículo también está disponible en español. As the yr ends, a famend analyst recommended…

    Technology

    How the shoplifting scare is undermining criminal justice reform

    Over the final couple of years, it appeared that America was experiencing a shoplifting epidemic.…

    AI

    Optimizing Computational Costs with AutoMix: An AI Strategic Approach to Leveraging Large Language Models from the Cloud

    AutoMix is an revolutionary method that optimises the allocation of queries to bigger language fashions…

    Our Picks
    Technology

    Five Great Microsoft Forms Features for Teachers

    Science

    These Rogue Worlds Upend the Theory of How Planets Form

    The Future

    AI news recap for July: While Hollywood strikes, is ChatGPT getting worse?

    Categories
    • AI (1,486)
    • Crypto (1,748)
    • Gadgets (1,799)
    • Mobile (1,843)
    • Science (1,858)
    • Technology (1,794)
    • The Future (1,640)
    Most Popular
    Technology

    What We Learned from a Year of Building with LLMs (Part III): Strategy – O’Reilly

    Science

    AI could assemble a record-breaking quantum computer out of cold atoms

    Crypto

    ‘Vitalik Slept On My Couch & Copied My Inventions’ Ethereum Insider Says

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.