Close Menu
Ztoog
    What's Hot
    Technology

    Best iPhone in 2023: Which Apple Phone Should You Buy?

    Crypto

    Why Bitcoin ATHs In December Could Be A “Sure Thing”

    Mobile

    Google Maps’ latest test lets you see where you can enter a building

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Can work-life balance tracking improve well-being?

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

    • Technology

      Elon Musk tries to stick to spaceships

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      Bitcoin Maxi Isn’t Buying Hype Around New Crypto Holding Firms

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

    Ztoog
    Home » Watch and Learn Little Robot: This AI Approach Teaches Robots Generalizable Manipulation Using Human Video Demonstrations
    AI

    Watch and Learn Little Robot: This AI Approach Teaches Robots Generalizable Manipulation Using Human Video Demonstrations

    Facebook Twitter Pinterest WhatsApp
    Watch and Learn Little Robot: This AI Approach Teaches Robots Generalizable Manipulation Using Human Video Demonstrations
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Robots have all the time been on the focal point within the tech panorama. They all the time discovered a spot in sci-fi films, child exhibits, books, dystopian novels, and many others. Not so way back, they have been simply sci-fi goals, however now they’re everywhere, reshaping industries and giving us a glimpse into the long run. From factories to outer area, robots are taking middle stage, displaying off their precision and adaptability like by no means earlier than. 

    The foremost objective within the panorama of robotics has all the time been the identical: mirror human dexterity. The quest for refining manipulation capabilities to reflect people has led to thrilling developments. Significant development has been made via the mixing of eye-in-hand cameras, both as enhances or substitutes for typical static third-person cameras.

    While eye-in-hand cameras maintain immense potential, they don’t assure error-free outcomes. Vision-based fashions usually wrestle with the true world’s fluctuations, resembling altering backgrounds, variable lighting, and altering object appearances, resulting in fragility. 

    To deal with this problem, a brand new set of generalization strategies have emerged not too long ago. Instead of counting on imaginative and prescient knowledge, educate robots sure motion insurance policies utilizing numerous robotic demonstration datasets. It works to some extent, however there’s a main catch. It’s costly, actually costly. Collecting such knowledge in an actual robotic setup means time-consuming duties like kinesthetic educating or robotic teleoperation via VR headsets or joysticks.

    Do we actually must depend on this costly dataset? Since the principle objective of robots is to imitate people, why can we not simply use human demonstration movies? These movies of people doing duties supply a more cost effective answer because of the agility of people. Doing so allows capturing a number of demos with out fixed robotic resets, {hardware} debugging, or arduous repositioning. This raises the intriguing chance of leveraging human video demonstrations to reinforce the generalization talents of vision-centric robotic manipulators, at scale. 

    However, bridging the hole between human and robotic realms isn’t a stroll within the park. The dissimilarities in look between people and robots introduce a distribution shift that wants cautious consideration. Let us meet with new analysis, Giving Robots a Hand, that bridges this hole. 

    Existing strategies, using third-person digital camera viewpoints, have tackled this problem with area adaptation methods involving picture translations, domain-invariant visible representations, and even leveraging keypoint details about human and robotic states.

    In distinction, Giving Robots a Hand takes a refreshingly simple route: masking a constant portion of every picture, successfully concealing the human hand or robotic end-effector. This simple methodology sidesteps the necessity for elaborate area adaptation strategies, permitting robots to study manipulation insurance policies from human movies straight. Consequently, it solves points arising from specific area adaptation strategies, like evident visible inconsistencies stemming from human-to-robot picture translations.

    The key side of Giving Robots a Hand lies within the methodology’s exploration. A way that integrates the wide-ranging eye-in-hand human video demonstrations to reinforce each surroundings and job generalization. It achieves superb efficiency throughout a spread of real-world robotic manipulation duties, encompassing reaching, greedy, pick-and-place, dice stacking, plate clearing, toy packing, and many others. The proposed methodology improves the generalization considerably. It empowers insurance policies to adapt to unfamiliar environments and novel duties that weren’t witnessed throughout robotic demonstrations. An common surge of 58% in absolute success charges in uncharted environments and duties turns into evident, as in comparison with insurance policies solely skilled on robotic demonstrations.


    Check out the Paper. All Credit For This Research Goes To the Researchers on This Project. Also, don’t neglect to hitch our 29k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, the place we share the newest AI analysis information, cool AI tasks, and extra.

    If you want our work, please comply with us on Twitter


    Ekrem Çetinkaya acquired his B.Sc. in 2018, and M.Sc. in 2019 from Ozyegin University, Istanbul, Türkiye. He wrote his M.Sc. thesis about picture denoising utilizing deep convolutional networks. He acquired his Ph.D. diploma in 2023 from the University of Klagenfurt, Austria, along with his dissertation titled “Video Coding Enhancements for HTTP Adaptive Streaming Using Machine Learning.” His analysis pursuits embody deep studying, laptop imaginative and prescient, video encoding, and multimedia networking.


    🚀 CodiumAI allows busy builders to generate significant checks (Sponsored)

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Crypto

    Comparing The Profitability Of The Top Meme Coins

    Holders of the highest meme cash have witnessed tough value motion up to now few…

    Crypto

    Crypto VC exits were low in Q4 2023, Phantom MAU’s reach new highs and spot bitcoin ETF volumes are still rising

    Welcome to Ztoog Crypto, previously often called Chain Reaction. To get a roundup of Ztoog’s…

    Science

    Neanderthals and modern humans intermingled in Europe 45,000 years ago

    About a decade ago, the speculation that Neanderthals had bred with Homo sapiens exterior of…

    Science

    Why dinosaur footprints inspired paleontologist Martin Lockley

    On November 25, paleontologist Martin Lockley handed on the age of 73. PopSci spoke with…

    Mobile

    Galaxy S24 breaks pre-order record as Samsung sales surge in a week

    What you must knowSamsung reportedly states its Galaxy S24 sequence has shattered its pre-order record…

    Our Picks
    Technology

    The Amazon strikes, explained | Vox

    Gadgets

    Wyze outage leaves customers without camera coverage overnight

    Gadgets

    Unicomp Mini-M Model F Keyboard Review

    Categories
    • AI (1,493)
    • Crypto (1,754)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,866)
    • Technology (1,803)
    • The Future (1,649)
    Most Popular
    Crypto

    Ukraine and Binance Boast Transformed Lives with Tech Education Project

    Gadgets

    Oneplus Bullets Wireless Z2 ANC Review: Same Device With a New Trick

    Mobile

    Leaked Chromecast with Google TV remote suggests a successor may be on the way

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.