Close Menu
Ztoog
    What's Hot
    The Future

    How Implementing No-Code Automation Eliminates Employees Pain Points

    Technology

    Get to Know the IEEE Board of Directors – June 2024

    Gadgets

    Indoors or out, this $129 Kapsule Wireless Speaker is your ultimate audio companion

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      How to Get Bot Lobbies in Fortnite? (2025 Guide)

      Can work-life balance tracking improve well-being?

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

    • Technology

      What does a millennial midlife crisis look like?

      Elon Musk tries to stick to spaceships

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

    • Gadgets

      Watch Apple’s WWDC 2025 keynote right here

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

    • Mobile

      YouTube is testing a leaderboard to show off top live stream fans

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

    • Science

      June skygazing: A strawberry moon, the summer solstice… and Asteroid Day!

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

    • AI

      Fueling seamless AI at scale

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    • Crypto

      Bitcoin Maxi Isn’t Buying Hype Around New Crypto Holding Firms

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

    Ztoog
    Home » MIT researchers develop an efficient way to train more reliable AI agents | Ztoog
    AI

    MIT researchers develop an efficient way to train more reliable AI agents | Ztoog

    Facebook Twitter Pinterest WhatsApp
    MIT researchers develop an efficient way to train more reliable AI agents | Ztoog
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Fields starting from robotics to drugs to political science are trying to train AI programs to make significant choices of every kind. For instance, utilizing an AI system to intelligently management site visitors in a congested metropolis might assist motorists attain their locations sooner, whereas bettering security or sustainability.

    Unfortunately, instructing an AI system to make good choices is not any simple job.

    Reinforcement studying fashions, which underlie these AI decision-making programs, nonetheless typically fail when confronted with even small variations within the duties they’re educated to carry out. In the case of site visitors, a mannequin would possibly battle to management a set of intersections with completely different pace limits, numbers of lanes, or site visitors patterns.

    To enhance the reliability of reinforcement studying fashions for complicated duties with variability, MIT researchers have launched a more efficient algorithm for coaching them.

    The algorithm strategically selects the most effective duties for coaching an AI agent so it will probably successfully carry out all duties in a set of associated duties. In the case of site visitors sign management, every job might be one intersection in a job house that features all intersections within the metropolis.

    By specializing in a smaller variety of intersections that contribute probably the most to the algorithm’s total effectiveness, this technique maximizes efficiency whereas conserving the coaching price low.

    The researchers discovered that their method was between 5 and 50 occasions more efficient than customary approaches on an array of simulated duties. This acquire in effectivity helps the algorithm study a greater resolution in a sooner method, in the end bettering the efficiency of the AI agent.

    “We were able to see incredible performance improvements, with a very simple algorithm, by thinking outside the box. An algorithm that is not very complicated stands a better chance of being adopted by the community because it is easier to implement and easier for others to understand,” says senior writer Cathy Wu, the Thomas D. and Virginia W. Cabot Career Development Associate Professor in Civil and Environmental Engineering (CEE) and the Institute for Data, Systems, and Society (IDSS), and a member of the Laboratory for Information and Decision Systems (LIDS).

    She is joined on the paper by lead writer Jung-Hoon Cho, a CEE graduate pupil; Vindula Jayawardana, a graduate pupil within the Department of Electrical Engineering and Computer Science (EECS); and Sirui Li, an IDSS graduate pupil. The analysis will likely be introduced on the Conference on Neural Information Processing Systems.

    Finding a center floor

    To train an algorithm to management site visitors lights at many intersections in a metropolis, an engineer would sometimes select between two predominant approaches. She can train one algorithm for every intersection independently, utilizing solely that intersection’s information, or train a bigger algorithm utilizing information from all intersections after which apply it to each.

    But every strategy comes with its share of downsides. Training a separate algorithm for every job (equivalent to a given intersection) is a time-consuming course of that requires an monumental quantity of information and computation, whereas coaching one algorithm for all duties typically leads to subpar efficiency.

    Wu and her collaborators sought a candy spot between these two approaches.

    For their technique, they select a subset of duties and train one algorithm for every job independently. Importantly, they strategically choose particular person duties that are almost certainly to enhance the algorithm’s total efficiency on all duties.

    They leverage a standard trick from the reinforcement studying discipline referred to as zero-shot switch studying, wherein an already educated mannequin is utilized to a brand new job with out being additional educated. With switch studying, the mannequin typically performs remarkably effectively on the brand new neighbor job.

    “We know it would be ideal to train on all the tasks, but we wondered if we could get away with training on a subset of those tasks, apply the result to all the tasks, and still see a performance increase,” Wu says.

    To establish which duties they need to choose to maximize anticipated efficiency, the researchers developed an algorithm referred to as Model-Based Transfer Learning (MBTL).

    The MBTL algorithm has two items. For one, it fashions how effectively every algorithm would carry out if it have been educated independently on one job. Then it fashions how a lot every algorithm’s efficiency would degrade if it have been transferred to one another job, an idea referred to as generalization efficiency.

    Explicitly modeling generalization efficiency permits MBTL to estimate the worth of coaching on a brand new job.

    MBTL does this sequentially, selecting the duty which leads to the best efficiency acquire first, then choosing further duties that present the most important subsequent marginal enhancements to total efficiency.

    Since MBTL solely focuses on probably the most promising duties, it will probably dramatically enhance the effectivity of the coaching course of.

    Reducing coaching prices

    When the researchers examined this method on simulated duties, together with controlling site visitors alerts, managing real-time pace advisories, and executing a number of basic management duties, it was 5 to 50 occasions more efficient than different strategies.

    This means they may arrive on the similar resolution by coaching on far much less information. For occasion, with a 50x effectivity enhance, the MBTL algorithm might train on simply two duties and obtain the identical efficiency as a regular technique which makes use of information from 100 duties.

    “From the perspective of the two main approaches, that means data from the other 98 tasks was not necessary or that training on all 100 tasks is confusing to the algorithm, so the performance ends up worse than ours,” Wu says.

    With MBTL, including even a small quantity of further coaching time may lead to significantly better efficiency.

    In the longer term, the researchers plan to design MBTL algorithms that may prolong to more complicated issues, equivalent to high-dimensional job areas. They are additionally desirous about making use of their strategy to real-world issues, particularly in next-generation mobility programs.

    The analysis is funded, partly, by a National Science Foundation CAREER Award, the Kwanjeong Educational Foundation PhD Scholarship Program, and an Amazon Robotics PhD Fellowship.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Fueling seamless AI at scale

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Crypto

    Why Is The Bitcoin Price Up Today?

    After the Bitcoin worth reached a three-month low of $24.835 final week, the bulls at…

    Mobile

    Fortnite said to return to the US iOS App Store next week following court verdict

    Following as we speak’s court verdict that blew a relatively massive gap via Apple’s grubby…

    AI

    Top Python Programming Books to Read in 2024

    Python is a general-purpose programming language and is without doubt one of the hottest languages…

    Mobile

    Google Chrome is getting five big address bar updates

    The most used net browser on Earth is getting some vital updates for its address…

    The Future

    Anthony Carrigan Joins Superman: Legacy as Metamorpho

    Image: Frazer Harrison/Getty Images (Getty Images)Superman: Legacy is including one other superhero to its roster,…

    Our Picks
    Science

    X-37B: Space Force’s secretive space plane is making its highest flight yet

    Crypto

    Bitcoin Insider Trading Suspicions Take Root Following Grayscale Win, What’s Happening?

    AI

    A better way to study ocean currents | Ztoog

    Categories
    • AI (1,494)
    • Crypto (1,754)
    • Gadgets (1,806)
    • Mobile (1,852)
    • Science (1,867)
    • Technology (1,804)
    • The Future (1,650)
    Most Popular
    Crypto

    Make or Break Season? Crypto Analyst Predicts A Fall To $1,200 If Ethereum Stays Beneath This Level

    AI

    Graph neural networks in TensorFlow – Google Research Blog

    Science

    Hippos Are in Trouble. Will ‘Endangered’ Status Save Them?

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.