Close Menu
Ztoog
    What's Hot
    Crypto

    ‘Dencun’ Upgrade Officially Deployed On Ethereum Mainnet, ETH Price Holds Steady Below $4,000

    Mobile

    Scribe close to Apple reveals when the OLED iPad Pro (2024) tablets will appear

    Technology

    Massachusetts lawmakers mull ‘killer robot’ bill

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      How I Turn Unstructured PDFs into Revenue-Ready Spreadsheets

      Is it the best tool for 2025?

      The clocks that helped define time from London’s Royal Observatory

      Summer Movies Are Here, and So Are the New Popcorn Buckets

      India-Pak conflict: Pak appoints ISI chief, appointment comes in backdrop of the Pahalgam attack

    • Technology

      Ensure Hard Work Is Recognized With These 3 Steps

      Cicada map 2025: Where will Brood XIV cicadas emerge this spring?

      Is Duolingo the face of an AI jobs crisis?

      The US DOD transfers its AI-based Open Price Exploration for National Security program to nonprofit Critical Minerals Forum to boost Western supply deals (Ernest Scheyder/Reuters)

      The more Google kills Fitbit, the more I want a Fitbit Sense 3

    • Gadgets

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

      Apple plans to split iPhone 18 launch into two phases in 2026

      Upgrade your desk to Starfleet status with this $95 USB-C hub

      37 Best Graduation Gift Ideas (2025): For College Grads

      Backblaze responds to claims of “sham accounting,” customer backups at risk

    • Mobile

      Samsung Galaxy S25 Edge promo materials leak

      What are people doing with those free T-Mobile lines? Way more than you’d expect

      Samsung doesn’t want budget Galaxy phones to use exclusive AI features

      COROS’s charging adapter is a neat solution to the smartwatch charging cable problem

      Fortnite said to return to the US iOS App Store next week following court verdict

    • Science

      Failed Soviet probe will soon crash to Earth – and we don’t know where

      Trump administration cuts off all future federal funding to Harvard

      Does kissing spread gluten? New research offers a clue.

      Why Balcony Solar Panels Haven’t Taken Off in the US

      ‘Dark photon’ theory of light aims to tear up a century of physics

    • AI

      How to build a better AI benchmark

      Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

      This data set helps researchers spot harmful stereotypes in LLMs

      Making AI models more trustworthy for high-stakes settings | Ztoog

      The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    • Crypto

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

      Bitcoin Holds Above $95K Despite Weak Blockchain Activity — Analytics Firm Explains Why

      eToro eyes US IPO launch as early as next week amid easing concerns over Trump’s tariffs

      Cardano ‘Looks Dope,’ Analyst Predicts Big Move Soon

      Speak at Ztoog Disrupt 2025: Applications now open

    Ztoog
    Home » MIT researchers develop an efficient way to train more reliable AI agents | Ztoog
    AI

    MIT researchers develop an efficient way to train more reliable AI agents | Ztoog

    Facebook Twitter Pinterest WhatsApp
    MIT researchers develop an efficient way to train more reliable AI agents | Ztoog
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Fields starting from robotics to drugs to political science are trying to train AI programs to make significant choices of every kind. For instance, utilizing an AI system to intelligently management site visitors in a congested metropolis might assist motorists attain their locations sooner, whereas bettering security or sustainability.

    Unfortunately, instructing an AI system to make good choices is not any simple job.

    Reinforcement studying fashions, which underlie these AI decision-making programs, nonetheless typically fail when confronted with even small variations within the duties they’re educated to carry out. In the case of site visitors, a mannequin would possibly battle to management a set of intersections with completely different pace limits, numbers of lanes, or site visitors patterns.

    To enhance the reliability of reinforcement studying fashions for complicated duties with variability, MIT researchers have launched a more efficient algorithm for coaching them.

    The algorithm strategically selects the most effective duties for coaching an AI agent so it will probably successfully carry out all duties in a set of associated duties. In the case of site visitors sign management, every job might be one intersection in a job house that features all intersections within the metropolis.

    By specializing in a smaller variety of intersections that contribute probably the most to the algorithm’s total effectiveness, this technique maximizes efficiency whereas conserving the coaching price low.

    The researchers discovered that their method was between 5 and 50 occasions more efficient than customary approaches on an array of simulated duties. This acquire in effectivity helps the algorithm study a greater resolution in a sooner method, in the end bettering the efficiency of the AI agent.

    “We were able to see incredible performance improvements, with a very simple algorithm, by thinking outside the box. An algorithm that is not very complicated stands a better chance of being adopted by the community because it is easier to implement and easier for others to understand,” says senior writer Cathy Wu, the Thomas D. and Virginia W. Cabot Career Development Associate Professor in Civil and Environmental Engineering (CEE) and the Institute for Data, Systems, and Society (IDSS), and a member of the Laboratory for Information and Decision Systems (LIDS).

    She is joined on the paper by lead writer Jung-Hoon Cho, a CEE graduate pupil; Vindula Jayawardana, a graduate pupil within the Department of Electrical Engineering and Computer Science (EECS); and Sirui Li, an IDSS graduate pupil. The analysis will likely be introduced on the Conference on Neural Information Processing Systems.

    Finding a center floor

    To train an algorithm to management site visitors lights at many intersections in a metropolis, an engineer would sometimes select between two predominant approaches. She can train one algorithm for every intersection independently, utilizing solely that intersection’s information, or train a bigger algorithm utilizing information from all intersections after which apply it to each.

    But every strategy comes with its share of downsides. Training a separate algorithm for every job (equivalent to a given intersection) is a time-consuming course of that requires an monumental quantity of information and computation, whereas coaching one algorithm for all duties typically leads to subpar efficiency.

    Wu and her collaborators sought a candy spot between these two approaches.

    For their technique, they select a subset of duties and train one algorithm for every job independently. Importantly, they strategically choose particular person duties that are almost certainly to enhance the algorithm’s total efficiency on all duties.

    They leverage a standard trick from the reinforcement studying discipline referred to as zero-shot switch studying, wherein an already educated mannequin is utilized to a brand new job with out being additional educated. With switch studying, the mannequin typically performs remarkably effectively on the brand new neighbor job.

    “We know it would be ideal to train on all the tasks, but we wondered if we could get away with training on a subset of those tasks, apply the result to all the tasks, and still see a performance increase,” Wu says.

    To establish which duties they need to choose to maximize anticipated efficiency, the researchers developed an algorithm referred to as Model-Based Transfer Learning (MBTL).

    The MBTL algorithm has two items. For one, it fashions how effectively every algorithm would carry out if it have been educated independently on one job. Then it fashions how a lot every algorithm’s efficiency would degrade if it have been transferred to one another job, an idea referred to as generalization efficiency.

    Explicitly modeling generalization efficiency permits MBTL to estimate the worth of coaching on a brand new job.

    MBTL does this sequentially, selecting the duty which leads to the best efficiency acquire first, then choosing further duties that present the most important subsequent marginal enhancements to total efficiency.

    Since MBTL solely focuses on probably the most promising duties, it will probably dramatically enhance the effectivity of the coaching course of.

    Reducing coaching prices

    When the researchers examined this method on simulated duties, together with controlling site visitors alerts, managing real-time pace advisories, and executing a number of basic management duties, it was 5 to 50 occasions more efficient than different strategies.

    This means they may arrive on the similar resolution by coaching on far much less information. For occasion, with a 50x effectivity enhance, the MBTL algorithm might train on simply two duties and obtain the identical efficiency as a regular technique which makes use of information from 100 duties.

    “From the perspective of the two main approaches, that means data from the other 98 tasks was not necessary or that training on all 100 tasks is confusing to the algorithm, so the performance ends up worse than ours,” Wu says.

    With MBTL, including even a small quantity of further coaching time may lead to significantly better efficiency.

    In the longer term, the researchers plan to design MBTL algorithms that may prolong to more complicated issues, equivalent to high-dimensional job areas. They are additionally desirous about making use of their strategy to real-world issues, particularly in next-generation mobility programs.

    The analysis is funded, partly, by a National Science Foundation CAREER Award, the Kwanjeong Educational Foundation PhD Scholarship Program, and an Amazon Robotics PhD Fellowship.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    AI

    Making AI models more trustworthy for high-stakes settings | Ztoog

    AI

    The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    AI

    Novel method detects microbial contamination in cell cultures | Ztoog

    Crypto

    Speak at Ztoog Disrupt 2025: Applications now open

    AI

    Seeing AI as a collaborator, not a creator

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Science

    Saturn’s moon Mimas may be hiding a vast global ocean under its ice

    Mimas photographed by NASA’s Cassini spacecraftNASA/JPL-Caltech/Space Science I Saturn’s moon Mimas seems to have a…

    The Future

    Meta to provide more ‘age appropriate’ content to teens, limit harmful items

    Meta Platforms on Tuesday stated it could cover more content from teenagers on its social…

    Crypto

    Expert Reveals 4 Reasons To Be Bullish On Q4

    In his newest market evaluation titled “Sugar High”, BitMEX founder Arthur Hayes lists 4 causes…

    Gadgets

    Samsung Galaxy Watch Ultra Could Cost Almost A Flagship Phone’s Price

    Samsung is ready for a busy July, planning to launch a number of new Galaxy…

    Mobile

    Apple is stepping up its work on AR glasses, but don’t expect them soon

    Edgar Cervantes / Android AuthorityTL;DR Apple has reportedly “renewed” efforts to develop its personal augmented…

    Our Picks
    Science

    NASA finally pries open stuck Bennu asteroid sampler

    Science

    California Is Solving Its Water Problems by Flooding Its Best Farmland

    Technology

    Free Technology for Teachers: Breaking News With ClassTools

    Categories
    • AI (1,482)
    • Crypto (1,744)
    • Gadgets (1,796)
    • Mobile (1,839)
    • Science (1,853)
    • Technology (1,789)
    • The Future (1,635)
    Most Popular
    Science

    Why do solar eclipses happen?

    Gadgets

    Scientist Expects Voyager Spacecraft To Last A Billion Years

    AI

    MIT engineers develop a way to determine how the surfaces of materials behave | Ztoog

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.