Close Menu
Ztoog
    What's Hot
    AI

    Bacterial injection system delivers proteins in mice and human cells | Ztoog

    Crypto

    Galaxy Digital and Invesco Bitcoin Spot ETF Join BlackRock On The DTCC

    Science

    A telescope happened to be pointing at the brightest supernova yet observed

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      What time tracking metrics should you track and why?

      Are entangled qubits following a quantum Moore’s law?

      Disneyland’s 70th Anniversary Brings Cartoony Chaos to This Summer’s Celebration

      Story of military airfield in Afghanistan that Biden left in 2021

      Tencent hires WizardLM team, a Microsoft AI group with an odd history

    • Technology

      Are Democrats fumbling a golden opportunity?

      Crypto elite increasingly worried about their personal safety

      Deep dive on the evolution of Microsoft's relationship with OpenAI, from its $1B investment in 2019 through Copilot rollouts and ChatGPT's launch to present day (Bloomberg)

      New leak reveals iPhone Fold won’t look like the Galaxy Z Fold 6 at all

      Apple will use AI and user data in iOS 19 to extend iPhone battery life

    • Gadgets

      The market’s down, but this OpenAI for the stock market can help you trade up

      We Hand-Picked the 24 Best Deals From the 2025 REI Anniversary Sale

      “Google wanted that”: Nextcloud decries Android permissions as “gatekeeping”

      Google Tests Automatic Password-to-Passkey Conversion On Android

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

    • Mobile

      Android 16 QPR1 lets you check what fingerprints you’ve enrolled on your Pixel phone

      The Forerunner 570 & 970 have made Garmin’s tiered strategy clearer than ever

      The iPhone Fold is now being tested with an under-display camera

      T-Mobile takes over one of golf’s biggest events, unleashes unique experiences

      Fitbit’s AI experiments just leveled up with 3 new health tracking features

    • Science

      Risk of a star destroying the solar system is higher than expected

      Do these Buddhist gods hint at the purpose of China’s super-secret satellites?

      From Espresso to Eco-Brick: How Coffee Waste Fuels 3D-Printed Design

      Ancient three-eyed ‘sea moth’ used its butt to breathe

      Intelligence on Earth Evolved Independently at Least Twice

    • AI

      With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

      Google DeepMind’s new AI agent cracks real-world problems better than humans can

      Study shows vision-language models can’t handle queries with negation words | Ztoog

      How a new type of AI is helping police skirt facial recognition bans

      Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

    • Crypto

      Senate advances GENIUS Act after cloture vote passes

      Is Bitcoin Bull Run Back? Daily RSI Shows Only Mild Bullish Momentum

      Robinhood grows its footprint in Canada by acquiring WonderFi

      HashKey Group Announces Launch of HashKey Global MENA with VASP License in UAE

      Ethereum Breaks Key Resistance In One Massive Move – Higher High Confirms Momentum

    Ztoog
    Home » Researchers from Google and the University of Toronto Introduce Groundbreaking Zero-Shot Agent for Autonomous Learning and Task Execution in Live Computer Environments
    AI

    Researchers from Google and the University of Toronto Introduce Groundbreaking Zero-Shot Agent for Autonomous Learning and Task Execution in Live Computer Environments

    Facebook Twitter Pinterest WhatsApp
    Researchers from Google and the University of Toronto Introduce Groundbreaking Zero-Shot Agent for Autonomous Learning and Task Execution in Live Computer Environments
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Large language fashions (LLMs) for motion manufacturing in varied reside contexts, equivalent to ALFWORLD and ALPHACODE, have proven promise in earlier efforts. Examples embrace SAYCAN, REACT, TOOLFORMER, and SWIFTSAGE. LLMs are used equally to comply with knowledgeable trails, perceive environmental adjustments, plan and perform future actions, and compose API requests. Several research, together with REFLEXION and SELF-REFINE, have demonstrated that repeatedly performing a process with quite a few rounds of self-reflection could considerably improve process completion. LLMs are requested to change a earlier execution plan in gentle of environmental suggestions. Such changes are integrated into the motion generator’s immediate for the subsequent spherical. 

    MINIWOB++ has lately been utilized as a testbed to guage LLM’s efficiency on modularized computing workloads. Using complete hint examples of the process for direct supervision (WebGUM), self-supervision, or few/many shot prompting (SYNAPSE) are commonplace strategies for studying a process. They have accomplished dozens of pc jobs with a process completion fee higher than 90%, seemingly fixing the pc management subject. Nonetheless, the want for knowledgeable traces constrains the agent’s capability to be taught new jobs. Can an agent independently know and improve its management over a pc with out using well-chosen traces as steerage? Researchers from Google Research and the University of Toronto recommend a zero-shot agent to reply this question. 

    Their agent is constructed on prime of PaLM2, a latest LLM, and it makes use of a single set of instruction prompts for all actions fairly than task-specific prompts. Additionally, modern efforts like RCI, ADAPLANNER, and SYNAPSE use display representations that may embrace much more knowledge than what’s exhibited to the person on the display. For occasion, Fig. 1 illustrates objects which are contained in the HTML which are supplied to the LLM however should not displayed on the display. Arbitrarily, utilizing this new information makes the agent’s capability to finish the process simpler. However, in typical utilization eventualities, such data may not be simply accessible and, relying on it, may restrict how broadly the agent may be utilized. 

    Figure 1 reveals disparate shows on screens. Fig. 1a–1c reveals the social media process earlier than and after urgent the “more” button (seed=2). HTML has already made the materials seen earlier than clicking. Fig. 1d-1e: The click-tab-2 (seed=0) has the same downside.

    13 fairly troublesome jobs on MINIWOB++ that should span many screens had been rigorously evaluated, and they found that 5 of them included HTML that contained such data—multi-screen data in a single commentary. These are the contributions they made: First, in comparability to earlier research, they undertake a condensed display depiction, which makes the check surroundings extra all-encompassing and life like. Second, they supply a simple however efficient motion planner that, in a single go, exactly plans out executable operations on a state. They display that such a “naive” method can full practically all the easy duties on the MINIWOB++ benchmark utilizing the most up-to-date LLM capability. 

    To assist the agent efficiently be taught from exploratory failures and advance in harder duties, they recommend a scientific thought administration method that pulls affect from Reflexion. Their agent achieves efficiency equal to previous couple of/many-shot state-of-the-art after just a few rounds of tries. Their agent is the first zero-shot design for pc management duties that they’re conscious of, in line with analysis.


    Check out the Paper. All Credit For This Research Goes To the Researchers on This Project. Also, don’t neglect to hitch our 31k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, the place we share the newest AI analysis information, cool AI initiatives, and extra.

    If you want our work, you’ll love our publication..

    We are additionally on WhatsApp. Join our AI Channel on Whatsapp..


    Aneesh Tickoo is a consulting intern at MarktechPost. He is at present pursuing his undergraduate diploma in Data Science and Artificial Intelligence from the Indian Institute of Technology(IIT), Bhilai. He spends most of his time engaged on initiatives aimed toward harnessing the energy of machine studying. His analysis curiosity is picture processing and is keen about constructing options round it. He loves to attach with individuals and collaborate on attention-grabbing initiatives.


    ▶️ Now Watch AI Research Updates On Our Youtube Channel [Watch Now]

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    AI

    Study shows vision-language models can’t handle queries with negation words | Ztoog

    AI

    How a new type of AI is helping police skirt facial recognition bans

    AI

    Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Gadgets

    The market’s down, but this OpenAI for the stock market can help you trade up

    We might earn income from the merchandise accessible on this web page and take part…

    Technology

    OnePlus 11R 5G, Samsung Galaxy S22 5G to Motorola Edge 30 Ultra- Technology News, Firstpost

    Ameya DalviJun 12, 2023 09:34:24 ISTIf you could have a finances of Rs 50,000 for…

    Science

    Study: Carbon offsets aren’t doing their job, overstate impact

    Enlarge / Paiter-Surui volunteers alongside “forest engineers” from a Brazillian Government help program utilizing GPS…

    Technology

    War Thunder devs apologize for accidental use of image of Space Shuttle disaster in latest update

    Gaijin Entertainment has reacted shortly to apologize for the use of imagery from the explosion…

    Gadgets

    Sticky GPS Trackers Enhance Police Tactics For Safer Suspect Apprehension

    The Police in Ramsey County, Minnesota, are utilizing Sticky GPS trackers to apprehend fleeing suspects…

    Our Picks
    Gadgets

    The best fat tire electric bikes for 2024, tested and reviewed

    The Future

    Naughty Dog Teases The Last of Us 3 Will (Eventually) Happen

    Gadgets

    98Q80C: Samsung Unveils Affordable 98-Inch QLED TV

    Categories
    • AI (1,487)
    • Crypto (1,749)
    • Gadgets (1,800)
    • Mobile (1,845)
    • Science (1,859)
    • Technology (1,796)
    • The Future (1,642)
    Most Popular
    The Future

    Solana’s price rises to $160, highest level since January 2022 as memecoin mania rises

    Gadgets

    Save $250 on an Apple M1 MacBook Air at Best Buy and Amazon before it sells out

    Technology

    One UI 6.1.1 could be coming soon with ‘video AI’ feature

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.