Close Menu
Ztoog
    What's Hot
    Science

    NASA workers paint iconic logo onto Artemis II rocket boosters

    Mobile

    iPhone 15 vs Samsung Galaxy S23

    Mobile

    Save up to $150 on a brand new Galaxy Tab S9 Ultra; get one from Best Buy now

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      What time tracking metrics should you track and why?

      Are entangled qubits following a quantum Moore’s law?

      Disneyland’s 70th Anniversary Brings Cartoony Chaos to This Summer’s Celebration

      Story of military airfield in Afghanistan that Biden left in 2021

      Tencent hires WizardLM team, a Microsoft AI group with an odd history

    • Technology

      Are Democrats fumbling a golden opportunity?

      Crypto elite increasingly worried about their personal safety

      Deep dive on the evolution of Microsoft's relationship with OpenAI, from its $1B investment in 2019 through Copilot rollouts and ChatGPT's launch to present day (Bloomberg)

      New leak reveals iPhone Fold won’t look like the Galaxy Z Fold 6 at all

      Apple will use AI and user data in iOS 19 to extend iPhone battery life

    • Gadgets

      The market’s down, but this OpenAI for the stock market can help you trade up

      We Hand-Picked the 24 Best Deals From the 2025 REI Anniversary Sale

      “Google wanted that”: Nextcloud decries Android permissions as “gatekeeping”

      Google Tests Automatic Password-to-Passkey Conversion On Android

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

    • Mobile

      The Forerunner 570 & 970 have made Garmin’s tiered strategy clearer than ever

      The iPhone Fold is now being tested with an under-display camera

      T-Mobile takes over one of golf’s biggest events, unleashes unique experiences

      Fitbit’s AI experiments just leveled up with 3 new health tracking features

      Motorola’s Moto Watch needs to start living up to the brand name

    • Science

      Risk of a star destroying the solar system is higher than expected

      Do these Buddhist gods hint at the purpose of China’s super-secret satellites?

      From Espresso to Eco-Brick: How Coffee Waste Fuels 3D-Printed Design

      Ancient three-eyed ‘sea moth’ used its butt to breathe

      Intelligence on Earth Evolved Independently at Least Twice

    • AI

      With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

      Google DeepMind’s new AI agent cracks real-world problems better than humans can

      Study shows vision-language models can’t handle queries with negation words | Ztoog

      How a new type of AI is helping police skirt facial recognition bans

      Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

    • Crypto

      Senate advances GENIUS Act after cloture vote passes

      Is Bitcoin Bull Run Back? Daily RSI Shows Only Mild Bullish Momentum

      Robinhood grows its footprint in Canada by acquiring WonderFi

      HashKey Group Announces Launch of HashKey Global MENA with VASP License in UAE

      Ethereum Breaks Key Resistance In One Massive Move – Higher High Confirms Momentum

    Ztoog
    Home » Researchers from Google and the University of Toronto Introduce Groundbreaking Zero-Shot Agent for Autonomous Learning and Task Execution in Live Computer Environments
    AI

    Researchers from Google and the University of Toronto Introduce Groundbreaking Zero-Shot Agent for Autonomous Learning and Task Execution in Live Computer Environments

    Facebook Twitter Pinterest WhatsApp
    Researchers from Google and the University of Toronto Introduce Groundbreaking Zero-Shot Agent for Autonomous Learning and Task Execution in Live Computer Environments
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Large language fashions (LLMs) for motion manufacturing in varied reside contexts, equivalent to ALFWORLD and ALPHACODE, have proven promise in earlier efforts. Examples embrace SAYCAN, REACT, TOOLFORMER, and SWIFTSAGE. LLMs are used equally to comply with knowledgeable trails, perceive environmental adjustments, plan and perform future actions, and compose API requests. Several research, together with REFLEXION and SELF-REFINE, have demonstrated that repeatedly performing a process with quite a few rounds of self-reflection could considerably improve process completion. LLMs are requested to change a earlier execution plan in gentle of environmental suggestions. Such changes are integrated into the motion generator’s immediate for the subsequent spherical. 

    MINIWOB++ has lately been utilized as a testbed to guage LLM’s efficiency on modularized computing workloads. Using complete hint examples of the process for direct supervision (WebGUM), self-supervision, or few/many shot prompting (SYNAPSE) are commonplace strategies for studying a process. They have accomplished dozens of pc jobs with a process completion fee higher than 90%, seemingly fixing the pc management subject. Nonetheless, the want for knowledgeable traces constrains the agent’s capability to be taught new jobs. Can an agent independently know and improve its management over a pc with out using well-chosen traces as steerage? Researchers from Google Research and the University of Toronto recommend a zero-shot agent to reply this question. 

    Their agent is constructed on prime of PaLM2, a latest LLM, and it makes use of a single set of instruction prompts for all actions fairly than task-specific prompts. Additionally, modern efforts like RCI, ADAPLANNER, and SYNAPSE use display representations that may embrace much more knowledge than what’s exhibited to the person on the display. For occasion, Fig. 1 illustrates objects which are contained in the HTML which are supplied to the LLM however should not displayed on the display. Arbitrarily, utilizing this new information makes the agent’s capability to finish the process simpler. However, in typical utilization eventualities, such data may not be simply accessible and, relying on it, may restrict how broadly the agent may be utilized. 

    Figure 1 reveals disparate shows on screens. Fig. 1a–1c reveals the social media process earlier than and after urgent the “more” button (seed=2). HTML has already made the materials seen earlier than clicking. Fig. 1d-1e: The click-tab-2 (seed=0) has the same downside.

    13 fairly troublesome jobs on MINIWOB++ that should span many screens had been rigorously evaluated, and they found that 5 of them included HTML that contained such data—multi-screen data in a single commentary. These are the contributions they made: First, in comparability to earlier research, they undertake a condensed display depiction, which makes the check surroundings extra all-encompassing and life like. Second, they supply a simple however efficient motion planner that, in a single go, exactly plans out executable operations on a state. They display that such a “naive” method can full practically all the easy duties on the MINIWOB++ benchmark utilizing the most up-to-date LLM capability. 

    To assist the agent efficiently be taught from exploratory failures and advance in harder duties, they recommend a scientific thought administration method that pulls affect from Reflexion. Their agent achieves efficiency equal to previous couple of/many-shot state-of-the-art after just a few rounds of tries. Their agent is the first zero-shot design for pc management duties that they’re conscious of, in line with analysis.


    Check out the Paper. All Credit For This Research Goes To the Researchers on This Project. Also, don’t neglect to hitch our 31k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, the place we share the newest AI analysis information, cool AI initiatives, and extra.

    If you want our work, you’ll love our publication..

    We are additionally on WhatsApp. Join our AI Channel on Whatsapp..


    Aneesh Tickoo is a consulting intern at MarktechPost. He is at present pursuing his undergraduate diploma in Data Science and Artificial Intelligence from the Indian Institute of Technology(IIT), Bhilai. He spends most of his time engaged on initiatives aimed toward harnessing the energy of machine studying. His analysis curiosity is picture processing and is keen about constructing options round it. He loves to attach with individuals and collaborate on attention-grabbing initiatives.


    ▶️ Now Watch AI Research Updates On Our Youtube Channel [Watch Now]

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    AI

    Study shows vision-language models can’t handle queries with negation words | Ztoog

    AI

    How a new type of AI is helping police skirt facial recognition bans

    AI

    Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Science

    Americans misunderstand their contribution to deteriorating environment

    Enlarge / Power traces are solid in silhouette because the Creek Fire creeps up on…

    Technology

    Crucial’s T700 PCIe 5.0 SSD can throttle to HDD speeds without a cooler

    A sizzling potato: With the rise of blazing quick NVMe SSDs, storage has change into…

    Science

    JWST should soon glimpse the very first stars born after the big bang

    NASA’s James Webb Space Telescope has captured pictures of actively forming stars like this pair,…

    Mobile

    Expected release date and what we want to see

    Calvin Wankhede / Android AuthorityUpdate, November 3, 2023 (5:40 PM ET): We have up to…

    AI

    Best AI Tools for Product Managers in 2023

    The fast enlargement of the AI market has shocked and amazed individuals in all places.…

    Our Picks
    Gadgets

    OnePlus Nord Buds CE Review: Budget TWS With Seamless Connectivity

    Technology

    As a new AI-driven coding assistant is launched, the battle for AI-mindshare moves to developers

    Crypto

    These 5 Crypto Analysts Signal Potential For Record-Shattering Bull Market In Early 2024

    Categories
    • AI (1,487)
    • Crypto (1,749)
    • Gadgets (1,800)
    • Mobile (1,844)
    • Science (1,859)
    • Technology (1,796)
    • The Future (1,642)
    Most Popular
    Science

    Humans are living longer than ever no matter where they come from 

    The Future

    Beware! Scammers are now using Barbie craze to scam you of your money

    Crypto

    Bitcoin Upper Band Moves Above $105,400

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.