Close Menu
Ztoog
    What's Hot
    Science

    How did water get on Earth?

    Science

    The Incredible Women Making Strides in Science

    Technology

    While X charging users $1/year to combat bots and spam is an appealing idea, the value in manipulating X is so high that bad actors will still find a way (Matt Mullenweg)

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Disneyland’s 70th Anniversary Brings Cartoony Chaos to This Summer’s Celebration

      Story of military airfield in Afghanistan that Biden left in 2021

      Tencent hires WizardLM team, a Microsoft AI group with an odd history

      Today’s NYT Connections Hints, Answers for May 12, #701

      OPPO launches A5 Pro 5G: Premium features at a budget price

    • Technology

      Deep dive on the evolution of Microsoft's relationship with OpenAI, from its $1B investment in 2019 through Copilot rollouts and ChatGPT's launch to present day (Bloomberg)

      New leak reveals iPhone Fold won’t look like the Galaxy Z Fold 6 at all

      Apple will use AI and user data in iOS 19 to extend iPhone battery life

      Today’s NYT Wordle Hints, Answer and Help for May 12, #1423

      What It Is and Why It Matters—Part 1 – O’Reilly

    • Gadgets

      We Hand-Picked the 24 Best Deals From the 2025 REI Anniversary Sale

      “Google wanted that”: Nextcloud decries Android permissions as “gatekeeping”

      Google Tests Automatic Password-to-Passkey Conversion On Android

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

      Apple plans to split iPhone 18 launch into two phases in 2026

    • Mobile

      The iPhone Fold is now being tested with an under-display camera

      T-Mobile takes over one of golf’s biggest events, unleashes unique experiences

      Fitbit’s AI experiments just leveled up with 3 new health tracking features

      Motorola’s Moto Watch needs to start living up to the brand name

      Samsung Galaxy S25 Edge promo materials leak

    • Science

      Do these Buddhist gods hint at the purpose of China’s super-secret satellites?

      From Espresso to Eco-Brick: How Coffee Waste Fuels 3D-Printed Design

      Ancient three-eyed ‘sea moth’ used its butt to breathe

      Intelligence on Earth Evolved Independently at Least Twice

      Nothing is stronger than quantum connections – and now we know why

    • AI

      Google DeepMind’s new AI agent cracks real-world problems better than humans can

      Study shows vision-language models can’t handle queries with negation words | Ztoog

      How a new type of AI is helping police skirt facial recognition bans

      Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

      How to build a better AI benchmark

    • Crypto

      Is Bitcoin Bull Run Back? Daily RSI Shows Only Mild Bullish Momentum

      Robinhood grows its footprint in Canada by acquiring WonderFi

      HashKey Group Announces Launch of HashKey Global MENA with VASP License in UAE

      Ethereum Breaks Key Resistance In One Massive Move – Higher High Confirms Momentum

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

    Ztoog
    Home » LUMOS: An Open-Source Generalizable Language Agent Training Framework
    AI

    LUMOS: An Open-Source Generalizable Language Agent Training Framework

    Facebook Twitter Pinterest WhatsApp
    LUMOS: An Open-Source Generalizable Language Agent Training Framework
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Imagine having a digital assistant that may not solely reply your questions but additionally navigate the online, resolve advanced math issues, write code, and even purpose about photographs and text-based video games. Sound too good to be true? Well, brace yourselves as a result of the way forward for synthetic intelligence simply received an entire lot extra accessible and clear with the introduction of LUMOS.

    In a groundbreaking improvement, researchers from the Allen Institute for AI, UCLA, and the University of Washington have unveiled LUMOS, an open-source framework that guarantees to revolutionize the best way we work together with language brokers. Unlike present closed-source options that always really feel like black containers, LUMOS presents an unprecedented degree of affordability, transparency, and reproducibility, making it a game-changer on the earth of AI.

    But what precisely is LUMOS, and why is it inflicting such a stir within the AI neighborhood? Buckle up, as a result of we’re about to dive into the nitty-gritty particulars of this exceptional innovation, exploring the way it works, what it could possibly do, and why it issues greater than you may suppose.

    Current language brokers usually depend on massive, closed-source language fashions like GPT-4 or ChatGPT because the core element. While highly effective, these fashions are costly, want extra transparency, and supply restricted reproducibility and controllability.

    The LUMOS framework takes a distinct method by using open-source massive language fashions (LLMs) as the bottom fashions. It employs a unified and modular structure consisting of three key elements: a planning module, a grounding module, and an execution module.

    The planning module decomposes advanced duties right into a sequence of high-level subgoals expressed in pure language. For instance, for a multimodal query like “The device in her hand is from which country?”, the planning module may generate two subgoals: “Identify the brand of the device” and “Answer the country of the device brand.”

    The grounding module then interprets these high-level subgoals into executable low-level actions that may be executed by numerous instruments within the execution module. For occasion, the primary subgoal is likely to be grounded into an motion like “VQA(<img>, What is the brand..?)” to determine the system model from the picture utilizing a visible question-answering device.

    The execution module accommodates a group of off-the-shelf instruments, together with APIs, neural fashions, and digital simulators, that may execute the grounded actions. The outcomes of those executed actions are then fed again into the planning and grounding modules, enabling an iterative and adaptive agent habits.

    One of the important thing benefits of LUMOS is its modular design, which permits for simple upgrades and wider applicability to numerous interactive duties. By separating the planning, grounding, and execution elements, researchers can enhance or change particular person modules with out affecting the others.

    To prepare LUMOS, the researchers curated a large-scale, high-quality dataset of over 56,000 annotations derived from numerous ground-truth reasoning rationales throughout numerous advanced interactive duties, together with query answering, arithmetic, coding, net looking, and multimodal reasoning. These annotations have been obtained by using GPT-4 and different superior language fashions to transform present benchmarks right into a unified format appropriate with the LUMOS structure. The ensuing dataset is among the largest open-source sources for agent fine-tuning, enabling smaller language fashions to be educated as language brokers successfully.

    In evaluations throughout 9 datasets, LUMOS exhibited a number of key benefits. It outperformed a number of bigger open-source brokers on held-out datasets for every process kind, even surpassing GPT brokers on question-answering and net duties in some circumstances. LUMOS additionally outperformed brokers produced by different coaching strategies, similar to chain-of-thoughts and unmodularized built-in coaching. LUMOS notably demonstrated spectacular generalization capabilities, considerably outperforming 30B-scale (WizardLM-30B and Vicuna-v1.3-33B) and domain-specific brokers on unseen duties involving new environments and actions.

    With its open-source nature, aggressive efficiency, and robust generalization skills, LUMOS represents a big step ahead in growing reasonably priced, clear, and reproducible language brokers for advanced interactive duties.


    Check out the Paper, HF Page, and Github. All credit score for this analysis goes to the researchers of this challenge. Also, don’t neglect to observe us on Twitter. Join our Telegram Channel, Discord Channel, and LinkedIn Group.

    If you want our work, you’ll love our e-newsletter..

    Don’t Forget to affix our 39k+ ML SubReddit

    🪄 is among the first unified and modular frameworks for coaching open-source LLM-based brokers.

    New options:
    🤖️Multimodal Reasoning with
    🐘 13B-scale fashions
    🤗 data-explorer demo@ai2_mosaic @uclanlp

    📝:… pic.twitter.com/RmjitjAi3w

    — Da Yin (@Wade_Yin9712) March 29, 2024


    Vibhanshu Patidar is a consulting intern at MarktechPost. Currently pursuing B.S. at Indian Institute of Technology (IIT) Kanpur. He is a Robotics and Machine Learning fanatic with a knack for unraveling the complexities of algorithms that bridge idea and sensible functions.


    🐝 Join the Fastest Growing AI Research Newsletter Read by Researchers from Google + NVIDIA + Meta + Stanford + MIT + Microsoft and lots of others…

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    AI

    Study shows vision-language models can’t handle queries with negation words | Ztoog

    AI

    How a new type of AI is helping police skirt facial recognition bans

    AI

    Hybrid AI model crafts smooth, high-quality videos in seconds | Ztoog

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    AI

    Making AI models more trustworthy for high-stakes settings | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    AI

    Researchers from UC Berkeley and Deepmind Propose SuccessVQA: A Reformulation of Success Detection that is Amenable to Pre-trained VLMs such as Flamingo

    In order to obtain the very best efficiency accuracy, it is essential to perceive whether…

    Crypto

    Argentina Welcomes First Pro-Bitcoin President, BTC Price Surges Above $37,000

    In a historic second for each the nation and the crypto group, Argentina has ushered…

    Science

    Robots Already Growing the Leafy Greens for the Salad of the Future

    Robots are making radical adjustments in our lives. With the course of beginning just a…

    Technology

    Samsung planning to stack HBM memory on top of CPUs and GPUs arriving in 2025

    Forward-looking: Major chipmaking firms have been making an attempt to stack totally different varieties of…

    Science

    NASA’s Lucy spacecraft is hurtling towards the tiny asteroid Dinkinesh

    NASA’s Lucy mission is heading to 2 swarms of asteroids trapped in Jupiter’s orbitNASA&#039;s Goddard…

    Our Picks
    AI

    Make Machine Learning Work for You

    Science

    How the balloon analogy for an expanding universe is almost perfect

    AI

    Meta AI Announces Purple Llama to Assist the Community in Building Ethically with Open and Generative AI Models

    Categories
    • AI (1,486)
    • Crypto (1,748)
    • Gadgets (1,799)
    • Mobile (1,843)
    • Science (1,858)
    • Technology (1,794)
    • The Future (1,640)
    Most Popular
    Technology

    Are we reaching the limits of homegrown silicon?

    Gadgets

    The Best Photography Books of 2023

    Gadgets

    The best alarm clocks for heavy sleepers in 2024

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.