Close Menu
Ztoog
    What's Hot
    Technology

    Nvidia Blackwell RTX 5000 GPUs may debut earlier than expected

    Crypto

    Bitcoin Diamond Hands Remain Strong As Supply Hits New ATH

    Technology

    PUBG maker reveals upcoming Sims-like game with realistic Unreal Engine 5 graphics

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Liquid Glass, New Photos App and All the Other iOS 26 Features Coming to Your iPhone

      Residential solar panel installation: What to expect

      How to Get Bot Lobbies in Fortnite? (2025 Guide)

      Top 12 time & billing software for consultants (2025 reviews)

      AI data scrapers are an existential threat to Wikipedia

    • Technology

      Normal Technology at Scale – O’Reilly

      Stevens Prof Kevin Lu Drives Standards Forward

      RFK Jr. fires vaccine advisory board: What to know

      Does Colossal Biosciences’ dire wolf creation justify its $10B+ valuation?

      Paris-based Pennylane, which makes cloud-based accounting software, raised €75M, doubling its valuation to €2B, led by Sequoia and with Alphabet among investors (Ryan Browne/CNBC)

    • Gadgets

      RedMagic Gaming Tablet 3 Pro Debuts With Snapdragon 8 Elite And 165 Hz OLED Display

      Withings ScanWatch Nova Review: A Stylish Hybrid That Puts Health First

      Breast pump startup Willow acquires assets of Elvie as UK women’s health pioneer moves into administration

      Raccoon or robber? Find out with sub $90 night vision binoculars

      Nomad Sale: 5 Great Deals on Our Favorite Accessories

    • Mobile

      Weekly poll results: the Realme GT 7 is great if you can get it at a discount, GT 7T not so much

      Amazon knocks the Garmin Forerunner 265 back to its lowest price

      This new flagship phone has two zoom lenses, but only one zoom camera (wait, what?)

      Moto G Stylus (2025) is now official ahead of April 17 release

      Apple’s iOS 18.5 beta update is pretty barebones, but more important than it seems

    • Science

      Perseverance rover may hold secrets to newly discovered Mars volcano

      Experimental retina implants give mice infrared vision

      8 Breakthroughs Tackling Pollution Across Air, Land, and Sea

      Why we can’t squash the common cold, even after 100 years of studying it

      Welcome to the Worst Allergy Season Ever

    • AI

      Bringing meaning into technology deployment | Ztoog

      The problem with AI agents

      Inroads to personalized AI trip planning | Ztoog

      AI companions are the final stage of digital addiction, and lawmakers are taking aim

      New method assesses and improves the reliability of radiologists’ diagnostic reports | Ztoog

    • Crypto

      Ethereum Price Could Rally To $10,000 If This Major Resistance Is Broke

      X names Polymarket as its official prediction market partner

      Kirby McInerney LLP Announces a Proposed Settlement in the DraftKings NFT Settlement

      Ethereum Whales Buy the Dip – Over 130K ETH Added In A Single Day

      Why Buying Bitcoin Now Is Better Than Later As BTC Price Consolidates Within Falling Wedge

    Ztoog
    Home » LUMOS: An Open-Source Generalizable Language Agent Training Framework
    AI

    LUMOS: An Open-Source Generalizable Language Agent Training Framework

    Facebook Twitter Pinterest WhatsApp
    LUMOS: An Open-Source Generalizable Language Agent Training Framework
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Imagine having a digital assistant that may not solely reply your questions but additionally navigate the online, resolve advanced math issues, write code, and even purpose about photographs and text-based video games. Sound too good to be true? Well, brace yourselves as a result of the way forward for synthetic intelligence simply received an entire lot extra accessible and clear with the introduction of LUMOS.

    In a groundbreaking improvement, researchers from the Allen Institute for AI, UCLA, and the University of Washington have unveiled LUMOS, an open-source framework that guarantees to revolutionize the best way we work together with language brokers. Unlike present closed-source options that always really feel like black containers, LUMOS presents an unprecedented degree of affordability, transparency, and reproducibility, making it a game-changer on the earth of AI.

    But what precisely is LUMOS, and why is it inflicting such a stir within the AI neighborhood? Buckle up, as a result of we’re about to dive into the nitty-gritty particulars of this exceptional innovation, exploring the way it works, what it could possibly do, and why it issues greater than you may suppose.

    Current language brokers usually depend on massive, closed-source language fashions like GPT-4 or ChatGPT because the core element. While highly effective, these fashions are costly, want extra transparency, and supply restricted reproducibility and controllability.

    The LUMOS framework takes a distinct method by using open-source massive language fashions (LLMs) as the bottom fashions. It employs a unified and modular structure consisting of three key elements: a planning module, a grounding module, and an execution module.

    The planning module decomposes advanced duties right into a sequence of high-level subgoals expressed in pure language. For instance, for a multimodal query like “The device in her hand is from which country?”, the planning module may generate two subgoals: “Identify the brand of the device” and “Answer the country of the device brand.”

    The grounding module then interprets these high-level subgoals into executable low-level actions that may be executed by numerous instruments within the execution module. For occasion, the primary subgoal is likely to be grounded into an motion like “VQA(<img>, What is the brand..?)” to determine the system model from the picture utilizing a visible question-answering device.

    The execution module accommodates a group of off-the-shelf instruments, together with APIs, neural fashions, and digital simulators, that may execute the grounded actions. The outcomes of those executed actions are then fed again into the planning and grounding modules, enabling an iterative and adaptive agent habits.

    One of the important thing benefits of LUMOS is its modular design, which permits for simple upgrades and wider applicability to numerous interactive duties. By separating the planning, grounding, and execution elements, researchers can enhance or change particular person modules with out affecting the others.

    To prepare LUMOS, the researchers curated a large-scale, high-quality dataset of over 56,000 annotations derived from numerous ground-truth reasoning rationales throughout numerous advanced interactive duties, together with query answering, arithmetic, coding, net looking, and multimodal reasoning. These annotations have been obtained by using GPT-4 and different superior language fashions to transform present benchmarks right into a unified format appropriate with the LUMOS structure. The ensuing dataset is among the largest open-source sources for agent fine-tuning, enabling smaller language fashions to be educated as language brokers successfully.

    In evaluations throughout 9 datasets, LUMOS exhibited a number of key benefits. It outperformed a number of bigger open-source brokers on held-out datasets for every process kind, even surpassing GPT brokers on question-answering and net duties in some circumstances. LUMOS additionally outperformed brokers produced by different coaching strategies, similar to chain-of-thoughts and unmodularized built-in coaching. LUMOS notably demonstrated spectacular generalization capabilities, considerably outperforming 30B-scale (WizardLM-30B and Vicuna-v1.3-33B) and domain-specific brokers on unseen duties involving new environments and actions.

    With its open-source nature, aggressive efficiency, and robust generalization skills, LUMOS represents a big step ahead in growing reasonably priced, clear, and reproducible language brokers for advanced interactive duties.


    Check out the Paper, HF Page, and Github. All credit score for this analysis goes to the researchers of this challenge. Also, don’t neglect to observe us on Twitter. Join our Telegram Channel, Discord Channel, and LinkedIn Group.

    If you want our work, you’ll love our e-newsletter..

    Don’t Forget to affix our 39k+ ML SubReddit

    🪄 is among the first unified and modular frameworks for coaching open-source LLM-based brokers.

    New options:
    🤖️Multimodal Reasoning with
    🐘 13B-scale fashions
    🤗 data-explorer demo@ai2_mosaic @uclanlp

    📝:… pic.twitter.com/RmjitjAi3w

    — Da Yin (@Wade_Yin9712) March 29, 2024


    Vibhanshu Patidar is a consulting intern at MarktechPost. Currently pursuing B.S. at Indian Institute of Technology (IIT) Kanpur. He is a Robotics and Machine Learning fanatic with a knack for unraveling the complexities of algorithms that bridge idea and sensible functions.


    🐝 Join the Fastest Growing AI Research Newsletter Read by Researchers from Google + NVIDIA + Meta + Stanford + MIT + Microsoft and lots of others…

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Bringing meaning into technology deployment | Ztoog

    AI

    The problem with AI agents

    AI

    Inroads to personalized AI trip planning | Ztoog

    AI

    AI companions are the final stage of digital addiction, and lawmakers are taking aim

    AI

    New method assesses and improves the reliability of radiologists’ diagnostic reports | Ztoog

    AI

    How do you teach an AI model to give therapy?

    AI

    Researchers teach LLMs to solve complex planning challenges | Ztoog

    AI

    The first trial of generative AI therapy shows it might help with depression

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Gadgets

    21 Best Deals From the Amazon Big Spring Sale: Phones, Chargers, and More

    Retailers love make-believe buying holidays, and the newest is the Amazon Big Spring Sale. Running…

    AI

    Microsoft Releases Orca 2: Pioneering Advanced Reasoning in Smaller Language Models with Tailored Training Strategies

    LLMs (Large Language Models) are skilled on huge volumes of textual knowledge to grasp and…

    Technology

    TrendForce: the top 10 chip foundries' Q3 revenue rose 7.9% YoY to $28.29B; TSMC stayed top at ~60%; Intel made the list for the first time in several quarters (Anton Shilov/AnandTech)

    Anton Shilov / AnandTech: TrendForce: the top 10 chip foundries’ Q3 revenue rose 7.9% YoY…

    AI

    GitLab Introduces Duo Chat: A Conversational AI Tool for Productivity

    In software program growth, builders typically face challenges when working with advanced code or managing…

    Technology

    No apologies as Reddit halfheartedly tries to repair ties with moderators

    Reddit is publicly extending an olive department to the moderator neighborhood that it largely enraged…

    Our Picks
    Mobile

    Week 42 in review: Apple’s M3 chip is here, Galaxy S23 gets Android 14

    Crypto

    USDC issuer Circle expands Asia focus in push to enter the region’s flourishing payments ecosystem

    Gadgets

    The best portable SSDs for 2024

    Categories
    • AI (1,471)
    • Crypto (1,734)
    • Gadgets (1,785)
    • Mobile (1,826)
    • Science (1,838)
    • Technology (1,775)
    • The Future (1,621)
    Most Popular
    Technology

    The top features heading to Apple Watches

    The Future

    WhatsApp will soon have the feature to share high-quality videos, here’s how it will work

    AI

    Looking for a specific action in a video? This AI-based method can find it for you | Ztoog

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.