Close Menu
Ztoog
    What's Hot
    Crypto

    Ethereum Bears Gain Upper Hand With Escalating Sell-Off

    AI

    Rethinking calibration for in-context learning and prompt engineering – Google Research Blog

    Technology

    GameScent uses audio cues and AI to bring smells to video games

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      How I Turn Unstructured PDFs into Revenue-Ready Spreadsheets

      Is it the best tool for 2025?

      The clocks that helped define time from London’s Royal Observatory

      Summer Movies Are Here, and So Are the New Popcorn Buckets

      India-Pak conflict: Pak appoints ISI chief, appointment comes in backdrop of the Pahalgam attack

    • Technology

      Ensure Hard Work Is Recognized With These 3 Steps

      Cicada map 2025: Where will Brood XIV cicadas emerge this spring?

      Is Duolingo the face of an AI jobs crisis?

      The US DOD transfers its AI-based Open Price Exploration for National Security program to nonprofit Critical Minerals Forum to boost Western supply deals (Ernest Scheyder/Reuters)

      The more Google kills Fitbit, the more I want a Fitbit Sense 3

    • Gadgets

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

      Apple plans to split iPhone 18 launch into two phases in 2026

      Upgrade your desk to Starfleet status with this $95 USB-C hub

      37 Best Graduation Gift Ideas (2025): For College Grads

      Backblaze responds to claims of “sham accounting,” customer backups at risk

    • Mobile

      Samsung Galaxy S25 Edge promo materials leak

      What are people doing with those free T-Mobile lines? Way more than you’d expect

      Samsung doesn’t want budget Galaxy phones to use exclusive AI features

      COROS’s charging adapter is a neat solution to the smartwatch charging cable problem

      Fortnite said to return to the US iOS App Store next week following court verdict

    • Science

      Failed Soviet probe will soon crash to Earth – and we don’t know where

      Trump administration cuts off all future federal funding to Harvard

      Does kissing spread gluten? New research offers a clue.

      Why Balcony Solar Panels Haven’t Taken Off in the US

      ‘Dark photon’ theory of light aims to tear up a century of physics

    • AI

      How to build a better AI benchmark

      Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

      This data set helps researchers spot harmful stereotypes in LLMs

      Making AI models more trustworthy for high-stakes settings | Ztoog

      The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    • Crypto

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

      Bitcoin Holds Above $95K Despite Weak Blockchain Activity — Analytics Firm Explains Why

      eToro eyes US IPO launch as early as next week amid easing concerns over Trump’s tariffs

      Cardano ‘Looks Dope,’ Analyst Predicts Big Move Soon

      Speak at Ztoog Disrupt 2025: Applications now open

    Ztoog
    Home » This AI Paper Introduces Lemur and Lemur Chat For Harmonizing Natural Language and Code For Language Agents
    AI

    This AI Paper Introduces Lemur and Lemur Chat For Harmonizing Natural Language and Code For Language Agents

    Facebook Twitter Pinterest WhatsApp
    This AI Paper Introduces Lemur and Lemur Chat For Harmonizing Natural Language and Code For Language Agents
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    In a broad sense, clever brokers are autonomous drawback solvers endowed with notion, judgment, and motion capabilities based mostly on knowledge gathered from their environment. Recent functions of this concept have proven promise in growing language brokers that may use pure language to do a variety of complicated duties in varied contexts. This is very true when these brokers are constructed utilizing massive language fashions (LLMs). Agents of this kind can mimic human thought and language as a result of they draw on human experience within the type of LLMs. This permits folks to be versatile of their use of instruments, adapt to new conditions, purpose linguistically, and develop multi-agent programs on the fly. 

    LLMs ought to grasp human interplay, reasoning, and planning and guarantee grounding within the obligatory contexts to correctly assemble the inspiration of language brokers. LLMs’ pure language capabilities permit them to carefully mimic human dialog, considering, and planning. However, environment-based execution is usually completed by way of general-purpose code or domain-specific APIs, similar to these used to handle net browsers, talk with working system command line interface terminals, and management robotic arms.

    To fill this hole, a brand new research by the University of Hong Kong, XLang Lab, Salesforce Research, Sea AI Lab, University of Washington, and MIT CSAIL current Lemur and Lemur-Chat, two state-of-the-art, publicly obtainable fashions which were pre-trained and fine-tuned to attain concord between textual content and code. Through fastidiously crafted pre-training and instruction fine-tuning steps, the researchers improved the unique Llama-2-70B. To guarantee enhanced capabilities in coding skill whereas retaining efficiency in pure language skill, they constructed a code-centric corpus based mostly on The Stack, together with 90 billion tokens with a ten:1 text-to-code ratio. This prototype is called Lemur. To create the instruction-following mannequin, Lemur-Chat, they first pretrained it utilizing round 100K situations from each textual content and code. Lemur and Lemur-Chat have been confirmed to be essentially the most well-rounded open-source fashions after present process intensive examinations throughout 8 textual and coding benchmarks. 

    In addition, this effort units out to supply agent requirements for evaluating the core competencies of linguistic brokers in varied settings. The staff focuses notably on their talent with instruments and their skill to root themselves in each environmental and social suggestions. They additionally examine the difficulties inherent in real-world, partially seen conditions, the place the agent should function based mostly on incomplete data and carry out extra actions to fill within the gaps. Experiments present that Lemur-Chat performs higher in 12 of the 13 agent benchmarks in comparison with different open-source fashions. This exemplifies how Lemur-Chat can outperform current open-source fashions for language brokers by bridging the efficiency hole between open-source and business alternate options by combining pure and coding abilities. 

    The outcomes of those assessments show the significance of mixing linguistic and computational abilities in agent-based settings. Models like Llama-2-70B-Chat, which excel in pure language processing however battle with coding, can effectively use primary instruments to assist reasoning as a result of the motion area is constrained, and the hassle of using such instruments is low. In distinction, the motion area is usually monumental when confronted with subtle decision-making eventualities like net shopping and dwelling navigation, and fashions with excessive coding skills have an edge when establishing complicated executable motion sequences. In sum, Lemur’s superior efficiency could be attributed to its pure language processing and programming superiority. This research lays the groundwork for creating subtle language brokers that may perform properly in a variety of settings by shedding mild on optimizing the synergy between pure and programming languages. 


    Check out the Paper and Github. All Credit For This Research Goes To the Researchers on This Project. Also, don’t overlook to hitch our 31k+ ML SubReddit, 40k+ Facebook Community, Discord Channel, and Email Newsletter, the place we share the newest AI analysis information, cool AI tasks, and extra.

    If you want our work, you’ll love our publication..

    We are additionally on WhatsApp. Join our AI Channel on Whatsapp..


    Dhanshree Shenwai is a Computer Science Engineer and has expertise in FinTech firms masking Financial, Cards & Payments and Banking area with eager curiosity in functions of AI. She is obsessed with exploring new applied sciences and developments in at present’s evolving world making everybody’s life simple.


    ▶️ Now Watch AI Research Updates On Our Youtube Channel [Watch Now]

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    AI

    Making AI models more trustworthy for high-stakes settings | Ztoog

    AI

    The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    AI

    Novel method detects microbial contamination in cell cultures | Ztoog

    AI

    Seeing AI as a collaborator, not a creator

    AI

    “Periodic table of machine learning” could fuel AI discovery | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    The Future

    Couples collide with fewer people on walks than pairs of friends do

    Couples are much less prone to bodily collide with others than friends areShutterstock/Shawn.ccf Couples are…

    Gadgets

    You can’t get this discreet WiFi camera on anywhere else for $49.97

    We might earn income from the merchandise out there on this web page and take…

    The Future

    Keyword Rank Tracking With SEO PowerSuite

    The monitoring and evaluation of the positions of the supposed key phrases to your web…

    Science

    Lunar eclipse 2023: October blood moon captured in stunning images around the world

    The partial lunar eclipse on 28 October in Munich, GermanyImago/Alamy Photographers around the world captured…

    Technology

    5 Android apps you shouldn’t miss this week

    Welcome to the 509th version of Android Apps Weekly. Once once more we carry you…

    Our Picks
    AI

    What’s next for AI in 2024

    Gadgets

    6 Best Electric Scooters (2023): Affordable, Lightweight, Long-Range, Fast

    The Future

    120+ best Amazon Prime Day tech deals: earbuds, laptops, and more

    Categories
    • AI (1,482)
    • Crypto (1,744)
    • Gadgets (1,796)
    • Mobile (1,839)
    • Science (1,853)
    • Technology (1,789)
    • The Future (1,635)
    Most Popular
    Crypto

    Did This Historical Line Act As Support Again?

    Mobile

    Galaxy S24 Ultra’s rumored pricing might hit you in the wallet

    Crypto

    Bitcoin Marked For Death Cross: What Data Says About The Ominous Signal

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.