Close Menu
Ztoog
    What's Hot
    Technology

    Can the U.S. Make Solar Panels? This Company Thinks So.

    The Future

    SmartLess is leaving Amazon for $100 million

    Science

    Most newborn black holes spew gas so hard they almost stop spinning

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

      LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

    • Technology

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

      5 Skills Kids (and Adults) Need in an AI World – O’Reilly

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

      Senate advances GENIUS Act after cloture vote passes

    Ztoog
    Home » Meet LIMA: A New 65B Parameter LLaMa Model Fine-Tuned On 1000 Carefully Curated Prompts And Responses
    AI

    Meet LIMA: A New 65B Parameter LLaMa Model Fine-Tuned On 1000 Carefully Curated Prompts And Responses

    Facebook Twitter Pinterest WhatsApp
    Meet LIMA: A New 65B Parameter LLaMa Model Fine-Tuned On 1000 Carefully Curated Prompts And Responses
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Language fashions develop general-purpose representations transferable to nearly any language interpretation or producing job by being pretrained to anticipate the subsequent token at an astounding scale. Different approaches to aligning language fashions have thus been put forth to facilitate this switch, with a specific emphasis on instruction tuning over sizable datasets with tens of millions of examples and, extra lately, reinforcement studying from human suggestions (RLHF) gathered over tens of millions of interactions with human annotators, for current alignment methods to perform at ChatGPT ranges, massive computing, and specialised information sources are wanted. 

    However, they present that with language mannequin already skilled, superb efficiency could also be obtained by simply tweaking 1,000 correctly chosen coaching situations. According to their speculation, alignment could also be a fast and straightforward process the place the mannequin learns the format or fashion of partaking customers to reveal the abilities and knowledge already realized throughout pretraining. They acquire 1,000 situations that resemble genuine consumer cues and glorious replies to confirm this concept. They select 750 of one of the best questions and responses from on-line dialogue boards like Stack Exchange and wikiHow, evaluating them for high quality and selection.

    They additionally manually compose 250 situations of questions and solutions whereas emphasizing a constant response fashion within the vein of an AI assistant and optimizing for activity range. Researchers from Meta AI, Carnegie Mellon University, University of Southern California and Tel Aviv University prepare LIMA, a 65B-parameter LLaMa mannequin beforehand skilled and improved on this assortment of 1,000 examples. Three hundred troublesome take a look at questions evaluate LIMA towards up to date language fashions and merchandise. LIMA surpasses RLHF-trained DaVinci003 from OpenAI, which was skilled with RLHF, in addition to a 65B-parameter duplicate of Alpaca, which was launched on 52,000 samples, in a research of human desire. 

    🚀 JOIN the quickest ML Subreddit Community

    Although people often favor GPT-4, Claude, and Bard replies over LIMA responses, this isn’t at all times the case; LIMA constantly yields equal or preferable ends in 43%, 46%, and 58% of the conditions, respectively. They repeat the annotations of human preferences utilizing GPT-4 because the annotator confirms their findings. When LIMA replies are evaluated on an absolute scale, 88% fulfill the immediate’s necessities, and 50% are rated excellent. Ablation checks present important enhancements when bettering information high quality and considerably falling returns when growing information quantity with out concurrently growing immediate selection. 

    Furthermore, they uncover that LIMA can stick with it coherent multi-turn discourse regardless of having no dialogue examples. Including 30 hand-crafted dialogue chains in coaching might improve this capability. Overall, these wonderful outcomes present the effectiveness of pretraining and its relative worth over approaches to reinforcement studying and large-scale instruction tailoring. They display how a strong pretrained language mannequin could also be tuned to supply excellent, aggressive outcomes on numerous prompts utilizing 1,000 well-picked samples. There are, nonetheless, drawbacks to this technique. 

    The psychological work required to create such situations is gigantic and difficult to scale up. Second, whereas LIMA usually supplies robust replies, an unlucky pattern throughout decoding or an aggressive immediate can often lead to a weak response. LIMA is much less resilient than product-grade fashions. Nevertheless, the information supplied on this work reveals that it’s potential to handle the troublesome alignment issues straightforwardly.


    Check out the Pre-Print Paper. Don’t overlook to affix our 22k+ ML SubReddit, Discord Channel, and Email Newsletter, the place we share the newest AI analysis information, cool AI initiatives, and extra. If you’ve got any questions relating to the above article or if we missed something, be happy to e-mail us at Asif@marktechpost.com

    🚀 Check Out 100’s AI Tools in AI Tools Club


    Aneesh Tickoo is a consulting intern at MarktechPost. He is at the moment pursuing his undergraduate diploma in Data Science and Artificial Intelligence from the Indian Institute of Technology(IIT), Bhilai. He spends most of his time engaged on initiatives geared toward harnessing the facility of machine studying. His analysis curiosity is picture processing and is keen about constructing options round it. He loves to attach with individuals and collaborate on fascinating initiatives.


    ➡️ Ultimate Guide to Data Labeling in Machine Learning

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    The Future

    Astra’s Apollo Fusion acquisition followed by delays and desertion

    Two years in the past, Astra hailed its acquisition of satellite tv for pc propulsion…

    Technology

    Open-Source AI Is Good for Us

    This is a visitor publish. For the opposite aspect of the argument about open-source AI,…

    AI

    Roblox is launching a generative AI that builds 3D environments in a snap

    Roblox’s new software works by “tokenizing” the 3D blocks that make up its hundreds of…

    Gadgets

    SpaceX’s Satellite Cellular Service To Launch In 2024

    SpaceX has launched a brand new webpage to advertise its upcoming “Starlink Direct to Cell”…

    Crypto

    Initia raises $7.5M seed round to simplify blockchain development

    It’s laborious to preserve monitor of crypto’s technical development, however one factor hasn’t modified a…

    Our Picks
    Gadgets

    You’ll Be Able Buy Cars on Amazon Next Year

    Science

    Seeing a corpse makes fruit flies age faster

    Science

    Students search desert for lost rocket after attempted launch to space

    Categories
    • AI (1,493)
    • Crypto (1,753)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,866)
    • Technology (1,802)
    • The Future (1,648)
    Most Popular
    Gadgets

    Save big on cleaning with this open-box Roomba 675 robot vacuum on sale for $29 off

    Mobile

    Save over $200 on the Samsung Galaxy Tab S9 Plus in record deal

    Gadgets

    Circular will pay competitor Oura royalties to sell its smart ring in the US

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.