Close Menu
Ztoog
    What's Hot
    Mobile

    Honor Magic 6 Pro camera review

    Science

    Studying Marine Life with Seaweed-Based Soft Robots

    Gadgets

    21 Best Deals From the Amazon Big Spring Sale: Phones, Chargers, and More

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      How I Turn Unstructured PDFs into Revenue-Ready Spreadsheets

      Is it the best tool for 2025?

      The clocks that helped define time from London’s Royal Observatory

      Summer Movies Are Here, and So Are the New Popcorn Buckets

      India-Pak conflict: Pak appoints ISI chief, appointment comes in backdrop of the Pahalgam attack

    • Technology

      Ensure Hard Work Is Recognized With These 3 Steps

      Cicada map 2025: Where will Brood XIV cicadas emerge this spring?

      Is Duolingo the face of an AI jobs crisis?

      The US DOD transfers its AI-based Open Price Exploration for National Security program to nonprofit Critical Minerals Forum to boost Western supply deals (Ernest Scheyder/Reuters)

      The more Google kills Fitbit, the more I want a Fitbit Sense 3

    • Gadgets

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

      Apple plans to split iPhone 18 launch into two phases in 2026

      Upgrade your desk to Starfleet status with this $95 USB-C hub

      37 Best Graduation Gift Ideas (2025): For College Grads

      Backblaze responds to claims of “sham accounting,” customer backups at risk

    • Mobile

      Samsung Galaxy S25 Edge promo materials leak

      What are people doing with those free T-Mobile lines? Way more than you’d expect

      Samsung doesn’t want budget Galaxy phones to use exclusive AI features

      COROS’s charging adapter is a neat solution to the smartwatch charging cable problem

      Fortnite said to return to the US iOS App Store next week following court verdict

    • Science

      Failed Soviet probe will soon crash to Earth – and we don’t know where

      Trump administration cuts off all future federal funding to Harvard

      Does kissing spread gluten? New research offers a clue.

      Why Balcony Solar Panels Haven’t Taken Off in the US

      ‘Dark photon’ theory of light aims to tear up a century of physics

    • AI

      How to build a better AI benchmark

      Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

      This data set helps researchers spot harmful stereotypes in LLMs

      Making AI models more trustworthy for high-stakes settings | Ztoog

      The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    • Crypto

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

      Bitcoin Holds Above $95K Despite Weak Blockchain Activity — Analytics Firm Explains Why

      eToro eyes US IPO launch as early as next week amid easing concerns over Trump’s tariffs

      Cardano ‘Looks Dope,’ Analyst Predicts Big Move Soon

      Speak at Ztoog Disrupt 2025: Applications now open

    Ztoog
    Home » Meet OpenLLaMA: An Open-Source Reproduction of Meta AI’s LLaMA Large Language Model
    AI

    Meet OpenLLaMA: An Open-Source Reproduction of Meta AI’s LLaMA Large Language Model

    Facebook Twitter Pinterest WhatsApp
    Meet OpenLLaMA: An Open-Source Reproduction of Meta AI’s LLaMA Large Language Model
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    A brand new growth in massive language fashions has emerged with the discharge of OpenLLaMA, an open-source copy of Meta AI’s LLaMA mannequin. The creators of OpenLLaMA have made the permissively licensed mannequin publicly out there as a 7B OpenLLaMA mannequin that has been educated with 200 billion tokens. The launch consists of PyTorch and Jax weights of pre-trained OpenLLaMA fashions, analysis outcomes, and a comparability towards the unique LLaMA fashions. This growth has vital implications for machine studying, notably for researchers who require massive language fashions however face challenges accessing proprietary fashions. 

    The creators of OpenLLaMA have shared particulars on how they educated their fashions on the RedPajama dataset, which is a copy of the LLaMA coaching dataset containing over 1.2 trillion tokens. They adopted the identical preprocessing and coaching hyperparameters as the unique LLaMA paper, together with mannequin structure, context size, coaching steps, studying price schedule, and optimizer. The solely distinction between their method and the unique one is the dataset used: OpenLLaMA employs the RedPajama dataset moderately than the one utilized by the unique LLaMA.

    The fashions had been educated on cloud TPU-v4s utilizing EasyLM, a JAX-based coaching pipeline developed for coaching and fine-tuning language fashions. They employed a mix of regular knowledge parallelism and absolutely sharded knowledge parallelism (also referred to as ZeRO stage 3) to stability the coaching throughput and reminiscence utilization. Overall, their coaching run achieved a throughput of over 1900 tokens/second / TPU-v4 chip. 

    🚀 JOIN the quickest ML Subreddit Community

    The efficiency of OpenLLaMA was evaluated on a number of duties utilizing the lm-evaluation-harness. The outcomes had been in contrast towards the unique LLaMA mannequin and GPT-J, a 6B parameter mannequin educated on the Pile dataset by EleutherAI. The analysis metrics for the unique LLaMA mannequin had been generated by working it on the identical duties. The outcomes for the LLaMA mannequin barely differed from these reported within the unique LLaMA paper, which can be resulting from variations in analysis protocols. However, OpenLLaMA exhibited comparable or higher efficiency than the unique LLaMA and GPT-J throughout most duties, in accordance with the offered outcomes. Although OpenLLaMA was educated on 200 billion tokens as a substitute of the 1 trillion tokens used for the unique LLaMA and 500 billion tokens used for GPT-J, its efficiency is predicted to enhance even additional upon finishing its coaching on 1 trillion tokens.

    To encourage suggestions and collaboration from the group, the staff behind OpenLLaMA has launched a preview checkpoint of their weights. These weights can be found in two codecs: an EasyLM format to be used with their EasyLM framework and a PyTorch format to be used with the Huggingface transformers library. Unlike the unique LLaMA mannequin, OpenLLaMA’s tokenizer and weights are educated fully from scratch, so acquiring the unique LLaMA tokenizer and weights is not crucial. However, it’s important to notice that OpenLLaMA makes use of the BOS (starting of a sentence) token (id=1) throughout coaching, so this token must be prepended for optimum efficiency throughout a few-shot analysis. The preview checkpoint weights and EasyLM framework are permissively underneath the Apache 2.0 license. The staff is at the moment centered on finishing the coaching course of on all the RedPajama dataset to permit for an apple-to-apple comparability between the unique LLaMA and OpenLLaMA. Additionally, they’re engaged on coaching a smaller 3B mannequin for low-resource use instances. The staff plans to launch extra updates quickly.


    Check out the Github Link. Don’t neglect to hitch our 20k+ ML SubReddit, Discord Channel, and Email Newsletter, the place we share the newest AI analysis information, cool AI initiatives, and extra. If you’ve any questions concerning the above article or if we missed something, be happy to e mail us at Asif@marktechpost.com

    🚀 Check Out 100’s AI Tools in AI Tools Club


    Niharika is a Technical consulting intern at Marktechpost. She is a 3rd 12 months undergraduate, at the moment pursuing her B.Tech from Indian Institute of Technology(IIT), Kharagpur. She is a extremely enthusiastic particular person with a eager curiosity in Machine studying, Data science and AI and an avid reader of the newest developments in these fields.


    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    AI

    Making AI models more trustworthy for high-stakes settings | Ztoog

    AI

    The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    AI

    Novel method detects microbial contamination in cell cultures | Ztoog

    AI

    Seeing AI as a collaborator, not a creator

    AI

    “Periodic table of machine learning” could fuel AI discovery | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Gadgets

    Google might already be replacing some Ad sales jobs with AI

    Google is wrapping its head across the thought of being a generative AI firm. The…

    The Future

    Social media companies change their policies in the wake of bad press

    Social media companies seem like delicate to criticismShutterstock/straightforward digital camera Negative information tales about social…

    Mobile

    Top Chinese smartphone manufacturer during the third-quarter is planning an IPO

    In November 2020, simply as the U.S. sanctions on Huawei have been starting to chew,…

    Science

    Researchers get primate embryos to start organ development in culture dishes

    Enlarge / Computer-generated picture of an early stage in embryonic development, earlier than organ formation…

    Mobile

    Galaxy Tab S9 FE and S9 FE Plus leak reveals prices, colors, and versions

    Ryan Haines / Android AuthorityTL;DR Samsung will launch a Galaxy Tab S9 FE and S9…

    Our Picks
    Science

    Can we smash together all of the asteroids to build a new planet?

    Gadgets

    A Gaming Powerhouse! Lenovo Launches Updated Legion 9i At CES 2024

    Science

    How soap operas can help us understand special relativity

    Categories
    • AI (1,482)
    • Crypto (1,744)
    • Gadgets (1,796)
    • Mobile (1,839)
    • Science (1,853)
    • Technology (1,789)
    • The Future (1,635)
    Most Popular
    AI

    List of Artificial Intelligence Models for Medical Landscape (2023)

    Technology

    NFL Draft 2024: How to Watch Tonight, Full Weekend TV Schedule, First Round Order

    Science

    Inside the Beef Industry’s Campaign to Influence Schoolchildren

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.