Close Menu
Ztoog
    What's Hot
    Mobile

    New Gboard feature helps Android users who type in landscape mode

    Mobile

    Samsung tests 2x portrait mode option for the Galaxy S23 Ultra

    AI

    3 Questions: How to prove humanity online | Ztoog

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Can work-life balance tracking improve well-being?

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

    • Technology

      Elon Musk tries to stick to spaceships

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      June skygazing: A strawberry moon, the summer solstice… and Asteroid Day!

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      Bitcoin Maxi Isn’t Buying Hype Around New Crypto Holding Firms

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

    Ztoog
    Home » Researchers from Tsinghua University and Microsoft AI Unveil a Breakthrough in Language Model Training: The Path to Optimal Learning Efficiency
    AI

    Researchers from Tsinghua University and Microsoft AI Unveil a Breakthrough in Language Model Training: The Path to Optimal Learning Efficiency

    Facebook Twitter Pinterest WhatsApp
    Researchers from Tsinghua University and Microsoft AI Unveil a Breakthrough in Language Model Training: The Path to Optimal Learning Efficiency
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    With the rise of language fashions, there was an infinite concentrate on enhancing the educational of  LMs to speed up the educational velocity and obtain a sure mannequin efficiency with as few coaching steps as attainable. This emphasis aids people in understanding the boundaries of LMs amidst their escalating computational necessities. It additionally advances the democratization of huge language fashions (LLMs), benefiting analysis and business communities.

    Prior works like Pre-Trained Models, Past, Present, and Future, concentrate on designing efficient architectures, using wealthy contexts, and enhancing computational effectivity. In h2oGPT: Democratizing Large Language Models, the researchers have tried to create open-source alternate options to the closed-source approaches. In Large Batch Optimization for Deep Learning: Training BERT in 76 minutes, they tried to overcome the computational problem of LLMs.  These prior works discover sensible acceleration strategies on the mannequin, optimizer, or knowledge ranges.

    The researchers from the CoAI Group, Tsinghua University, and Microsoft Research have proposed a principle for optimizing LM studying, starting with maximizing the info compression ratio. They derive the Learning Law theorem to elucidate optimum studying dynamics. Validation experiments on linear classification and language modeling duties affirm the concept’s properties. Results point out that optimum LM studying enhances coefficients in LM scaling legal guidelines, providing promising implications for sensible studying acceleration strategies.

    In their technique (Optimal Learning of Language Models), the researchers demonstrated the rules of optimizing the LM studying velocity, together with the optimization goal, the property of optimum studying dynamics, and the important enchancment of the educational acceleration. For the optimization goal, they’ve proposed to reduce the world underneath the curve (AUC), a studying course of with the smallest loss AUC corresponds to the very best compression ratio. Then, they derived the Learning Law theorem that characterizes the property of dynamics in the LM studying course of that achieves the optimum of their goal. Here, a studying coverage induces a studying course of that determines which knowledge factors the LM learns because the coaching progresses.

    After conducting experiments on linear classification with Perceptron and language modeling with Transformer, researchers optimized studying insurance policies and validated them empirically. Near-optimal insurance policies considerably accelerated studying, enhancing loss AUC by 5.50× and 2.41× for Perceptron and Transformer, respectively. Results confirmed theoretical predictions, demonstrating improved scaling regulation coefficients by up to 96.6% and 21.2%, promising sooner LM coaching with sensible significance.

    In conclusion, researchers from the CoAI Group, Tsinghua University, and Microsoft Research have proposed a principle for optimizing LM studying to maximize compression ratio. They derive the Learning Law theorem, confirming that every one examples contribute equally to optimum studying, validated in experiments. The optimum course of improves LM scaling regulation coefficients, guiding future acceleration strategies. 


    Check out the Paper and Github. All credit score for this analysis goes to the researchers of this mission. Also, don’t neglect to observe us on Twitter and Google News. Join our 38k+ ML SubReddit, 41k+ Facebook Community, Discord Channel, and LinkedIn Group.

    If you want our work, you’ll love our e-newsletter..

    Don’t Forget to be part of our Telegram Channel

    You may like our FREE AI Courses….


    Asjad is an intern marketing consultant at Marktechpost. He is persuing B.Tech in mechanical engineering on the Indian Institute of Technology, Kharagpur. Asjad is a Machine studying and deep studying fanatic who’s at all times researching the functions of machine studying in healthcare.


    🐝 Join the Fastest Growing AI Research Newsletter Read by Researchers from Google + NVIDIA + Meta + Stanford + MIT + Microsoft and many others…

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Science

    CDC reports dips in flu, COVID-19, and RSV—though levels still very high

    Enlarge / The influenza virus from a picture produced from a picture taken with transmission…

    Gadgets

    The best space heaters in 2024

    We might earn income from the merchandise out there on this web page and take…

    Science

    Students search desert for lost rocket after attempted launch to space

    The Aurora rocket launched within the Mojave desert on 24 SeptemberKarman Space Programme A group…

    Crypto

    Whales Are Loading Up on Bitcoin Again, $3.6B in BTC Snapped Up in a Day

    Bitcoin has seen modest upward momentum in the previous 24 hours, climbing again above $83,000…

    Science

    A Gene-Edited Pig Kidney Was Just Transplanted Into a Person for the First Time

    Slayman acquired his first kidney transplant in 2018 from a human donor. The donor kidney…

    Our Picks
    Science

    Game on—the most metal of asteroid missions is back on the menu

    The Future

    Chevy Blazer EV models get price increases as it rolls into dealerships

    The Future

    X sets up team to generate $100m from political ads: reports

    Categories
    • AI (1,493)
    • Crypto (1,754)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,867)
    • Technology (1,803)
    • The Future (1,649)
    Most Popular
    Mobile

    Honor Magic V2 RSR Porsche Design is official with sporty look

    Technology

    Jimmy and Rosalynn Carter show how much hospice care helps patients and families. But people with dementia may struggle to get it.

    AI

    How can the Effectiveness of Vision Transformers be Leveraged in Diffusion-based Generative Learning? This Paper from NVIDIA Introduces a Novel Artificial Intelligence Model Called Diffusion Vision Transformers (DiffiT)

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.