Close Menu
Ztoog
    What's Hot
    Science

    As Extreme Heat Increases, Heart Attacks Will Rise

    Gadgets

    Samsung Launched Galaxy Watch FE, A $200 Smartwatch Packed With Advanced Health Features

    Technology

    Robotic Tongue Licks Gecko Gripper Clean

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Can work-life balance tracking improve well-being?

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

    • Technology

      Elon Musk tries to stick to spaceships

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      June skygazing: A strawberry moon, the summer solstice… and Asteroid Day!

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

    • AI

      Fueling seamless AI at scale

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    • Crypto

      Bitcoin Maxi Isn’t Buying Hype Around New Crypto Holding Firms

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

    Ztoog
    Home » AI2 is developing a large language model optimized for science
    The Future

    AI2 is developing a large language model optimized for science

    Facebook Twitter Pinterest WhatsApp
    AI2 is developing a large language model optimized for science
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    PaLM 2. GPT-4. The checklist of text-generating AI virtually grows by the day.

    Most of those fashions are walled behind APIs, making it not possible for researchers to see precisely what makes them tick. But more and more, group efforts are yielding open supply AI that’s as refined, if no more so, than their business counterparts.

    The newest of those efforts is the Open Language Model, a large language model set to be launched by the nonprofit Allen Institute for AI Research (AI2) someday in 2024. Open Language Model, or OLMo for quick, is being developed in collaboration with AMD and the Large Unified Modern Infrastructure consortium, which gives supercomputing energy for coaching and training, in addition to Surge AI and MosaicML (that are offering information and coaching code).

    “The research and technology communities need access to open language models to advance this science,” Hanna Hajishirzi, the senior director of NLP analysis at AI2, informed Ztoog in an e-mail interview. “With OLMo, we are working to close the gap between public and private research capabilities and knowledge by building a competitive language model.”

    One would possibly surprise — together with this reporter — why AI2 felt the necessity to develop an open language model when there’s already a number of to select from (see Bloom, Meta’s LLaMA, and so on.). The approach Hajishirzi sees it, whereas the open supply releases up to now have been useful and even boundary-pushing, they’ve missed the mark in varied methods.

    AI2 sees OLMo as a platform, not simply a model — one which’ll enable the analysis group to take every part AI2 creates and both use it themselves or search to enhance it. Everything AI2 makes for OLMo might be overtly accessible, Hajishirzi says, together with a public demo, coaching information set and API, and documented with “very limited” exceptions beneath “suitable” licensing.

    “We’re building OLMo to create greater access for the AI research community to work directly on language models,” Hajishirzi mentioned. “We believe the broad availability of all aspects of OLMo will enable the research community to take what we are creating and work to improve it. Our ultimate goal is to collaboratively build the best open language model in the world.”

    OLMo’s different differentiator, in keeping with Noah Smith, senior director of NLP analysis at AI2, is a give attention to enabling the model to higher leverage and perceive textbooks and tutorial papers versus, say, code. There’s been different makes an attempt at this, like Meta’s notorious Galactica model. But Hajishirzi believes that AI2’s work in academia and the instruments it’s developed for analysis, like Semantic Scholar, will assist make OLMo “uniquely suited” for scientific and tutorial functions.

    “We believe OLMo has the potential to be something really special in the field, especially in a landscape where many are rushing to cash in on interest in generative AI models,” Smith mentioned. “AI2’s unique ability to act as third party experts gives us an opportunity to work not only with our own world-class expertise but collaborate with the strongest minds in the industry. As a result, we think our rigorous, documented approach will set the stage for building the next generation of safe, effective AI technologies.”

    That’s a good sentiment, to make sure. But what in regards to the thorny moral and authorized points round coaching — and releasing — generative AI? The debate’s raging across the rights of content material homeowners (amongst different affected stakeholders), and numerous nagging points have but to be settled within the courts.

    To allay issues, the OLMo workforce plans to work with AI2’s authorized division and to-be-determined outdoors consultants, stopping at “checkpoints” within the model-building course of to reassess privateness and mental property rights points.

    “We hope that through an open and transparent dialogue about the model and its intended use, we can better understand how to mitigate bias, toxicity, and shine a light on outstanding research questions within the community, ultimately resulting in one of the strongest models available,” Smith mentioned.

    What in regards to the potential for misuse? Models, which are sometimes poisonous and biased to start with, are ripe for dangerous actors intent on spreading disinformation and producing malicious code.

    Hajishirzi mentioned that AI2 will use a mixture of licensing, model design and selective entry to the underlying parts to “maximize the scientific benefits while reducing the risk of harmful use.” To information coverage, OLMo has an ethics assessment committee with inside and exterior advisors (AI2 wouldn’t say who, precisely) that’ll present suggestions all through the model creation course of.

    We’ll see to what extent that makes a distinction. For now, a lot’s up within the air — together with many of the model’s technical specs. (AI2 did reveal that it’ll have round 70 billion parameters, parameters being the components of the model realized from historic coaching information.) Training’s set to start on LUMI’s supercomputer in Finland — the quickest supercomputer in Europe, as of January — within the coming months.

    AI2 is inviting collaborators to assist contribute to — and critique — the model improvement course of. Those can contact the OLMo venture organizers right here. 

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    The Future

    Can work-life balance tracking improve well-being?

    The Future

    Any wall can be turned into a camera to see around corners

    The Future

    JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

    The Future

    AI may already be shrinking entry-level jobs in tech, new research suggests

    The Future

    Today’s NYT Strands Hints, Answer and Help for May 26 #449

    The Future

    LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

    The Future

    Common Security Mistakes Made By Businesses and How to Avoid Them

    The Future

    What time tracking metrics should you track and why?

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Technology

    How to Build a Power Grid on the Moon

    The early forays to the moon employed throwaway applied sciences, designed to operate throughout the…

    AI

    How AI taught Cassie the two-legged robot to run and jump

    Researchers used an AI method known as reinforcement studying to assist a two-legged robot nicknamed…

    The Future

    Operate the Financial Tool pi123: Basic Functions Explained

    Predominantly often called a mathematical fixed, pi123, some expert builders determined to make a complete…

    AI

    3 Questions: Enhancing last-mile logistics with machine learning | Ztoog

    Across the nation, tons of of hundreds of drivers ship packages and parcels to prospects…

    The Future

    Michael Schumacher’s family awarded €200,000 compensation after AI ‘interview’

    Michael Schumacher’s family have been awarded €200,000 ($216,360) compensation from the writer of a German…

    Our Picks
    AI

    Deciphering Neuronal Universality in GPT-2 Language Models

    Technology

    Ali Selim, executive producer of Marvel's Secret Invasion, says Method Studios used AI to create the opening credits (Zosha Millman/Polygon)

    Technology

    Moscow terror attack: ISIS-K takes responsibility but Putin looks at Ukraine

    Categories
    • AI (1,494)
    • Crypto (1,754)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,867)
    • Technology (1,803)
    • The Future (1,649)
    Most Popular
    Mobile

    Android owners can now transfer their eSIMs to any Android phone

    AI

    Researchers from Tsinghua University Introduce LLM4VG: A Novel AI Benchmark for Evaluating LLMs on Video Grounding Tasks

    Science

    Distant comet cracks into two halves after being heated by the sun

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.