Close Menu
Ztoog
    What's Hot
    The Future

    How to Build an Efficient Data Team to Work with Public Web Data

    The Future

    Sennheiser Momentum True Wireless 4 review: redemption never sounded so good

    Gadgets

    Another Product To The Grave! Google Domains To Be Acquired By Squarespace

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

      LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

    • Technology

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

      5 Skills Kids (and Adults) Need in an AI World – O’Reilly

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

      Senate advances GENIUS Act after cloture vote passes

    Ztoog
    Home » AI2 is developing a large language model optimized for science
    The Future

    AI2 is developing a large language model optimized for science

    Facebook Twitter Pinterest WhatsApp
    AI2 is developing a large language model optimized for science
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    PaLM 2. GPT-4. The checklist of text-generating AI virtually grows by the day.

    Most of those fashions are walled behind APIs, making it not possible for researchers to see precisely what makes them tick. But more and more, group efforts are yielding open supply AI that’s as refined, if no more so, than their business counterparts.

    The newest of those efforts is the Open Language Model, a large language model set to be launched by the nonprofit Allen Institute for AI Research (AI2) someday in 2024. Open Language Model, or OLMo for quick, is being developed in collaboration with AMD and the Large Unified Modern Infrastructure consortium, which gives supercomputing energy for coaching and training, in addition to Surge AI and MosaicML (that are offering information and coaching code).

    “The research and technology communities need access to open language models to advance this science,” Hanna Hajishirzi, the senior director of NLP analysis at AI2, informed Ztoog in an e-mail interview. “With OLMo, we are working to close the gap between public and private research capabilities and knowledge by building a competitive language model.”

    One would possibly surprise — together with this reporter — why AI2 felt the necessity to develop an open language model when there’s already a number of to select from (see Bloom, Meta’s LLaMA, and so on.). The approach Hajishirzi sees it, whereas the open supply releases up to now have been useful and even boundary-pushing, they’ve missed the mark in varied methods.

    AI2 sees OLMo as a platform, not simply a model — one which’ll enable the analysis group to take every part AI2 creates and both use it themselves or search to enhance it. Everything AI2 makes for OLMo might be overtly accessible, Hajishirzi says, together with a public demo, coaching information set and API, and documented with “very limited” exceptions beneath “suitable” licensing.

    “We’re building OLMo to create greater access for the AI research community to work directly on language models,” Hajishirzi mentioned. “We believe the broad availability of all aspects of OLMo will enable the research community to take what we are creating and work to improve it. Our ultimate goal is to collaboratively build the best open language model in the world.”

    OLMo’s different differentiator, in keeping with Noah Smith, senior director of NLP analysis at AI2, is a give attention to enabling the model to higher leverage and perceive textbooks and tutorial papers versus, say, code. There’s been different makes an attempt at this, like Meta’s notorious Galactica model. But Hajishirzi believes that AI2’s work in academia and the instruments it’s developed for analysis, like Semantic Scholar, will assist make OLMo “uniquely suited” for scientific and tutorial functions.

    “We believe OLMo has the potential to be something really special in the field, especially in a landscape where many are rushing to cash in on interest in generative AI models,” Smith mentioned. “AI2’s unique ability to act as third party experts gives us an opportunity to work not only with our own world-class expertise but collaborate with the strongest minds in the industry. As a result, we think our rigorous, documented approach will set the stage for building the next generation of safe, effective AI technologies.”

    That’s a good sentiment, to make sure. But what in regards to the thorny moral and authorized points round coaching — and releasing — generative AI? The debate’s raging across the rights of content material homeowners (amongst different affected stakeholders), and numerous nagging points have but to be settled within the courts.

    To allay issues, the OLMo workforce plans to work with AI2’s authorized division and to-be-determined outdoors consultants, stopping at “checkpoints” within the model-building course of to reassess privateness and mental property rights points.

    “We hope that through an open and transparent dialogue about the model and its intended use, we can better understand how to mitigate bias, toxicity, and shine a light on outstanding research questions within the community, ultimately resulting in one of the strongest models available,” Smith mentioned.

    What in regards to the potential for misuse? Models, which are sometimes poisonous and biased to start with, are ripe for dangerous actors intent on spreading disinformation and producing malicious code.

    Hajishirzi mentioned that AI2 will use a mixture of licensing, model design and selective entry to the underlying parts to “maximize the scientific benefits while reducing the risk of harmful use.” To information coverage, OLMo has an ethics assessment committee with inside and exterior advisors (AI2 wouldn’t say who, precisely) that’ll present suggestions all through the model creation course of.

    We’ll see to what extent that makes a distinction. For now, a lot’s up within the air — together with many of the model’s technical specs. (AI2 did reveal that it’ll have round 70 billion parameters, parameters being the components of the model realized from historic coaching information.) Training’s set to start on LUMI’s supercomputer in Finland — the quickest supercomputer in Europe, as of January — within the coming months.

    AI2 is inviting collaborators to assist contribute to — and critique — the model improvement course of. Those can contact the OLMo venture organizers right here. 

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    The Future

    Any wall can be turned into a camera to see around corners

    The Future

    JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

    The Future

    AI may already be shrinking entry-level jobs in tech, new research suggests

    The Future

    Today’s NYT Strands Hints, Answer and Help for May 26 #449

    The Future

    LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

    The Future

    Common Security Mistakes Made By Businesses and How to Avoid Them

    The Future

    What time tracking metrics should you track and why?

    The Future

    Are entangled qubits following a quantum Moore’s law?

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    AI

    MuPT: A Series of Pre-Trained AI Models for Symbolic Music Generation that Sets the Standard for Training Open-Source Symbolic Music Foundation Models

    In the ever-expanding panorama of synthetic intelligence, Large Language Models (LLMs) have emerged as versatile…

    The Future

    The ‘PS5 Pro Enhanced’ label could mean constant 60fps and ray-tracing

    Sony will use a brand new “PS5 Pro Enhanced” label to inform gamers which video…

    Technology

    Stability AI CEO resigns because you can’t beat centralized AI with more centralized AI

    Stability AI founder and chief government Emad Mostaque has stepped down from the highest position…

    Crypto

    Crypto Analyst Tips Bitcoin (BTC) To Reach $40,000 In Q4 2023

    Bitcoin has recorded an total optimistic worth motion within the final week, gaining by 2.39%,…

    The Future

    Who is Emmett Shear? New interim CEO of OpenAI who once called AI a ‘universe-destroying bomb’

    OpenAI, a firm that took the world by storm with its AI chatbot, ChatGPT, now…

    Our Picks
    Crypto

    When BlackRock Bitcoin ETF? Detailed Timeline And Implications

    Mobile

    Bang and Olufsen Beosound Emerge review: Brilliant sound meets stylish design

    The Future

    Inside the gigafactory producing the greenest batteries in the world

    Categories
    • AI (1,493)
    • Crypto (1,753)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,866)
    • Technology (1,802)
    • The Future (1,648)
    Most Popular
    Gadgets

    Paralyzed Man Walks Again: Digital Bridge Connects Brain And Spinal Cord

    Crypto

    SEC Anticipated To Reject Spot Ethereum ETFs In Upcoming Decision, ETH Price Takes 5% Hit

    Gadgets

    All-Clad’s Factory Seconds Sale Is Happening Right Now: Deals on Pots and Pans

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.