Close Menu
Ztoog
    What's Hot
    Crypto

    New SHIB-Based Token On The Way? Update

    Crypto

    Worldcoin 50% Crash Caused By Mounting Data Privacy Paranoia

    Technology

    Cruise told by regulators to ‘immediately’ reduce robotaxi fleet 50% following crash

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      What is Project Management? 5 Best Tools that You Can Try

      Operational excellence strategy and continuous improvement

      Hannah Fry: AI isn’t as powerful as we think

      FanDuel goes all in on responsible gaming push with new Play with a Plan campaign

      Gettyimages.com Is the Best Website on the Internet Right Now

    • Technology

      Iran war: How could it end?

      Democratic senators question CFTC staffing cuts in Chicago enforcement office

      Google’s Cloud AI lead on the three frontiers of model capability

      AMD agrees to backstop a $300M loan from Goldman Sachs for Crusoe to buy AMD AI chips, the first known case of AMD chips used as debt collateral (The Information)

      Productivity apps failed me when I needed them most

    • Gadgets

      macOS Tahoe 26.3.1 update will “upgrade” your M5’s CPU to new “super” cores

      Lenovo Shows Off a ThinkBook Modular AI PC Concept With Swappable Ports and Detachable Displays at MWC 2026

      POCO M8 Review: The Ultimate Budget Smartphone With Some Cons

      The Mission: Impossible of SSDs has arrived with a fingerprint lock

      6 Best Phones With Headphone Jacks (2026), Tested and Reviewed

    • Mobile

      Android’s March update is all about finding people, apps, and your missing bags

      Watch Xiaomi’s global launch event live here

      Our poll shows what buyers actually care about in new smartphones (Hint: it’s not AI)

      Is Strava down for you? You’re not alone

      The Motorola Razr FIFA World Cup 2026 Edition was literally just unveiled, and Verizon is already giving them away

    • Science

      Big Tech Signs White House Data Center Pledge With Good Optics and Little Substance

      Inside the best dark matter detector ever built

      NASA’s Artemis moon exploration programme is getting a major makeover

      Scientists crack the case of “screeching” Scotch tape

      Blue-faced, puffy-lipped monkey scores a rare conservation win

    • AI

      Online harassment is entering its AI era

      Meet NullClaw: The 678 KB Zig AI Agent Framework Running on 1 MB RAM and Booting in Two Milliseconds

      New method could increase LLM training efficiency | Ztoog

      The human work behind humanoid robots is being hidden

      NVIDIA Releases DreamDojo: An Open-Source Robot World Model Trained on 44,711 Hours of Real-World Human Video Data

    • Crypto

      Google paid startup Form Energy $1B for its massive 100-hour battery

      Ethereum Breakout Alert: Corrective Channel Flip Sparks Impulsive Wave

      Show Your ID Or No Deal

      Jane Street sued for alleged front-running trades that accelerated Terraform Labs meltdown

      Bitcoin Trades Below ETF Cost-Basis As MVRV Signals Mounting Pressure

    Ztoog
    Home » ServiceNow AI Releases Apriel-1.5-15B-Thinker: An Open-Weights Multimodal Reasoning Model that Hits Frontier-Level Performance on a Single-GPU Budget
    AI

    ServiceNow AI Releases Apriel-1.5-15B-Thinker: An Open-Weights Multimodal Reasoning Model that Hits Frontier-Level Performance on a Single-GPU Budget

    Facebook Twitter Pinterest WhatsApp
    ServiceNow AI Releases Apriel-1.5-15B-Thinker: An Open-Weights Multimodal Reasoning Model that Hits Frontier-Level Performance on a Single-GPU Budget
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    ServiceNow AI Research Lab has launched Apriel-1.5-15B-Thinker, a 15-billion-parameter open-weights multimodal reasoning mannequin educated with a data-centric mid-training recipe—continuous pretraining adopted by supervised fine-tuning—with out reinforcement studying or choice optimization. The mannequin attains an Artificial Analysis Intelligence Index rating of 52 with 8x price financial savings in comparison with SOTA. The checkpoint ships beneath an MIT license on Hugging Face.

    So, What’s new in it for me?

    • Frontier-level composite rating at small scale. The mannequin experiences Artificial Analysis Intelligence Index (AAI) = 52, matching DeepSeek-R1-0528 on that mixed metric whereas being dramatically smaller. AAI aggregates 10 third-party evaluations (MMLU-Pro, GPQA Diamond, Humanity’s Last Exam, LiveCodeBench, SciCode, AIME 2025, IFBench, AA-LCR, Terminal-Bench Hard, τ²-Bench Telecom).
    • Single-GPU deployability. The mannequin card states the 15B checkpoint “fits on a single GPU,” focusing on on-premises and air-gapped deployments with mounted reminiscence and latency budgets.
    • Open weights and reproducible pipeline. Weights, coaching recipe, and analysis protocol are public for impartial verification.
    https://huggingface.co/ServiceNow-AI/Apriel-1.5-15b-Thinker

    Ok! I bought it however what’s it’s coaching mechanism?

    Base and upscaling. Apriel-1.5-15B-Thinker begins from Mistral’s Pixtral-12B-Base-2409 multimodal decoder-vision stack. The analysis workforce applies depth upscaling—growing decoder layers from 40→48—then projection-network realignment to align the imaginative and prescient encoder with the enlarged decoder. This avoids pretraining from scratch whereas preserving single-GPU deployability.

    CPT (Continual Pretraining). Two levels: (1) combined textual content+picture knowledge to construct foundational reasoning and doc/diagram understanding; (2) focused artificial visible duties (reconstruction, matching, detection, counting) to sharpen spatial and compositional reasoning. Sequence lengths lengthen to 32k and 16k tokens respectively, with selective loss placement on response tokens for instruction-formatted samples.

    🚨 [Recommended Read] ViPE (Video Pose Engine): A Powerful and Versatile 3D Video Annotation Tool for Spatial AI

    SFT (Supervised Fine-Tuning). High-quality, reasoning-trace instruction knowledge for math, coding, science, software use, and instruction following; two extra SFT runs (stratified subset; longer-context) are weight-merged to kind the ultimate checkpoint. No RL (reinforcement studying) or RLAIF (reinforcement studying from AI suggestions).

    Data word. ~25% of the depth-upscaling textual content combine derives from NVIDIA’s Nemotron assortment.

    O’ Wow! Tell me about it’s outcomes then?

    Key textual content benchmarks (cross@1 / accuracy).

    • AIME 2025 (American Invitational Mathematics Examination 2025): 87.5–88%
    • GPQA Diamond (Graduate-Level Google-Proof Question Answering, Diamond cut up): ≈71%
    • IFBench (Instruction-Following Benchmark): ~62
    • τ²-Bench (Tau-squared Bench) Telecom: ~68
    • LiveCodeBench (purposeful code correctness): ~72.8

    Using VLMEvalKit for reproducibility, Apriel scores competitively throughout MMMU / MMMU-Pro (Massive Multi-discipline Multimodal Understanding), LogicVista, MathVision, MathVista, MathVerse, MMStar, CharXiv, AI2D, BLINK, with stronger outcomes on paperwork/diagrams and text-dominant math imagery.

    https://huggingface.co/ServiceNow-AI/Apriel-1.5-15b-Thinker/blob/fundamental/Apriel-1.5-Thinker.pdf

    Lets Summarize all the pieces

    Apriel-1.5-15B-Thinker demonstrates that cautious mid-training (continuous pretraining + supervised fine-tuning, no reinforcement studying) can ship a 52 on the Artificial Analysis Intelligence Index (AAI) whereas remaining deployable on a single graphics processing unit. Reported task-level scores (for instance, AIME 2025 ≈88, GPQA Diamond ≈71, IFBench ≈62, Tau-squared Bench Telecom ≈68) align with the mannequin card and place the 15-billion-parameter checkpoint in probably the most cost-efficient band of present open-weights reasoners. For enterprises, that mixture—open weights, reproducible recipe, and single-GPU latency—makes Apriel a sensible baseline to judge earlier than contemplating bigger closed techniques.


    Asif Razzaq is the CEO of Marktechpost Media Inc.. As a visionary entrepreneur and engineer, Asif is dedicated to harnessing the potential of Artificial Intelligence for social good. His most up-to-date endeavor is the launch of an Artificial Intelligence Media Platform, Marktechpost, which stands out for its in-depth protection of machine studying and deep studying information that is each technically sound and simply comprehensible by a huge viewers. The platform boasts of over 2 million month-to-month views, illustrating its reputation amongst audiences.

    🔥[Recommended Read] NVIDIA AI Open-Sources ViPE (Video Pose Engine): A Powerful and Versatile 3D Video Annotation Tool for Spatial AI

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Online harassment is entering its AI era

    AI

    Meet NullClaw: The 678 KB Zig AI Agent Framework Running on 1 MB RAM and Booting in Two Milliseconds

    Gadgets

    POCO M8 Review: The Ultimate Budget Smartphone With Some Cons

    AI

    New method could increase LLM training efficiency | Ztoog

    AI

    The human work behind humanoid robots is being hidden

    Technology

    Google’s Cloud AI lead on the three frontiers of model capability

    AI

    NVIDIA Releases DreamDojo: An Open-Source Robot World Model Trained on 44,711 Hours of Real-World Human Video Data

    AI

    Personalization features can make LLMs more agreeable | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Crypto

    Market Alert: Ethereum Faces Potential Downfall as Dencun Upgrade Looms

    Ethereum (ETH) is poised for a notable enchancment with the approaching Dencun improve to boost…

    Crypto

    Polygon Labs lays off 60 employees, about 19% of its staff, CEO says

    Polygon Labs, the crew targeted on constructing the layer-2 blockchain Polygon, has laid off 60…

    The Future

    Microsoft’s Edge Copilot AI can’t really summarize every YouTube video

    One function added to Microsoft’s AI Copilot within the Edge browser this week is the…

    Mobile

    How to disable biometrics on your Android phone from the lock screen

    Android 12 and later units have a neat security characteristic baked into the OS. Designed…

    Gadgets

    The best wired security cameras of 2023

    We could earn income from the merchandise obtainable on this web page and take part…

    Our Picks
    The Future

    On the Heels of a Heavy Whale, Paleontologists Find a Puny One

    AI

    CMU Researchers Present ‘Echo Embeddings’: An Embedding Strategy Designed to Address an Architectural Limitation of Autoregressive Models

    Mobile

    Xiaomi has violated India’s FEMA, according to the Enforcement Directorate

    Categories
    • AI (1,560)
    • Crypto (1,826)
    • Gadgets (1,870)
    • Mobile (1,910)
    • Science (1,939)
    • Technology (1,862)
    • The Future (1,716)
    Most Popular
    The Future

    Elon Musk Announces Tesla Robotaxi To Be Unveiled On August 8

    Technology

    Your SD Card Might Slow Down Your Nintendo Switch

    Mobile

    If OnePlus can have a 7,000 mAh battery, are Samsung and Apple scamming us with tech from 2020?

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2026 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.