Close Menu
Ztoog
    What's Hot
    The Future

    A New Year’s resolution for tech companies: knock it off with the CAPTCHAs

    Mobile

    Apparent Nothing data breach exposes community member email addresses

    Science

    A single meteorite smashed into Mars and created 2 billion craters

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

      LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

      Common Security Mistakes Made By Businesses and How to Avoid Them

    • Technology

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

      5 Skills Kids (and Adults) Need in an AI World – O’Reilly

      How To Come Back After A Layoff

    • Gadgets

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

      The market’s down, but this OpenAI for the stock market can help you trade up

    • Mobile

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

      Forget screens: more details emerge on the mysterious Jony Ive + OpenAI device

    • Science

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

      AI Is Eating Data Center Power Demand—and It’s Only Getting Worse

    • AI

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

      How AI is introducing errors into courtrooms

    • Crypto

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

      Senate advances GENIUS Act after cloture vote passes

    Ztoog
    Home » AI “godfather” Yoshua Bengio joins UK project to prevent AI catastrophes
    AI

    AI “godfather” Yoshua Bengio joins UK project to prevent AI catastrophes

    Facebook Twitter Pinterest WhatsApp
    AI “godfather” Yoshua Bengio joins UK project to prevent AI catastrophes
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Safeguarded AI’s objective is to construct AI programs that may provide quantitative ensures, akin to a threat rating, about their impact on the actual world, says David “davidad” Dalrymple, this system director for Safeguarded AI at ARIA. The concept is to complement human testing with mathematical evaluation of recent programs’ potential for hurt. 

    The project goals to construct AI security mechanisms by combining scientific world fashions, that are primarily simulations of the world, with mathematical proofs. These proofs would come with explanations of the AI’s work, and people could be tasked with verifying whether or not the AI mannequin’s security checks are right. 

    Bengio says he needs to assist make sure that future AI programs can not trigger critical hurt. 

    “We’re currently racing toward a fog behind which might be a precipice,” he says. “We don’t know how far the precipice is, or if there even is one, so it might be years, decades, and we don’t know how serious it could be … We need to build up the tools to clear that fog and make sure we don’t cross into a precipice if there is one.”  

    Science and expertise corporations don’t have a method to give mathematical ensures that AI programs are going to behave as programmed, he provides. This unreliability, he says, could lead on to catastrophic outcomes. 

    Dalrymple and Bengio argue that present methods to mitigate the chance of superior AI programs—akin to red-teaming, the place individuals probe AI programs for flaws—have critical limitations and might’t be relied on to make sure that crucial programs don’t go off-piste. 

    Instead, they hope this system will present new methods to safe AI programs that rely much less on human efforts and extra on mathematical certainty. The imaginative and prescient is to construct a “gatekeeper” AI, which is tasked with understanding and decreasing the security dangers of different AI brokers. This gatekeeper would make sure that AI brokers functioning in high-stakes sectors, akin to transport or power programs, function as we would like them to. The concept is to collaborate with corporations early on to perceive how AI security mechanisms might be helpful for various sectors, says Dalrymple. 

    The complexity of superior programs means we now have no selection however to use AI to safeguard AI, argues Bengio. “That’s the only way, because at some point these AIs are just too complicated. Even the ones that we have now, we can’t really break down their answers into human, understandable sequences of reasoning steps,” he says. 

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    AI

    Study shows vision-language models can’t handle queries with negation words | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    AI

    Researchers from NVIDIA and Tel Aviv University Introduce Perfusion: A Compact 100 KB Neural Network with Efficient Training Time

    (*100*)Text-to-image(T2I)  fashions have ushered in a brand new period of technological flexibility, granting customers the…

    AI

    UCSD Researchers Evaluate GPT-4’s Performance in a Turing Test: Unveiling the Dynamics of Human-like Deception and Communication Strategies

    The GPT-4 was examined utilizing a public Turing check on the web by a group…

    Technology

    Science Fiction Short: Hijack – IEEE Spectrum

    Andrew ArcherWhich brings us to our story… Simon Okoro settled into a garden chair within…

    Crypto

    Ethereum Layer 2 Networks Just Set A New Record

    The complete worth locked (TVL) on Ethereum layer-2 networks not too long ago hit a…

    Gadgets

    Google is killing Play Movies & TV, will only have three video stores left

    Google Google kills product View extra tales Google is killing off the final vestiges of…

    Our Picks
    Crypto

    Institutional crypto adoption in Asia is growing

    Gadgets

    Unveil the hidden wonders of the micro-world with this pocket-sized LCD microscope, now $81.99

    Gadgets

    Best AeroPress Coffee Makers (2023): Original, Go, Clear, XL

    Categories
    • AI (1,492)
    • Crypto (1,753)
    • Gadgets (1,804)
    • Mobile (1,850)
    • Science (1,865)
    • Technology (1,801)
    • The Future (1,647)
    Most Popular
    Technology

    iMessage gets a major makeover that puts it on equal footing with Signal

    AI

    Robots Get a ‘Gripping’ Upgrade: AO-Grasp Teaches Bots the Art of Not Dropping Your Stuff!

    Crypto

    Blockchain Firm Says Bitcoin Price Might Be Headed For $60,000

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.