Close Menu
Ztoog
    What's Hot
    Technology

    Virtual Reality Helps Students Improve Their Math Literacy

    Gadgets

    The best light bulb security cameras of 2024

    AI

    Finding value in generative AI for financial services

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

      LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

    • Technology

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

      5 Skills Kids (and Adults) Need in an AI World – O’Reilly

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

      Senate advances GENIUS Act after cloture vote passes

    Ztoog
    Home » AI “godfather” Yoshua Bengio joins UK project to prevent AI catastrophes
    AI

    AI “godfather” Yoshua Bengio joins UK project to prevent AI catastrophes

    Facebook Twitter Pinterest WhatsApp
    AI “godfather” Yoshua Bengio joins UK project to prevent AI catastrophes
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Safeguarded AI’s objective is to construct AI programs that may provide quantitative ensures, akin to a threat rating, about their impact on the actual world, says David “davidad” Dalrymple, this system director for Safeguarded AI at ARIA. The concept is to complement human testing with mathematical evaluation of recent programs’ potential for hurt. 

    The project goals to construct AI security mechanisms by combining scientific world fashions, that are primarily simulations of the world, with mathematical proofs. These proofs would come with explanations of the AI’s work, and people could be tasked with verifying whether or not the AI mannequin’s security checks are right. 

    Bengio says he needs to assist make sure that future AI programs can not trigger critical hurt. 

    “We’re currently racing toward a fog behind which might be a precipice,” he says. “We don’t know how far the precipice is, or if there even is one, so it might be years, decades, and we don’t know how serious it could be … We need to build up the tools to clear that fog and make sure we don’t cross into a precipice if there is one.”  

    Science and expertise corporations don’t have a method to give mathematical ensures that AI programs are going to behave as programmed, he provides. This unreliability, he says, could lead on to catastrophic outcomes. 

    Dalrymple and Bengio argue that present methods to mitigate the chance of superior AI programs—akin to red-teaming, the place individuals probe AI programs for flaws—have critical limitations and might’t be relied on to make sure that crucial programs don’t go off-piste. 

    Instead, they hope this system will present new methods to safe AI programs that rely much less on human efforts and extra on mathematical certainty. The imaginative and prescient is to construct a “gatekeeper” AI, which is tasked with understanding and decreasing the security dangers of different AI brokers. This gatekeeper would make sure that AI brokers functioning in high-stakes sectors, akin to transport or power programs, function as we would like them to. The concept is to collaborate with corporations early on to perceive how AI security mechanisms might be helpful for various sectors, says Dalrymple. 

    The complexity of superior programs means we now have no selection however to use AI to safeguard AI, argues Bengio. “That’s the only way, because at some point these AIs are just too complicated. Even the ones that we have now, we can’t really break down their answers into human, understandable sequences of reasoning steps,” he says. 

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Mobile

    The best Galaxy Tab S9 deal known to man is back for a little while

    Samsung has introduced back the superior launch time Galaxy Tab S9 deal, supplying you with…

    Gadgets

    18 Best Subscription Boxes to Gift (2023): Services We Love

    Between birthdays, anniversaries, and holidays, it is easy to run out of reward concepts. Don’t…

    Mobile

    The new Galaxy Watch 6 has a new chipset and Samsung is showing it off

    What you might want to knowSamsung just lately launched the Galaxy Watch 6 sequence, which…

    Mobile

    Galaxy S24 series pre-order benefits in Europe outed along with exclusive colors

    Samsung’s massive unveiling occasion for the Galaxy S24 household is happening this Wednesday, on January…

    Mobile

    Samsung Galaxy A35 to come with a camera upgrade

    Samsung unveiled the Galaxy A34 in March, and we anticipate the Galaxy A35 to arrive…

    Our Picks
    Mobile

    Google Pixel 8 review: Primed for success

    Crypto

    Bitcoin Network’s First-Ever BRC20 Stablecoin Launched: Stably USD

    AI

    How AI assistants are already changing the way code gets made

    Categories
    • AI (1,493)
    • Crypto (1,753)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,866)
    • Technology (1,802)
    • The Future (1,648)
    Most Popular
    Mobile

    Google might be letting loose its own Pixel tablet with a pen and keyboard

    Crypto

    VanEck Eyes $1 Trillion Market Cap As ETH Stalls

    The Future

    Offering The Best Electric Vehicle Car Park Management Solutions For Your Business

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.