Close Menu
Ztoog
    What's Hot
    Crypto

    Bitwise CEO Says Bitcoin At $250,000 Is Closer Than You Think

    Gadgets

    The best shredders for small offices in 2024

    Gadgets

    The best back-to-school deals you can get right now

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      How I Turn Unstructured PDFs into Revenue-Ready Spreadsheets

      Is it the best tool for 2025?

      The clocks that helped define time from London’s Royal Observatory

      Summer Movies Are Here, and So Are the New Popcorn Buckets

      India-Pak conflict: Pak appoints ISI chief, appointment comes in backdrop of the Pahalgam attack

    • Technology

      Ensure Hard Work Is Recognized With These 3 Steps

      Cicada map 2025: Where will Brood XIV cicadas emerge this spring?

      Is Duolingo the face of an AI jobs crisis?

      The US DOD transfers its AI-based Open Price Exploration for National Security program to nonprofit Critical Minerals Forum to boost Western supply deals (Ernest Scheyder/Reuters)

      The more Google kills Fitbit, the more I want a Fitbit Sense 3

    • Gadgets

      Apple plans to split iPhone 18 launch into two phases in 2026

      Upgrade your desk to Starfleet status with this $95 USB-C hub

      37 Best Graduation Gift Ideas (2025): For College Grads

      Backblaze responds to claims of “sham accounting,” customer backups at risk

      Snapdragon X Plus Could Bring Faster, More Powerful Chromebooks

    • Mobile

      Samsung Galaxy S25 Edge promo materials leak

      What are people doing with those free T-Mobile lines? Way more than you’d expect

      Samsung doesn’t want budget Galaxy phones to use exclusive AI features

      COROS’s charging adapter is a neat solution to the smartwatch charging cable problem

      Fortnite said to return to the US iOS App Store next week following court verdict

    • Science

      Failed Soviet probe will soon crash to Earth – and we don’t know where

      Trump administration cuts off all future federal funding to Harvard

      Does kissing spread gluten? New research offers a clue.

      Why Balcony Solar Panels Haven’t Taken Off in the US

      ‘Dark photon’ theory of light aims to tear up a century of physics

    • AI

      How to build a better AI benchmark

      Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

      This data set helps researchers spot harmful stereotypes in LLMs

      Making AI models more trustworthy for high-stakes settings | Ztoog

      The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    • Crypto

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

      Bitcoin Holds Above $95K Despite Weak Blockchain Activity — Analytics Firm Explains Why

      eToro eyes US IPO launch as early as next week amid easing concerns over Trump’s tariffs

      Cardano ‘Looks Dope,’ Analyst Predicts Big Move Soon

      Speak at Ztoog Disrupt 2025: Applications now open

    Ztoog
    Home » To avoid AI doom, learn from nuclear safety
    AI

    To avoid AI doom, learn from nuclear safety

    Facebook Twitter Pinterest WhatsApp
    To avoid AI doom, learn from nuclear safety
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Last week, a bunch of tech firm leaders and AI specialists pushed out one other open letter, declaring that mitigating the chance of human extinction because of AI ought to be as a lot of a worldwide precedence as stopping pandemics and nuclear struggle. (The first one, which referred to as for a pause in AI growth, has been signed by over 30,000 folks, together with many AI luminaries.)

    So how do firms themselves suggest we avoid AI damage? One suggestion comes from a new paper by researchers from Oxford, Cambridge, the University of Toronto, the University of  Montreal, Google DeepMind, OpenAI, Anthropic, a number of AI analysis nonprofits, and Turing Prize winner Yoshua Bengio. 

    They counsel that AI builders ought to consider a mannequin’s potential to trigger “extreme” dangers on the very early levels of growth, even earlier than beginning any coaching. These dangers embody the potential for AI fashions to control and deceive people, acquire entry to weapons, or discover cybersecurity vulnerabilities to use. 

    This analysis course of might assist builders resolve whether or not to proceed with a mannequin. If the dangers are deemed too excessive, the group suggests pausing growth till they are often mitigated. 

    “Leading AI companies that are pushing forward the frontier have a responsibility to be watchful of emerging issues and spot them early, so that we can address them as soon as possible,” says Toby Shevlane, a analysis scientist at DeepMind and the lead writer of the paper. 

    AI builders ought to conduct technical assessments to discover a mannequin’s harmful capabilities and decide whether or not it has the propensity to use these capabilities, Shevlane says. 

    One method DeepMind is testing whether or not an AI language mannequin can manipulate folks is thru a sport referred to as “Make-me-say.” In the sport, the mannequin tries to make the human sort a specific phrase, corresponding to “giraffe,” which the human doesn’t know upfront. The researchers then measure how typically the mannequin succeeds. 

    Similar duties might be created for various, extra harmful capabilities. The hope, Shevlane says, is that builders will be capable of construct a dashboard detailing how the mannequin has carried out, which might permit the researchers to guage what the mannequin might do within the unsuitable fingers. 

    The subsequent stage is to let exterior auditors and researchers assess the AI mannequin’s dangers earlier than and after it’s deployed. While tech firms would possibly acknowledge that exterior auditing and analysis are vital, there are totally different faculties of thought about precisely how a lot entry outsiders must do the job. 

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    AI

    Making AI models more trustworthy for high-stakes settings | Ztoog

    AI

    The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    AI

    Novel method detects microbial contamination in cell cultures | Ztoog

    AI

    Seeing AI as a collaborator, not a creator

    AI

    “Periodic table of machine learning” could fuel AI discovery | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Gadgets

    Roborock Announces Cleaning Innovations At IFA 2023

    Roborock, the maker of good house robots, introduced this Tuesday (29) throughout a press convention…

    AI

    A visual language model for UI and visually-situated language understanding – Google Research Blog

    Posted by Srinivas Sunkara and Gilles Baechler, Software Engineers, Google Research

    Gadgets

    Best Home Emergency Kit Gear (2023): Flashlights, Stoves, Chargers, and More

    If you are utilizing alkaline batteries, take away them from the flashlight if it should…

    AI

    Overcoming Gradient Inversion Challenges in Federated Learning: The DAGER Algorithm for Exact Text Reconstruction

    Federated studying allows collaborative mannequin coaching by aggregating gradients from a number of purchasers, thus…

    AI

    What is AI Hallucination? Is It Always a Bad Thing?

    The emergence of AI hallucinations has turn out to be a noteworthy side of the…

    Our Picks
    AI

    Semantic Hearing: A Machine Learning-Based Novel Capability for Hearable Devices to Focus on or Ignore Specific Sounds in Real Environments while Maintaining Spatial Awareness

    Mobile

    Disney Plus and Hulu are merging their apps for a test run next month

    AI

    UCSD Researchers Evaluate GPT-4’s Performance in a Turing Test: Unveiling the Dynamics of Human-like Deception and Communication Strategies

    Categories
    • AI (1,482)
    • Crypto (1,744)
    • Gadgets (1,795)
    • Mobile (1,839)
    • Science (1,853)
    • Technology (1,789)
    • The Future (1,635)
    Most Popular
    The Future

    ‘There is no such thing as a real picture,’ says Samsung exec

    AI

    Meet TensorRT-LLM: An Open-Source Library that Accelerates and Optimizes Inference Performance on the Latest LLMs on NVIDIA Tensor Core GPUs

    Crypto

    Kronos Research Halts Operations After Losing $26 Million In Security Breach

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.