Close Menu
Ztoog
    What's Hot
    AI

    Image recognition accuracy: An unseen challenge confounding today’s AI | Ztoog

    Mobile

    Samsung Galaxy S24 Plus’ Korean model appears on Geekbench, hints at RAM upgrade

    Crypto

    South Korea Establishing Crypto Investigative Unit Amid Crime Surge

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      How I Turn Unstructured PDFs into Revenue-Ready Spreadsheets

      Is it the best tool for 2025?

      The clocks that helped define time from London’s Royal Observatory

      Summer Movies Are Here, and So Are the New Popcorn Buckets

      India-Pak conflict: Pak appoints ISI chief, appointment comes in backdrop of the Pahalgam attack

    • Technology

      Ensure Hard Work Is Recognized With These 3 Steps

      Cicada map 2025: Where will Brood XIV cicadas emerge this spring?

      Is Duolingo the face of an AI jobs crisis?

      The US DOD transfers its AI-based Open Price Exploration for National Security program to nonprofit Critical Minerals Forum to boost Western supply deals (Ernest Scheyder/Reuters)

      The more Google kills Fitbit, the more I want a Fitbit Sense 3

    • Gadgets

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

      Apple plans to split iPhone 18 launch into two phases in 2026

      Upgrade your desk to Starfleet status with this $95 USB-C hub

      37 Best Graduation Gift Ideas (2025): For College Grads

      Backblaze responds to claims of “sham accounting,” customer backups at risk

    • Mobile

      Samsung Galaxy S25 Edge promo materials leak

      What are people doing with those free T-Mobile lines? Way more than you’d expect

      Samsung doesn’t want budget Galaxy phones to use exclusive AI features

      COROS’s charging adapter is a neat solution to the smartwatch charging cable problem

      Fortnite said to return to the US iOS App Store next week following court verdict

    • Science

      Failed Soviet probe will soon crash to Earth – and we don’t know where

      Trump administration cuts off all future federal funding to Harvard

      Does kissing spread gluten? New research offers a clue.

      Why Balcony Solar Panels Haven’t Taken Off in the US

      ‘Dark photon’ theory of light aims to tear up a century of physics

    • AI

      How to build a better AI benchmark

      Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

      This data set helps researchers spot harmful stereotypes in LLMs

      Making AI models more trustworthy for high-stakes settings | Ztoog

      The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    • Crypto

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

      Bitcoin Holds Above $95K Despite Weak Blockchain Activity — Analytics Firm Explains Why

      eToro eyes US IPO launch as early as next week amid easing concerns over Trump’s tariffs

      Cardano ‘Looks Dope,’ Analyst Predicts Big Move Soon

      Speak at Ztoog Disrupt 2025: Applications now open

    Ztoog
    Home » Text-to-image AI models can be tricked into generating disturbing images
    AI

    Text-to-image AI models can be tricked into generating disturbing images

    Facebook Twitter Pinterest WhatsApp
    Text-to-image AI models can be tricked into generating disturbing images
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Their work, which they may current on the IEEE Symposium on Security and Privacy in May subsequent yr, shines a light-weight on how straightforward it’s to power generative AI models into disregarding their very own guardrails and insurance policies, referred to as “jailbreaking.” It additionally demonstrates how troublesome it’s to forestall these models from generating such content material, because it’s included within the huge troves of knowledge they’ve been educated on, says Zico Kolter, an affiliate professor at Carnegie Mellon University. He demonstrated an identical type of jailbreaking on ChatGPT earlier this yr however was not concerned on this analysis.

    “We have to take into account the potential risks in releasing software and tools that have known security flaws into larger software systems,” he says.

    All main generative AI models have security filters to forestall customers from prompting them to provide pornographic, violent, or in any other case inappropriate images. The models gained’t generate images from prompts that include delicate phrases like “naked,” “murder,” or “sexy.”

    But this new jailbreaking methodology, dubbed “SneakyPrompt” by its creators from Johns Hopkins University and Duke University, makes use of reinforcement studying to create written prompts that appear to be garbled nonsense to us however that AI models study to acknowledge as hidden requests for disturbing images. It basically works by turning the best way text-to-image AI models perform towards them.

    These models convert text-based requests into tokens—breaking phrases up into strings of phrases or characters—to course of the command the immediate has given them. SneakyPrompt repeatedly tweaks a immediate’s tokens to attempt to power it to generate banned images, adjusting its method till it’s profitable. This approach makes it faster and simpler to generate such images than if anyone needed to enter every entry manually, and it can generate entries that people wouldn’t think about attempting.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    AI

    Making AI models more trustworthy for high-stakes settings | Ztoog

    AI

    The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    AI

    Novel method detects microbial contamination in cell cultures | Ztoog

    AI

    Seeing AI as a collaborator, not a creator

    AI

    “Periodic table of machine learning” could fuel AI discovery | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Technology

    Vulnerabilities in Supermicro BMCs could allow for unkillable server rootkits

    Getty Images If your group makes use of servers which are outfitted with baseboard administration…

    The Future

    The best robot photos of 2023, from fashion shows to Hollywood strikes

    Spot the robot canine performs with a mannequin at Paris Fashion WeekFrancois Durand/Getty Images The…

    Gadgets

    From Cafeteria Trays to Buffet Bins: Nuvilab’s Innovative Approach to Food Measurement

    Nuvilab, established in 2018, is an award-winning “Food Vision AI” startup. It focuses on creating…

    The Future

    Exclusive: KKR just closed its third tech growth fund with roughly $3 billion, $400 million of which came from KKR

    KKR, a worldwide funding powerhouse, tells Ztoog completely that it just held a last shut…

    The Future

    Oclean X Ultra S sonic toothbrush – Australian Review

    You may say I’ve a factor for cleaned tooth. There’s the apparent advantages – nobody…

    Our Picks
    Mobile

    OPPO’s time in France may come to an end

    The Future

    Flying drone can roll on the ground to save energy over long distances

    Technology

    Android 14 beta testers can now turn some Pixel phones into webcams

    Categories
    • AI (1,482)
    • Crypto (1,744)
    • Gadgets (1,796)
    • Mobile (1,839)
    • Science (1,853)
    • Technology (1,789)
    • The Future (1,635)
    Most Popular
    Mobile

    Apple reportedly faces billions in fines for failing to comply with Europe’s Digital Markets Act

    Mobile

    Apple doesn’t need to ban Sunbird or Beeper; they’re already on borrowed time

    Gadgets

    The best RV generators in 2023

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.