Close Menu
Ztoog
    What's Hot
    Crypto

    How Urvashi Barooah broke into venture after everyone told her she couldn’t

    Mobile

    Tecno shows off a rollable phone prototype, better UTG for foldables and color changing tech

    Gadgets

    Flight of the RoboBees: Advancements in Miniature Robotics

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Can work-life balance tracking improve well-being?

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

    • Technology

      Elon Musk tries to stick to spaceships

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      June skygazing: A strawberry moon, the summer solstice… and Asteroid Day!

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      Bitcoin Maxi Isn’t Buying Hype Around New Crypto Holding Firms

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

    Ztoog
    Home » Text-to-image AI models can be tricked into generating disturbing images
    AI

    Text-to-image AI models can be tricked into generating disturbing images

    Facebook Twitter Pinterest WhatsApp
    Text-to-image AI models can be tricked into generating disturbing images
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Their work, which they may current on the IEEE Symposium on Security and Privacy in May subsequent yr, shines a light-weight on how straightforward it’s to power generative AI models into disregarding their very own guardrails and insurance policies, referred to as “jailbreaking.” It additionally demonstrates how troublesome it’s to forestall these models from generating such content material, because it’s included within the huge troves of knowledge they’ve been educated on, says Zico Kolter, an affiliate professor at Carnegie Mellon University. He demonstrated an identical type of jailbreaking on ChatGPT earlier this yr however was not concerned on this analysis.

    “We have to take into account the potential risks in releasing software and tools that have known security flaws into larger software systems,” he says.

    All main generative AI models have security filters to forestall customers from prompting them to provide pornographic, violent, or in any other case inappropriate images. The models gained’t generate images from prompts that include delicate phrases like “naked,” “murder,” or “sexy.”

    But this new jailbreaking methodology, dubbed “SneakyPrompt” by its creators from Johns Hopkins University and Duke University, makes use of reinforcement studying to create written prompts that appear to be garbled nonsense to us however that AI models study to acknowledge as hidden requests for disturbing images. It basically works by turning the best way text-to-image AI models perform towards them.

    These models convert text-based requests into tokens—breaking phrases up into strings of phrases or characters—to course of the command the immediate has given them. SneakyPrompt repeatedly tweaks a immediate’s tokens to attempt to power it to generate banned images, adjusting its method till it’s profitable. This approach makes it faster and simpler to generate such images than if anyone needed to enter every entry manually, and it can generate entries that people wouldn’t think about attempting.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Mobile

    Honor Magic 6 Pro camera review

    The Honor Magic 6 Pro camera is the primary smartphone camera that has blown me…

    Gadgets

    Save 50% on a wireless outdoor security system from Blink at Amazon

    We might earn income from the merchandise obtainable on this web page and take part…

    Science

    Collision review: How CERN’s stellar secrets became sci-fi gold

    Inside CMS, one of many Large Hadron Collider’s key experiments, in 2017Maximilien Brice/cern Collision Edited by Rob…

    Technology

    We drive a gilded lily: The 2024 Mercedes-AMG EQE SUV

    Enlarge / A brand new grille and wheels are the important thing giveaway that that…

    Gadgets

    The best fire starters for camping and fireplaces

    We might earn income from the merchandise accessible on this web page and take part…

    Our Picks
    Science

    5 mind-bending numbers that could reveal the secrets of the universe

    The Future

    Google Search AI Gives Ridiculous, Wrong Answers

    Mobile

    Feast your eyes on these leaked Pixel Watch 2 watch faces

    Categories
    • AI (1,493)
    • Crypto (1,754)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,867)
    • Technology (1,803)
    • The Future (1,649)
    Most Popular
    Crypto

    Binance Announces Complete Phase-Out By December

    Gadgets

    Avast ordered to stop selling browsing data from its browsing privacy apps

    Gadgets

    Make waves in 2025: Exhibit at Ztoog events

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.