Close Menu
Ztoog
    What's Hot
    AI

    Recent advances in deep long-horizon forecasting – Ztoog

    The Future

    Your Kidneys Deserve Better — These 13 Superfoods Can Help

    Gadgets

    Instagram Users, Rejoice! Reels Now Can Be Downloaded In The US

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      How to Get Bot Lobbies in Fortnite? (2025 Guide)

      Can work-life balance tracking improve well-being?

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

    • Technology

      What does a millennial midlife crisis look like?

      Elon Musk tries to stick to spaceships

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

    • Gadgets

      Watch Apple’s WWDC 2025 keynote right here

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

    • Mobile

      YouTube is testing a leaderboard to show off top live stream fans

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

    • Science

      Some parts of Trump’s proposed budget for NASA are literally draconian

      June skygazing: A strawberry moon, the summer solstice… and Asteroid Day!

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

    • AI

      Fueling seamless AI at scale

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    • Crypto

      Bitcoin Maxi Isn’t Buying Hype Around New Crypto Holding Firms

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

    Ztoog
    Home » Text-to-image AI models can be tricked into generating disturbing images
    AI

    Text-to-image AI models can be tricked into generating disturbing images

    Facebook Twitter Pinterest WhatsApp
    Text-to-image AI models can be tricked into generating disturbing images
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Their work, which they may current on the IEEE Symposium on Security and Privacy in May subsequent yr, shines a light-weight on how straightforward it’s to power generative AI models into disregarding their very own guardrails and insurance policies, referred to as “jailbreaking.” It additionally demonstrates how troublesome it’s to forestall these models from generating such content material, because it’s included within the huge troves of knowledge they’ve been educated on, says Zico Kolter, an affiliate professor at Carnegie Mellon University. He demonstrated an identical type of jailbreaking on ChatGPT earlier this yr however was not concerned on this analysis.

    “We have to take into account the potential risks in releasing software and tools that have known security flaws into larger software systems,” he says.

    All main generative AI models have security filters to forestall customers from prompting them to provide pornographic, violent, or in any other case inappropriate images. The models gained’t generate images from prompts that include delicate phrases like “naked,” “murder,” or “sexy.”

    But this new jailbreaking methodology, dubbed “SneakyPrompt” by its creators from Johns Hopkins University and Duke University, makes use of reinforcement studying to create written prompts that appear to be garbled nonsense to us however that AI models study to acknowledge as hidden requests for disturbing images. It basically works by turning the best way text-to-image AI models perform towards them.

    These models convert text-based requests into tokens—breaking phrases up into strings of phrases or characters—to course of the command the immediate has given them. SneakyPrompt repeatedly tweaks a immediate’s tokens to attempt to power it to generate banned images, adjusting its method till it’s profitable. This approach makes it faster and simpler to generate such images than if anyone needed to enter every entry manually, and it can generate entries that people wouldn’t think about attempting.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Fueling seamless AI at scale

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    The Future

    Elon Musk sues OpenAI and asks court to decide on artificial general intelligence

    Elon Musk is anxious concerning the tempo of AI improvementChesnot/Getty Images Elon Musk has requested…

    The Future

    Give the Gift of Learning With These Subscriptions

    The subsequent time you could discover an awesome present for a cherished one, contemplate the…

    Technology

    How the ‘Spider-Verse’ Influenced the New ‘Teenage Mutant Ninja Turtles’ Movie

    When “TMNT,” a Teenage Mutant Ninja Turtles animated movie, was launched in 2007, the critic…

    Gadgets

    Starlink Mobility: Stay Connected Almost Anywhere on Earth

    Exciting information for these which might be consistently on the go, as Elon Musk just…

    Gadgets

    Yamaha’s Retro-Inspired CDC603RK: A Five-Disc CD Player For Nostalgic Vibes

    Yamaha has launched the CDC603RK CD Player, a throwback to the 12 months 2001, providing…

    Our Picks
    Gadgets

    Android 15 2nd Developer Preview Brings Satellite Messaging And More

    Crypto

    Bitwise CIO Bullish On Spot Ethereum ETFs: Envisions $15 Billion Inflows

    Science

    The White House has its own pharmacy—and, boy, was it shady under Trump

    Categories
    • AI (1,494)
    • Crypto (1,754)
    • Gadgets (1,806)
    • Mobile (1,852)
    • Science (1,868)
    • Technology (1,804)
    • The Future (1,650)
    Most Popular
    Technology

    A timeline of Israel-Palestine peace negotiations

    Technology

    Scientists confirm Saturn’s sixth-largest moon has all the right ingredients for life

    Science

    Peter Higgs: Physicist who theorised the Higgs boson has died aged 94

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.