Close Menu
Ztoog
    What's Hot
    Mobile

    Meta Quest 3’s more affordable sibling may have leaked in new render

    Science

    A Brain Implant Helped Stroke Survivors Regain Movement

    Mobile

    Amazon is now offering an even better deal on Galaxy Tab S9 FE+

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      How I Turn Unstructured PDFs into Revenue-Ready Spreadsheets

      Is it the best tool for 2025?

      The clocks that helped define time from London’s Royal Observatory

      Summer Movies Are Here, and So Are the New Popcorn Buckets

      India-Pak conflict: Pak appoints ISI chief, appointment comes in backdrop of the Pahalgam attack

    • Technology

      Ensure Hard Work Is Recognized With These 3 Steps

      Cicada map 2025: Where will Brood XIV cicadas emerge this spring?

      Is Duolingo the face of an AI jobs crisis?

      The US DOD transfers its AI-based Open Price Exploration for National Security program to nonprofit Critical Minerals Forum to boost Western supply deals (Ernest Scheyder/Reuters)

      The more Google kills Fitbit, the more I want a Fitbit Sense 3

    • Gadgets

      Maono Caster G1 Neo & PD200X Review: Budget Streaming Gear for Aspiring Creators

      Apple plans to split iPhone 18 launch into two phases in 2026

      Upgrade your desk to Starfleet status with this $95 USB-C hub

      37 Best Graduation Gift Ideas (2025): For College Grads

      Backblaze responds to claims of “sham accounting,” customer backups at risk

    • Mobile

      Samsung Galaxy S25 Edge promo materials leak

      What are people doing with those free T-Mobile lines? Way more than you’d expect

      Samsung doesn’t want budget Galaxy phones to use exclusive AI features

      COROS’s charging adapter is a neat solution to the smartwatch charging cable problem

      Fortnite said to return to the US iOS App Store next week following court verdict

    • Science

      Failed Soviet probe will soon crash to Earth – and we don’t know where

      Trump administration cuts off all future federal funding to Harvard

      Does kissing spread gluten? New research offers a clue.

      Why Balcony Solar Panels Haven’t Taken Off in the US

      ‘Dark photon’ theory of light aims to tear up a century of physics

    • AI

      How to build a better AI benchmark

      Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

      This data set helps researchers spot harmful stereotypes in LLMs

      Making AI models more trustworthy for high-stakes settings | Ztoog

      The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    • Crypto

      ‘The Big Short’ Coming For Bitcoin? Why BTC Will Clear $110,000

      Bitcoin Holds Above $95K Despite Weak Blockchain Activity — Analytics Firm Explains Why

      eToro eyes US IPO launch as early as next week amid easing concerns over Trump’s tariffs

      Cardano ‘Looks Dope,’ Analyst Predicts Big Move Soon

      Speak at Ztoog Disrupt 2025: Applications now open

    Ztoog
    Home » 3 Questions: What you need to know about audio deepfakes | Ztoog
    AI

    3 Questions: What you need to know about audio deepfakes | Ztoog

    Facebook Twitter Pinterest WhatsApp
    3 Questions: What you need to know about audio deepfakes | Ztoog
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Audio deepfakes have had a latest bout of unhealthy press after a synthetic intelligence-generated robocall purporting to be the voice of Joe Biden hit up New Hampshire residents, urging them not to solid ballots. Meanwhile, spear-phishers — phishing campaigns that concentrate on a particular individual or group, particularly utilizing data identified to be of curiosity to the goal — go fishing for cash, and actors goal to protect their audio likeness.

    What receives much less press, nevertheless, are among the makes use of of audio deepfakes that would truly profit society. In this Q&A ready for Ztoog, postdoc Nauman Dawalatabad addresses considerations in addition to potential upsides of the rising tech. A fuller model of this interview could be seen on the video beneath.

    Q: What moral concerns justify the concealment of the supply speaker’s identification in audio deepfakes, particularly when this expertise is used for creating revolutionary content material?

    A: The inquiry into why analysis is essential in obscuring the identification of the supply speaker, regardless of a big main use of generative fashions for audio creation in leisure, for instance, does increase moral concerns. Speech doesn’t comprise the knowledge solely about “who you are?” (identification) or (*3*) (content material); it encapsulates a myriad of delicate data together with age, gender, accent, present well being, and even cues about the upcoming future well being situations. For occasion, our latest analysis paper on “Detecting Dementia from Long Neuropsychological Interviews” demonstrates the feasibility of detecting dementia from speech with significantly excessive accuracy. Moreover, there are a number of fashions that may detect gender, accent, age, and different data from speech with very excessive accuracy. There is a need for developments in expertise that safeguard in opposition to the inadvertent disclosure of such personal knowledge. The endeavor to anonymize the supply speaker’s identification is just not merely a technical problem however an ethical obligation to protect particular person privateness within the digital age.

    Q: How can we successfully maneuver by means of the challenges posed by audio deepfakes in spear-phishing assaults, making an allowance for the related dangers, the event of countermeasures, and the development of detection methods?

    A: The deployment of audio deepfakes in spear-phishing assaults introduces a number of dangers, together with the propagation of misinformation and pretend information, identification theft, privateness infringements, and the malicious alteration of content material. The latest circulation of misleading robocalls in Massachusetts exemplifies the detrimental affect of such expertise. We additionally just lately spoke with the spoke with The Boston Globe about this expertise, and the way simple and cheap it’s to generate such deepfake audios.

    Anyone with no important technical background can simply generate such audio, with a number of obtainable instruments on-line. Such faux information from deepfake turbines can disturb monetary markets and even electoral outcomes. The theft of 1’s voice to entry voice-operated financial institution accounts and the unauthorized utilization of 1’s vocal identification for monetary achieve are reminders of the pressing need for strong countermeasures. Further dangers might embody privateness violation, the place an attacker can make the most of the sufferer’s audio with out their permission or consent. Further, attackers may alter the content material of the unique audio, which may have a severe affect.

    Two main and distinguished instructions have emerged in designing techniques to detect faux audio: artifact detection and liveness detection. When audio is generated by a generative mannequin, the mannequin introduces some artifact within the generated sign. Researchers design algorithms/fashions to detect these artifacts. However, there are some challenges with this strategy due to growing sophistication of audio deepfake turbines. In the longer term, we may additionally see fashions with very small or virtually no artifacts. Liveness detection, however, leverages the inherent qualities of pure speech, comparable to respiratory patterns, intonations, or rhythms, that are difficult for AI fashions to replicate precisely. Some firms like Pindrop are growing such options for detecting audio fakes. 

    Additionally, methods like audio watermarking function proactive defenses, embedding encrypted identifiers inside the unique audio to hint its origin and deter tampering. Despite different potential vulnerabilities, comparable to the danger of replay assaults, ongoing analysis and improvement on this enviornment provide promising options to mitigate the threats posed by audio deepfakes.

    Q: Despite their potential for misuse, what are some optimistic points and advantages of audio deepfake expertise? How do you think about the longer term relationship between AI and our experiences of audio notion will evolve?

    A: Contrary to the predominant concentrate on the nefarious purposes of audio deepfakes, the expertise harbors immense potential for optimistic affect throughout varied sectors. Beyond the realm of creativity, the place voice conversion applied sciences allow unprecedented flexibility in leisure and media, audio deepfakes maintain transformative promise in well being care and training sectors. My present ongoing work within the anonymization of affected person and physician voices in cognitive health-care interviews, as an illustration, facilitates the sharing of essential medical knowledge for analysis globally whereas guaranteeing privateness. Sharing this knowledge amongst researchers fosters improvement within the areas of cognitive well being care. The software of this expertise in voice restoration represents a hope for people with speech impairments, for instance, for ALS or dysarthric speech, enhancing communication skills and high quality of life.

    I’m very optimistic about the longer term affect of audio generative AI fashions. The future interaction between AI and audio notion is poised for groundbreaking developments, significantly by means of the lens of psychoacoustics — the examine of how people understand sounds. Innovations in augmented and digital actuality, exemplified by units just like the Apple Vision Pro and others, are pushing the boundaries of audio experiences in the direction of unparalleled realism. Recently we’ve seen an exponential enhance within the variety of subtle fashions arising virtually each month. This fast tempo of analysis and improvement on this subject guarantees not solely to refine these applied sciences but in addition to develop their purposes in ways in which profoundly profit society. Despite the inherent dangers, the potential for audio generative AI fashions to revolutionize well being care, leisure, training, and past is a testomony to the optimistic trajectory of this analysis subject.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    How to build a better AI benchmark

    AI

    Q&A: A roadmap for revolutionizing health care through data-driven innovation | Ztoog

    AI

    This data set helps researchers spot harmful stereotypes in LLMs

    AI

    Making AI models more trustworthy for high-stakes settings | Ztoog

    AI

    The AI Hype Index: AI agent cyberattacks, racing robots, and musical models

    AI

    Novel method detects microbial contamination in cell cultures | Ztoog

    AI

    Seeing AI as a collaborator, not a creator

    AI

    “Periodic table of machine learning” could fuel AI discovery | Ztoog

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Crypto

    Ethereum: Long-Term Holders Shape Its Future

    In the risky world of cryptocurrency, investor confidence is commonly gauged by the willingness to…

    The Future

    ‘Plastic’ rocks found on remote Brazil islands, scientists raise concerns

    On the island, located nearly 1,140 kilometres (708 miles) from the state of Espirito Santo within…

    The Future

    Rollable Phones and See-Through Laptops: What You Missed From MWC 2024

    If you want vaporware and taking a look at telephones and devices you’ll by no…

    Mobile

    You can now pin multiple messages in a WhatsApp chat

    Last December, WhatsApp rolled out the power to pin messages inside one-on-one conversations and group…

    Technology

    Stellantis CEO says there’s still life in Waymo deal for self-driving delivery vans

    Stellantis, the automaker that owns 14 manufacturers together with Chrysler, Jeep and Ram, and autonomous…

    Our Picks
    Science

    The gravitational waves that could shed light on the cosmic dark age

    AI

    Meet TensorRT-LLM: An Open-Source Library that Accelerates and Optimizes Inference Performance on the Latest LLMs on NVIDIA Tensor Core GPUs

    AI

    CMU Research Introduces CoVO-MPC (Covariance-Optimal MPC): A Novel Sampling-based MPC Algorithm that Optimizes the Convergence Rate

    Categories
    • AI (1,482)
    • Crypto (1,744)
    • Gadgets (1,796)
    • Mobile (1,839)
    • Science (1,853)
    • Technology (1,789)
    • The Future (1,635)
    Most Popular
    AI

    A Scene understanding, Accessibility, Navigation, Pathfinding, & Obstacle avoidance dataset – Google Research Blog

    Gadgets

    The best ratcheting screwdrivers in 2023

    Technology

    Getty Images subscribers to get access to AI image generator

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.