Close Menu
Ztoog
    What's Hot
    The Future

    OPPO Find N3 released at AU$2,699.00

    Mobile

    iPhone, iPad, and Mac users are getting locked out of Apple ID accounts

    Science

    Inside ALPHA-g: The detector measuring gravity’s effect on antimatter

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

      LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

    • Technology

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

      5 Skills Kids (and Adults) Need in an AI World – O’Reilly

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

      Forget screens: more details emerge on the mysterious Jony Ive + OpenAI device

    • Science

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

      Senate advances GENIUS Act after cloture vote passes

    Ztoog
    Home » MusicMagus: Harnessing Diffusion Models for Zero-Shot Text-to-Music Editing
    AI

    MusicMagus: Harnessing Diffusion Models for Zero-Shot Text-to-Music Editing

    Facebook Twitter Pinterest WhatsApp
    MusicMagus: Harnessing Diffusion Models for Zero-Shot Text-to-Music Editing
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Music era has lengthy been an interesting area, mixing creativity with know-how to provide compositions that resonate with human feelings. The course of entails producing music that aligns with particular themes or feelings conveyed by way of textual descriptions. While growing music from textual content has seen exceptional progress, a big problem stays: modifying the generated music to refine or alter particular components with out ranging from scratch. This job entails intricate changes to the music’s attributes, reminiscent of altering an instrument’s sound or the piece’s total temper, with out affecting its core construction.

    Models are primarily divided into autoregressive (AR) and diffusion-based classes. AR fashions produce longer, higher-quality audio at the price of longer inference instances, and diffusion fashions excel in parallel decoding regardless of challenges in producing prolonged sequences. The modern MagNet mannequin merges AR and diffusion benefits, optimizing high quality and effectivity. While fashions like InstructME and M2UGen display inter-stem and intra-stem modifying capabilities, Loop Copilot facilitates compositional modifying with out altering the unique fashions’ structure or interface.

    Researchers from QMU London, Sony AI, and MBZUAI have launched a novel method named MusicMagus. This method provides a classy but user-friendly answer for modifying music generated from textual content descriptions. By leveraging superior diffusion fashions, MusicMagus permits exact modifications to particular musical attributes whereas sustaining the integrity of the unique composition. 

    MusicMagus showcases its unparalleled means to edit and refine music by way of refined methodologies and modern use of datasets. The system’s spine is constructed upon the prowess of the AudioLDM 2 mannequin, which makes use of a variational autoencoder (VAE) framework for compressing music audio spectrograms right into a latent house. This house is then manipulated to generate or edit music primarily based on textual descriptions, bridging the hole between textual enter and musical output. The modifying mechanism of MusicMagus leverages the latent capacities of pre-trained diffusion-based fashions, a novel method that considerably enhances its modifying accuracy and suppleness.

    Researchers performed intensive experiments to validate MusicMagus’s effectiveness, which concerned crucial duties reminiscent of timbre and elegance switch, evaluating its efficiency in opposition to established baselines like AudioLDM 2, Transplayer, and MusicGen. These comparative analyses are grounded in using metrics reminiscent of CLAP Similarity and Chromagram Similarity for goal evaluations and Overall Quality (OVL), Relevance (REL), and Structural Consistency (CON) for subjective assessments. Results reveal MusicMagus outperforming baselines with a notable CLAP Similarity rating enhance of as much as 0.33 and Chromagram Similarity of 0.77, indicating a big development in sustaining music’s semantic integrity and structural consistency. The datasets employed in these experiments, together with POP909 and MAESTRO for the timbre switch job, have performed a vital position in demonstrating MusicMagus’s superior capabilities in altering musical semantics whereas preserving the unique composition’s essence.

    In conclusion, MusicMagus introduces a pioneering text-to-music modifying framework adept at manipulating particular musical points whereas preserving the integrity of the composition. Although it faces challenges with multi-instrument music era, editability versus constancy trade-offs, and sustaining construction throughout substantial modifications, it marks a big development in music modifying know-how. Despite its limitations in dealing with lengthy sequences and being confined to a 16kHz sampling price, MusicMagus considerably advances the state-of-the-art type and timbre switch, showcasing its modern method to music modifying.


    Check out the Paper. All credit score for this analysis goes to the researchers of this challenge. Also, don’t neglect to comply with us on Twitter. Join our 37k+ ML SubReddit, 41k+ Facebook Community, Discord Channel, and LinkedIn Group.

    If you want our work, you’ll love our publication..

    Don’t Forget to affix our Telegram Channel


    Nikhil is an intern advisor at Marktechpost. He is pursuing an built-in twin diploma in Materials on the Indian Institute of Technology, Kharagpur. Nikhil is an AI/ML fanatic who’s at all times researching purposes in fields like biomaterials and biomedical science. With a powerful background in Material Science, he’s exploring new developments and creating alternatives to contribute.


    🚀 LLMWare Launches SLIMs: Small Specialized Function-Calling Models for Multi-Step Automation [Check out all the models]

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    AI

    Rationale engineering generates a compact new tool for gene therapy | Ztoog

    AI

    The AI Hype Index: College students are hooked on ChatGPT

    AI

    Learning how to predict rare kinds of failures | Ztoog

    AI

    Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

    AI

    AI learns how vision and sound are connected, without human intervention | Ztoog

    AI

    How AI is introducing errors into courtrooms

    AI

    With AI, researchers predict the location of virtually any protein within a human cell | Ztoog

    AI

    Google DeepMind’s new AI agent cracks real-world problems better than humans can

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Mobile

    Samsung’s working on a cheap Galaxy Z Flip and a surprise for the Watch 8

    What you could knowTwo new Samsung units have already handed by means of the GSMA…

    Gadgets

    New 6GB version of the RTX 3050 may be Nvidia’s first sub-$200 GPU in over 4 years

    Gigabyte Nvidia launched three new GPUs final month, half of a Super overhaul of the…

    Mobile

    Spotify is bricking its Car Thing gadget and you can’t do anything about it

    TL;DR Spotify has discontinued its Car Thing gadget, used for listening to and controlling Spotify…

    Mobile

    Coros Pace 3 review: Should you buy it?

    Coros is coming after Garmin with a third-generation GPS watch that is smaller, lighter, and…

    Mobile

    Weekly deals roundup: 4K Sony Xperia 1 III and Moto edge+ 512GB at half off!

    (*1*) We’ve compiled one of the best telephone deals that popped up this week and…

    Our Picks
    Mobile

    Deals: Xiaomi 14 Ultra arrives, Samsung Galaxy A55 and A35 go on pre-order

    Technology

    Nest revival? Google’s smart speakers may be poised for a long-due refresh

    Mobile

    iPhone 17 and 17 Plus to get 120Hz displays with Always-on

    Categories
    • AI (1,493)
    • Crypto (1,753)
    • Gadgets (1,805)
    • Mobile (1,850)
    • Science (1,866)
    • Technology (1,802)
    • The Future (1,648)
    Most Popular
    The Future

    Story of military airfield in Afghanistan that Biden left in 2021

    Technology

    Build Your Own Tiny PC With This Motherboard

    Mobile

    The overclocked Snapdragon 8 Gen 2 is coming to more phones

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.