Close Menu
Ztoog
    What's Hot
    Crypto

    DC Comics debuts Catwoman phygital comic at San Diego Comic-Con

    Technology

    IEEE Society Restores Electricity To a Nepali School

    Crypto

    Bitcoin Price Confirms Double Top, How Low Can BTC Drop?

    Important Pages:
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    Facebook X (Twitter) Instagram Pinterest
    Facebook X (Twitter) Instagram Pinterest
    Ztoog
    • Home
    • The Future

      Any wall can be turned into a camera to see around corners

      JD Vance and President Trump’s Sons Hype Bitcoin at Las Vegas Conference

      AI may already be shrinking entry-level jobs in tech, new research suggests

      Today’s NYT Strands Hints, Answer and Help for May 26 #449

      LiberNovo Omni: The World’s First Dynamic Ergonomic Chair

    • Technology

      A Replit employee details a critical security flaw in web apps created using AI-powered app builder Lovable that exposes API keys and personal info of app users (Reed Albergotti/Semafor)

      Gemini in Google Drive can now help you skip watching that painfully long Zoom meeting

      Apple iPhone exports from China to the US fall 76% as India output surges

      Today’s NYT Wordle Hints, Answer and Help for May 26, #1437

      5 Skills Kids (and Adults) Need in an AI World – O’Reilly

    • Gadgets

      Future-proof your career by mastering AI skills for just $20

      8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

      Google Home is getting deeper Gemini integration and a new widget

      Google Announces AI Ultra Subscription Plan With Premium Features

      Google shows off Android XR-based glasses, announces Warby Parker team-up

    • Mobile

      Deals: the Galaxy S25 series comes with a free tablet, Google Pixels heavily discounted

      Microsoft is done being subtle – this new tool screams “upgrade now”

      Wallpaper Wednesday: Android wallpapers 2025-05-28

      Google can make smart glasses accessible with Warby Parker, Gentle Monster deals

      vivo T4 Ultra specs leak

    • Science

      Analysts Say Trump Trade Wars Would Harm the Entire US Energy Sector, From Oil to Solar

      Do we have free will? Quantum experiments may soon reveal the answer

      Was Planet Nine exiled from the solar system as a baby?

      How farmers can help rescue water-loving birds

      A trip to the farm where loofahs grow on vines

    • AI

      Rationale engineering generates a compact new tool for gene therapy | Ztoog

      The AI Hype Index: College students are hooked on ChatGPT

      Learning how to predict rare kinds of failures | Ztoog

      Anthropic’s new hybrid AI model can work on tasks autonomously for hours at a time

      AI learns how vision and sound are connected, without human intervention | Ztoog

    • Crypto

      GameStop bought $500 million of bitcoin

      CoinW Teams Up with Superteam Europe to Conclude Solana Hackathon and Accelerate Web3 Innovation in Europe

      Ethereum Net Flows Turn Negative As Bulls Push For $3,500

      Bitcoin’s Power Compared To Nuclear Reactor By Brazilian Business Leader

      Senate advances GENIUS Act after cloture vote passes

    Ztoog
    Home » Meta releases open source AI audio tools, AudioCraft
    Gadgets

    Meta releases open source AI audio tools, AudioCraft

    Facebook Twitter Pinterest WhatsApp
    Meta releases open source AI audio tools, AudioCraft
    Share
    Facebook Twitter LinkedIn Pinterest WhatsApp

    Meta

    On Wednesday, Meta introduced it’s open-sourcing AudioCraft, a collection of generative AI instruments for creating music and audio from textual content prompts. With the instruments, content material creators can enter easy textual content descriptions to generate advanced audio landscapes, compose melodies, and even simulate complete digital orchestras.

    AudioCraft consists of three core elements: AudioGen, a device for producing numerous audio results and soundscapes; MusicGen, which might create musical compositions and melodies from descriptions; and EnCodec, a neural network-based audio compression codec.

    In explicit, Meta says that EnCodec, which we first lined in November, has just lately been improved and permits for “greater high quality music era with fewer artifacts.” Also, AudioGen can create audio sound results like a canine barking, a automotive horn honking, or footsteps on a wood ground. And MusicGen can whip up songs of assorted genres from scratch, based mostly on descriptions like “Pop dance observe with catchy melodies, tropical percussions, and upbeat rhythms, excellent for the seaside.”

    Meta has offered a number of audio samples on its web site for analysis. The outcomes appear in keeping with their state-of-the-art labeling, however arguably they don’t seem to be fairly prime quality sufficient to exchange professionally produced business audio results or music.

    Meta notes that whereas generative AI fashions centered round textual content and nonetheless photos have obtained plenty of consideration (and are comparatively straightforward for folks to experiment with on-line), growth in generative audio instruments has lagged behind. “There’s some work on the market, however it’s extremely difficult and never very open, so folks aren’t capable of readily play with it,” they write. But they hope that AudioCraft’s launch beneath the MIT License will contribute to the broader neighborhood by offering accessible instruments for audio and musical experimentation.

    Advertisement

    “The fashions can be found for analysis functions and to additional folks’s understanding of the know-how. We’re excited to offer researchers and practitioners entry to allow them to prepare their very own fashions with their very own datasets for the primary time and assist advance the state-of-the-art,” Meta stated.

    Meta is not the primary firm to experiment with AI-powered audio and music turbines. Among among the extra notable latest makes an attempt, OpenAI debuted its Jukebox in 2020, Google debuted MusicLM in January, and final December, an unbiased analysis group created a text-to-music era platform known as Riffusion utilizing a Stable Diffusion base.

    None of those generative audio initiatives have attracted as a lot consideration as picture synthesis fashions, however that does not imply the method of growing them is not any simpler, as Meta notes on its web site:

    Generating high-fidelity audio of any type requires modeling advanced alerts and patterns at various scales. Music is arguably essentially the most difficult kind of audio to generate as a result of it’s composed of native and long-range patterns, from a collection of notes to a world musical construction with a number of devices. Generating coherent music with AI has usually been addressed via using symbolic representations like MIDI or piano rolls. However, these approaches are unable to completely grasp the expressive nuances and stylistic components present in music. More latest advances leverage self-supervised audio illustration studying and various hierarchical or cascaded fashions to generate music, feeding the uncooked audio into a posh system with a view to seize long-range buildings within the sign whereas producing high quality audio. But we knew that extra might be achieved on this subject.

    Amid controversy over undisclosed and probably unethical coaching materials used to create picture synthesis fashions comparable to Stable Diffusion, DALL-E, and Midjourney, it is notable that Meta says that MusicGen was skilled on “20,000 hours of music owned by Meta or licensed particularly for this goal.” On its floor, that looks like a transfer in a extra moral path which will please some critics of generative AI.

    It might be attention-grabbing to see how open source builders select to combine these Meta audio fashions of their work. It might lead to some attention-grabbing and easy-to-use generative audio instruments within the close to future. For now, the extra code-savvy amongst us can discover mannequin weights and code for the three AudioCraft instruments on GitHub.

    Share. Facebook Twitter Pinterest LinkedIn WhatsApp

    Related Posts

    Gadgets

    Future-proof your career by mastering AI skills for just $20

    Gadgets

    8 Best Vegan Meal Delivery Services and Kits (2025), Tested and Reviewed

    Gadgets

    Google Home is getting deeper Gemini integration and a new widget

    Gadgets

    Google Announces AI Ultra Subscription Plan With Premium Features

    Gadgets

    Google shows off Android XR-based glasses, announces Warby Parker team-up

    Gadgets

    The market’s down, but this OpenAI for the stock market can help you trade up

    Gadgets

    We Hand-Picked the 24 Best Deals From the 2025 REI Anniversary Sale

    Gadgets

    “Google wanted that”: Nextcloud decries Android permissions as “gatekeeping”

    Leave A Reply Cancel Reply

    Follow Us
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    Top Posts
    Science

    Emergency Planners Are Having a Moment

    Also, in a catastrophe, there are not any good selections, there are solely least-worse selections.…

    Science

    Notre Dame cathedral first to use iron reinforcements in 12th century

    Enlarge / The Notre-Dame de Paris has been below restoration since a devastating hearth destroyed…

    AI

    Efficient technique improves machine-learning models’ reliability | Ztoog

    Powerful machine-learning fashions are getting used to assist individuals deal with powerful issues comparable to…

    AI

    This AI Paper Unveils How Multilingual Instruction-Tuning Boosts Cross-Lingual Understanding in Large Language Models

    The optimization of huge language fashions (LLMs) for multilingual instruction-following stands as a big space…

    AI

    On-device diffusion plugins for conditioned text-to-image generation – Google Research Blog

    Posted by Yang Zhao and Tingbo Hou, Software Engineers, Core ML

    Our Picks
    Mobile

    Samsung Galaxy Tab S9 FE and S9 FE Plus renders and specs surface

    Mobile

    Get $50 for reserving the Galaxy Ring right now ahead of Samsung Unpacked

    Crypto

    What Is It And Why Does It Matter?

    Categories
    • AI (1,493)
    • Crypto (1,753)
    • Gadgets (1,805)
    • Mobile (1,851)
    • Science (1,866)
    • Technology (1,802)
    • The Future (1,648)
    Most Popular
    Gadgets

    GitHub Copilot moves beyond OpenAI models to support Claude 3.5, Gemini

    Technology

    Spider-Man 2 Limited Edition PS5 Consoles, Accessories Still Available to Preorder

    AI

    This AI Paper Explores the Impact of Reasoning Step Length on Chain of Thought Performance in Large Language Models

    Ztoog
    Facebook X (Twitter) Instagram Pinterest
    • Home
    • About Us
    • Contact us
    • Privacy Policy
    • Terms & Conditions
    © 2025 Ztoog.

    Type above and press Enter to search. Press Esc to cancel.