Meta released a new open-source AI code called AudioCraft, which lets users create music and sounds entirely through generative AI. It consists of three AI models, all tackling different areas of sound generation. MusicGen takes text inputs to generate music. This model was trained on “20,000 hours of music owned by Meta or licensed specifically for this purpose.” AudioGen creates audio from written prompts, simulating barking dogs or footsteps, and was trained on public sound effects. An improved version of Meta’s EnCodec decoder lets users create sounds with fewer artifacts — which is what happens when you manipulate audio too much. The company let the media listen to some sample audio made with AudioCraft. The generated noise of whistling, sirens, and humming sounded pretty natural. While the guitar strings on the songs felt real, they still felt, well, artificial. Meta is just the latest to tackle combining music and AI. Google came up with MusicLM, a large language model that generated minutes of sounds based on text prompts and is only accessible to researchers.
About OODA Analyst
OODA is comprised of a unique team of international experts capable of providing advanced intelligence and analysis, strategy and planning support, risk and threat management, training, decision support, crisis response, and security services to global corporations and governments.