Stability AI launches Stable Audio Open - An open source model for audio samples and sound design

Advertisements

Stability AI, the company behind the popular AI art generator Stable Diffusion, has introduced a new open-source AI model, Stable Audio Open. This model is designed to generate sounds and songs, trained exclusively on royalty-free recordings to avoid copyright issues.

Key Takeaways

Stable Audio Open is an open source text-to-audio model for generating up to 47 seconds of samples and sound effects.
Users can create drum beats, instrument riffs, ambient sounds, foley and production elements.
The model enables audio variations and style transfer of audio samples.

Model capabilities and training

Stable Audio Open allows users to input text descriptions, such as “Rock beat played in a treated studio, session drumming on an acoustic kit,” to generate audio clips up to 47 seconds long. The model was trained using approximately 486,000 samples from free music libraries such as FreeSound and the Free Music Archive.

While it is adept at creating drum beats, instrument riffs, and ambient noises, it is not optimized to produce full songs, melodies, or vocals.

Controversies and business context

The release of Stable Audio Open comes amidst controversies surrounding Stability AI, especially regarding the use of copyrighted content in training AI models. This new model aims to shift focus by using only royalty-free sources. This move is also seen as part of Stability AI’s strategy to promote its premium services, like the Stable Audio service, which offers more advanced capabilities.

The launch of Stable Audio Open highlights ongoing concerns about copyright in the realm of AI-generated content. Recent actions by major music labels and new legislation, such as the law signed in Tennessee, underscore the industry’s focus on regulating AI applications in music to protect creators’ rights.

Stability AI’s introduction of an open-source, royalty-free audio model is a significant step towards addressing these legal and ethical challenges, providing a tool that encourages creative expression while respecting copyright laws.