Stability AI, the company behind the popular AI art generator Stable Diffusion, has introduced a new open-source AI model, Stable Audio Open. This model is designed to generate sounds and songs, trained exclusively on royalty-free recordings to avoid copyright issues.
Key Takeaways
- Stable Audio Open is an open source text-to-audio model for generating up to 47 seconds of samples and sound effects.
- Users can create drum beats, instrument riffs, ambient sounds, foley and production elements.
- The model enables audio variations and style transfer of audio samples.
![Stable Audio Open](https://i0.wp.com/nosisnews.com/wp-content/uploads/2024/06/image-10.png?resize=1024%2C507&ssl=1)
![Stable Audio Open](https://i0.wp.com/nosisnews.com/wp-content/uploads/2024/06/image-10.png?resize=1024%2C507&ssl=1)
Model capabilities and training
Stable Audio Open allows users to input text descriptions, such as “Rock beat played in a treated studio, session drumming on an acoustic kit,” to generate audio clips up to 47 seconds long. The model was trained using approximately 486,000 samples from free music libraries such as FreeSound and the Free Music Archive.
While it is adept at creating drum beats, instrument riffs, and ambient noises, it is not optimized to produce full songs, melodies, or vocals.
The model is not without its limitations. It cannot be used for commercial purposes, as its terms of service prohibit such use. Additionally, it performs variably across different musical styles and cultures, and it has limitations with descriptions in non-English languages—a result of biases in the training data.
Controversies and business context
The release of Stable Audio Open comes amidst controversies surrounding Stability AI, especially regarding the use of copyrighted content in training AI models. This new model aims to shift focus by using only royalty-free sources. This move is also seen as part of Stability AI’s strategy to promote its premium services, like the Stable Audio service, which offers more advanced capabilities.
The launch of Stable Audio Open highlights ongoing concerns about copyright in the realm of AI-generated content. Recent actions by major music labels and new legislation, such as the law signed in Tennessee, underscore the industry’s focus on regulating AI applications in music to protect creators’ rights.
Stability AI’s introduction of an open-source, royalty-free audio model is a significant step towards addressing these legal and ethical challenges, providing a tool that encourages creative expression while respecting copyright laws.