Stability AI releases a new audio model that can create six-minute songs
Stability AI has launched a new family of audio models named Stability Audio 3.0, capable of generating professional-grade music over six minutes long. The new models include small, medium, and large variants, with the latter two able to create full compositions while maintaining musical structure. The company has also secured partnerships with major music labels to ensure the models are built on fully licensed data.
- ▪Stability Audio 3.0 includes four new models: small SFX, small, medium, and large.
- ▪The medium and large models can generate music compositions of up to 6 minutes and 20 seconds.
- ▪Stability AI has partnered with Warner Music Group and Universal Music Group for model development.
Opening excerpt (first ~120 words) tap to expand
Stability AI, the company behind Stable Diffusion, is releasing a new family of audio models, called Stability Audio 3.0. The top model can generate professional-grade music of more than six minutes long, the company claimed. The company is releasing four new models under the Stable Audio 3.0 name: small SFX (459M parameters), small (459M parameters), medium (1.4B parameters), and large (2.7B parameters). The duo of small models is suitable for on-device sound and music generation of up to two minutes. Both medium and large models can create full compositions of 6 minutes 20 seconds long that can maintain musical structure and melodic tone. This is more than double the length of what Stable Audio 2.0, released in 2024, was capable of generating.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at TechCrunch.