Stability AI’s Secure Audio replace permits full-length track manufacturing from textual content or audio

Business News

Stability AI’s Secure Audio replace permits full-length track manufacturing from textual content or audio

foodvox

April 4, 2024

Stability AI’s Secure Audio replace permits full-length track manufacturing from textual content or audio

[ad_1]

Synthetic intelligence developer Stability AI has unveiled Secure Audio 2.0, the following iteration of its text-to-music technology system.

The most recent model helps artists and musicians with a wider vary of inventive instruments and the power to supply full-length music tracks “with conventional track construction and excessive audio high quality” utilizing pure language prompts, the corporate mentioned Wednesday (April 3).

Secure Audio 1.0, released final September, captured consideration with its means to craft quick audio clips based mostly on textual descriptions. It was named one in every of TIME’s Best Inventions of 2023.

The brand new model expands on this basis, permitting customers to generate full songs as much as three minutes lengthy at 44.1 kHz stereo. This prolonged timeframe opens doorways for a greater variety of musical creations, from full instrumentals to structured compositions with intros, improvement sections, and outros.

“Secure Audio 2.0 units a brand new commonplace in AI-generated audio,” Stability AI mentioned in a weblog put up. “The brand new mannequin introduces audio-to-audio technology by permitting customers to add and rework samples utilizing pure language prompts.

Past the elevated size, Secure Audio 2.0 additionally affords different options together with new “audio-to-audio” capabilities that permit customers to add their very own audio samples to set the type and sound of AI-generated outputs.

“With each text-to-audio and audio-to-audio prompting, customers can produce melodies, backing tracks, stems, and sound results, thus enhancing the inventive course of.”

Stability AI

“Our most superior audio mannequin but expands the inventive toolkit for artists and musicians with its new functionalities. With each text-to-audio and audio-to-audio prompting, customers can produce melodies, backing tracks, stems, and sound results, thus enhancing the inventive course of,” Stability AI mentioned.

The discharge of Secure Audio 2.0 comes amid a interval of inside change at Stability AI. Ed Newton-Rex, the corporate’s former Vice President of Audio, not too long ago departed as a consequence of disagreements over using copyrighted supplies in coaching datasets.

“Corporations value billions of {dollars} are, with out permission, coaching generative AI fashions on creators’ works, that are then getting used to create new content material that in lots of circumstances can compete with the unique works. I don’t see how this may be acceptable in a society that has arrange the economics of the inventive arts such that creators depend on copyright,” Newton-Rex, who helped develop Secure Audio, mentioned in a public resignation letter. He has since launched an initiative to judge and certify AI fashions based mostly on their respect for creators’ rights.

Stability AI addressed copyright considerations about its AI improvement, saying “Secure Audio 2.0 was solely skilled on a licensed dataset from the AudioSparx music library, honoring opt-out requests and guaranteeing honest compensation for creators.”

The 1.0 mannequin was additionally skilled utilizing knowledge from AudioSparx, which consists of over 800,000 audio information containing music, sound results, and single-instrument stems, and corresponding textual content metadata.

“Secure Audio 2.0 is among the strongest and versatile generative AI music instruments accessible and makes it attainable for musicians, producers, and different creators to make use of AI as a collaborative device for music composition, audio experimentation, and content material creation — like by no means earlier than.”

Stability AI

The replace additionally built-in Audible Magic to scan audio uploads for copyright infringement. Audible Magic affords content material recognition know-how to assist with real-time content material matching to stop copyright infringement.

Secure Audio 2.0 additionally introduces options like Type Switch to match generated or uploaded audio to present tracks, SFX creation, and variations.

“Secure Audio 2.0 is among the strongest and versatile generative AI music instruments accessible and makes it attainable for musicians, producers, and different creators to make use of AI as a collaborative device for music composition, audio experimentation, and content material creation — like by no means earlier than,” Stability AI mentioned in an announcement.

Stability AI additionally affords technical particulars in regards to the mannequin’s structure, explaining its effectiveness in producing high-quality musical compositions.

“A brand new, extremely compressed autoencoder compresses uncooked audio waveforms into a lot shorter representations. For the diffusion mannequin, we make use of a diffusion transformer (DiT), akin to that utilized in Secure Diffusion 3, rather than the earlier U-Web, as it’s more proficient at manipulating knowledge over lengthy sequences.

“The mixture of those two parts ends in a mannequin able to recognizing and reproducing the large-scale constructions which might be important for high-quality musical compositions.”

The brand new mannequin is out there to make use of without cost on the Secure Audio website and can quickly be accessible on the Secure Audio API.

Stability AI has additionally launched Stable Radio, a 24/7 stay stream that options tracks generated by Secure Audio.

Music Enterprise Worldwide

[ad_2]

LEAVE A REPLY Cancel reply