TheSequence

TheSequence

Share this post

TheSequence
TheSequence
Edge 328: Inside AudioCraft: Meta AI’s New Family of Generative Audio Models

Edge 328: Inside AudioCraft: Meta AI’s New Family of Generative Audio Models

A review of Meta's EnCodec, AudioGen and MusicGen models.

Sep 21, 2023
∙ Paid
76

Share this post

TheSequence
TheSequence
Edge 328: Inside AudioCraft: Meta AI’s New Family of Generative Audio Models
1
Share
Created Using Audiogram

Audio is rapidly becoming one of the new frontiers of generative AI. In the pursuit of generating high-fidelity audio, Meta AI faces the challenge of modeling intricate signals and patterns at diverse scales. Among various audio types, music proves especially daunting due to its amalgamation of local and long-range patterns, spanning from individual notes to complex musical structures with multiple instruments. While conventional approaches rely on symbolic representations like MIDI or piano rolls, they fall short in capturing the expressive nuances and stylistic richness intrinsic to music. Meta AI recently introduced AudioCraft, a family of generative AI models for high-quality audio generation.

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Jesus Rodriguez
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share