StableAudio

Text-to-audio

30Prompting the AI model
Return to User Guide

What is text-to-audio?

31Overview

1Text-to-audio is how you communicate with the Stable Audio models via text.

2The generative AI model will output audio based on the natural language instructions you give it.

3The Stable Audio model performs best when you give it musical descriptions based on genre, sub-genre, mood and instrument type.

4 The input text is called a ‘prompt’.

Text prompt

32Prompts

Below are a few basic tips on how to prompt Stable Audio. We’ve broken the prompt down into four sections.

With Stable Audio, you describe the audio you want with your text prompt, and the generative model creates an audio output for you. This is what works for us - we encourage you to experiment and find out what works for you!

Add detail

Add detail

If you have something specific in mind, include it. Genres, descriptive phrases, instruments and moods work particularly well. The more detail, the better.

For example, a detailed prompt might look something like this:

Cinematic, Soundtrack, Wild West, High Noon Shoot Out, Percussion, Whistles, Horses, Action Scene, SFX, Shaker, Guitar, Bass, Timpani, Strings, Tense, Climactic, Atmospheric, Moody

Set the mood

Set the mood

When including detail on the mood you want, try using a combination of musical and emotional terms.

Musical terms might be groovy or rhythmic. Emotional terms might be sad or beautiful.

Choose instrument

Choose instruments

We’ve found that adding adjectives to instrument names is helpful.

For example, Reverberated Guitar, Powerful Choir, or Swelling Strings.

Choose BPM

Set the BPM

Setting the beats per minute is a great way to ensure your output is the tempo you want, and can help keep it in time. The key here is to try to stick to BPM settings that are appropriate to the genre you’re generating.

For example, if you were generating a Drum and Bass track, you might want to add 170 BPM to your prompt.

Text-to-audio examples:
Full instrumentals

33Audio examples

Use Stable Audio to generate full musical audio encompassing a range of instruments. Include as much detail as you can!

1

Soulful Boom Bap Hip Hop instrumental, Solemn effected Piano, SP-1200, low-key swing drums, sine wave bass, Characterful, Peaceful, Interesting, well-arranged composition, 90 BPM

2

Trance, Ibiza, Beach, Sun, 4 AM, Progressive, Synthesizer, 909, Dramatic chords, Choir, Euphoric, Nostalgic, Dynamic, Flowing

3

Post Rock, echoing electric guitars with chorus, well recorded drum-kit, Electric Bass, occasional soaring harmonies, Moving, Epic, Climactic, 125 BPM

4

Nu-Disco, funky emotional Piano, lush string quartet, well layered Drum Machine, well-arranged composition, funky G-Funk bass, Synthersizers, Modern, Club-orientated, 115 BPM

5

Synthpop, Big Reverbed Synthesizer Pad Chords, Driving Gated Drum Machine, Atmospheric, Moody, Nostalgic, Cool, Club, Striped-back, Pop Instrumental, 100 BPM

6

Post-Rock, Guitars, Drum Kit, Bass, Strings, Euphoric, Up-Lifting, Moody, Flowing, Raw, Epic, Sentimental, 125 BPM

7

Ambient Techno, meditation, Scandinavian Forest, 808 drum machine, 808 kick, claps, shaker, synthesizer, synth bass, Synth Drones, beautiful, peaceful, Ethereal, Natural, 122 BPM, Instrumental

8

Warm soft hug, comfort, low synths, twinkle, wind and leaves, ambient, peace, relaxed, water

9

Lofi hip hop beat, chillhop

10

Disco, Driving Drum Machine, Synthesizer, Bass, Piano, Guitars, Instrumental, Clubby, Euphoric, Chicago, New York, 115 BPM

11

Cyberpunk, Country Instrumental, Synthwave

12

Ambient house, 808 drum machine, 808 kick, claps, shaker, synthesizer, synth bass, modern, futuristic, Dancy, Euphoric, 125 BPM

13

Calm meditation music to play in a spa lobby

14

3/4, in 3, 3 beat, guitar, drums, bright, happy, claps

15

Cinematic synthwave

16

Trip Hop, drum kit, bass, electric guitar, bass guitar, synthesizer, cool, moody, atmospheric, dreamy, groovy, introspective, thoughtful, beautiful, well-arranged composition, expansive, epic, 85 BPM

17

Pop, pop-electronic, ballad, billboard, drum machine, bass, lush synthesizer pads, synthesizer arp, synth bass, percussion, honest, heart-felt, melancholic, vibe, cool, modern, atmospheric, well-arranged composition, 115 BPM

18

Electronica, instrumental, arcade, vintage drum machine, rhodes piano, brass stabs, inspiring, beautiful, up-lifting, epic, flowing, vibe, cool

Text-to-audio examples:
Individual stems

34Audio examples

You can also use Stable Audio to generate individual stems featuring a single instrument or group of instruments.

Just specify what you want in your prompt.

1

Solo electric guitar, classic rock, clean, rhythm, soft

2

A beautiful piano arpeggio grows to a full beautiful orchestral piece

3

Drum solo

4

Piano, beautiful, clean, soft, building

5

Drums, Bass, 808 bass stabs

6

Computer, drums, electronics

7

Folk, live, atmospheric, soulful, acoustic guitar, smooth, soft

Text-to-audio examples:
Sound effects

35Audio examples

You can also use Stable Audio to generate individual stems featuring a single instrument or group of instruments.

Just specify what you want in your prompt.

1

Ringtone

2

Explosion

3

Car passing by

4

Fireworks, 44.1k high fidelity