StableAudio

Audio-to-audio

40Prompting with audio
Return to User Guide

What is audio-to-audio?

41Overview

1Add audio into your AI generation process. You can upload, record or use existing AI generated audio for this.

2This audio will be added as Input audio.

3Paired with a text prompt Input audio helps guide the AI model to your output goals.

4Change the style, genres and mood to create variations.

Audio-to-audio

42Video

Types of input audio

43Types

Below are some examples of how you can add audio into the generation process to help guide your output.

Adding input audio alongside a text prompt is a great way to experiment with audio you have already created. You can make minimal to extreme changes based on the Input audio strength and Prompt strength sliders.

1
Generate audio

Generate audio

Use the audio you have already generated on Stable Audio.

2
Upload

Upload

Upload audio you have created outside of Stable Audio. This can be anything from stems, samples to full songs.

3
Record audio

Record

Use our record feature to instantly add audio.

4
Vocals

Vocals

Sing or hum what you want your generation to sound like.

Sample: Synth to ...

Samples
44Create samples

Here is an example of an uploaded synth sample that has been used as Input audio to create variations when paired with a text prompt.

Original audio:

1

Bass guitar

Input audio strength: 75%

2

Heavy metal guitar

Input audio strength: 45%

3

Upright bass

Input audio strength: 35%

Stems: Piano to ...

Stems
45Create samples

Here’s an example of an uploaded piano stem that has been used as Input audio. In each output, the piano stem has been modified to become a different instrument, used as an accompaniment and showcase a change in style.

Original audio:

1

format: solo | instruments: vibraphone

Steps: 50

Input audio strength: 65%

Prompt strength: 95%

2

Post rock, guitars, bass, strings, euphoric, up-lifting, moody, flowing, raw, epic

Steps: 50

Input audio strength: 40%

Prompt strength: 95%

3

Lofi hip hop beat, chillhop

Steps: 50

Input audio strength: 35%

Add the first generated output as input audio to generate similar results.

Vocals to ...

Voclas
46Create samples

Add or record your vocals as Input audio to guide how you want your audio output to sound. This is an effortless way to create quick backing tracks. This is a first edition beta that we are working further to optimize.

Original audio:

1

Lofi hip hop beat, chillhop

Input audio strength: 50%

2

Electronic, orchestral, relaxed, synth, soft, piano, bass, 808 bass stabs

Input audio strength: 60%

3

Genre: UK Bass | Instruments: 707 Drum Machine, Strings, 808 bass stabs, Beautiful Synths

Input audio strength: 56%

Steps: 60

Experiments

Experiments
47Create samples

This is a description on the approach you can take to sample creation. Stable Audio is really good for making, experimenting and creating variations of samples.

SFX: crisp packet

Original audio:

Racecar

Input audio strength: 35%

Vocals: hum

Original audio:

Drums

Input audio strength: 50%

Stem: synth

Original audio:

Choir

Input audio strength: 30%

SFX: racing car

Original audio:

Racing car

Input audio strength: 40%

Whistle

Original audio:

Guitar

Input audio strength: 30%

Stem: guitar

Original audio:

Instruments: Strings, Drum Kit, Electric Bass, Choir, String Section, Flute, Harp

Input audio strength: 50%

You have a monthly allowance of audio uploads.

48Monthly allowance

Depending on your subscription type you are able to upload a set amount of audio each month, so that we can check the audio you’ve uploaded against copyrighted works.

If you upload audio that infringes copyright, this upload will still count against your monthly allowance.

  • Free: 3 minutes a month with all uploads cropped at 30 seconds.
  • Pro: 30 minutes a month with all uploads cropped at 3 minutes.
  • Studio: 60 minutes a month with all uploads cropped at 3 minutes.
  • Max: 90 minutes a month with all uploads cropped at 3 minutes.

Upload details

49Upload details

The audio files you upload must belong to you, or you have been granted the right to upload to Stable Audio. There will be a copyright check on uploaded files.

Accepted file formats: MP3, WAV, MP4, AIFF.

Uploaded audio will be cropped to 3 minutes.

Min upload length is 1 seconds.

Your uploaded audio will be stored separately from generated audio and won’t be used to train any of our models.

Limited monthly uploads

410Limits

You are limited by the amount you’re able to upload each month. This is due to the 3rd part copyright check on all uploaded audio.

To find out more on how much you can upload, take a look at our pricing page.

If your uploaded audio is flagged as belonging to someone else the upload amount (e.g. 36 seconds) will come off of your monthly amount.

You can view your monthly usage within the add audio modal and on your account page.

Recording in browser

411Best

To get the best quality out of recording audio to add into the generation process on Stable Audio is by using a wired mic.

There is latency when using bluetooth headphones to record audio on the Stable Audio website of around half a second. Remember this when recording.

For the best experience use professional recording equipment and then upload.