> ## Documentation Index
> Fetch the complete documentation index at: https://dripart-mintlify-b90d3c69.mintlify.site/llms.txt
> Use this file to discover all available pages before exploring further.

# Stable Audio 3 ComfyUI workflow example

> Guide to using Stability AI's open-source Stable Audio 3 model in ComfyUI for text-to-audio generation with Qwen prompt expansion and category-aware reprompting.

<Tip>
  <Tabs>
    <Tab title="Portable or self deployed users">
      Make sure your ComfyUI is updated.

      * [Download ComfyUI](https://www.comfy.org/download)
      * [Update Guide](/installation/update_comfyui)

      Workflows in this guide can be found in the [Workflow Templates](/interface/features/template).
      If you can't find them in the template, your ComfyUI may be outdated. (Desktop version's update will delay sometime)

      If nodes are missing when loading a workflow, possible reasons:

      1. You are not using the latest ComfyUI version (Nightly version)
      2. Some nodes failed to import at startup
    </Tab>

    <Tab title="Desktop or Cloud users">
      * The Desktop is base on ComfyUI stable release, it will auto-update when there is a new Desktop stable release available.
      * [Cloud](https://cloud.comfy.org) will update after ComfyUI stable release.

      So, if you find any core node missing in this document, it might be because the new core nodes have not yet been released in the latest stable version. Please wait for the next stable release.
    </Tab>
  </Tabs>
</Tip>

**Stable Audio 3** ([GitHub](https://github.com/Stability-AI/stable-audio-3)) is Stability AI's latest open-source audio generation model, trained on fully licensed music data and licensed for commercial use. It uses a dedicated subgraph node to produce high-quality stereo audio — including music, sound effects, and instruments — from text descriptions, with optional Qwen-powered category-aware reprompting.

Stable Audio 3 comes in three variants:

* **Small-SFX** — Sound effects and short ambiance, up to 2:00. Small enough to run on CPU.
* **Small-Music** — Short music loops, on-device-friendly, up to 2:00.
* **Medium** — Longer tracks with stronger structure and musicality, up to \~6:20. Requires a GPU.

**Related links**:

* [GitHub: Stability-AI/stable-audio-3](https://github.com/Stability-AI/stable-audio-3)
* [Hugging Face (Comfy-Org/stable-audio-3)](https://huggingface.co/Comfy-Org/stable-audio-3)
* [Blog: Announcement](https://blog.comfy.org/p/stable-audio-3-day-0-support)

## Available workflows

### Stable Audio 3 Medium

<Card title="Download Workflow" icon="download" href="https://github.com/Comfy-Org/workflow_templates/blob/main/templates/audio_stable_audio_3_medium.json">
  Download JSON or search "Stable Audio 3 Medium" in Template Library
</Card>

<Card title="Run on Comfy Cloud" icon="cloud" href="https://cloud.comfy.org/?template=audio_stable_audio_3_medium&utm_source=docs&utm_medium=referral&utm_campaign=stable-audio-3">
  Open in Comfy Cloud
</Card>

![Stable Audio 3 Medium workflow](https://raw.githubusercontent.com/Comfy-Org/workflow_templates/main/templates/audio_stable_audio_3_medium-1.webp)

The **Stable Audio 3 Medium** workflow is a full-featured text-to-audio generation pipeline. You provide a short text idea, optional duration, seed, and category — the workflow expands your prompt using Qwen with a **category-aware reprompt template**, then generates stereo audio via the Stable Audio 3 checkpoint.

**How to use**:

1. **Text idea** — Enter a short description of the sound, music, or effect you want (e.g. "upbeat electronic dance track with heavy bass")
2. **Duration** — Set the desired clip length in seconds (default varies)
3. **Seed** — Control reproducibility by adjusting the seed value
4. **Category** — Choose a reprompt preset: **Music**, **Instrument**, **SFX**, or **One-shot**
5. **Enable reprompt** — Toggle `use_reprompt` on to let Qwen expand your short idea into a detailed prompt before generation
6. Click **Run** (`Ctrl/Cmd + Enter`) to generate. The audio will be saved to `ComfyUI/output/audio/`

### Stable Audio 3 Medium Base

<Card title="Download Workflow" icon="download" href="https://github.com/Comfy-Org/workflow_templates/blob/main/templates/audio_stable_audio_3_medium_base.json">
  Download JSON or search "Stable Audio 3 Medium Base" in Template Library
</Card>

<Card title="Run on Comfy Cloud" icon="cloud" href="https://cloud.comfy.org/?template=audio_stable_audio_3_medium_base&utm_source=docs&utm_medium=referral&utm_campaign=stable-audio-3">
  Open in Comfy Cloud
</Card>

![Stable Audio 3 Medium Base workflow](https://raw.githubusercontent.com/Comfy-Org/workflow_templates/main/templates/audio_stable_audio_3_medium_base-1.webp)

A simplified version of Stable Audio 3 without Qwen reprompt expansion. It expects a complete text prompt and passes it directly to the model. Use this when you already have a detailed prompt and want faster generation.

**How to use**:

1. **Text prompt** — Enter a detailed description of the audio you want
2. **Duration** — Set the clip length in seconds
3. **Seed** — Control reproducibility
4. Click **Run** (`Ctrl/Cmd + Enter`) to generate

## Model download

When loading the workflow, ComfyUI will prompt you with download links for any missing models. To set up manually, download the files below and place them in the correct folders.

### Checkpoints

<Card title="stable_audio_3_medium.safetensors" icon="download" href="https://huggingface.co/Comfy-Org/stable-audio-3/resolve/main/checkpoints/stable_audio_3_medium.safetensors">
  For the Medium workflow. Place in models/checkpoints/
</Card>

<Card title="stable_audio_3_medium_base.safetensors" icon="download" href="https://huggingface.co/Comfy-Org/stable-audio-3/resolve/main/checkpoints/stable_audio_3_medium_base.safetensors">
  For the Medium Base workflow. Place in models/checkpoints/
</Card>

Place checkpoints in:

```
📂 ComfyUI/
├── 📂 models/
│   └── 📂 checkpoints/
│       ├── stable_audio_3_medium.safetensors
│       └── stable_audio_3_medium_base.safetensors
```

### Text encoders

<Card title="t5gemma_b_b_ul2.safetensors" icon="download" href="https://huggingface.co/Comfy-Org/stable-audio-3/resolve/main/text_encoders/t5gemma_b_b_ul2.safetensors">
  Required for all Stable Audio 3 workflows. Place in models/text\_encoders/
</Card>

<Card title="qwen3.5_2b_bf16.safetensors" icon="download" href="https://huggingface.co/Comfy-Org/Qwen3.5/resolve/main/text_encoders/qwen3.5_2b_bf16.safetensors">
  Required for the Medium workflow (Qwen reprompt). Place in models/text\_encoders/
</Card>

Place text encoders in:

```
📂 ComfyUI/
├── 📂 models/
│   └── 📂 text_encoders/
│       ├── t5gemma_b_b_ul2.safetensors
│       └── qwen3.5_2b_bf16.safetensors
```

After placing the files, press **R** in ComfyUI to refresh nodes and load the latest models.
