Music

Generate, remix, extend, and analyze AI music with Mureka — songs, instrumentals, soundtracks, stems, and lyrics.

Generate original AI music from a prompt or lyrics, remix an existing song, score an image or video, isolate a single track, or analyze audio — all powered by the Mureka provider through the listenhub music CLI.

Trigger

Invoke this skill with /music, or use any of these phrases:

Phrase	Language
`music` / `generate music` / `compose`	English
`create a song` / `cover` / `remix`	English
`instrumental` / `soundtrack` / `stem`	English
`recognize lyrics` / `extend`	English
`音乐` / `生成音乐` / `作曲` / `做一首歌`	Chinese
`翻唱` / `混音` / `续写` / `纯音乐`	Chinese
`配乐` / `分轨` / `识别歌词`	Chinese

Requires ListenHub Skills to be installed — see Getting Started.

This skill does not use speakers — music generation has no speaker selection.

Quick Example

Make a song about a summer evening by the sea

The AI confirms the capability and parameters with you, submits the job, and notifies you when the track is ready. You get a listen link, duration, and credit cost, plus a local download when output mode is set to download or both.

Capabilities

The skill splits into two groups: generation commands run asynchronously and return a task to poll; analysis commands run synchronously and return results in the same call.

generate — text and/or lyrics into a new song. At least one of prompt or lyrics is required. Optional style, title, model, instrumental toggle, and a cloned vocal-id. Async.

remix — an existing song plus new lyrics into a re-creation. Provide exactly one input source: an audio file, an audio URL, or a Mureka provider-song-id. Both lyrics and prompt are required. Optional style, title, model. Async.

instrumental — a pure instrumental with no vocals. Provide exactly one of prompt or a reference-audio file. Optional title, model. Async.

soundtrack — music scored to an image or a video. Provide exactly one of image or video. Optional prompt, title, model. Async.

track — isolate or generate a single instrument or vocal track from a song. Provide exactly one input source (audio file or provider-song-id) plus a generate-type. When the type is Vocals, lyrics is required. Optional prompt, vocal-gender, and a generate-start/generate-end range in seconds. Async.

extend — make an existing song longer. Provide one input source (audio file or provider-song-id). Optional prompt describing how to continue, and model. Async.

recognize — extract lyrics with line-level timestamps from an audio file. Sync — results return immediately.

describe — analyze an audio file into a description, tags, genres, and instruments. Sync — results return immediately.

stem — split a song into separated stems and return ZIP download URLs. Choose a separation model (audio-separation-1 or audio-separation-2). Sync — results return immediately.

A cover capability also exists but is deprecated — use remix instead unless you specifically need the older cover flow.

Two task-management commands are available any time: list shows recent tasks, and get <taskId> returns the status or result of a single task.

Models

Generation commands accept a model parameter. auto is the default and lets the service pick.

Model	Notes
`auto`	Default — service selects the model
`mureka-7.6`	Mureka 7.6
`mureka-8`	Mureka 8
`mureka-9`	Mureka 9
`mureka-o2`	Mureka o2

Analysis is different. The stem command takes a separation model instead — audio-separation-1 or audio-separation-2. The recognize and describe commands take no model.

Parameters

Parameters apply per capability. Provide only what each command needs; the AI asks for required inputs and offers optional ones.

Parameter	Applies to	Notes
`prompt`	generate, remix, instrumental, soundtrack, track, extend	Free text describing the music or direction
`lyrics`	generate, remix, track (Vocals only)	Song lyrics
`style`	generate, remix	Genre or mood, e.g. `city pop`
`title`	generate, remix, instrumental, soundtrack	Track title; auto-generated if omitted
`model`	generation commands	One of the models above; defaults to `auto`
`instrumental`	generate	Toggle vocals off
`vocal-id`	generate	A cloned voice id
`audio` / `audio-url` / `provider-song-id`	remix, track, extend	The input song; supply exactly one
`reference-audio`	instrumental	Reference audio file (alternative to `prompt`)
`image` / `video`	soundtrack	Source media; supply exactly one
`generate-type`	track	One of `Vocals`, `Instrumental`, `Drums`, `Bass`, `Guitar`, `Keyboard`, `Percussion`, `Strings`, `Synth`, `FX`, `Brass`, `Woodwinds`
`vocal-gender`	track	`male` or `female`
`generate-start` / `generate-end`	track	Time range in seconds

File limits

All input files are capped at 10 MB. Accepted formats by type:

Type	Formats
Audio	`mp3`, `m4a` (the `track` command also accepts `wav`)
Image	`jpg`, `jpeg`, `png`, `webp`
Video	`mp4`, `mov`, `avi`, `mkv`, `webm`

You can also pass a URL instead of a local file where the command supports it; the CLI validates it on submission.

Music generation is slow — expect roughly 5 to 10 minutes per track. The AI submits the job in the background and notifies you on completion. If you only have a task id, check progress with listenhub music get <taskId> --json or browse listenhub music list --json.

Output

Output behavior follows the outputMode set during config:

inline or both — the audio URL is shown as a clickable listen link, alongside the title, duration, and credit cost.
download or both — the file is also saved to the current working directory with a friendly, topic-based name (e.g. summer-breeze.mp3). Names are de-duplicated automatically.

For stem, the result is one or more ZIP download URLs; in download or both mode they are fetched to the current directory. For recognize and describe, the result is shown directly in the conversation.

Each completed task reports its creditCost. To estimate or check credits, see the credits notes in the Music API reference and your balance via GET /v1/user/subscription.

CLI Commands

The skill drives the listenhub music command group. The common shape:

# Generate a song (async — polls until ready)
listenhub music generate --prompt "upbeat summer pop about the sea" --json

# Remix an existing file with new lyrics
listenhub music remix --audio demo.mp3 --lyrics "..." --prompt "rework as city pop" --json

# Pure instrumental
listenhub music instrumental --prompt "electronic track for a game intro" --json

# Soundtrack for a video
listenhub music soundtrack --video clip.mp4 --prompt "tense, suspenseful score" --json

# Analyze audio (sync)
listenhub music recognize --audio song.mp3 --json
listenhub music describe  --audio song.mp3 --json
listenhub music stem      --audio track.mp3 --model audio-separation-2 --json

# Task management
listenhub music list --json
listenhub music get <taskId> --json

Global flags apply: --json / -j for machine output, --no-wait to skip polling, --timeout <s> to bound it.

API Reference

See the Music API reference for endpoint paths, request parameters, response fields, and credit handling.

Music

On this page