Music
Generate, remix, extend, and analyze AI music with Mureka — songs, instrumentals, soundtracks, stems, and lyrics.
Generate original AI music from a prompt or lyrics, remix an existing song, score an image or video, isolate a single track, or analyze audio — all powered by the Mureka provider through the listenhub music CLI.
Trigger
Invoke this skill with /music, or use any of these phrases:
| Phrase | Language |
|---|---|
music / generate music / compose | English |
create a song / cover / remix | English |
instrumental / soundtrack / stem | English |
recognize lyrics / extend | English |
音乐 / 生成音乐 / 作曲 / 做一首歌 | Chinese |
翻唱 / 混音 / 续写 / 纯音乐 | Chinese |
配乐 / 分轨 / 识别歌词 | Chinese |
Requires ListenHub Skills to be installed — see Getting Started.
This skill does not use speakers — music generation has no speaker selection.
Quick Example
Make a song about a summer evening by the seaThe AI confirms the capability and parameters with you, submits the job, and notifies you when the track is ready. You get a listen link, duration, and credit cost, plus a local download when output mode is set to download or both.
Capabilities
The skill splits into two groups: generation commands run asynchronously and return a task to poll; analysis commands run synchronously and return results in the same call.
generate — text and/or lyrics into a new song. At least one of prompt or lyrics is required. Optional style, title, model, instrumental toggle, and a cloned vocal-id. Async.
remix — an existing song plus new lyrics into a re-creation. Provide exactly one input source: an audio file, an audio URL, or a Mureka provider-song-id. Both lyrics and prompt are required. Optional style, title, model. Async.
instrumental — a pure instrumental with no vocals. Provide exactly one of prompt or a reference-audio file. Optional title, model. Async.
soundtrack — music scored to an image or a video. Provide exactly one of image or video. Optional prompt, title, model. Async.
track — isolate or generate a single instrument or vocal track from a song. Provide exactly one input source (audio file or provider-song-id) plus a generate-type. When the type is Vocals, lyrics is required. Optional prompt, vocal-gender, and a generate-start/generate-end range in seconds. Async.
extend — make an existing song longer. Provide one input source (audio file or provider-song-id). Optional prompt describing how to continue, and model. Async.
recognize — extract lyrics with line-level timestamps from an audio file. Sync — results return immediately.
describe — analyze an audio file into a description, tags, genres, and instruments. Sync — results return immediately.
stem — split a song into separated stems and return ZIP download URLs. Choose a separation model (audio-separation-1 or audio-separation-2). Sync — results return immediately.
A cover capability also exists but is deprecated — use remix instead unless you specifically need the older cover flow.
Two task-management commands are available any time: list shows recent tasks, and get <taskId> returns the status or result of a single task.
Models
Generation commands accept a model parameter. auto is the default and lets the service pick.
| Model | Notes |
|---|---|
auto | Default — service selects the model |
mureka-7.6 | Mureka 7.6 |
mureka-8 | Mureka 8 |
mureka-9 | Mureka 9 |
mureka-o2 | Mureka o2 |
Analysis is different. The stem command takes a separation model instead — audio-separation-1 or audio-separation-2. The recognize and describe commands take no model.
Parameters
Parameters apply per capability. Provide only what each command needs; the AI asks for required inputs and offers optional ones.
| Parameter | Applies to | Notes |
|---|---|---|
prompt | generate, remix, instrumental, soundtrack, track, extend | Free text describing the music or direction |
lyrics | generate, remix, track (Vocals only) | Song lyrics |
style | generate, remix | Genre or mood, e.g. city pop |
title | generate, remix, instrumental, soundtrack | Track title; auto-generated if omitted |
model | generation commands | One of the models above; defaults to auto |
instrumental | generate | Toggle vocals off |
vocal-id | generate | A cloned voice id |
audio / audio-url / provider-song-id | remix, track, extend | The input song; supply exactly one |
reference-audio | instrumental | Reference audio file (alternative to prompt) |
image / video | soundtrack | Source media; supply exactly one |
generate-type | track | One of Vocals, Instrumental, Drums, Bass, Guitar, Keyboard, Percussion, Strings, Synth, FX, Brass, Woodwinds |
vocal-gender | track | male or female |
generate-start / generate-end | track | Time range in seconds |
File limits
All input files are capped at 10 MB. Accepted formats by type:
| Type | Formats |
|---|---|
| Audio | mp3, m4a (the track command also accepts wav) |
| Image | jpg, jpeg, png, webp |
| Video | mp4, mov, avi, mkv, webm |
You can also pass a URL instead of a local file where the command supports it; the CLI validates it on submission.
Music generation is slow — expect roughly 5 to 10 minutes per track. The AI submits the job in the background and notifies you on completion. If you only have a task id, check progress with listenhub music get <taskId> --json or browse listenhub music list --json.
Output
Output behavior follows the outputMode set during config:
inlineorboth— the audio URL is shown as a clickable listen link, alongside the title, duration, and credit cost.downloadorboth— the file is also saved to the current working directory with a friendly, topic-based name (e.g.summer-breeze.mp3). Names are de-duplicated automatically.
For stem, the result is one or more ZIP download URLs; in download or both mode they are fetched to the current directory. For recognize and describe, the result is shown directly in the conversation.
Each completed task reports its creditCost. To estimate or check credits, see the credits notes in the Music API reference and your balance via GET /v1/user/subscription.
CLI Commands
The skill drives the listenhub music command group. The common shape:
# Generate a song (async — polls until ready)
listenhub music generate --prompt "upbeat summer pop about the sea" --json
# Remix an existing file with new lyrics
listenhub music remix --audio demo.mp3 --lyrics "..." --prompt "rework as city pop" --json
# Pure instrumental
listenhub music instrumental --prompt "electronic track for a game intro" --json
# Soundtrack for a video
listenhub music soundtrack --video clip.mp4 --prompt "tense, suspenseful score" --json
# Analyze audio (sync)
listenhub music recognize --audio song.mp3 --json
listenhub music describe --audio song.mp3 --json
listenhub music stem --audio track.mp3 --model audio-separation-2 --json
# Task management
listenhub music list --json
listenhub music get <taskId> --jsonGlobal flags apply: --json / -j for machine output, --no-wait to skip polling, --timeout <s> to bound it.
API Reference
See the Music API reference for endpoint paths, request parameters, response fields, and credit handling.