ListenHubSkills

Creator

Generate multi-platform content packages — articles, image cards, or narration scripts with illustrations and audio — from any topic, URL, or audio file.

Creator is a ListenHub Skill — an AI agent workflow with access to file system, web browsing, image generation, and other tools. Give it a topic, URL, or audio file, and it produces a complete content package — WeChat articles with illustrations, Xiaohongshu image cards, or narration scripts with optional TTS. All outputs land in a local folder: text, images, and metadata, ready to publish.

Quick Example

WeChat article from a URL:

Write a WeChat article based on this link https://mp.weixin.qq.com/s/xxx

Creator shows a summary and waits for your confirmation:

Platform:  WeChat
Source:    https://mp.weixin.qq.com/s/xxx (article extraction)
Preset:    Flat illustration
Output:    ./ai-trends-wechat/
APIs used: Content Extraction, Image Generation

Once you confirm, Creator runs the full pipeline. After a few minutes, you get:

ai-trends-wechat/
├── article.md          # Full article with image references
├── images/
│   ├── cover.jpg       # AI-generated cover
│   ├── section-1.jpg
│   └── section-2.jpg
└── meta.json           # Title, summary, tags

Xiaohongshu cards from a topic:

Create Xiaohongshu cards about must-have items for solo apartment living
solo-living-xiaohongshu/
├── cards/
│   ├── 01-cover.jpg
│   ├── 02-page.jpg
│   ├── ...
│   └── prompts.json
├── long-text.md        # Post text with hashtags
└── meta.json

Platforms

Creator writes a structured, long-form WeChat article with clear headings and concise paragraphs, and generates a cover image plus section illustrations to match.

Three visual presets are available for illustrations — Flat, Watercolor, and Photo-Realistic. Creator picks one based on your topic, or you can specify: "use the watercolor preset".

Creator produces Xiaohongshu content in two formats: image cards (5–8 designed pages with bold text and visuals) and a long-form post (hook-first, with hashtags). You get both by default, or can request just one.

Ten visual presets are available for cards, ranging from minimal to retro to pop. Creator auto-selects based on content, or you can specify: "use the Notion preset".

Creator writes a conversational, spoken-word script with natural pacing and clear structure. Optionally, it generates a TTS audio file using an AI voice.

Supported Inputs

InputExampleWhat Creator does
URL (article/page)https://mp.weixin.qq.com/s/xxxExtracts content via API, uses it as source material
URL (audio/video)A YouTube or Bilibili linkDownloads and transcribes locally, writes from the transcript
Local audio filemeeting.mp3Transcribes with local ASR, writes from the transcript
TextA pasted paragraph or documentUses the text as source material
Topic"AI in education"Generates content from scratch

Audio/video transcription runs locally via coli. Install with npm i -g @marswave/coli. No API key needed for transcription.

API Key

Image generation, content extraction from URLs, and TTS require a ListenHub API key. Creator checks at the confirmation step and walks you through setup if needed.

Text-only pipelines (e.g., topic → narration script without audio) work without an API key.

Style Learning

Learn from a reference: Share an article, post, or script you like — say "use this as a style reference". Creator extracts the writing style and applies it to future generations.

Set rules directly: After reviewing output, tell Creator what to adjust. It saves these as persistent style rules:

  • "remember: keep WeChat paragraphs short"
  • "narration scripts should be under 800 words"
  • "小红书少用 emoji"

Style rules are saved per platform in .listenhub/creator/styles/ under your current working directory and apply automatically to future generations. To reset: "reset WeChat style" or "重置公众号风格偏好".

Trigger

Type /creator to invoke directly, or describe what you want in natural language — Creator activates when it recognizes a content generation request (e.g., "write a WeChat article", "帮我写篇公众号").

API Reference

See the OpenAPI documentation for details on the underlying content extraction, image generation, and TTS endpoints.

On this page