jpskill.com
💬 コミュニケーション コミュニティ

ai-podcast-creation

AI音声合成と音楽生成を組み合わせ、複数話者の会話やBGM付きのポッドキャストを自動作成するSkill。

📜 元の英語説明(参考)

Create AI-powered podcasts with text-to-speech, music, and audio editing. Tools: Kokoro TTS, DIA TTS, Chatterbox, AI music generation, media merger. Capabilities: multi-voice conversations, background music, intro/outro, full episodes. Use for: podcast production, audiobooks, voice content, audio newsletters. Triggers: podcast, ai podcast, text to speech podcast, audio content, voice over, ai audiobook, multi voice, conversation ai, notebooklm alternative, audio generation, podcast automation, ai narrator, voice content, audio newsletter, podcast maker

🇯🇵 日本人クリエイター向け解説

一言でいうと

AI音声合成と音楽生成を組み合わせ、複数話者の会話やBGM付きのポッドキャストを自動作成するSkill。

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

  1. 1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
  2. 2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
  3. 3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
    • · macOS / Linux: ~/.claude/skills/
    • · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →
最終更新
2026-05-17
取得日時
2026-05-17
同梱ファイル
1

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

AIポッドキャスト作成

AI Podcast Creation

inference.sh CLI を介して、AIを活用したポッドキャストやオーディオコンテンツを作成できます。

クイックスタート

curl -fsSL https://cli.inference.sh | sh && infsh login

# ポッドキャストセグメントを生成
infsh app run infsh/kokoro-tts --input '{
  "text": "Welcome to the AI Frontiers podcast. Today we explore the latest developments in generative AI.",
  "voice": "am_michael"
}'

利用可能な音声

Kokoro TTS

Voice ID 説明 最適な用途
af_sarah アメリカ人女性、温かい ホスト、ナレーター
af_nicole アメリカ人女性、プロフェッショナル ニュース、ビジネス
am_michael アメリカ人男性、権威的 ドキュメンタリー、テクノロジー
am_adam アメリカ人男性、会話的 カジュアルなポッドキャスト
bf_emma イギリス人女性、洗練された オーディオブック
bm_george イギリス人男性、クラシック フォーマルなコンテンツ

DIA TTS (会話型)

Voice ID 説明 最適な用途
dia-conversational 自然な会話 対話、インタビュー

Chatterbox

Voice ID 説明 最適な用途
chatterbox-default 表現豊か カジュアル、エンターテイメント

ポッドキャストのワークフロー

シンプルなナレーション

# 単一音声のポッドキャストセグメント
infsh app run infsh/kokoro-tts --input '{
  "text": "Your podcast script here. Make it conversational and engaging. Add natural pauses with punctuation.",
  "voice": "am_michael"
}'

複数音声の会話

# ホストの紹介
infsh app run infsh/kokoro-tts --input '{
  "text": "Welcome back to Tech Talk. Today I have a special guest to discuss AI developments.",
  "voice": "am_michael"
}' > host_intro.json

# ゲストの返答
infsh app run infsh/kokoro-tts --input '{
  "text": "Thanks for having me. I am excited to share what we have been working on.",
  "voice": "af_sarah"
}' > guest_response.json

# 会話にマージ
infsh app run infsh/media-merger --input '{
  "audio_files": ["<host-url>", "<guest-url>"],
  "crossfade_ms": 500
}'

全エピソードのパイプライン

# 1. Claudeでスクリプトを生成
infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Write a 5-minute podcast script about the impact of AI on creative work. Format as a two-person dialogue between HOST and GUEST. Include natural conversation, questions, and insights."
}' > script.json

# 2. イントロ音楽を生成
infsh app run infsh/ai-music --input '{
  "prompt": "Podcast intro music, upbeat, modern, tech feel, 15 seconds"
}' > intro_music.json

# 3. ホストのセグメントを生成
infsh app run infsh/kokoro-tts --input '{
  "text": "<host-lines>",
  "voice": "am_michael"
}' > host.json

# 4. ゲストのセグメントを生成
infsh app run infsh/kokoro-tts --input '{
  "text": "<guest-lines>",
  "voice": "af_sarah"
}' > guest.json

# 5. アウトロ音楽を生成
infsh app run infsh/ai-music --input '{
  "prompt": "Podcast outro music, matching intro style, fade out, 10 seconds"
}' > outro_music.json

# 6. 全てをマージ
infsh app run infsh/media-merger --input '{
  "audio_files": [
    "<intro-music>",
    "<host>",
    "<guest>",
    "<outro-music>"
  ],
  "crossfade_ms": 1000
}'

NotebookLMスタイルのコンテンツ

ドキュメントからポッドキャストスタイルの議論を生成します。

# 1. 主要なポイントを抽出
infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Read this document and create a podcast script where two hosts discuss the key points in an engaging, conversational way. Include questions, insights, and natural dialogue.\n\nDocument:\n<your-document-content>"
}' > discussion_script.json

# 2. ホストAを生成
infsh app run infsh/kokoro-tts --input '{
  "text": "<host-a-lines>",
  "voice": "am_michael"
}' > host_a.json

# 3. ホストBを生成
infsh app run infsh/kokoro-tts --input '{
  "text": "<host-b-lines>",
  "voice": "af_sarah"
}' > host_b.json

# 4. 交互に配置してマージ
infsh app run infsh/media-merger --input '{
  "audio_files": ["<host-a-1>", "<host-b-1>", "<host-a-2>", "<host-b-2>"],
  "crossfade_ms": 300
}'

オーディオブックの章

# 長尺のナレーション
infsh app run infsh/kokoro-tts --input '{
  "text": "Chapter One. It was a dark and stormy night when the first AI achieved consciousness...",
  "voice": "bf_emma",
  "speed": 0.9
}'

オーディオの強化

BGMを追加

# 1. ポッドキャストの音声を生成
infsh app run infsh/kokoro-tts --input '{
  "text": "<podcast-script>",
  "voice": "am_michael"
}' > podcast.json

# 2. 環境音楽を生成
infsh app run infsh/ai-music --input '{
  "prompt": "Soft ambient background music for podcast, subtle, non-distracting, loopable"
}' > background.json

# 3. 背景音量を下げてミックス
infsh app run infsh/media-merger --input '{
  "audio_files": ["<podcast-url>"],
  "background_audio": "<background-url>",
  "background_volume": 0.15
}'

効果音を追加

# セグメント間のトランジションサウンド
infsh app run infsh/ai-music --input '{
  "prompt": "Short podcast transition sound, whoosh, 2 seconds"
}' > transition.json

スクリプト作成のヒント

Claudeへのプロンプト

infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Write a podcast script with these requirements:
  - Topic: [YOUR TOPIC]
  - Duration: 5 minutes (about 750 words)
  - Format: Two hosts (HOST_A and HOST_B)
  - Tone: Conversational, informative, engaging
  - Include: Hook intro, 3 main points, call to action
  - Mark speaker changes clearly

  Make it sound natural, not scripted. Add verbal fillers like \"you know\" and \"I mean\" occasionally."
}'

ポッドキャストのテンプレート

インタビュー形式

HOST: Introduction and welcome
GUEST: Thank you, happy to be here
HOST: First question about background
GUEST: Response with story
HOST: Follow-up question
GUEST: Deeper insight
... continue pattern ...
HOST: Closing question
GUEST: Fina
📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

AI Podcast Creation

AI Podcast Creation

Create AI-powered podcasts and audio content via inference.sh CLI.

Quick Start

curl -fsSL https://cli.inference.sh | sh && infsh login

# Generate podcast segment
infsh app run infsh/kokoro-tts --input '{
  "text": "Welcome to the AI Frontiers podcast. Today we explore the latest developments in generative AI.",
  "voice": "am_michael"
}'

Available Voices

Kokoro TTS

Voice ID Description Best For
af_sarah American female, warm Host, narrator
af_nicole American female, professional News, business
am_michael American male, authoritative Documentary, tech
am_adam American male, conversational Casual podcast
bf_emma British female, refined Audiobooks
bm_george British male, classic Formal content

DIA TTS (Conversational)

Voice ID Description Best For
dia-conversational Natural conversation Dialogue, interviews

Chatterbox

Voice ID Description Best For
chatterbox-default Expressive Casual, entertainment

Podcast Workflows

Simple Narration

# Single voice podcast segment
infsh app run infsh/kokoro-tts --input '{
  "text": "Your podcast script here. Make it conversational and engaging. Add natural pauses with punctuation.",
  "voice": "am_michael"
}'

Multi-Voice Conversation

# Host introduction
infsh app run infsh/kokoro-tts --input '{
  "text": "Welcome back to Tech Talk. Today I have a special guest to discuss AI developments.",
  "voice": "am_michael"
}' > host_intro.json

# Guest response
infsh app run infsh/kokoro-tts --input '{
  "text": "Thanks for having me. I am excited to share what we have been working on.",
  "voice": "af_sarah"
}' > guest_response.json

# Merge into conversation
infsh app run infsh/media-merger --input '{
  "audio_files": ["<host-url>", "<guest-url>"],
  "crossfade_ms": 500
}'

Full Episode Pipeline

# 1. Generate script with Claude
infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Write a 5-minute podcast script about the impact of AI on creative work. Format as a two-person dialogue between HOST and GUEST. Include natural conversation, questions, and insights."
}' > script.json

# 2. Generate intro music
infsh app run infsh/ai-music --input '{
  "prompt": "Podcast intro music, upbeat, modern, tech feel, 15 seconds"
}' > intro_music.json

# 3. Generate host segments
infsh app run infsh/kokoro-tts --input '{
  "text": "<host-lines>",
  "voice": "am_michael"
}' > host.json

# 4. Generate guest segments
infsh app run infsh/kokoro-tts --input '{
  "text": "<guest-lines>",
  "voice": "af_sarah"
}' > guest.json

# 5. Generate outro music
infsh app run infsh/ai-music --input '{
  "prompt": "Podcast outro music, matching intro style, fade out, 10 seconds"
}' > outro_music.json

# 6. Merge everything
infsh app run infsh/media-merger --input '{
  "audio_files": [
    "<intro-music>",
    "<host>",
    "<guest>",
    "<outro-music>"
  ],
  "crossfade_ms": 1000
}'

NotebookLM-Style Content

Generate podcast-style discussions from documents.

# 1. Extract key points
infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Read this document and create a podcast script where two hosts discuss the key points in an engaging, conversational way. Include questions, insights, and natural dialogue.\n\nDocument:\n<your-document-content>"
}' > discussion_script.json

# 2. Generate Host A
infsh app run infsh/kokoro-tts --input '{
  "text": "<host-a-lines>",
  "voice": "am_michael"
}' > host_a.json

# 3. Generate Host B
infsh app run infsh/kokoro-tts --input '{
  "text": "<host-b-lines>",
  "voice": "af_sarah"
}' > host_b.json

# 4. Interleave and merge
infsh app run infsh/media-merger --input '{
  "audio_files": ["<host-a-1>", "<host-b-1>", "<host-a-2>", "<host-b-2>"],
  "crossfade_ms": 300
}'

Audiobook Chapter

# Long-form narration
infsh app run infsh/kokoro-tts --input '{
  "text": "Chapter One. It was a dark and stormy night when the first AI achieved consciousness...",
  "voice": "bf_emma",
  "speed": 0.9
}'

Audio Enhancement

Add Background Music

# 1. Generate podcast audio
infsh app run infsh/kokoro-tts --input '{
  "text": "<podcast-script>",
  "voice": "am_michael"
}' > podcast.json

# 2. Generate ambient music
infsh app run infsh/ai-music --input '{
  "prompt": "Soft ambient background music for podcast, subtle, non-distracting, loopable"
}' > background.json

# 3. Mix with lower background volume
infsh app run infsh/media-merger --input '{
  "audio_files": ["<podcast-url>"],
  "background_audio": "<background-url>",
  "background_volume": 0.15
}'

Add Sound Effects

# Transition sounds between segments
infsh app run infsh/ai-music --input '{
  "prompt": "Short podcast transition sound, whoosh, 2 seconds"
}' > transition.json

Script Writing Tips

Prompt for Claude

infsh app run openrouter/claude-sonnet-45 --input '{
  "prompt": "Write a podcast script with these requirements:
  - Topic: [YOUR TOPIC]
  - Duration: 5 minutes (about 750 words)
  - Format: Two hosts (HOST_A and HOST_B)
  - Tone: Conversational, informative, engaging
  - Include: Hook intro, 3 main points, call to action
  - Mark speaker changes clearly

  Make it sound natural, not scripted. Add verbal fillers like \"you know\" and \"I mean\" occasionally."
}'

Podcast Templates

Interview Format

HOST: Introduction and welcome
GUEST: Thank you, happy to be here
HOST: First question about background
GUEST: Response with story
HOST: Follow-up question
GUEST: Deeper insight
... continue pattern ...
HOST: Closing question
GUEST: Final thoughts
HOST: Thank you and outro

Solo Episode

Introduction with hook
Topic overview
Point 1 with examples
Point 2 with examples
Point 3 with examples
Summary and takeaways
Call to action
Outro

News Roundup

Intro music
Welcome and date
Story 1: headline + details
Story 2: headline + details
Story 3: headline + details
Analysis/opinion segment
Outro

Best Practices

  1. Natural punctuation - Use commas and periods for pacing
  2. Short sentences - Easier to speak and listen
  3. Varied voices - Different speakers prevent monotony
  4. Background music - Subtle, at 10-15% volume
  5. Crossfades - Smooth transitions between segments
  6. Edit scripts - Remove filler before generating

Related Skills

# Text-to-speech models
npx skills add inferencesh/skills@text-to-speech

# AI music generation
npx skills add inferencesh/skills@ai-music-generation

# LLM for scripts
npx skills add inferencesh/skills@llm-models

# Content pipelines
npx skills add inferencesh/skills@ai-content-pipeline

# Full platform skill
npx skills add inferencesh/skills@inference-sh

Browse all apps: infsh app list --category audio