💼 ビジネスコミュニティ

elevenlabs-tts

ElevenLabsの高品質なAI音声合成技術を利用し、22種類以上の音声から選択、多言語に対応し、読み上げスタイルを調整して、ナレーションやオーディオブックなどを作成するSkill。

📜 元の英語説明(参考)

ElevenLabs text-to-speech with 22+ premium voices, multilingual support, and voice tuning via inference.sh CLI. Models: eleven_multilingual_v2 (highest quality), eleven_turbo_v2_5 (low latency), eleven_flash_v2_5 (ultra-fast). Capabilities: text-to-speech, voice selection, stability/style control, 32 languages. Use for: voiceovers, audiobooks, video narration, podcasts, accessibility, IVR. Triggers: elevenlabs, eleven labs, elevenlabs tts, premium tts, professional voice, ai voice, high quality tts, multilingual tts, eleven labs voice, voice generation, natural speech, realistic voice, voice over, speech synthesis

🇯🇵 日本人クリエイター向け解説

一言でいうと

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⚡ おすすめ: コマンド1行でインストール(60秒)

下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。ダウンロード → 解凍 → 配置まで全自動。

🍎 Mac / 🐧 Linux

mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o elevenlabs-tts.zip https://jpskill.com/download/19244.zip && unzip -o elevenlabs-tts.zip && rm elevenlabs-tts.zip

🪟 Windows (PowerShell)

$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/19244.zip -OutFile "$d\elevenlabs-tts.zip"; Expand-Archive "$d\elevenlabs-tts.zip" -DestinationPath $d -Force; ri "$d\elevenlabs-tts.zip"

完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。

💾 手動でダウンロードしたい(コマンドが難しい人向け)

1. 下の青いボタンを押して elevenlabs-tts.zip をダウンロード
2. ZIPファイルをダブルクリックで解凍 → elevenlabs-tts フォルダができる
3. そのフォルダを C:\Users\あなたの名前\.claude\skills\(Win)または ~/.claude/skills/(Mac)へ移動
4. Claude Code を再起動

⬇ .zip でダウンロード(推奨) ⬇ .skill 形式(上級者用) 元のソース ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-18
取得日時: 2026-05-18
同梱ファイル: 1

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

ElevenLabs テキスト読み上げ

inference.sh CLI を介した、22 種類以上の音声によるプレミアムなテキスト読み上げです。

ElevenLabs TTS

クイックスタート

inference.sh CLI (infsh) が必要です。インストール手順

infsh login

# ElevenLabs で音声を生成
infsh app run elevenlabs/tts --input '{"text": "Hello, welcome to our product demo.", "voice": "aria"}'

利用可能なモデル

モデル	ID	最適な用途	レイテンシー
Multilingual v2	`eleven_multilingual_v2`	最高品質、32 言語	~250ms
Turbo v2.5	`eleven_turbo_v2_5`	速度と品質のバランス	~150ms
Flash v2.5	`eleven_flash_v2_5`	超低レイテンシー	~75ms

音声ライブラリ

女性の声

音声	スタイル
`aria`	アメリカ英語、会話調
`alice`	イギリス英語、自信に満ちた
`bella`	アメリカ英語、温かい
`jessica`	アメリカ英語、表現力豊かな
`laura`	アメリカ英語、プロフェッショナルな
`lily`	イギリス英語、優しい
`sarah`	アメリカ英語、フレンドリーな

男性の声

音声	スタイル
`george`	イギリス英語、威厳のある
`adam`	アメリカ英語、深みのある
`bill`	アメリカ英語、成熟した
`brian`	アメリカ英語、会話調
`callum`	環大西洋英語、力強い
`charlie`	オーストラリア英語、自然な
`chris`	アメリカ英語、カジュアルな
`daniel`	イギリス英語、威厳のある
`eric`	アメリカ英語、フレンドリーな
`harry`	アメリカ英語、若い
`liam`	アメリカ英語、明瞭な
`matilda`	アメリカ英語、温かい
`river`	アメリカ英語、自信に満ちた
`roger`	アメリカ英語、威厳のある
`will`	アメリカ英語、明るい

例

基本的な音声

infsh app run elevenlabs/tts --input '{"text": "Welcome to our quarterly earnings presentation.", "voice": "george"}'

モデルの選択

# 最高品質
infsh app run elevenlabs/tts --input '{
  "text": "This is our premium multilingual model with the best quality.",
  "voice": "aria",
  "model": "eleven_multilingual_v2"
}'

# リアルタイムアプリケーション向けの超高速モデル
infsh app run elevenlabs/tts --input '{
  "text": "Flash model for low-latency applications.",
  "voice": "brian",
  "model": "eleven_flash_v2_5"
}'

音声チューニング

infsh app run elevenlabs/tts --input '{
  "text": "Fine-tune the voice characteristics for your use case.",
  "voice": "bella",
  "stability": 0.3,
  "similarity_boost": 0.9,
  "style": 0.4
}'

パラメータ	範囲	効果
`stability`	0-1	高いほど一貫性が高く、低いほど表現豊かになります
`similarity_boost`	0-1	高いほど元の音声特性に近くなります
`style`	0-1	高いほどスタイルの誇張が大きくなります
`use_speaker_boost`	true/false	話者の明瞭度を高めます

出力形式

# 高品質 MP3
infsh app run elevenlabs/tts --input '{
  "text": "High quality audio output.",
  "voice": "daniel",
  "output_format": "mp3_44100_192"
}'

形式	説明
`mp3_44100_128`	44.1kHz、128kbps の MP3 (デフォルト)
`mp3_44100_192`	44.1kHz、192kbps の MP3
`pcm_16000`	16kHz の生 PCM
`pcm_22050`	22.05kHz の生 PCM
`pcm_24000`	24kHz の生 PCM
`pcm_44100`	44.1kHz の生 PCM

多言語対応

ElevenLabs は、英語、スペイン語、フランス語、ドイツ語、イタリア語、ポルトガル語、中国語、日本語、韓国語、アラビア語、ヒンディー語、ロシア語など、32 の言語をサポートしています。

# スペイン語
infsh app run elevenlabs/tts --input '{
  "text": "Hola, bienvenidos a nuestra presentación.",
  "voice": "aria",
  "model": "eleven_multilingual_v2"
}'

# フランス語
infsh app run elevenlabs/tts --input '{
  "text": "Bonjour, bienvenue à notre démonstration.",
  "voice": "alice",
  "model": "eleven_multilingual_v2"
}'

音声 + ビデオワークフロー

# 1. ナレーションを生成
infsh app run elevenlabs/tts --input '{
  "text": "Introducing the future of AI-powered content creation.",
  "voice": "george"
}' > voiceover.json

# 2. トーキングヘッドビデオを作成
infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "<audio-url-from-step-1>"
}'

ユースケース

ナレーション: 製品デモ、解説ビデオ、コマーシャル
オーディオブック: 一貫した音声による長編ナレーション
ポッドキャスト: 自然な話し方の AI ホスト
eラーニング: 多言語でのコースナレーション
アクセシビリティ: 高品質なスクリーンリーダーコンテンツ
IVR: プロフェッショナルな電話システムメッセージ
ビデオナレーション: ドキュメンタリーやソーシャルメディアコンテンツ

ElevenLabs Text-to-Speech

Premium text-to-speech with 22+ voices via inference.sh CLI.

ElevenLabs TTS

Quick Start

Requires inference.sh CLI (infsh). Install instructions

infsh login

# Generate speech with ElevenLabs
infsh app run elevenlabs/tts --input '{"text": "Hello, welcome to our product demo.", "voice": "aria"}'

Available Models

Model	ID	Best For	Latency
Multilingual v2	`eleven_multilingual_v2`	Highest quality, 32 languages	~250ms
Turbo v2.5	`eleven_turbo_v2_5`	Balance of speed & quality	~150ms
Flash v2.5	`eleven_flash_v2_5`	Ultra-low latency	~75ms

Voice Library

Female Voices

Voice	Style
`aria`	American, conversational
`alice`	British, confident
`bella`	American, warm
`jessica`	American, expressive
`laura`	American, professional
`lily`	British, soft
`sarah`	American, friendly

Male Voices

Voice	Style
`george`	British, authoritative
`adam`	American, deep
`bill`	American, mature
`brian`	American, conversational
`callum`	Transatlantic, intense
`charlie`	Australian, natural
`chris`	American, casual
`daniel`	British, commanding
`eric`	American, friendly
`harry`	American, young
`liam`	American, articulate
`matilda`	American, warm
`river`	American, confident
`roger`	American, authoritative
`will`	American, bright

Examples

Basic Speech

infsh app run elevenlabs/tts --input '{"text": "Welcome to our quarterly earnings presentation.", "voice": "george"}'

Choose a Model

# Highest quality
infsh app run elevenlabs/tts --input '{
  "text": "This is our premium multilingual model with the best quality.",
  "voice": "aria",
  "model": "eleven_multilingual_v2"
}'

# Ultra-fast for real-time applications
infsh app run elevenlabs/tts --input '{
  "text": "Flash model for low-latency applications.",
  "voice": "brian",
  "model": "eleven_flash_v2_5"
}'

Voice Tuning

infsh app run elevenlabs/tts --input '{
  "text": "Fine-tune the voice characteristics for your use case.",
  "voice": "bella",
  "stability": 0.3,
  "similarity_boost": 0.9,
  "style": 0.4
}'

Parameter	Range	Effect
`stability`	0-1	Higher = more consistent, lower = more expressive
`similarity_boost`	0-1	Higher = closer to original voice character
`style`	0-1	Higher = more style exaggeration
`use_speaker_boost`	true/false	Enhances speaker clarity

Output Formats

# High-quality MP3
infsh app run elevenlabs/tts --input '{
  "text": "High quality audio output.",
  "voice": "daniel",
  "output_format": "mp3_44100_192"
}'

Format	Description
`mp3_44100_128`	MP3 at 44.1kHz, 128kbps (default)
`mp3_44100_192`	MP3 at 44.1kHz, 192kbps
`pcm_16000`	Raw PCM at 16kHz
`pcm_22050`	Raw PCM at 22.05kHz
`pcm_24000`	Raw PCM at 24kHz
`pcm_44100`	Raw PCM at 44.1kHz

Multilingual

ElevenLabs supports 32 languages including English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese, Korean, Arabic, Hindi, Russian, and more.

# Spanish
infsh app run elevenlabs/tts --input '{
  "text": "Hola, bienvenidos a nuestra presentación.",
  "voice": "aria",
  "model": "eleven_multilingual_v2"
}'

# French
infsh app run elevenlabs/tts --input '{
  "text": "Bonjour, bienvenue à notre démonstration.",
  "voice": "alice",
  "model": "eleven_multilingual_v2"
}'

Voice + Video Workflow

# 1. Generate voiceover
infsh app run elevenlabs/tts --input '{
  "text": "Introducing the future of AI-powered content creation.",
  "voice": "george"
}' > voiceover.json

# 2. Create talking head video
infsh app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "<audio-url-from-step-1>"
}'

Use Cases

Voiceovers: Product demos, explainer videos, commercials
Audiobooks: Long-form narration with consistent voices
Podcasts: AI hosts with natural delivery
E-learning: Course narration in multiple languages
Accessibility: High-quality screen reader content
IVR: Professional phone system messages
Video Narration: Documentary and social media content

Related Skills

# ElevenLabs multi-speaker dialogue
npx skills add inference-sh/skills@elevenlabs-dialogue

# ElevenLabs voice changer
npx skills add inference-sh/skills@elevenlabs-voice-changer

# ElevenLabs sound effects
npx skills add inference-sh/skills@elevenlabs-sound-effects

# All TTS models (Kokoro, DIA, Chatterbox, and more)
npx skills add inference-sh/skills@text-to-speech

# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@infsh-cli

Browse all audio apps: infsh app list --category audio