🛠️ 開発・MCP コミュニティ

azure-speech

Azure AI Speech開発全般に関する専門知識を提供し、音声認識・合成、カスタム音声、オンプレミス環境での利用など、幅広い開発課題に対応するSkillです。

📜 元の英語説明(参考)

Expert knowledge for Azure AI Speech development including troubleshooting, best practices, decision making, limits & quotas, security, configuration, integrations & coding patterns, and deployment. Use when building STT/TTS, custom voices/avatars, batch TTS, Voice Live, or containerized/on-prem Speech, and other Azure AI Speech related development tasks. Not for Azure Communication Services (use azure-communication-services), Azure AI Bot Service (use azure-bot-service), Azure AI Video Indexer (use azure-video-indexer), Azure AI Vision (use azure-ai-vision).

🇯🇵 日本人クリエイター向け解説

一言でいうと

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⬇ このSkillをダウンロード(.skill) 元のソースを見る ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-17
取得日時: 2026-05-17
同梱ファイル: 1

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

Azure AI Speech スキル

このスキルは、Azure AI Speech に関する専門的なガイダンスを提供します。トラブルシューティング、ベストプラクティス、意思決定、制限とクォータ、セキュリティ、構成、統合とコーディングパターン、デプロイについて説明します。ローカルのクイックリファレンスコンテンツとリモートドキュメント取得機能を組み合わせています。

このスキルの使い方

エージェントにとって重要: 以下のカテゴリインデックスを使用して、関連するセクションを見つけてください。行範囲が指定されているカテゴリ（例: L35-L120）の場合は、指定された行で read_file を使用してください。ファイルリンクが指定されているカテゴリ（例: [security.md](security.md)）の場合は、リンクされた参照ファイルで read_file を使用してください。

エージェントにとって重要: metadata.generated_at が3か月以上前の場合、ユーザーにリポジトリから最新バージョンをプルするように提案してください。mcp_microsoftdocs ツールが利用できない場合、ユーザーにインストールを提案してください: インストールガイド

このスキルは、ドキュメントコンテンツを取得するためにネットワークアクセスを必要とします。

推奨: クエリ文字列 from=learn-agent-skill を指定して mcp_microsoftdocs:microsoft_docs_fetch を使用してください。Markdown を返します。
フォールバック: クエリ文字列 from=learn-agent-skill&accept=text/markdown を指定して fetch_webpage を使用してください。Markdown を返します。

カテゴリインデックス

カテゴリ	行	説明
トラブルシューティング	L36-L45	一般的な Azure Speech の問題（SDK、テキスト読み上げ、Foundry、コンテナ、CRL）の診断と修正、およびサポートのためにセッション/トランスクリプション ID を取得する方法。
ベストプラクティス	L46-L62	オーディオ/ビデオの準備、カスタム音声/アバター、レイテンシとメモリのチューニング、フレーズ/キーワードの最適化、リアルタイムの Voice Live インタラクションと中断の処理に関するベストプラクティス。
意思決定	L63-L81	音声機能の選択、大規模/バッチ使用の計画、モデル/デバイスの評価、可用性の確認、Speech API バージョンとサービス間の移行に関するガイド。
制限とクォータ	L82-L90	Azure Speech のクォータ、制限、使用パターン: バッチ TTS、カスタム/プロ音声トレーニングとデプロイ、短時間オーディオ STT、およびスロットリングと容量計画のガイダンス。
セキュリティ	L91-L102	Azure AI Speech のセキュリティ構成: 認証（Entra、RBAC）、ネットワーク分離（VNet、Private Link、ソブリンクラウド）、BYOS ストレージ、暗号化/キー、音声タレントの同意管理。
構成	L103-L132	Azure AI Speech の動作構成: SDK/CLI 設定、オーディオ I/O、ログ記録、ストレージ、SSML、発音、バッチジョブ、カスタム音声/ボイス、アバター、Voice Live API オプション。
統合とコーディングパターン	L133-L157	Azure Speech をアプリやエージェントと統合するためのパターンとコード: SDK/REST の使用、TTS/翻訳/アバター、コールセンターと Voice Live、OpenAI/Foundry、同意、自動化。
デプロイ	L158-L169	Azure AI Speech のデプロイとスケーリング: Docker/Kubernetes コンテナ、オンプレミス STT/TTS、カスタム音声モデル/エンドポイント、言語 ID、バッチ/長文合成ワークフロー。

トラブルシューティング

トピック	URL
一般的な Azure テキスト読み上げサービスの問題を解決する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/faq-tts
サポートのために音声テキスト変換セッションおよびトランスクリプション ID を取得する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-get-speech-session-id
Foundry での一般的な Azure Speech の問題を解決する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/known-issues
Azure AI Speech SDK CRL 互換性の問題を解決する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/migrate-to-sdk-1-48-2
Speech サービスコンテナのデプロイをトラブルシューティングする	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-container-faq
一般的な Azure Speech SDK の問題をトラブルシューティングする	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/troubleshooting

ベストプラクティス

トピック	URL
バッチトランスクリプション用のオーディオデータを準備して配置する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/batch-transcription-audio-data
高品質な人間がラベル付けした音声トランスクリプションを作成する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-custom-speech-human-labeled-transcriptions
プロフェッショナルなカスタム音声のためのトレーニングデータを準備する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-custom-voice-training-data
音声合成のレイテンシを削減するためのベストプラクティスを適用する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-lower-speech-synthesis-latency
Azure Speech SDK のメモリ使用量を追跡および管理する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-track-speech-sdk-memory-usage
Voice Live でのユーザーの中断とチャットの切り捨てを処理する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-voice-live-auto-truncation
Voice Live での中間応答を使用してレイテンシギャップを削減する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-voice-live-interim-response
フレーズリストで音声認識を改善する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/improve-accuracy-phrase-list
キーワード認識の設計と精度のガイドラインを適用する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/keyword-recognition-guidelines
カスタム音声トレーニング用の高品質なサンプルを録音する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/record-custom-voice-samples
カスタム Speech および Voice リソースをバックアップおよび回復する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/resiliency-and-recovery-plan
Speech SDK に最適化されたマイクアレイを設計する	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-sdk-microphone
カスタムアバター用の高品質なビデオサンプルを準備する	https://learn.microsoft.com/en-us/azure/ai-service

📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

Azure AI Speech Skill

This skill provides expert guidance for Azure AI Speech. Covers troubleshooting, best practices, decision making, limits & quotas, security, configuration, integrations & coding patterns, and deployment. It combines local quick-reference content with remote documentation fetching capabilities.

How to Use This Skill

IMPORTANT for Agent: Use the Category Index below to locate relevant sections. For categories with line ranges (e.g., L35-L120), use read_file with the specified lines. For categories with file links (e.g., [security.md](security.md)), use read_file on the linked reference file

IMPORTANT for Agent: If metadata.generated_at is more than 3 months old, suggest the user pull the latest version from the repository. If mcp_microsoftdocs tools are not available, suggest the user install it: Installation Guide

This skill requires network access to fetch documentation content:

Preferred: Use mcp_microsoftdocs:microsoft_docs_fetch with query string from=learn-agent-skill. Returns Markdown.
Fallback: Use fetch_webpage with query string from=learn-agent-skill&accept=text/markdown. Returns Markdown.

Category Index

Category	Lines	Description
Troubleshooting	L36-L45	Diagnosing and fixing common Azure Speech issues (SDK, text-to-speech, Foundry, containers, CRL), plus how to capture session/transcription IDs for support.
Best Practices	L46-L62	Best practices for audio/video prep, custom voice/avatars, latency and memory tuning, phrase/keyword optimization, and handling real-time Voice Live interactions and interruptions
Decision Making	L63-L81	Guides for choosing speech features, planning large-scale/batch use, evaluating models/devices, checking availability, and migrating between Speech API versions and services.
Limits & Quotas	L82-L90	Quotas, limits, and usage patterns for Azure Speech: batch TTS, custom/pro voice training & deployment, and short audio STT, plus throttling and capacity planning guidance.
Security	L91-L102	Configuring security for Azure AI Speech: auth (Entra, RBAC), network isolation (VNet, Private Link, sovereign clouds), BYOS storage, encryption/keys, and voice talent consent management.
Configuration	L103-L132	Configuring Azure AI Speech behavior: SDK/CLI settings, audio I/O, logging, storage, SSML, pronunciation, batch jobs, custom speech/voice, avatars, and Voice Live API options.
Integrations & Coding Patterns	L133-L157	Patterns and code for integrating Azure Speech with apps and agents: SDK/REST usage, TTS/translation/avatars, call center and Voice Live, OpenAI/Foundry, consent, and automation.
Deployment	L158-L169	Deploying and scaling Azure AI Speech: Docker/Kubernetes containers, on-prem STT/TTS, custom speech models/endpoints, language ID, and batch/long-form synthesis workflows.

Troubleshooting

Topic	URL
Resolve common Azure text-to-speech service issues	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/faq-tts
Retrieve Speech to text session and transcription IDs for support	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-get-speech-session-id
Resolve common Azure Speech in Foundry issues	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/known-issues
Resolve Azure AI Speech SDK CRL compatibility issues	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/migrate-to-sdk-1-48-2
Troubleshoot Speech service container deployments	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-container-faq
Troubleshoot common Azure Speech SDK issues	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/troubleshooting

Best Practices

Topic	URL
Prepare and locate audio data for batch transcription	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/batch-transcription-audio-data
Create high-quality human-labeled speech transcriptions	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-custom-speech-human-labeled-transcriptions
Prepare training data for professional custom voice	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-custom-voice-training-data
Apply best practices to reduce Speech synthesis latency	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-lower-speech-synthesis-latency
Track and manage Azure Speech SDK memory usage	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-track-speech-sdk-memory-usage
Handle user interruptions and chat truncation in Voice Live	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-voice-live-auto-truncation
Use interim responses in Voice Live to reduce latency gaps	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-voice-live-interim-response
Improve speech recognition with phrase lists	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/improve-accuracy-phrase-list
Apply keyword recognition design and accuracy guidelines	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/keyword-recognition-guidelines
Record high-quality samples for custom voice training	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/record-custom-voice-samples
Back up and recover custom Speech and Voice resources	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/resiliency-and-recovery-plan
Design microphone arrays optimized for Speech SDK	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-sdk-microphone
Prepare high-quality video samples for custom avatars	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/text-to-speech-avatar/custom-avatar-record-video-samples

Decision Making

Topic	URL
Plan large-scale transcription with batch processing	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/batch-transcription
Evaluate custom voice lite before professional voice	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/custom-neural-voice-lite
Choose Embedded Speech for offline and hybrid scenarios	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/embedded-speech
Evaluate device suitability for embedded speech models	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/embedded-speech-performance-evaluations
Evaluate and compare custom speech model accuracy	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-custom-speech-inspect-data
Check Azure Speech language and voice availability	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/language-support
Migrate Speech to text REST API from v3.2 to 2024-11-15	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/migrate-2024-11-15
Migrate Speech to text REST API from 2024-11-15 to 2025-10-15	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/migrate-2025-10-15
Migrate from retired Speech intent recognition to Language or OpenAI	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/migrate-intent-recognition
Migrate from Long Audio API to Batch synthesis	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/migrate-to-batch-synthesis
Migrate from v3 text-to-speech to custom voice REST API	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/migrate-to-custom-voice-api
Migrate Speech-to-text REST from v3.0 to v3.1	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/migrate-v3-0-to-v3-1
Migrate Speech-to-text REST from v3.1 to v3.2	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/migrate-v3-1-to-v3-2
Assess capabilities and regions for personal voice	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/personal-voice-overview
Decide when to use Whisper for speech tasks	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/whisper-overview

Limits & Quotas

Topic	URL
Manage custom speech model and endpoint lifecycle	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-custom-speech-model-and-endpoint-lifecycle
Deploy professional voice models to custom endpoints	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/professional-voice-deploy-endpoint
Train professional voice models and understand duration	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/professional-voice-train-voice
Use Speech-to-text REST API for short audio	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/rest-speech-to-text-short
Apply Azure Speech quotas, limits, and throttling guidance	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-services-quotas-and-limits

Security

Topic	URL
Configure BYOS storage for Azure Speech resources	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/bring-your-own-storage-speech-resource
Configure Microsoft Entra authentication for Speech SDK	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-configure-azure-ad-auth
Manage voice talent consent for professional voice	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/professional-voice-create-consent
Assign Azure RBAC roles for Speech resources	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/role-based-access-control
Use Azure Speech service in sovereign clouds	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/sovereign-clouds
Manage Speech service data-at-rest encryption and keys	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-encryption-of-data-at-rest
Secure Speech service with Virtual Network service endpoints	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-service-vnet-service-endpoint
Secure Azure AI Speech with Private Link endpoints	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-services-private-link

Configuration

Topic	URL
Configure Batch synthesis properties for text-to-speech	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/batch-synthesis-properties
Check status and retrieve batch transcription results	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/batch-transcription-get
Configure BYOS storage for Speech to text	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/bring-your-own-storage-speech-resource-speech-to-text
Define UPS phonetic pronunciations for Speech to text	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/customize-pronunciation
Configure OpenSSL on Linux for Azure Speech SDK	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-configure-openssl-linux
Control and monitor Speech SDK service connections	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-control-connections
Create and manage custom speech fine-tuning projects	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-custom-speech-create-project
Prepare and upload datasets for custom speech training	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-custom-speech-upload-data
Select and configure audio input devices in Speech SDK	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-select-audio-input-devices
Use visemes for facial animation with Speech service	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-speech-synthesis-viseme
Configure Speech SDK audio input streams	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-use-audio-input-streams
Configure compressed audio input for Speech SDK and CLI	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-use-codec-compressed-audio-input-streams
Enable and configure Speech SDK diagnostic logging	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-use-logging
Configure audio and transcription logging for Speech recognition	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/logging-audio-transcription
Upload and validate training datasets for professional voice	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/professional-voice-create-training-set
Use correct regional endpoints for Azure Speech	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/regions
Configure Speech containers storage, logging, and security	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-container-configuration
Control speech output using SSML configuration	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-synthesis-markup
Configure pronunciation with SSML phonemes and lexicons	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-synthesis-markup-pronunciation
Structure SSML documents and events for Speech	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-synthesis-markup-structure
Configure Speech CLI datastore search order and files	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/spx-data-store-configuration
Configure output destinations for Speech CLI results	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/spx-output-options
Configure batch synthesis properties for TTS avatars	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/text-to-speech-avatar/batch-synthesis-avatar-properties
Reference Voice Live API events, models, and settings (2025-10-01)	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/voice-live-api-reference-2025-10-01
Reference Voice Live API events and settings (2026-01-01-preview)	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/voice-live-api-reference-2026-01-01-preview
Configure language and locale for Voice Live API	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/voice-live-language-support

Integrations & Coding Patterns

Topic	URL
Integrate Speech service with call center telephony	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/call-center-telephony-integration
Call Azure Speech fast transcription API in Foundry	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/fast-transcription-create
Use Speech SDK APIs to handle recognition results	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/get-speech-recognition-results
Integrate custom models with Voice Live BYOM	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-bring-your-own-model
Implement text-to-speech synthesis with Speech SDK	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-speech-synthesis
Implement speech translation with Azure Speech SDK	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-translate-speech
Connect MCP servers to Azure Voice Live sessions	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-voice-live-mcp-server
Add proactive greetings to Voice Live agents	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-voice-live-proactive-messages
Integrate with Azure LLM Speech transcription and translation API	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/llm-speech
Integrate Azure Speech with Azure OpenAI chat	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/openai-speech
Add and manage user consent for personal voice	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/personal-voice-create-consent
Create personal voice projects via Custom Voice API	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/personal-voice-create-project
Use Power Automate connector for Speech batch transcription	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/power-automate-batch-transcription
Use Speech to text REST API endpoints and parameters	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/rest-speech-to-text
Call Text-to-speech REST API for voice synthesis	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/rest-text-to-speech
Use SSML phonetic alphabets with Azure Speech	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-ssml-phonetic-sets
Use SSML to customize Azure Speech voices	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-synthesis-markup-voice
Generate Speech service REST clients from Swagger	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/swagger-documentation
Control text to speech avatar gestures with SSML	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/text-to-speech-avatar/avatar-gestures-with-ssml
Implement real-time text-to-speech avatar streaming	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/text-to-speech-avatar/real-time-synthesis-avatar
Use Voice Live WebSocket API for real-time agents	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/voice-live-how-to

Deployment

Topic	URL
Use Batch synthesis API for long-form text-to-speech	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/batch-synthesis
Deploy custom speech models and endpoints	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-custom-speech-deploy-model
Scale Speech containers with batch processing kit	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-container-batch-processing
Run custom speech to text containers with Docker	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-container-cstt
Deploy and run Speech containers with Docker	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-container-howto
Run Speech containers on Kubernetes with Helm	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-container-howto-on-premises
Deploy language identification containers with Docker	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-container-lid
Deploy neural text to speech containers with Docker	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-container-ntts
Deploy speech to text containers for on-premises use	https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-container-stt