🛠️ 開発・MCP コミュニティ

visual-asset-workflow

クリエイティブ・ブリーフの手法を用いて、Geminiで章のイラストや図表などの教育用ビジュアルを生成し、特徴的な教材作成を支援するSkill。

📜 元の英語説明(参考)

Generate distinctive educational visuals using creative brief methodology. Use when creating chapter illustrations, diagrams, or teaching visuals with Gemini.

🇯🇵 日本人クリエイター向け解説

一言でいうと

クリエイティブ・ブリーフの手法を用いて、Geminiで章のイラストや図表などの教育用ビジュアルを生成し、特徴的な教材作成を支援するSkill。

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⚡ おすすめ: コマンド1行でインストール(60秒)

下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。ダウンロード → 解凍 → 配置まで全自動。

🍎 Mac / 🐧 Linux

mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o visual-asset-workflow.zip https://jpskill.com/download/16881.zip && unzip -o visual-asset-workflow.zip && rm visual-asset-workflow.zip

🪟 Windows (PowerShell)

$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/16881.zip -OutFile "$d\visual-asset-workflow.zip"; Expand-Archive "$d\visual-asset-workflow.zip" -DestinationPath $d -Force; ri "$d\visual-asset-workflow.zip"

完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。

💾 手動でダウンロードしたい(コマンドが難しい人向け)

1. 下の青いボタンを押して visual-asset-workflow.zip をダウンロード
2. ZIPファイルをダブルクリックで解凍 → visual-asset-workflow フォルダができる
3. そのフォルダを C:\Users\あなたの名前\.claude\skills\(Win)または ~/.claude/skills/(Mac)へ移動
4. Claude Code を再起動

⬇ .zip でダウンロード(推奨) ⬇ .skill 形式(上級者用) 元のソース ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-18
取得日時: 2026-05-18
同梱ファイル: 1

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

Visual Asset Workflow Skill

背景と問題点

教育的なビジュアル生成は、推論モードではなく予測モードを活性化する技術仕様（「44pt Roboto Bold、250pxのボックス」）を備えた、汎用的なインフォグラフィックに収束する傾向があります。これにより、特徴的で教育的に効果的なビジュアルではなく、平凡でPowerPointのデフォルトのような美学が生み出されます。

このスキルは、Gemini 3の推論能力を活性化するための、プロフェッショナルなクリエイティブ・ブリーフ手法を提供します。

中核原則

ストーリーは推論を活性化する - 物語的な意図は特徴的なビジュアルを生み出し、技術仕様は汎用的なビジュアルを生み出す
習熟度が複雑さを決定する - A2レベルの学生は5秒未満で理解する必要があり、C2レベルのプロフェッショナルは高密度な情報を扱える
前提知識がコンテンツを制限する - ビジュアルは、学生がまだ習得していない知識を前提とすることはできない
教育法が階層構造を駆動する - ビジュアルの重みは、恣意的な美学ではなく、重要性を教える

次元的なガイダンス

実行前の計画

避けるべきこと: 文脈なしにビジュアル分析に飛び込むこと 推奨すること: 戦略的計画フェーズ（Q0）

最初に読むべきもの:

apps/learn-app/docs/chapter-index.md → パート、習熟度（A2/B1/C2）、前提知識を抽出
apps/learn-app/docs/[part]/[chapter]/README.md → レッスンの構成を理解

作業前に矛盾を検出:

習熟度と複雑さのミスマッチ（A2の初心者向けの複雑なビジュアル）
前提知識の違反（学生がまだ学習していないPythonコード）
教育レイヤーの非整合性（レイヤー4のアプローチを使用するレイヤー1のコンテンツ）

戦略計画を出力し、承認を待ってから続行してください。

原則: 計画は無駄な作業を防ぐ（第9章の失敗：計画をスキップしたことによる5つの誤ったレッスン）

プロンプト構造：プロフェッショナルなクリエイティブ・ブリーフ

避けるべきこと: 技術仕様

❌ "Title: 44pt Roboto Bold at (50, 20)"
❌ "Box: 250px × 90px, #aaaaaa, 8px corners"
❌ "Shadow: 4px offset, 8px blur"

推奨すること: ストーリー + 意図 + メタファー

✅ The Story: [視覚化されるものの1〜2文の物語]
✅ Emotional Intent: [指数関数的な成長、驚くべき大きさ]のように感じるはず
✅ Visual Metaphor: [複利のような乗算カスケード]
✅ Key Insight: [学生が把握しなければならない1つのこと]
✅ Color Semantics: Blue (#2563eb) = Authority（ガバナンスの概念を教える）
✅ Typography Hierarchy: Largest = Key insight（恣意的なサイズ設定ではない）
✅ Pedagogical Reasoning: これらの選択が教えることに役立つ理由

原則: クリエイティブ・ブリーフは推論モードを活性化し、仕様は予測モードを活性化する

重要な理由: Gemini 3は意図を達成する方法について推論する → 汎用的なものではなく、特徴的なビジュアル

トークン節約戦略

いつ: 複数のビジュアル（>8）を使用するバッチモードまたは継続セッション

推論の活性化を維持しながら、凝縮を適用:

常に保持:

Story（1〜2文の物語）
Emotional Intent（どのように感じるべきか）
Visual Metaphor（普遍的な概念）
Key Insight（学生が把握しなければならない1つのこと）
16進数コードを使用した色の意味 (#2563eb)
Pedagogical reasoning（これらの選択の理由）

凝縮:

長い例 → 短いラベル
詳細な説明 → 箇条書き
繰り返されるパターン → コンパクトな表記

決して削除しない:

物語の要素
意図の記述
推論の説明

例:

FULL: "Top Layer shows the Coordinator at center top with label..."
CONDENSED: "Top Layer - Coordinator: Center top: 'Orchestrator'..."

目標: 60〜70％のトークン削減、100％の推論活性化を維持

原則: 推論のトリガーの排除ではなく、圧縮による効率化

習熟度と複雑さの整合

避けるべきこと: すべての人に合う単一の複雑さ

推奨すること: 習熟度によって制限された制約

A2 Beginner（交渉の余地のない制限）：

最大5〜7個の要素
5秒未満で理解
静的のみ（インタラクティブなし）
最大2×2グリッド
明確な階層（最大=最も重要）

B1 Intermediate:

最大7〜10個の要素
10秒未満で理解
インタラクティブTier 1 OK（タップして表示）
最大3×3グリッド

C2 Professional:

人工的な制限なし
高密度なインフォグラフィックOK
完全なインタラクティブアーキテクチャ
生産上の複雑さ

原則: A2の学生を圧倒する=学習の失敗、C2に対する人工的な単純さ=見下している

前提知識の検証ゲート

避けるべきこと: 学生が知識を持っていると仮定すること

推奨すること: 章の前提知識に対して検証すること

検出:

パート番号を確認：パート1-2 =プログラミングなし、パート3 = markdown/プロンプト、パート4+ = Python
chapter-index.mdから前提知識リストを確認

違反の例:

❌ 第9章のPythonコード（パート3 - 学生はまだ学習していない）
❌ パート2のGitコマンド（学生はCLIをまだ学習していない）

例外: メタレベルの教育はOK

✅ Pythonコードブロックを表示して「markdownコードブロック構文」を教える（Pythonではなくmarkdownを教える）

原則: ビジュアルは未知の知識を必要とすることはできない

憲法との整合性

避けるべきこと: 教育目的のない装飾的なビジュアル

推奨すること: すべてのビジュアルが特定の学習目標に役立つ

原則3（事実の正確さ）：

すべての統計、日付、技術仕様を確認
事実の主張についてGoogle検索の根拠付けを有効にする
データの出典を引用する

原則7（最小限のコンテンツ）：

「変化のためにビジュアルを追加しましょう」を拒否
すべての要素が何かを教える必要がある
教えない装飾を削除する

原則: ビジュアルの決定はプロジェクトの憲法と一致する

教育レイヤーの整合性

避けるべきこと: レイヤーのミスマッチ

推奨すること: ビジュアルアプローチは、章の教育レイヤーと一致する

L1（手動の基礎）：

ステップバイステップの図
具体的な例
明確なラベル付け（語彙の構築）

L2（AIコラボレーション）：

ビフォー/アフターの比較
イテレーションフロー
3つの役割フレームワークは非表示（役割ラベルなし）

L3（インテリジェンスデザイン）：

アーキテクチャ図
再利用可能なパターン病

(原文はここで切り詰められています)

📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

Visual Asset Workflow Skill

Context & Problem

Educational visual generation converges toward generic infographics with technical specifications ("44pt Roboto Bold, 250px box") that activate prediction mode instead of reasoning mode. This produces bland, PowerPoint-default aesthetics instead of distinctive, pedagogically effective visuals.

This skill provides professional creative brief methodology to activate Gemini 3's reasoning capabilities.

Core Principles

Story activates reasoning - Narrative intent produces distinctive visuals; technical specs produce generic ones
Proficiency dictates complexity - A2 students need <5 sec grasp; C2 professionals handle dense information
Prerequisites gate content - Visuals cannot assume knowledge students don't have yet
Pedagogy drives hierarchy - Visual weight teaches importance, not arbitrary aesthetics

Dimensional Guidance

Planning Before Execution

Avoid: Jumping into visual analysis without context Prefer: Strategic planning phase (Q0)

Read FIRST:

apps/learn-app/docs/chapter-index.md → Extract part, proficiency (A2/B1/C2), prerequisites
apps/learn-app/docs/[part]/[chapter]/README.md → Understand lesson structure

Detect conflicts BEFORE work:

Proficiency-complexity mismatch (complex visual for A2 beginners)
Prerequisite violations (Python code when students haven't learned it)
Pedagogical layer incoherence (Layer 1 content using Layer 4 approaches)

Output strategic plan, WAIT for approval before proceeding.

Principle: Plan prevents wasted work (Chapter 9 failure: 5 wrong lessons from skipping planning)

Prompt Structure: Professional Creative Briefs

Avoid: Technical specifications

❌ "Title: 44pt Roboto Bold at (50, 20)"
❌ "Box: 250px × 90px, #aaaaaa, 8px corners"
❌ "Shadow: 4px offset, 8px blur"

Prefer: Story + Intent + Metaphor

✅ The Story: [1-2 sentence narrative of what's visualized]
✅ Emotional Intent: Should feel [exponential growth, surprising magnitude]
✅ Visual Metaphor: [Multiplication cascade - like compound interest]
✅ Key Insight: [ONE thing students must grasp]
✅ Color Semantics: Blue (#2563eb) = Authority (teaches governance concept)
✅ Typography Hierarchy: Largest = Key insight (not arbitrary sizing)
✅ Pedagogical Reasoning: Why these choices serve teaching

Principle: Creative briefs activate reasoning mode; specifications activate prediction mode

Why it matters: Gemini 3 reasons about HOW to achieve intent → Distinctive visuals instead of generic

Token Conservation Strategy

When: Batch mode with >8 visuals OR continuation session

Apply condensation while preserving reasoning activation:

ALWAYS KEEP:

Story (1-2 sentence narrative)
Emotional Intent (what it should FEEL like)
Visual Metaphor (universal concept)
Key Insight (ONE thing students must grasp)
Color semantics with hex codes (#2563eb)
Pedagogical reasoning (why these choices)

CONDENSE:

Long examples → Short labels
Verbose descriptions → Bullet points
Repeated patterns → Compact notation

NEVER REMOVE:

Narrative elements
Intent statements
Reasoning explanations

Example:

FULL: "Top Layer shows the Coordinator at center top with label..."
CONDENSED: "Top Layer - Coordinator: Center top: 'Orchestrator'..."

Target: 60-70% token reduction, 100% reasoning activation preserved

Principle: Efficiency through compression, not through elimination of reasoning triggers

Proficiency-Complexity Alignment

Avoid: One-size-fits-all complexity

Prefer: Proficiency-gated constraints

A2 Beginner (Non-negotiable limits):

Max 5-7 elements
<5 second grasp
Static only (no interactive)
Max 2×2 grids
Clear hierarchy (largest = most important)

B1 Intermediate:

Max 7-10 elements
<10 second grasp
Interactive Tier 1 OK (tap-to-reveal)
Max 3×3 grids

C2 Professional:

No artificial limits
Dense infographics OK
Full interactive architecture
Production complexity

Principle: Overwhelming A2 students = learning failure; artificial simplicity for C2 = patronizing

Prerequisite Validation Gate

Avoid: Assuming knowledge students don't have

Prefer: Validate against chapter prerequisites

Detection:

Check Part number: Part 1-2 = no programming, Part 3 = markdown/prompts, Part 4+ = Python
Check prerequisite list from chapter-index.md

Example Violations:

❌ Python code in Chapter 9 (Part 3 - students haven't learned it)
❌ Git commands in Part 2 (students haven't learned CLI)

Exception: Meta-level teaching OK

✅ Teaching "markdown code block syntax" by showing Python code block (teaches markdown, not Python)

Principle: Visual cannot require unknown knowledge

Constitutional Alignment

Avoid: Decorative visuals without pedagogical purpose

Prefer: Every visual serves specific learning objective

Principle 3 (Factual Accuracy):

Verify all statistics, dates, technical specs
Enable Google Search grounding for factual claims
Cite sources for data

Principle 7 (Minimal Content):

Reject "let's add a visual for variety"
Every element must teach something
Remove non-teaching decoration

Principle: Visual decisions align with project constitution

Pedagogical Layer Coherence

Avoid: Layer mismatch

Prefer: Visual approach matches chapter's pedagogical layer

L1 (Manual Foundation):

Step-by-step diagrams
Concrete examples
Clear labeling (building vocabulary)

L2 (AI Collaboration):

Before/after comparisons
Iteration flows
Three Roles Framework INVISIBLE (no role labels)

L3 (Intelligence Design):

Architecture diagrams
Reusable pattern illustrations

L4 (Spec-Driven):

Specification → implementation flow
Component composition diagrams

Principle: Visual design reinforces pedagogical approach

Duplicate Prevention Protocol

Avoid: Generating different prompts that produce the same visual

Prevent BEFORE generation:

Review existing visuals in chapter:
- List all *.png files in target chapter directory
- Read corresponding *.prompt.md files
- Identify visual patterns already used
Validate prompt distinctiveness:
- Does this prompt's intent differ clearly from existing prompts?
- Example conflicts to detect:
  - ❌ Timeline + Graph → Both might render as timeline
  - ❌ Architecture + Workflow → Both might render as hierarchy
  - ❌ Same metaphor, different names → Same visual result
Differentiation strategy:
- Make visual type explicit in story ("GRAPH showing exponential growth" not just "showing growth")
- Use distinct metaphors (cascade vs tree vs timeline vs curve)
- Specify unique structural elements (2D axes vs linear flow vs hierarchical pyramid)

Detect AFTER generation (in image-generator):

Visual comparison with existing chapter images
Prompt alignment check (does output match brief intent?)

Principle: Prevention cheaper than rework

Anti-Patterns

Never:

Generate visuals without reading chapter-index.md first (skipping context)
Use pixel specifications, font sizes, coordinates in prompts (kills reasoning)
Assume knowledge not in prerequisites (prerequisite violation)
Create decorative visuals without learning objective (Principle 7 violation)
Apply same complexity to A2 and C2 students (proficiency mismatch)
Create prompts without checking for duplicate visual patterns (causes rework)

Even if it seems reasonable:

Don't use Python examples in Part 3 (students don't know Python yet)
Don't create complex multi-step visuals for A2 (cognitive overload)
Don't specify "44pt Roboto Bold" (removes Gemini's judgment)
Don't skip distinctiveness validation "because they have different filenames" (names differ, visuals might not)

Creative Variance

You tend to default to comparison diagrams even with story-driven prompts. Vary visual types:

Timeline progressions (evolution over time)
Multiplication cascades (compound growth visualization)
Hierarchical authority flows (governance models)
Transformation sequences (before → after → impact)
Conceptual metaphors (abstract → concrete mapping)

Match visual type to story, not habit.

Post-Generation Reflection

After batch completion, analyze systematically (Q8):

Success patterns:

Quality gate performance (which caught most issues?)
Average iterations (efficiency indicator)
Time vs estimate (planning accuracy)

Failure analysis:

Deferred visuals root causes (layout? spelling? concept mismatch?)
Guardrail gaps (what principle would have prevented this?)
Planning effectiveness (conflicts caught early vs missed?)

Continuous improvement:

Pattern-based updates (not one-off fixes)
New guardrails from learnings
Prompt template refinements

Document in: history/visual-assets/reflections/chapter-{NN}-reflection.md

Principle: Systematic reflection → Improved future performance

Success Indicators

You'll know this skill is working when:

✅ Zero pixel specifications in prompts (creative briefs only)
✅ Strategic plan created before visual analysis (Q0 complete)
✅ Proficiency conflicts detected early (A2 limits enforced)
✅ Prerequisite violations prevented (no unknown concepts)
✅ Story/Intent/Metaphor in every prompt (reasoning activated)
✅ Token conservation applied in batch mode (60-70% reduction)
✅ Duplicate prevention validation passed (zero duplicate visuals)
✅ Visuals feel distinctive and compelling (not generic PowerPoint)
✅ Reflection document created after batch (systematic learning)

Result: Professional-quality visuals that teach effectively, generated efficiently through planning, with zero duplicates requiring rework.