🎨 画像AI コミュニティ

nano-banana-2

Googleの画像生成モデルNano Banana 2をRunComfy上で使い、プロンプトの工夫で高品質な画像を生成、文字入れや構図の安定性、ウェブ情報との連携も可能で、解像度に応じた価格設定や安全性の調整もできる画像生成を支援するSkill。

📜 元の英語説明(参考)

Generate images with Google Nano Banana 2 (Gemini-family flash-tier text-to-image) on RunComfy — bundled with the model's documented prompting patterns so the skill gets sharper output than naive prompting against the same model. Documents Nano Banana 2's strengths (rapid iteration, in-image typography rendering, predictable framing, optional web-grounded context), the resolution-tier pricing, the safety-tolerance dial, and when to route to Nano Banana Pro / GPT Image 2 / Flux 2 / Seedream instead. Calls `runcomfy run google/nano-banana-2/text-to-image` through the local RunComfy CLI. Triggers on "nano banana", "nano-banana-2", "nano banana 2", "google image gen", "gemini image", or any explicit ask to generate with this model.

🇯🇵 日本人クリエイター向け解説

一言でいうと

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⚡ おすすめ: コマンド1行でインストール(60秒)

下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。ダウンロード → 解凍 → 配置まで全自動。

🍎 Mac / 🐧 Linux

mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o nano-banana-2.zip https://jpskill.com/download/10372.zip && unzip -o nano-banana-2.zip && rm nano-banana-2.zip

🪟 Windows (PowerShell)

$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/10372.zip -OutFile "$d\nano-banana-2.zip"; Expand-Archive "$d\nano-banana-2.zip" -DestinationPath $d -Force; ri "$d\nano-banana-2.zip"

完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。

💾 手動でダウンロードしたい(コマンドが難しい人向け)

1. 下の青いボタンを押して nano-banana-2.zip をダウンロード
2. ZIPファイルをダブルクリックで解凍 → nano-banana-2 フォルダができる
3. そのフォルダを C:\Users\あなたの名前\.claude\skills\(Win)または ~/.claude/skills/(Mac)へ移動
4. Claude Code を再起動

⬇ .zip でダウンロード(推奨) ⬇ .skill 形式(上級者用) 元のソース ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-18
取得日時: 2026-05-18
同梱ファイル: 1

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

Nano Banana 2 — RunComfy の Pro パック

runcomfy.com · Model page · GitHub

Google Nano Banana 2 — Gemini ファミリーのフラッシュ層テキストから画像へのモデル — RunComfy Model API でホストされています。アイデア出し、ソーシャルサムネイルのバッチ処理、および画像内のタイポグラフィが強力な迅速なドラフトに最適化されています。

npx skills add agentspace-so/runcomfy-skills --skill nano-banana-2 -g

このモデルを選ぶべき時 (兄弟モデルとの比較)

Nano Banana 2 は、Google 画像生成ラインのフラッシュ層です。最大のディテールよりも、反復速度と予測可能なフレーミングが重要な場合に選択してください。

必要なもの	使用するモデル
迅速なドラフト、ソーシャルサムネイル、バッチバリアント	Nano Banana 2
予測可能なレンダリングによる画像内タイポグラフィ	Nano Banana 2
Web に基づいた画像 (現在のイベント / 実在のエンティティ)	Nano Banana 2 + `enable_web_search`
画像の編集 (被写体を保持し、背景を入れ替える)	Nano Banana Edit (兄弟スキル)
ヘビースタイライズ、絵画的な外観	Flux 2
最大限のプロンプト遵守 + 多言語テキスト	GPT Image 2
2K–4K のヒーローショット、最大限のリアリズム	Seedream 5
超現実的なポートレート	Nano Banana Pro

ユーザーが「Nano Banana」/「nano-banana-2」/「Gemini image」を明示的に言った場合は、ここにルーティングしてください。ユーザーが「Nano Banana」と言ったものの、2 と Pro のどちらかを指定しなかった場合は、ポートレートにはPro を、それ以外には2 をデフォルトで使用してください。

前提条件

RunComfy CLI — npm i -g @runcomfy/cli
RunComfy アカウント — runcomfy login はブラウザのデバイスコードフローを開きます。
CI / コンテナ — runcomfy login の代わりに RUNCOMFY_TOKEN=<token> を設定します。

エンドポイント + 入力スキーマ

`google/nano-banana-2/text-to-image`

フィールド	タイプ	必須	デフォルト	注
`prompt`	string	yes	—	主題を先頭に記述します。
`num_images`	int	no	1	1–4。アイデア出しラウンドには 4 を使用します。
`seed`	int	no	0	再現性のために再利用します。
`aspect_ratio`	enum	no	`auto`	`auto`、`21:9`、`16:9`、`3:2`、`4:3`、`5:4`、`1:1`、`4:5`、`3:4`、`2:3`、`9:16`。
`resolution`	enum	no	`1K`	`0.5K` (ドラフト)、`1K` (デフォルト)、`2K` (最終)、`4K` (最大)。
`output_format`	enum	no	`png`	`png`、`jpeg`、`webp`。
`safety_tolerance`	int	no	4	1 (厳格) – 6 (寛容)。
`limit_generations`	bool	no	true	各プロンプトラウンドを 1 つの生成に制限します。
`enable_web_search`	bool	no	false	Web グラウンディングを追加します (追加コスト + レイテンシ)。

画像の編集 (被写体を保持し、変更を適用する) については、兄弟スキルの nano-banana-edit を参照してください。

呼び出し方法

デフォルトのドラフト (1K、正方形、png):

runcomfy run google/nano-banana-2/text-to-image \
  --input '{"prompt": "<ユーザープロンプト>"}' \
  --output-dir <absolute/path>

アイデア出し用の縦型 4 アップバッチ:

runcomfy run google/nano-banana-2/text-to-image \
  --input '{
    "prompt": "<ユーザープロンプト>",
    "num_images": 4,
    "aspect_ratio": "9:16",
    "resolution": "0.5K"
  }' \
  --output-dir <absolute/path>

シードロック付きの 2K での最終版:

runcomfy run google/nano-banana-2/text-to-image \
  --input '{
    "prompt": "<ユーザープロンプト>",
    "resolution": "2K",
    "aspect_ratio": "16:9",
    "seed": 42
  }' \
  --output-dir <absolute/path>

Web グラウンディング (現在のイベント / 実在のエンティティ):

runcomfy run google/nano-banana-2/text-to-image \
  --input '{
    "prompt": "<今週の現実世界のイベントを参照するプロンプト>",
    "enable_web_search": true
  }' \
  --output-dir <absolute/path>

プロンプト — 実際に効果があるもの

主題を先頭に記述する宣言的な文法。 「雨の東京のネオンライトの下に立つアメリカ人女性の映画のようなクローズアップポートレート、浅い被写界深度、反射する濡れた通り、超詳細、リアルな肌の質感」— 主要な被写体、次にアクション、環境、スタイル、カメラ。被写体を最初に記述し、指示を後に続けます。

画像内のタイポグラフィのための正確なテキスト引用。 「ラベルには、クリーンで太字のサンセリフで中央揃え、白地に黒で「AURA」と書かれています」— 文字通り文字を引用します。配置とフォントスタイルを指定します。「ブランド名が記載されている」と言って期待しないでください。

洗練のための一貫したシード。 小さなバリアント間で単一のプロンプトを反復処理するときに seed をロックします — 構成を安定に保ちます。

Web グラウンディングは控えめに。 プロンプトが現在のイベント / 実在のエンティティを指定する場合にのみ、enable_web_search をオンにします。レイテンシとコストが追加されます。デフォルトではオフになっています。

スタイルを競合させないでください。 「ミニマリスト + 華やか + レトロ + サイバーパンク」は打ち消し合います。1〜2 個のアンカーを選択してください。

アンチパターン:

安定した被写体のアイデンティティを口頭で説明しようとする — 代わりに画像参照を使用して edit エンドポイントを使用します。
4 つの層以外の解像度を要求する → 422。
11 個のサポートされている値以外の縦横比 → 422。
引用符で囲まれていない画像内のテキスト → 予測不可能なレンダリング。

強みを発揮する場所

ユースケース	Nano Banana 2 を使用する理由
マーケティングドラフトサムネイル (4 つのバッチ)	0.5K での高速な反復処理、その後、勝者を 2K に昇格
ソーシャルプラットフォームネイティブ	9:16、4:5、21:9 を含む幅広いアスペクト比のサポート
ポスター/カードの画像内タイポグラフィ	文字が引用されている場合の予測可能なテキストレンダリング
Web グラウンディングされた現在のイベントの画像	`enable_web_search` は最新の情報を統合します
再現可能なバリアントテスト	強力なシード + 一貫したフレーミング

サンプルプロンプト (強力な結果を生み出すことが確認されています)

映画のようなポートレート (ページ例):

雨の東京のネオンの下に立つアメリカ人女性の映画のようなクローズアップポートレート
ライト、浅い被写界深度、反射する濡れた通り、
超詳細、リアルな肌の質感

引用テキストを含むブランドアセットカード:

ミニマリストの 16:9 製品カード: マットブラックのセラミックマグが中央に配置されています

(原文はここで切り詰められています)

📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

Nano Banana 2 — Pro Pack on RunComfy

runcomfy.com · Model page · GitHub

Google Nano Banana 2 — the flash-tier text-to-image model in the Gemini family — hosted on the RunComfy Model API. Optimized for ideation, social-thumbnail batches, and rapid drafts with strong in-image typography.

npx skills add agentspace-so/runcomfy-skills --skill nano-banana-2 -g

When to pick this model (vs siblings)

Nano Banana 2 is the flash-tier of the Google image-gen line. Pick it when iteration speed and predictable framing matter more than maximum detail.

You want	Use
Rapid drafts, social thumbnails, batch variants	Nano Banana 2
In-image typography with predictable rendering	Nano Banana 2
Web-grounded image (current events / real entities)	Nano Banana 2 + `enable_web_search`
Image edit (preserve subject, swap background)	Nano Banana Edit (sibling skill)
Heavy stylization, painterly look	Flux 2
Maximum prompt adherence + multilingual text	GPT Image 2
2K–4K hero shots, max realism	Seedream 5
Hyperrealistic portrait	Nano Banana Pro

If the user said "Nano Banana" / "nano-banana-2" / "Gemini image" explicitly, route here regardless. If they said "Nano Banana" without specifying 2 vs Pro, default to Pro for portraits and 2 for everything else.

Prerequisites

RunComfy CLI — npm i -g @runcomfy/cli
RunComfy account — runcomfy login opens a browser device-code flow.
CI / containers — set RUNCOMFY_TOKEN=<token> instead of runcomfy login.

Endpoints + input schema

`google/nano-banana-2/text-to-image`

Field	Type	Required	Default	Notes
`prompt`	string	yes	—	Subject-first description.
`num_images`	int	no	1	1–4. Use 4 for ideation rounds.
`seed`	int	no	0	Reuse for reproducibility.
`aspect_ratio`	enum	no	`auto`	`auto`, `21:9`, `16:9`, `3:2`, `4:3`, `5:4`, `1:1`, `4:5`, `3:4`, `2:3`, `9:16`.
`resolution`	enum	no	`1K`	`0.5K` (drafts), `1K` (default), `2K` (final), `4K` (max).
`output_format`	enum	no	`png`	`png`, `jpeg`, `webp`.
`safety_tolerance`	int	no	4	1 (strict) – 6 (permissive).
`limit_generations`	bool	no	true	Limit each prompt round to one generation.
`enable_web_search`	bool	no	false	Adds web grounding (extra cost + latency).

For image edit (preserve subject + apply changes), see the sibling nano-banana-edit skill.

How to invoke

Default draft (1K, square, png):

runcomfy run google/nano-banana-2/text-to-image \
  --input '{"prompt": "<user prompt>"}' \
  --output-dir <absolute/path>

Vertical 4-up batch for ideation:

runcomfy run google/nano-banana-2/text-to-image \
  --input '{
    "prompt": "<user prompt>",
    "num_images": 4,
    "aspect_ratio": "9:16",
    "resolution": "0.5K"
  }' \
  --output-dir <absolute/path>

Final at 2K with seed lock:

runcomfy run google/nano-banana-2/text-to-image \
  --input '{
    "prompt": "<user prompt>",
    "resolution": "2K",
    "aspect_ratio": "16:9",
    "seed": 42
  }' \
  --output-dir <absolute/path>

Web-grounded (current event / real entity):

runcomfy run google/nano-banana-2/text-to-image \
  --input '{
    "prompt": "<prompt referencing a real-world event from this week>",
    "enable_web_search": true
  }' \
  --output-dir <absolute/path>

Prompting — what actually works

Subject-first declarative grammar. "A cinematic close-up portrait of an American woman standing under neon lights in rainy Tokyo, shallow depth of field, reflective wet streets, ultra-detailed, realistic skin texture" — primary subject, then action, environment, style, camera. Front-load subject; trail with directives.

Exact text quoting for in-image typography. "The label reads 'AURA' in clean bold sans-serif, centered, white on black" — quote the literal characters. Specify placement and font style. Don't say "with the brand name on it" and hope.

Consistent seeds for refinement. Lock seed when iterating a single prompt across small variants — keeps composition stable.

Web-grounding, sparingly. Turn on enable_web_search only when the prompt names current events / real entities. Adds latency + cost; off by default.

Don't conflict styles. "minimalist + ornate + retro + cyberpunk" cancels. Pick 1–2 anchors.

Anti-patterns:

Trying to verbally describe a stable subject identity — use the edit endpoint with image refs instead.
Asking for resolutions outside the 4 tiers → 422.
Aspect ratios outside the 11 supported values → 422.
Non-quoted in-image text → unpredictable rendering.

Where it shines

Use case	Why Nano Banana 2
Marketing draft thumbnails (batch of 4)	Fast iteration at 0.5K, then promote winner to 2K
Social-platform-native	Wide aspect ratio support including 9:16, 4:5, 21:9
In-image typography for posters / cards	Predictable text rendering when characters are quoted
Web-grounded current-event imagery	`enable_web_search` integrates fresh info
Reproducible variant testing	Strong seed + consistent framing

Sample prompts (verified to produce strong results)

Cinematic portrait (page example):

A cinematic close-up portrait of an American woman standing under neon
lights in rainy Tokyo, shallow depth of field, reflective wet streets,
ultra-detailed, realistic skin texture

Brand-asset card with quoted text:

A minimalist 16:9 product card: a matte black ceramic mug centered on a
soft warm-grey paper background, rim highlight from upper-left, the
headline "Brewed Quietly" in clean bold sans-serif top-right, balanced
negative space below, e-commerce ready, clean studio lighting

Vertical platform-native:

A 9:16 vertical hero for a wellness brand: a single ceramic teacup on a
linen runner, soft morning side-light, the words "Slow Down" in
hand-drawn serif large at the top, gentle steam rising, neutral color
palette, uncluttered

Limitations

Still images only. No video on this endpoint.
Max 4 outputs per request.
Web search adds latency + cost — only enable on demand.
2K / 4K cost more — default to 1K unless user asked for higher.
For image edit, use the /edit endpoint — not this one.

Exit codes

code	meaning
0	success
64	bad CLI args
65	bad input JSON / schema mismatch
69	upstream 5xx
75	retryable: timeout / 429
77	not signed in or token rejected

Full reference: docs.runcomfy.com/cli/troubleshooting.

How it works

The skill invokes runcomfy run google/nano-banana-2/text-to-image with a JSON body matching the schema. The CLI POSTs to https://model-api.runcomfy.net/v1/models/google/nano-banana-2/text-to-image, polls the request, fetches the result, and downloads any .runcomfy.net/.runcomfy.com URL into --output-dir. Ctrl-C cancels the remote request before exit.

Security & Privacy

Token storage: runcomfy login writes the API token to ~/.config/runcomfy/token.json with mode 0600 (owner-only read/write). Set RUNCOMFY_TOKEN env var to bypass the file entirely in CI / containers.
Input boundary: the user prompt is passed as a JSON string to the CLI via --input. The CLI does NOT shell-expand the prompt; it transmits the JSON body directly to the Model API over HTTPS. No shell injection surface from prompt content.
Third-party content: image / mask / video URLs you pass are fetched by the RunComfy model server, not by the CLI on your machine. Treat external URLs as untrusted; image-based prompt injection is a known risk for any image-edit / video-edit model.
Outbound endpoints: only model-api.runcomfy.net (request submission) and *.runcomfy.net / *.runcomfy.com (download whitelist for generated outputs). No telemetry, no callbacks.
Generated-file size cap: the CLI aborts any single download > 2 GiB to prevent disk-fill from a malicious or runaway model output.