🎨 画像AI コミュニティ

nano-banana-edit

RunComfy上のGoogle Nano Banana 2を使って、画像の背景変更や部分的な修正、複数画像の同時編集など、高度な画像編集を、対象物の特徴を維持しつつ、空間的な指示に基づいて実行するSkill。

📜 元の英語説明(参考)

Edit images with Google Nano Banana 2 (image-to-image edit endpoint) on RunComfy. Documents Nano Banana Edit's strengths (preserve subject identity, swap background, localize edits with spatial language, multi-image batch edits up to 20 inputs), the schema, and when to route to GPT Image 2 edit / Flux Kontext / Nano Banana 2 t2i instead. Calls `runcomfy run google/nano-banana-2/edit` through the local RunComfy CLI. Triggers on "nano banana edit", "edit with nano banana", "image edit nano banana", or any explicit ask to edit with this model.

🇯🇵 日本人クリエイター向け解説

一言でいうと

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⚡ おすすめ: コマンド1行でインストール(60秒)

下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。ダウンロード → 解凍 → 配置まで全自動。

🍎 Mac / 🐧 Linux

mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o nano-banana-edit.zip https://jpskill.com/download/10373.zip && unzip -o nano-banana-edit.zip && rm nano-banana-edit.zip

🪟 Windows (PowerShell)

$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/10373.zip -OutFile "$d\nano-banana-edit.zip"; Expand-Archive "$d\nano-banana-edit.zip" -DestinationPath $d -Force; ri "$d\nano-banana-edit.zip"

完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。

💾 手動でダウンロードしたい(コマンドが難しい人向け)

1. 下の青いボタンを押して nano-banana-edit.zip をダウンロード
2. ZIPファイルをダブルクリックで解凍 → nano-banana-edit フォルダができる
3. そのフォルダを C:\Users\あなたの名前\.claude\skills\(Win)または ~/.claude/skills/(Mac)へ移動
4. Claude Code を再起動

⬇ .zip でダウンロード(推奨) ⬇ .skill 形式(上級者用) 元のソース ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-18
取得日時: 2026-05-18
同梱ファイル: 1

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

Nano Banana Edit — RunComfy の Pro Pack

runcomfy.com · Edit endpoint · GitHub

Google Nano Banana 2 Edit — Geminiファミリーのフラッシュ層画像モデルの画像から画像への編集エンドポイント — RunComfy Model API でホストされています。バッチ編集とマルチリファレンスバリエーションのために、1回の呼び出しあたり最大20個の入力画像が可能です。

npx skills add agentspace-so/runcomfy-skills --skill nano-banana-edit -g

このモデルを選ぶべき時 (兄弟モデルとの比較)

必要なこと	使用するモデル
被写体のアイデンティティを保持し、背景や服装を入れ替えたい	Nano Banana Edit
1つのバッチで最大20枚の画像を整合性をもって編集したい	Nano Banana Edit
空間的な言語で「Xのみ」に編集をローカライズしたい	Nano Banana Edit
画像内の多言語テキスト（標識、ラベル）を編集したい	GPT Image 2 edit
単一の参照 + 正確なローカル編集（「彼女は今Xを持っている」）	Flux Kontext
スクラッチから新しい画像を生成したい	Nano Banana 2 t2i (兄弟スキル)

ユーザーが「nano banana edit」/「edit with nano banana」と明示的に言った場合は、必ずここにルーティングしてください。

前提条件

RunComfy CLI — npm i -g @runcomfy/cli
RunComfy アカウント — runcomfy login はブラウザのデバイスコードフローを開きます。
CI / コンテナ — runcomfy login の代わりに RUNCOMFY_TOKEN=<token> を設定します。

エンドポイント + 入力スキーマ

`google/nano-banana-2/edit`

フィールド	タイプ	必須	デフォルト	注
`prompt`	string	yes	—	編集指示。保持を最初に、変更を最後に記述します。
`image_urls`	array	yes	—	1–20 個の公開されている HTTPS URL。
`number_of_images`	int	no	1	1回の呼び出しあたり 1–4 個の出力。
`seed`	int	no	—	再現性。
`aspect_ratio`	enum	no	`auto`	`auto` (入力を追従) または固定比率 — バッチの一貫性のためにロックします。
`resolution`	enum	no	`1K`	`0.5K` / `1K` / `2K` / `4K`。
`output_format`	enum	no	`png`	`png` / `jpeg` / `webp`。
`safety_tolerance`	int	no	4	1 (厳格) – 6 (寛容)。
`limit_generations`	bool	no	—	trueの場合、各ラウンドを1つの出力に制限します。
`enable_web_search`	bool	no	false	Webグラウンディング（追加コスト/レイテンシ）。

呼び出し方法

単一画像の背景入れ替え、アイデンティティ保持:

runcomfy run google/nano-banana-2/edit \
  --input '{
    "prompt": "被写体のアイデンティティ、ポーズ、服装は変更せずに維持してください。背景を雨のネオンサイバーパンクの街に変えてください。",
    "image_urls": ["https://.../portrait.jpg"]
  }' \
  --output-dir <absolute/path>

ロックされたフレーミングによるバッチ編集:

runcomfy run google/nano-banana-2/edit \
  --input '{
    "prompt": "右下のウォーターマークを、クリーンな白いサンセリフ体のテキスト「AURA」に置き換えてください。その他はすべて入力とまったく同じにしてください。",
    "image_urls": ["https://.../sku-1.jpg", "https://.../sku-2.jpg", "https://.../sku-3.jpg"],
    "aspect_ratio": "1:1",
    "resolution": "1K"
  }' \
  --output-dir <absolute/path>

ターゲットを絞った空間編集（「左側のオブジェクトのみ」）:

runcomfy run google/nano-banana-2/edit \
  --input '{
    "prompt": "左端のオブジェクトのみを削除してください。右側の2つのオブジェクト、テーブル、照明は変更せずに維持してください。",
    "image_urls": ["https://.../still-life.jpg"]
  }' \
  --output-dir <absolute/path>

プロンプト — 実際に効果があること

保持を最初に、変更を最後に記述します。 常に "[アイデンティティ / ポーズ / 服装 / ブランド / フレーミング] を変更せずに維持してください。" で始めてください。次に、1つの明確な文で変更を記述します。モデルは最初に述べられたことを尊重します。後続の保持目標は無視されます。

空間的な言語でローカライズします。 「背景のみ」、「左側のオブジェクト」、「右上隅」、「見出しの上」— 具体的な空間範囲は尊重されます。「もっとXにする」は曖昧で、ずれていきます。

バッチの一貫性 — シリーズを編集する場合は、aspect_ratio と resolution をロックします。各出力がリミックスではなく兄弟として認識されるように、バッチ全体で同じプロンプト文法を使用します。

小さく反復します。 1回のパス編集でずれる場合は、2つに分割します。パス1は背景のみを変更し、パス2は被写体の服装を入れ替えます。よりクリーンな編集、同じ総コスト（同様の解像度を想定）。

マルチ画像バリエーション — 最大20個の入力を渡して、一貫性のあるバッチを取得します。SKUギャラリー、A/Bテスト、キャラクターシートのバリエーションに役立ちます。

アンチパターン:

長い複合指示（「AとBとCとDを変更する」）— スコープを追加するごとにずれが増加します。
受け身で書かれた編集指示（「背景を変更する必要がある」）— 命令形にしてください。
保持目標の欠落 — モデルは顔/ブランドを微妙に書き換えます。
入力と一致しないアスペクト比 — クロップまたはストレッチが発生します。

強みを発揮する場所

ユースケース	Nano Banana Edit を選ぶ理由
SKUギャラリー — さまざまな背景に同じ製品	20個のバッチ、アイデンティティ保持、フレーミングロック
インフルエンサー/スポークスパーソンの背景入れ替え	編集全体で強力なアイデンティティ保持
ローカライズされたオブジェクトの削除/追加	空間言語が尊重される
広告クリエイティブのA/Bバリアント	シードロック + 複数の `number_of_images`
ブランドアセットの再ローカライズ	テキスト/パレットの入れ替えによる同じ構成

サンプルプロンプト (強力な結果を生み出すことが確認されています)

背景入れ替え (ページ例):

被写体のアイデンティティは変更せずに維持してください。背景を雨の
ネオンサイバーパンクの街に変えてください。

ターゲットを絞ったテキストの置換:

ボトル、ラベル、照明は入力とまったく同じに維持してください。
ラベルのブランドテキストのみを「ALPHA」から「AURA」に置き換えてください。
同じフォントウェイト、中央揃え、白地に黒。

マルチ画像バッチの一貫性:


各入力画像について: 被写体のポーズとアイデンティティは変更せずに維持してください。
背景を柔らかく暖かいグレーのスタジオスイープに変えてください。w

(原文はここで切り詰められています)

📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

Nano Banana Edit — Pro Pack on RunComfy

runcomfy.com · Edit endpoint · GitHub

Google Nano Banana 2 Edit — the image-to-image edit endpoint of the Gemini-family flash-tier image model — hosted on the RunComfy Model API. Up to 20 input images per call for batch edits and multi-reference variation.

npx skills add agentspace-so/runcomfy-skills --skill nano-banana-edit -g

When to pick this model (vs siblings)

You want	Use
Preserve subject identity, swap background or clothing	Nano Banana Edit
Edit up to 20 images consistently in one batch	Nano Banana Edit
Localize edit to "X only" with spatial language	Nano Banana Edit
Edit multilingual text inside the image (signs, labels)	GPT Image 2 edit
Single ref + precise local edit ("she's now holding X")	Flux Kontext
Generate a new image from scratch	Nano Banana 2 t2i (sibling skill)

If the user said "nano banana edit" / "edit with nano banana" explicitly, route here regardless.

Prerequisites

RunComfy CLI — npm i -g @runcomfy/cli
RunComfy account — runcomfy login opens a browser device-code flow.
CI / containers — set RUNCOMFY_TOKEN=<token> instead of runcomfy login.

Endpoints + input schema

`google/nano-banana-2/edit`

Field	Type	Required	Default	Notes
`prompt`	string	yes	—	Edit instruction. Lead with preservation, end with the change.
`image_urls`	array	yes	—	1–20 publicly-fetchable HTTPS URLs.
`number_of_images`	int	no	1	1–4 outputs per call.
`seed`	int	no	—	Reproducibility.
`aspect_ratio`	enum	no	`auto`	`auto` (follows input) or fixed ratios — lock for batch consistency.
`resolution`	enum	no	`1K`	`0.5K` / `1K` / `2K` / `4K`.
`output_format`	enum	no	`png`	`png` / `jpeg` / `webp`.
`safety_tolerance`	int	no	4	1 (strict) – 6 (permissive).
`limit_generations`	bool	no	—	If true, restricts each round to one output.
`enable_web_search`	bool	no	false	Web grounding (extra cost / latency).

How to invoke

Single-image background swap, identity preserved:

runcomfy run google/nano-banana-2/edit \
  --input '{
    "prompt": "Keep the subject identity, pose, and clothing unchanged. Convert the background into a rainy neon cyberpunk street.",
    "image_urls": ["https://.../portrait.jpg"]
  }' \
  --output-dir <absolute/path>

Batch edit with locked framing:

runcomfy run google/nano-banana-2/edit \
  --input '{
    "prompt": "Replace the watermark in the bottom-right with the text \"AURA\" in clean white sans-serif. Keep everything else exactly as in the input.",
    "image_urls": ["https://.../sku-1.jpg", "https://.../sku-2.jpg", "https://.../sku-3.jpg"],
    "aspect_ratio": "1:1",
    "resolution": "1K"
  }' \
  --output-dir <absolute/path>

Targeted spatial edit ("left object only"):

runcomfy run google/nano-banana-2/edit \
  --input '{
    "prompt": "Remove the leftmost object only. Keep the right two objects, the table, and the lighting unchanged.",
    "image_urls": ["https://.../still-life.jpg"]
  }' \
  --output-dir <absolute/path>

Prompting — what actually works

Preservation first, change last. Always lead with "Keep [identity / pose / clothing / brand / framing] unchanged." Then state the change in one clean sentence. Models honor what's stated up front; tail-end preservations get ignored.

Localize with spatial language. "background only", "the left object", "the upper-right corner", "above the headline" — concrete spatial scopes are honored. "make it more X" is vague and drifts.

Batch consistency — when editing a series, lock aspect_ratio and resolution. Use the same prompt grammar across the batch so each output reads as a sibling, not a remix.

Iterate small. If a one-pass edit drifts, split into two: pass 1 changes background only, pass 2 swaps the subject's outfit. Cleaner edits, same total cost (assuming similar resolution).

Multi-image variation — pass up to 20 inputs to get a coherent batch. Useful for SKU galleries, A/B testing, character sheet variations.

Anti-patterns:

Long compound instructions ("change A and B and C and D") — drift increases per added scope.
Edit instructions written in passive voice ("the background should be changed") — be imperative.
Missing preservation goals — model will subtly rewrite the face / brand.
Aspect ratios that don't match input — causes crops or stretches.

Where it shines

Use case	Why Nano Banana Edit
SKU gallery — same product on different backgrounds	Batch of 20, identity-preserved, framing locked
Influencer / spokesperson background swaps	Strong identity preservation across edits
Localized object removal / addition	Spatial language honored
A/B variants for ad creative	Seed lock + multiple `number_of_images`
Brand-asset relocalization	Same composition with text / palette swap

Sample prompts (verified to produce strong results)

Background swap (page example):

Keep the subject identity unchanged. Convert the background into a rainy
neon cyberpunk street.

Targeted text replacement:

Keep the bottle, label, and lighting exactly as in the input.
Replace only the brand text on the label from "ALPHA" to "AURA",
same font weight, centered, white on black.

Multi-image batch consistency:

For each input image: keep the subject's pose and identity unchanged.
Convert the background to a soft warm-grey studio sweep with subtle
floor shadow. Center the subject at the same fraction of frame as the
input.

Limitations

1–20 input images per call — the first is treated as primary; the rest provide auxiliary cues.
1–4 outputs per call.
Long compound prompts drift — split into multiple passes.
Web search adds latency + cost — only enable on demand.
For multilingual in-image text edits, GPT Image 2 edit wins.

Exit codes

code	meaning
0	success
64	bad CLI args
65	bad input JSON / schema mismatch
69	upstream 5xx
75	retryable: timeout / 429
77	not signed in or token rejected

Full reference: docs.runcomfy.com/cli/troubleshooting.

How it works

The skill invokes runcomfy run google/nano-banana-2/edit with a JSON body matching the schema. The CLI POSTs to https://model-api.runcomfy.net/v1/models/google/nano-banana-2/edit, polls the request, fetches the result, and downloads any .runcomfy.net/.runcomfy.com URL into --output-dir. Ctrl-C cancels the remote request before exit.

Security & Privacy

Token storage: runcomfy login writes the API token to ~/.config/runcomfy/token.json with mode 0600 (owner-only read/write). Set RUNCOMFY_TOKEN env var to bypass the file entirely in CI / containers.
Input boundary: the user prompt is passed as a JSON string to the CLI via --input. The CLI does NOT shell-expand the prompt; it transmits the JSON body directly to the Model API over HTTPS. No shell injection surface from prompt content.
Third-party content: image / mask / video URLs you pass are fetched by the RunComfy model server, not by the CLI on your machine. Treat external URLs as untrusted; image-based prompt injection is a known risk for any image-edit / video-edit model.
Outbound endpoints: only model-api.runcomfy.net (request submission) and *.runcomfy.net / *.runcomfy.com (download whitelist for generated outputs). No telemetry, no callbacks.
Generated-file size cap: the CLI aborts any single download > 2 GiB to prevent disk-fill from a malicious or runaway model output.