📦 その他コミュニティ

boltzgen

BoltzGen拡散モデルを用いて、タンパク質の側鎖まで考慮した精密な設計や、低分子との結合部位設計を、YAML設定に基づいて実行できる、高精度なタンパク質設計を支援するSkill。

📜 元の英語説明(参考)

All-atom protein design using BoltzGen diffusion model. Use this skill when: (1) Need side-chain aware design from the start, (2) Designing around small molecules or ligands, (3) Want all-atom diffusion (not just backbone), (4) Require precise binding geometries, (5) Using YAML-based configuration. For backbone-only generation, use rfdiffusion. For sequence-only design, use proteinmpnn. For structure validation, use boltz.

🇯🇵 日本人クリエイター向け解説

一言でいうと

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⚡ おすすめ: コマンド1行でインストール(60秒)

下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。ダウンロード → 解凍 → 配置まで全自動。

🍎 Mac / 🐧 Linux

mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o boltzgen.zip https://jpskill.com/download/9542.zip && unzip -o boltzgen.zip && rm boltzgen.zip

🪟 Windows (PowerShell)

$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/9542.zip -OutFile "$d\boltzgen.zip"; Expand-Archive "$d\boltzgen.zip" -DestinationPath $d -Force; ri "$d\boltzgen.zip"

完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。

💾 手動でダウンロードしたい(コマンドが難しい人向け)

1. 下の青いボタンを押して boltzgen.zip をダウンロード
2. ZIPファイルをダブルクリックで解凍 → boltzgen フォルダができる
3. そのフォルダを C:\Users\あなたの名前\.claude\skills\(Win)または ~/.claude/skills/(Mac)へ移動
4. Claude Code を再起動

⬇ .zip でダウンロード(推奨) ⬇ .skill 形式(上級者用) 元のソース ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-18
取得日時: 2026-05-18
同梱ファイル: 1

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

BoltzGen 全原子設計

前提条件

要件	最小	推奨
Python	3.10+	3.11
CUDA	12.0+	12.1+
GPU VRAM	24GB	48GB (L40S)
RAM	32GB	64GB

実行方法

初回ですか？ Modalとbiomodalsのセットアップについては、インストールガイドをご覧ください。

オプション1: Modal (推奨)

# biomodals をクローン
git clone https://github.com/hgbrian/biomodals && cd biomodals

# BoltzGen を実行 (YAML 設定ファイルが必要)
modal run modal_boltzgen.py \
  --input-yaml binder_config.yaml \
  --protocol protein-anything \
  --num-designs 50

# カスタム GPU を使用する場合
GPU=L40S modal run modal_boltzgen.py \
  --input-yaml binder_config.yaml \
  --protocol protein-anything \
  --num-designs 100

GPU: L40S (48GB) 推奨 | タイムアウト: デフォルト 120分

利用可能なプロトコル: protein-anything, peptide-anything, protein-small_molecule, nanobody-anything, antibody-anything

オプション2: ローカルインストール

git clone https://github.com/HannesStark/boltzgen.git
cd boltzgen
pip install -e .

python sample.py config=config.yaml

オプション3: Python API

from boltzgen import BoltzGen

model = BoltzGen.load_pretrained()
designs = model.sample(
    target_pdb="target.pdb",
    num_samples=50,
    binder_length=80
)

GPU: L40S (48GB) | 時間: デザインあたり ~30-60秒

主要なパラメータ (CLI)

パラメータ	デフォルト	説明
`--input-yaml`	必須	YAML 設計仕様へのパス
`--protocol`	`protein-anything`	設計プロトコル
`--num-designs`	10	生成する設計の数
`--steps`	all	実行するパイプラインステップ (例: `design inverse_folding`)

YAML 設定

BoltzGen は、設計されたタンパク質とターゲット構造をエンティティとして指定する エンティティベースの YAML 形式 を使用します。

重要な注意点:

残基インデックスは、作成者の残基番号ではなく、label_seq_id (1 から始まる) を使用します
ファイルパスは YAML ファイルの場所からの相対パスです
ターゲットファイルは CIF 形式である必要があります (PDB も動作しますが、CIF が推奨されます)
実行する前に boltzgen check config.yaml を実行して、仕様を確認してください

基本的なバインダー設定

entities:
  # 設計されたタンパク質 (可変長 80-140 残基)
  - protein:
      id: B
      sequence: 80..140

  # 構造ファイルからのターゲット
  - file:
      path: target.cif
      include:
        - chain:
            id: A
      # 結合部位の残基を指定 (オプションですが推奨)
      binding_types:
        - chain:
            id: A
            binding: 45,67,89

特定の結合部位を持つバインダー

entities:
  - protein:
      id: G
      sequence: 60..100

  - file:
      path: 5cqg.cif
      include:
        - chain:
            id: A
      binding_types:
        - chain:
            id: A
            binding: 343,344,251
      structure_groups: "all"

ペプチド設計 (環状)

entities:
  - protein:
      id: S
      sequence: 10..14C6C3  # ジスルフィド結合のためのシステイン付き

  - file:
      path: target.cif
      include:
        - chain:
            id: A

constraints:
  - bond:
      atom1: [S, 11, SG]
      atom2: [S, 18, SG]  # ジスルフィド結合

設計プロトコル

プロトコル	ユースケース
`protein-anything`	タンパク質またはペプチドに結合するタンパク質を設計する
`peptide-anything`	タンパク質に結合する環状ペプチドを設計する
`protein-small_molecule`	低分子に結合するタンパク質を設計する
`nanobody-anything`	ナノボディ CDR を設計する
`antibody-anything`	抗体 CDR を設計する

出力形式

output/
├── sample_0/
│   ├── design.cif         # 全原子構造 (CIF 形式)
│   ├── metrics.json       # 信頼性スコア
│   └── sequence.fasta     # 配列
├── sample_1/
│   └── ...
└── summary.csv

注意: BoltzGen は CIF 形式で出力します。必要に応じて PDB に変換してください:

from Bio.PDB import MMCIFParser, PDBIO
parser = MMCIFParser()
structure = parser.get_structure("design", "design.cif")
io = PDBIO()
io.set_structure(structure)
io.save("design.pdb")

出力例

成功した実行

$ modal run modal_boltzgen.py --input-yaml binder.yaml --protocol protein-anything --num-designs 10
Running: boltzgen run binder.yaml --output /tmp/out --protocol protein-anything --num_designs 10
[INFO] Loading BoltzGen model...
[INFO] Generating designs...
[INFO] Running inverse folding...
[INFO] Running structure prediction...
[INFO] Filtering and ranking...
[INFO] Pipeline complete

Results saved to: ./out/boltzgen/2501161234/

出力ディレクトリ構造:

out/boltzgen/2501161234/
├── intermediate_designs/           # 生の拡散出力
│   ├── design_0.cif
│   └── design_0.npz
├── intermediate_designs_inverse_folded/
│   ├── refold_cif/                 # リフォールドされた複合体
│   └── aggregate_metrics_analyze.csv
└── final_ranked_designs/
    ├── final_10_designs/           # 上位デザイン
    └── results_overview.pdf        # サマリープロット

良好な出力の例:

リフォールディング RMSD < 2.0A (デザインが予測どおりにフォールドされる)
ipTM > 0.5 (信頼できるインターフェース)
すべてのデザインがエラーなしでパイプラインを完了する

決定木

BoltzGen を使用すべきか？
│
├─ どのようなタイプの設計か？
│  ├─ 全原子の精度が必要 → BoltzGen ✓
│  ├─ リガンド結合ポケット → BoltzGen ✓
│  └─ 標準的なミニタンパク質 → RFdiffusion (より高速)
│
├─ 何が最も重要か？
│  ├─ 側鎖パッキング → BoltzGen ✓
│  ├─ 速度 / 多様性 → RFdiffusion
│  ├─ 最高の成功率 → BindCraft
│  └─ AF2 最適化 → ColabDesign
│
└─ 計算リソースは？
   ├─ L40S/A100 (48GB+) を持っている → BoltzGen ✓
   └─ A10G (24GB) しか持っていない → RFdiffusion を検討

標準的なパフォーマンス

キャンペーンサイズ	時間 (L40S)	コスト (Modal)	注
50 デザイン	30-45 分	~$8	簡単な探索
100 デザイン	1-1.5 時間	~$15	標準的なキャンペーン
500 デザイン	5-8 時間	~$70	大規模なキャンペーン

デザインあたり:

(原文がここで切り詰められています)

📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

BoltzGen All-Atom Design

Prerequisites

Requirement	Minimum	Recommended
Python	3.10+	3.11
CUDA	12.0+	12.1+
GPU VRAM	24GB	48GB (L40S)
RAM	32GB	64GB

How to run

First time? See Installation Guide to set up Modal and biomodals.

Option 1: Modal (recommended)

# Clone biomodals
git clone https://github.com/hgbrian/biomodals && cd biomodals

# Run BoltzGen (requires YAML config file)
modal run modal_boltzgen.py \
  --input-yaml binder_config.yaml \
  --protocol protein-anything \
  --num-designs 50

# With custom GPU
GPU=L40S modal run modal_boltzgen.py \
  --input-yaml binder_config.yaml \
  --protocol protein-anything \
  --num-designs 100

GPU: L40S (48GB) recommended | Timeout: 120min default

Available protocols: protein-anything, peptide-anything, protein-small_molecule, nanobody-anything, antibody-anything

Option 2: Local installation

git clone https://github.com/HannesStark/boltzgen.git
cd boltzgen
pip install -e .

python sample.py config=config.yaml

Option 3: Python API

from boltzgen import BoltzGen

model = BoltzGen.load_pretrained()
designs = model.sample(
    target_pdb="target.pdb",
    num_samples=50,
    binder_length=80
)

GPU: L40S (48GB) | Time: ~30-60s per design

Key parameters (CLI)

Parameter	Default	Description
`--input-yaml`	required	Path to YAML design specification
`--protocol`	`protein-anything`	Design protocol
`--num-designs`	10	Number of designs to generate
`--steps`	all	Pipeline steps to run (e.g., `design inverse_folding`)

YAML configuration

BoltzGen uses an entity-based YAML format where you specify designed proteins and target structures as entities.

Important notes:

Residue indices use label_seq_id (1-indexed), not author residue numbers
File paths are relative to the YAML file location
Target files should be in CIF format (PDB also works but CIF preferred)
Run boltzgen check config.yaml to verify your specification before running

Basic Binder Config

entities:
  # Designed protein (variable length 80-140 residues)
  - protein:
      id: B
      sequence: 80..140

  # Target from structure file
  - file:
      path: target.cif
      include:
        - chain:
            id: A
      # Specify binding site residues (optional but recommended)
      binding_types:
        - chain:
            id: A
            binding: 45,67,89

Binder with Specific Binding Site

entities:
  - protein:
      id: G
      sequence: 60..100

  - file:
      path: 5cqg.cif
      include:
        - chain:
            id: A
      binding_types:
        - chain:
            id: A
            binding: 343,344,251
      structure_groups: "all"

Peptide Design (Cyclic)

entities:
  - protein:
      id: S
      sequence: 10..14C6C3  # With cysteines for disulfide

  - file:
      path: target.cif
      include:
        - chain:
            id: A

constraints:
  - bond:
      atom1: [S, 11, SG]
      atom2: [S, 18, SG]  # Disulfide bond

Design protocols

Protocol	Use Case
`protein-anything`	Design proteins to bind proteins or peptides
`peptide-anything`	Design cyclic peptides to bind proteins
`protein-small_molecule`	Design proteins to bind small molecules
`nanobody-anything`	Design nanobody CDRs
`antibody-anything`	Design antibody CDRs

Output format

output/
├── sample_0/
│   ├── design.cif         # All-atom structure (CIF format)
│   ├── metrics.json       # Confidence scores
│   └── sequence.fasta     # Sequence
├── sample_1/
│   └── ...
└── summary.csv

Note: BoltzGen outputs CIF format. Convert to PDB if needed:

from Bio.PDB import MMCIFParser, PDBIO
parser = MMCIFParser()
structure = parser.get_structure("design", "design.cif")
io = PDBIO()
io.set_structure(structure)
io.save("design.pdb")

Sample output

Successful run

$ modal run modal_boltzgen.py --input-yaml binder.yaml --protocol protein-anything --num-designs 10
Running: boltzgen run binder.yaml --output /tmp/out --protocol protein-anything --num_designs 10
[INFO] Loading BoltzGen model...
[INFO] Generating designs...
[INFO] Running inverse folding...
[INFO] Running structure prediction...
[INFO] Filtering and ranking...
[INFO] Pipeline complete

Results saved to: ./out/boltzgen/2501161234/

Output directory structure:

out/boltzgen/2501161234/
├── intermediate_designs/           # Raw diffusion outputs
│   ├── design_0.cif
│   └── design_0.npz
├── intermediate_designs_inverse_folded/
│   ├── refold_cif/                 # Refolded complexes
│   └── aggregate_metrics_analyze.csv
└── final_ranked_designs/
    ├── final_10_designs/           # Top designs
    └── results_overview.pdf        # Summary plots

What good output looks like:

Refolding RMSD < 2.0A (design folds as predicted)
ipTM > 0.5 (confident interface)
All designs complete pipeline without errors

Decision tree

Should I use BoltzGen?
│
├─ What type of design?
│  ├─ All-atom precision needed → BoltzGen ✓
│  ├─ Ligand binding pocket → BoltzGen ✓
│  └─ Standard miniprotein → RFdiffusion (faster)
│
├─ What matters most?
│  ├─ Side-chain packing → BoltzGen ✓
│  ├─ Speed / diversity → RFdiffusion
│  ├─ Highest success rate → BindCraft
│  └─ AF2 optimization → ColabDesign
│
└─ Compute resources?
   ├─ Have L40S/A100 (48GB+) → BoltzGen ✓
   └─ Only A10G (24GB) → Consider RFdiffusion

Typical performance

Campaign Size	Time (L40S)	Cost (Modal)	Notes
50 designs	30-45 min	~$8	Quick exploration
100 designs	1-1.5h	~$15	Standard campaign
500 designs	5-8h	~$70	Large campaign

Per-design: ~30-60s for typical binder.

Verify

find output -name "*.cif" | wc -l  # Should match num_samples

Troubleshooting

Verify config first: Always run boltzgen check config.yaml before running the full pipeline Slow generation: Use fewer designs for initial testing, then scale up OOM errors: Use A100-80GB or reduce --num-designs Wrong binding site: Residue indices use label_seq_id (1-indexed), check in Molstar viewer

Error interpretation

Error	Cause	Fix
`RuntimeError: CUDA out of memory`	Large design or long protein	Use A100-80GB or reduce designs
`FileNotFoundError: *.cif`	Target file not found	File paths are relative to YAML location
`ValueError: invalid chain`	Chain not in target	Verify chain IDs with Molstar or PyMOL
`modal: command not found`	Modal CLI not installed	Run `pip install modal && modal setup`

Next: Validate with boltz or chai → protein-qc for filtering.