🛠️ 開発・MCP コミュニティ

protein-design-workflow

タンパク質設計プロジェクトの開始から完了まで、必要なステップ、ツール選定、計算資源の見積もりなどを包括的にガイドし、効率的な設計を支援するSkill。

📜 元の英語説明(参考)

End-to-end guidance for protein design pipelines. Use this skill when: (1) Starting a new protein design project, (2) Need step-by-step workflow guidance, (3) Understanding the full design pipeline, (4) Planning compute resources and timelines, (5) Integrating multiple design tools. For tool selection, use binder-design. For QC thresholds, use protein-qc.

🇯🇵 日本人クリエイター向け解説

一言でいうと

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⚡ おすすめ: コマンド1行でインストール(60秒)

下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。ダウンロード → 解凍 → 配置まで全自動。

🍎 Mac / 🐧 Linux

mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o protein-design-workflow.zip https://jpskill.com/download/9551.zip && unzip -o protein-design-workflow.zip && rm protein-design-workflow.zip

🪟 Windows (PowerShell)

$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/9551.zip -OutFile "$d\protein-design-workflow.zip"; Expand-Archive "$d\protein-design-workflow.zip" -DestinationPath $d -Force; ri "$d\protein-design-workflow.zip"

完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。

💾 手動でダウンロードしたい(コマンドが難しい人向け)

1. 下の青いボタンを押して protein-design-workflow.zip をダウンロード
2. ZIPファイルをダブルクリックで解凍 → protein-design-workflow フォルダができる
3. そのフォルダを C:\Users\あなたの名前\.claude\skills\(Win)または ~/.claude/skills/(Mac)へ移動
4. Claude Code を再起動

⬇ .zip でダウンロード(推奨) ⬇ .skill 形式(上級者用) 元のソース ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-18
取得日時: 2026-05-18
同梱ファイル: 1

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

タンパク質設計ワークフローガイド

標準的な結合タンパク質設計パイプライン

概要

ターゲット準備 --> バックボーン生成 --> 配列設計
         |                     |                     |
         v                     v                     v
    (pdb skill)          (rfdiffusion)         (proteinmpnn)
                               |                     |
                               v                     v
                        構造検証 --> フィルタリング
                               |                     |
                               v                     v
                         (alphafold/chai)      (protein-qc)

フェーズ 1: ターゲット準備

1.1 ターゲット構造の取得

# PDB からダウンロード
curl -o target.pdb "https://files.rcsb.org/download/XXXX.pdb"

1.2 クリーニングと準備

# ターゲット鎖の抽出
# 必要に応じて水分子、リガンドの除去
# 結合領域 + 10A バッファーにトリミング

1.3 ホットスポットの選択

露出した残基を 3-6 個選択
電荷を持つ/芳香族 (K, R, E, D, W, Y, F) を優先
表面へのアクセス性を確認
残基番号を確認

出力: target_prepared.pdb、ホットスポットリスト

フェーズ 2: バックボーン生成

オプション A: RFdiffusion (多様な探索)

modal run modal_rfdiffusion.py \
  --pdb target_prepared.pdb \
  --contigs "A1-150/0 70-100" \
  --hotspot "A45,A67,A89" \
  --num-designs 500

オプション B: BindCraft (エンドツーエンド)

modal run modal_bindcraft.py \
  --target-pdb target_prepared.pdb \
  --hotspots "A45,A67,A89" \
  --num-designs 100

出力: 100-500 個のバックボーン PDB

フェーズ 3: 配列設計

RFdiffusion バックボーンの場合

for backbone in backbones/*.pdb; do
  modal run modal_proteinmpnn.py \
    --pdb-path "$backbone" \
    --num-seq-per-target 8 \
    --sampling-temp 0.1
done

出力: バックボーンあたり 8 配列 (合計 800-4000)

フェーズ 4: 構造検証

複合体の予測

# 結合タンパク質 + ターゲットを含む FASTA を準備
# 多量体の場合の binder:target 形式

modal run modal_colabfold.py \
  --input-faa all_sequences.fasta \
  --out-dir predictions/

出力: pLDDT、ipTM、PAE を含む AF2 予測

フェーズ 5: フィルタリングと選択

標準的な閾値の適用

import pandas as pd

# メトリクスのロード
designs = pd.read_csv('all_metrics.csv')

# フィルタリング
filtered = designs[
    (designs['pLDDT'] > 0.85) &
    (designs['ipTM'] > 0.50) &
    (designs['PAE_interface'] < 10) &
    (designs['scRMSD'] < 2.0) &
    (designs['esm2_pll'] > 0.0)
]

# 複合スコアによるランキング
filtered['score'] = (
    0.3 * filtered['pLDDT'] +
    0.3 * filtered['ipTM'] +
    0.2 * (1 - filtered['PAE_interface'] / 20) +
    0.2 * filtered['esm2_pll']
)

top_designs = filtered.nlargest(50, 'score')

出力: 50-200 個のフィルタリングされた候補

リソース計画

計算要件

ステージ	GPU	時間 (100 デザイン)
RFdiffusion	A10G	30 分
ProteinMPNN	T4	15 分
ColabFold	A100	4-8 時間
フィルタリング	CPU	15 分

全体的なタイムライン

小規模キャンペーン (100 デザイン): 8-12 時間
中規模キャンペーン (500 デザイン): 24-48 時間
大規模キャンペーン (1000+ デザイン): 2-5 日

品質チェックポイント

バックボーン生成後

[ ] 多様なバックボーンの目視検査
[ ] 二次構造の存在
[ ] ターゲットとの衝突がないこと

配列設計後

[ ] ほとんどの配列で ESM2 PLL > 0.0
[ ] 不要なシステインがないこと (意図的な場合を除く)
[ ] 妥当な配列多様性

検証後

[ ] pLDDT > 0.85
[ ] ipTM > 0.50
[ ] PAE_interface < 10
[ ] 自己整合性 RMSD < 2.0 A

最終選択

[ ] 多様な配列 (必要に応じてクラスタリング)
[ ] 製造可能性 (問題のあるモチーフがないこと)
[ ] 妥当な分子量

よくある問題

問題	解決策
低い ipTM	ホットスポットの確認、デザイン数の増加
低い多様性	より高い温度、より多くのバックボーン
高い scRMSD	バックボーンが異常である可能性
低い pLDDT	デザイン品質の確認

高度なワークフロー

複数ツールの組み合わせ

最初のバックボーンに RFdiffusion
洗練に ColabDesign
多様化に ProteinMPNN
最終検証に AF2

反復的な洗練

最初のキャンペーンの実行
失敗の分析
ホットスポット/パラメーターの調整
知見に基づいて繰り返す

📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

Protein Design Workflow Guide

Standard binder design pipeline

Overview

Target Preparation --> Backbone Generation --> Sequence Design
         |                     |                     |
         v                     v                     v
    (pdb skill)          (rfdiffusion)         (proteinmpnn)
                               |                     |
                               v                     v
                        Structure Validation --> Filtering
                               |                     |
                               v                     v
                         (alphafold/chai)      (protein-qc)

Phase 1: Target preparation

1.1 Obtain target structure

# Download from PDB
curl -o target.pdb "https://files.rcsb.org/download/XXXX.pdb"

1.2 Clean and prepare

# Extract target chain
# Remove waters, ligands if needed
# Trim to binding region + 10A buffer

1.3 Select hotspots

Choose 3-6 exposed residues
Prefer charged/aromatic (K, R, E, D, W, Y, F)
Check surface accessibility
Verify residue numbering

Output: target_prepared.pdb, hotspot list

Phase 2: Backbone generation

Option A: RFdiffusion (diverse exploration)

modal run modal_rfdiffusion.py \
  --pdb target_prepared.pdb \
  --contigs "A1-150/0 70-100" \
  --hotspot "A45,A67,A89" \
  --num-designs 500

Option B: BindCraft (end-to-end)

modal run modal_bindcraft.py \
  --target-pdb target_prepared.pdb \
  --hotspots "A45,A67,A89" \
  --num-designs 100

Output: 100-500 backbone PDBs

Phase 3: Sequence design

For RFdiffusion backbones

for backbone in backbones/*.pdb; do
  modal run modal_proteinmpnn.py \
    --pdb-path "$backbone" \
    --num-seq-per-target 8 \
    --sampling-temp 0.1
done

Output: 8 sequences per backbone (800-4000 total)

Phase 4: Structure validation

Predict complexes

# Prepare FASTA with binder + target
# binder:target format for multimer

modal run modal_colabfold.py \
  --input-faa all_sequences.fasta \
  --out-dir predictions/

Output: AF2 predictions with pLDDT, ipTM, PAE

Phase 5: Filtering and selection

Apply standard thresholds

import pandas as pd

# Load metrics
designs = pd.read_csv('all_metrics.csv')

# Filter
filtered = designs[
    (designs['pLDDT'] > 0.85) &
    (designs['ipTM'] > 0.50) &
    (designs['PAE_interface'] < 10) &
    (designs['scRMSD'] < 2.0) &
    (designs['esm2_pll'] > 0.0)
]

# Rank by composite score
filtered['score'] = (
    0.3 * filtered['pLDDT'] +
    0.3 * filtered['ipTM'] +
    0.2 * (1 - filtered['PAE_interface'] / 20) +
    0.2 * filtered['esm2_pll']
)

top_designs = filtered.nlargest(50, 'score')

Output: 50-200 filtered candidates

Resource planning

Compute requirements

Stage	GPU	Time (100 designs)
RFdiffusion	A10G	30 min
ProteinMPNN	T4	15 min
ColabFold	A100	4-8 hours
Filtering	CPU	15 min

Total timeline

Small campaign (100 designs): 8-12 hours
Medium campaign (500 designs): 24-48 hours
Large campaign (1000+ designs): 2-5 days

Quality checkpoints

After backbone generation

[ ] Visual inspection of diverse backbones
[ ] Secondary structure present
[ ] No clashes with target

After sequence design

[ ] ESM2 PLL > 0.0 for most sequences
[ ] No unwanted cysteines (unless intentional)
[ ] Reasonable sequence diversity

After validation

[ ] pLDDT > 0.85
[ ] ipTM > 0.50
[ ] PAE_interface < 10
[ ] Self-consistency RMSD < 2.0 A

Final selection

[ ] Diverse sequences (cluster if needed)
[ ] Manufacturable (no problematic motifs)
[ ] Reasonable molecular weight

Common issues

Problem	Solution
Low ipTM	Check hotspots, increase designs
Poor diversity	Higher temperature, more backbones
High scRMSD	Backbone may be unusual
Low pLDDT	Check design quality