📦 その他コミュニティ

signal-classification

XGBoostやLightGBMといった機械学習モデルを用いて、過去のデータに基づいた取引シグナルを分類し、将来の価格変動を予測、さらにSHAP値で重要な特徴を分析し、最適な閾値を設定することで、より精度の高い投資判断を支援するSkill。

📜 元の英語説明(参考)

ML trading signal classifiers using XGBoost and LightGBM with walk-forward validation, SHAP feature importance, and threshold optimization

🇯🇵 日本人クリエイター向け解説

一言でいうと

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⚡ おすすめ: コマンド1行でインストール(60秒)

下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。ダウンロード → 解凍 → 配置まで全自動。

🍎 Mac / 🐧 Linux

mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o signal-classification.zip https://jpskill.com/download/10435.zip && unzip -o signal-classification.zip && rm signal-classification.zip

🪟 Windows (PowerShell)

$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/10435.zip -OutFile "$d\signal-classification.zip"; Expand-Archive "$d\signal-classification.zip" -DestinationPath $d -Force; ri "$d\signal-classification.zip"

完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。

💾 手動でダウンロードしたい(コマンドが難しい人向け)

1. 下の青いボタンを押して signal-classification.zip をダウンロード
2. ZIPファイルをダブルクリックで解凍 → signal-classification フォルダができる
3. そのフォルダを C:\Users\あなたの名前\.claude\skills\(Win)または ~/.claude/skills/(Mac)へ移動
4. Claude Code を再起動

⬇ .zip でダウンロード(推奨) ⬇ .skill 形式(上級者用) 元のソース ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-18
取得日時: 2026-05-18
同梱ファイル: 1

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

シグナル分類

教師あり機械学習分類器を使用して、資産価格が将来の期間にわたって上昇するか下降するかを予測します。このスキルは、ラベル作成、モデル学習、ウォークフォワード検証、特徴量の重要度分析、および取引アプリケーション向けの閾値最適化という、完全なパイプラインを網羅しています。

なぜツリーベースモデルが取引MLを支配するのか

XGBoostとLightGBMが定量取引MLの主力であるのには、正当な理由があります。

非線形な関係: 金融の特徴量は複雑な非線形な方法で相互作用し、ツリーはそれを自然に捉えます。
特徴量のスケールに対するロバスト性: 入力を正規化または標準化する必要はありません。ツリーはランク順で分割します。
組み込みの特徴量の重要度: 個別の分析なしに、どの特徴量が予測を促進するかを理解できます。
高速な学習と推論: 数千のサンプルで数秒で学習し、マイクロ秒で予測します。
欠損値の処理: 補完ハックなしでNaNをネイティブにサポートします。
組み込みの正則化: max_depth、min_child_weight、subsampleはすべて過学習を防ぎます。

線形モデルと深層学習にも活躍の場はありますが、10万未満のサンプルを持つ表形式の取引特徴量の場合、勾配ブースティングツリーは一貫して代替手段よりも優れています。

分類タイプ

二値分類

最も単純で一般的な設定。将来のリターンが閾値を超えるかどうかを予測します。

上昇シグナル: 将来のリターン > +1%
下降シグナル: 将来のリターン < -1%
ニュートラル（除外）: -1%から+1% — よりクリーンなラベルを作成するために、これらを学習から削除します。

import numpy as np

def create_binary_labels(
    prices: np.ndarray, horizon: int = 24, threshold: float = 0.01
) -> np.ndarray:
    """Create binary labels from forward returns.

    Args:
        prices: Array of prices.
        horizon: Forward return lookback in bars.
        threshold: Minimum return magnitude for a label.

    Returns:
        Array of labels: 1 (up), 0 (down), NaN (neutral).
    """
    fwd_returns = np.roll(prices, -horizon) / prices - 1
    fwd_returns[-horizon:] = np.nan
    labels = np.where(fwd_returns > threshold, 1,
             np.where(fwd_returns < -threshold, 0, np.nan))
    return labels

多クラス分類

より細かいシグナル粒度のための3つのクラス：

Class	Condition	Typical threshold
Strong Up	fwd_return > +2%	High confidence long
Mild Up	+0.5% to +2%	Moderate confidence
Down	fwd_return < -0.5%	Avoid / short

多クラスは、クラスごとのサンプルサイズを削減します。大規模なデータセット（クラスごとに1000以上のサンプル）でのみ使用してください。

確率キャリブレーション

XGBoost/LightGBMからの生のモデル確率は、適切にキャリブレーションされていません。予測された0.7の確率は、70％の正解率を意味するわけではありません。これを修正するには、キャリブレーションを使用します。

from sklearn.calibration import CalibratedClassifierCV

calibrated = CalibratedClassifierCV(base_model, cv=5, method="isotonic")
calibrated.fit(X_train, y_train)
probs = calibrated.predict_proba(X_test)[:, 1]

等張キャリブレーションは、ツリーモデルの場合、Plattスケーリングよりも優れています。

ウォークフォワード検証

これは、取引MLにおいて最も重要な概念です。 標準的な交差検証はデータをランダムにシャッフルするため、先読みバイアスが発生します。ウォークフォワード検証は、時間の順序を尊重します。

仕組み

Window 1: [===TRAIN===][GAP][=TEST=]
Window 2:    [===TRAIN===][GAP][=TEST=]
Window 3:       [===TRAIN===][GAP][=TEST=]
Window 4:          [===TRAIN===][GAP][=TEST=]

各ウィンドウ：

過去N本のバーで学習します
将来のリターン期間に等しいギャップ（エンバーゴ）をスキップします
次のM本のバーで予測します
アウトオブサンプル予測を記録します
前方にスライドして繰り返します

一般的なパラメータ

Parameter	Value	Rationale
Train window	30 days (720 hourly bars)	学習するのに十分なデータ、関連性を保つために十分に最近
Test window	7 days (168 hourly bars)	統計的有意性のために十分な予測
Step size	1 day (24 bars)	より多くのデータポイントのためにテストウィンドウをオーバーラップ
Gap (embargo)	Same as forward horizon	ラベルリークを防ぎます

ウォークフォワードの実装

from typing import Iterator

def walk_forward_splits(
    n_samples: int,
    train_size: int = 720,
    test_size: int = 168,
    step_size: int = 24,
    gap: int = 24,
) -> Iterator[tuple[np.ndarray, np.ndarray]]:
    """Generate walk-forward train/test index splits.

    Args:
        n_samples: Total number of samples.
        train_size: Number of training samples per window.
        test_size: Number of test samples per window.
        step_size: Step between successive windows.
        gap: Gap between train end and test start.

    Yields:
        Tuples of (train_indices, test_indices).
    """
    start = 0
    while start + train_size + gap + test_size <= n_samples:
        train_idx = np.arange(start, start + train_size)
        test_start = start + train_size + gap
        test_idx = np.arange(test_start, test_start + test_size)
        yield train_idx, test_idx
        start += step_size

パージされたCV、CPCV、および評価指標については、references/validation_methods.mdを参照してください。

モデル学習パイプライン

完全なパイプラインの概要

特徴量エンジニアリング — テクニカル指標、オンチェーンメトリクス、ボリューム特徴量を計算します（feature-engineeringスキルを参照）。
ラベル作成 — 閾値を使用した将来のリターン、ニュートラルゾーンを削除します。
ウォークフォワード分割 — ギャップのある時間順の学習/テストウィンドウ。
モデル学習 — 各学習ウィンドウでXGBoostまたはLightGBMを使用します。
テストでの予測 — アウトオブサンプル確率予測を生成します。
予測の集約 — すべてのアウトオブサンプル結果を連結します。
評価 — 精度、適合率、再現率、F1、AUC、プロフィットファクター。

簡単な学習例

from xgboost import XGBClassifier

model = XGBClassifier(
    n_estimators=200,
    max_depth=4,
    learning_rate=0.05,
    subsample=0.8,
    colsample_bytree=0.8,
    eval_metric="loglo

📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

Signal Classification

Predict whether an asset's price will move up or down over a forward horizon using supervised machine learning classifiers. This skill covers the full pipeline: label creation, model training, walk-forward validation, feature importance analysis, and threshold optimization for trading applications.

Why Tree-Based Models Dominate Trading ML

XGBoost and LightGBM are the workhorses of quantitative trading ML for good reason:

Non-linear relationships: Financial features interact in complex, non-linear ways that trees capture naturally
Robust to feature scale: No need to normalize or standardize inputs — trees split on rank order
Built-in feature importance: Understand which features drive predictions without separate analysis
Fast training and inference: Train on thousands of samples in seconds, predict in microseconds
Handle missing values: Native support for NaN without imputation hacks
Regularization built in: max_depth, min_child_weight, subsample all prevent overfitting

Linear models and deep learning have their place, but for tabular trading features with fewer than 100k samples, gradient-boosted trees consistently outperform alternatives.

Classification Types

Binary Classification

The simplest and most common setup. Predict whether forward returns exceed a threshold:

Up signal: forward return > +1%
Down signal: forward return < -1%
Neutral (excluded): -1% to +1% — drop these from training to create cleaner labels

import numpy as np

def create_binary_labels(
    prices: np.ndarray, horizon: int = 24, threshold: float = 0.01
) -> np.ndarray:
    """Create binary labels from forward returns.

    Args:
        prices: Array of prices.
        horizon: Forward return lookback in bars.
        threshold: Minimum return magnitude for a label.

    Returns:
        Array of labels: 1 (up), 0 (down), NaN (neutral).
    """
    fwd_returns = np.roll(prices, -horizon) / prices - 1
    fwd_returns[-horizon:] = np.nan
    labels = np.where(fwd_returns > threshold, 1,
             np.where(fwd_returns < -threshold, 0, np.nan))
    return labels

Multi-Class Classification

Three classes for finer signal granularity:

Class	Condition	Typical threshold
Strong Up	fwd_return > +2%	High confidence long
Mild Up	+0.5% to +2%	Moderate confidence
Down	fwd_return < -0.5%	Avoid / short

Multi-class reduces per-class sample size. Use only with large datasets (1000+ samples per class).

Probability Calibration

Raw model probabilities from XGBoost/LightGBM are not well-calibrated. A predicted 0.7 probability does not mean 70% chance of being correct. Use calibration to fix this:

from sklearn.calibration import CalibratedClassifierCV

calibrated = CalibratedClassifierCV(base_model, cv=5, method="isotonic")
calibrated.fit(X_train, y_train)
probs = calibrated.predict_proba(X_test)[:, 1]

Isotonic calibration works better than Platt scaling for tree models.

Walk-Forward Validation

This is the single most important concept in trading ML. Standard cross-validation randomly shuffles data, which creates lookahead bias. Walk-forward validation respects time ordering.

How It Works

Window 1: [===TRAIN===][GAP][=TEST=]
Window 2:    [===TRAIN===][GAP][=TEST=]
Window 3:       [===TRAIN===][GAP][=TEST=]
Window 4:          [===TRAIN===][GAP][=TEST=]

Each window:

Train on past N bars
Skip a gap (embargo) equal to the forward return horizon
Predict on next M bars
Record out-of-sample predictions
Slide forward and repeat

Typical Parameters

Parameter	Value	Rationale
Train window	30 days (720 hourly bars)	Enough data to learn, recent enough to be relevant
Test window	7 days (168 hourly bars)	Enough predictions for statistical significance
Step size	1 day (24 bars)	Overlap test windows for more data points
Gap (embargo)	Same as forward horizon	Prevents label leakage

Walk-Forward Implementation

from typing import Iterator

def walk_forward_splits(
    n_samples: int,
    train_size: int = 720,
    test_size: int = 168,
    step_size: int = 24,
    gap: int = 24,
) -> Iterator[tuple[np.ndarray, np.ndarray]]:
    """Generate walk-forward train/test index splits.

    Args:
        n_samples: Total number of samples.
        train_size: Number of training samples per window.
        test_size: Number of test samples per window.
        step_size: Step between successive windows.
        gap: Gap between train end and test start.

    Yields:
        Tuples of (train_indices, test_indices).
    """
    start = 0
    while start + train_size + gap + test_size <= n_samples:
        train_idx = np.arange(start, start + train_size)
        test_start = start + train_size + gap
        test_idx = np.arange(test_start, test_start + test_size)
        yield train_idx, test_idx
        start += step_size

See references/validation_methods.md for purged CV, CPCV, and evaluation metrics.

Model Training Pipeline

Full Pipeline Overview

Feature engineering — compute technical indicators, on-chain metrics, volume features (see feature-engineering skill)
Label creation — forward returns with threshold, drop neutral zone
Walk-forward split — time-ordered train/test windows with gap
Train model — XGBoost or LightGBM on each training window
Predict on test — generate out-of-sample probability predictions
Aggregate predictions — concatenate all out-of-sample results
Evaluate — accuracy, precision, recall, F1, AUC, profit factor

Quick Training Example

from xgboost import XGBClassifier

model = XGBClassifier(
    n_estimators=200,
    max_depth=4,
    learning_rate=0.05,
    subsample=0.8,
    colsample_bytree=0.8,
    eval_metric="logloss",
    use_label_encoder=False,
    random_state=42,
)

model.fit(
    X_train, y_train,
    eval_set=[(X_val, y_val)],
    verbose=False,
)

probabilities = model.predict_proba(X_test)[:, 1]

See references/model_guide.md for parameter recommendations and tuning.

SHAP Feature Importance

SHAP (SHapley Additive exPlanations) provides the gold standard for understanding model predictions.

Global Feature Importance

Which features matter most across all predictions:

import shap

explainer = shap.TreeExplainer(model)
shap_values = explainer.shap_values(X_test)

# Summary plot (top 15 features)
shap.summary_plot(shap_values, X_test, max_display=15)

Local Explanations

Why a specific prediction was made:

# Explain a single prediction
shap.force_plot(explainer.expected_value, shap_values[0], X_test.iloc[0])

Temporal Feature Importance

Track how feature importance drifts over walk-forward windows. If a feature's importance drops significantly, the market regime may have shifted.

Threshold Optimization

The default 0.5 probability threshold is almost never optimal for trading.

Why Not 0.5?

Class imbalance: if 60% of labels are "up", a 0.5 threshold is too aggressive
Trading costs: marginal signals (0.51 probability) rarely cover transaction costs
Asymmetric payoffs: precision matters more than recall for trading

Optimize for Profit Factor

def optimize_threshold(
    probabilities: np.ndarray,
    returns: np.ndarray,
    thresholds: np.ndarray | None = None,
) -> tuple[float, float]:
    """Find threshold that maximizes profit factor.

    Args:
        probabilities: Model predicted probabilities.
        returns: Actual forward returns.
        thresholds: Thresholds to search over.

    Returns:
        Tuple of (best_threshold, best_profit_factor).
    """
    if thresholds is None:
        thresholds = np.arange(0.50, 0.85, 0.01)
    best_threshold, best_pf = 0.5, 0.0
    for t in thresholds:
        signals = probabilities >= t
        if signals.sum() < 10:
            continue
        signal_returns = returns[signals]
        wins = signal_returns[signal_returns > 0].sum()
        losses = abs(signal_returns[signal_returns < 0].sum())
        pf = wins / losses if losses > 0 else 0.0
        if pf > best_pf:
            best_pf = pf
            best_threshold = t
    return best_threshold, best_pf

Typical finding: optimal threshold is 0.60-0.75 for crypto trading signals.

Crypto-Specific Considerations

Short Training Windows

Crypto market regimes change fast. A model trained on 6 months of data may perform worse than one trained on 30 days. Use shorter training windows and retrain frequently.

Class Imbalance

Most time periods are "flat" (returns within the neutral zone). Strategies to handle this:

Drop neutral zone: only train on clear up/down labels
Undersample majority class: scale_pos_weight in XGBoost
SMOTE: synthetic minority oversampling (use cautiously — can introduce lookahead)
Adjust threshold: raise the probability threshold to compensate

Transaction Costs

A model with 55% accuracy sounds good, but after 0.5% round-trip costs (slippage + fees), many signals become unprofitable. Always evaluate signals net of costs:

net_return = gross_return - 0.005  # 50 bps round-trip

Feature Decay

Features lose predictive power over time as more participants discover and trade on them. Monitor rolling performance and retrain when metrics degrade.

Integration with Other Skills

Skill	Integration
`feature-engineering`	Compute input features for the classifier
`vectorbt`	Backtest trading strategies from ML signals
`regime-detection`	Train separate models per regime, or use regime as a feature
`position-sizing`	Size positions based on classifier confidence
`risk-management`	Apply portfolio-level risk limits to ML-generated signals

Files

References

references/model_guide.md — XGBoost and LightGBM parameter guide, tuning, and ensembling
references/validation_methods.md — Walk-forward, purged CV, CPCV, and evaluation metrics

Scripts

scripts/train_classifier.py — Train a signal classifier with walk-forward validation and feature importance
scripts/walk_forward_backtest.py — Backtest ML signals vs buy-and-hold with walk-forward validation

Dependencies

# Core (required)
uv pip install pandas numpy scikit-learn

# Optional (recommended)
uv pip install xgboost lightgbm shap

Key Takeaways

Walk-forward validation is non-negotiable — random CV will give you wildly inflated results
Optimize threshold for profit factor, not accuracy — a high-precision, low-recall model beats a high-accuracy one
Short training windows for crypto — 30 days beats 6 months in most regimes
Monitor feature decay — retrain when rolling metrics drop below baseline
Always evaluate net of costs — a 55% accurate model may be unprofitable after fees
SHAP over raw feature importance — SHAP gives consistent, theoretically grounded explanations