jpskill.com
🛠️ 開発・MCP コミュニティ

validate-data

QA an analysis before sharing -- methodology, accuracy, and bias checks. Use when reviewing an analysis before a stakeholder presentation, spot-checking calculations and aggregation logic, verifying a SQL query's results look right, or assessing whether conclusions are actually supported by the data.

⚡ おすすめ: コマンド1行でインストール(60秒)

下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。 ダウンロード → 解凍 → 配置まで全自動。

🍎 Mac / 🐧 Linux
mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o validate-data.zip https://jpskill.com/download/22590.zip && unzip -o validate-data.zip && rm validate-data.zip
🪟 Windows (PowerShell)
$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/22590.zip -OutFile "$d\validate-data.zip"; Expand-Archive "$d\validate-data.zip" -DestinationPath $d -Force; ri "$d\validate-data.zip"

完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。

💾 手動でダウンロードしたい(コマンドが難しい人向け)
  1. 1. 下の青いボタンを押して validate-data.zip をダウンロード
  2. 2. ZIPファイルをダブルクリックで解凍 → validate-data フォルダができる
  3. 3. そのフォルダを C:\Users\あなたの名前\.claude\skills\(Win)または ~/.claude/skills/(Mac)へ移動
  4. 4. Claude Code を再起動

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

  1. 1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
  2. 2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
  3. 3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
    • · macOS / Linux: ~/.claude/skills/
    • · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →
最終更新
2026-05-18
取得日時
2026-05-18
同梱ファイル
1

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

[Skill 名] validate-data

/validate-data - 共有前の分析の検証

見慣れないプレースホルダーが表示された場合や、どのツールが接続されているかを確認する必要がある場合は、CONNECTORS.md を参照してください。

ステークホルダーと共有する前に、分析の正確性、方法論、潜在的なバイアスを確認します。信頼性評価と改善提案を生成します。

使用法

/validate-data <analysis to review>

分析は以下のいずれかです。

  • 会話内のドキュメントまたはレポート
  • ファイル(Markdown、ノートブック、スプレッドシート)
  • SQLクエリとその結果
  • チャートとその基礎となるデータ
  • 方法論と調査結果の説明

ワークフロー

1. 方法論と仮定のレビュー

以下を検討します。

  • 質問の枠組み: 分析は正しい質問に答えていますか?質問は異なる解釈ができますか?
  • データ選択: 適切なテーブル/データセットが使用されていますか?時間範囲は適切ですか?
  • 母集団の定義: 分析の母集団は正しく定義されていますか?意図しない除外はありませんか?
  • メトリックの定義: メトリックは明確かつ一貫して定義されていますか?ステークホルダーの理解と一致していますか?
  • ベースラインと比較: 比較は公平ですか?期間、コホートサイズ、コンテキストは比較可能ですか?

2. 納品前QAチェックリストの実行

以下のチェックリスト(データ品質、計算、妥当性、プレゼンテーションのチェック)を実行します。

3. 一般的な分析の落とし穴の確認

以下の詳細な落とし穴カタログ(結合爆発、生存者バイアス、不完全な期間比較、分母の変動、平均の平均、タイムゾーンの不一致、選択バイアス)に対して体系的にレビューします。

4. 計算と集計の検証

可能な場合は、以下をスポットチェックします。

  • いくつかの主要な数値を個別に再計算する
  • 小計が合計に合計されることを確認する
  • 期待される場所でパーセンテージが100%(またはそれに近い値)に合計されることを確認する
  • YoY/MoM比較が正しい基準期間を使用していることを確認する
  • すべてのメトリックにフィルターが一貫して適用されていることを検証する

以下の結果の健全性チェック手法(規模チェック、相互検証、危険信号検出)を適用します。

5. 可視化の評価

分析にチャートが含まれる場合:

  • 軸は適切な値(棒グラフの場合はゼロ)から始まっていますか?
  • 比較チャート間でスケールは一貫していますか?
  • チャートのタイトルは表示されている内容を正確に説明していますか?
  • 可視化は素早い読者を誤解させる可能性がありますか?
  • 認識を歪める切り詰められた軸、不整合な間隔、または3D効果はありませんか?

6. ナラティブと結論の評価

以下を確認します。

  • 結論は示されたデータによって裏付けられているか
  • 代替の説明が認められているか
  • 不確実性が適切に伝えられているか
  • 提言が調査結果から論理的に導き出されているか
  • 信頼度が証拠の強さと一致しているか

7. 改善提案

具体的で実行可能な提案を提供します。

  • 結論を強化する追加の分析
  • 留意すべき注意点または制限
  • 主要なポイントのためのより良い可視化またはフレーミング
  • ステークホルダーが求めるであろう不足しているコンテキスト

8. 信頼性評価の生成

分析を3段階のスケールで評価します。

共有準備完了 -- 分析は方法論的に健全で、計算は検証済み、注意点は記載済みです。軽微な改善提案はありますが、共有を妨げるものはありません。

注意点を記載して共有 -- 分析は概ね正しいですが、ステークホルダーに伝える必要がある特定の制限または仮定があります。必要な注意点をリストアップしてください。

改訂が必要 -- 共有する前に対応すべき特定の誤り、方法論的な問題、または不足している分析が見つかりました。必要な変更を優先順位を付けてリストアップしてください。

出力形式

## Validation Report

### Overall Assessment: [Ready to share | Share with caveats | Needs revision]

### Methodology Review
[Findings about approach, data selection, definitions]

### Issues Found
1. [Severity: High/Medium/Low] [Issue description and impact]
2. ...

### Calculation Spot-Checks
- [Metric]: [Verified / Discrepancy found]
- ...

### Visualization Review
[Any issues with charts or visual presentation]

### Suggested Improvements
1. [Improvement and why it matters]
2. ...

### Required Caveats for Stakeholders
- [Caveat that must be communicated]
- ...

納品前QAチェックリスト

ステークホルダーと分析を共有する前に、このチェックリストを実行してください。

データ品質チェック

  • [ ] ソース検証: どのテーブル/データソースが使用されたかを確認しました。この質問に対して適切なものですか?
  • [ ] 鮮度: データは分析にとって十分に最新です。「時点」の日付を記載しました。
  • [ ] 完全性: 時系列に予期せぬギャップや欠落したセグメントはありません。
  • [ ] Null処理: 主要な列のNull率を確認しました。Nullは適切に処理されています(除外、補完、またはフラグ付け)。
  • [ ] 重複排除: 不適切な結合や重複するソースレコードによる二重カウントがないことを確認しました。
  • [ ] フィルター検証: すべてのWHERE句とフィルターが正しいです。意図しない除外はありません。

計算チェック

  • [ ] 集計ロジック: GROUP BYには、集計されていないすべての列が含まれています。集計レベルは分析の粒度と一致しています。
  • [ ] 分母の正確性: レートとパーセンテージの計算は正しい分母を使用しています。分母はゼロではありません。
  • [ ] 日付の整合性: 比較は同じ期間の長さを使用しています。部分的な期間は除外されているか、記載されています。
  • [ ] 結合の正確性: JOINタイプは適切です(INNER vs LEFT)。多対多の結合によってカウントが膨らんでいません。
  • [ ] メトリックの定義: メトリックはステークホルダーが定義する方法と一致しています。逸脱がある場合は記載されています。
  • [ ] 小計の合計: 期待される場所で部分が全体に合計されます。合計されない場合は、その理由を説明します(例:重複)。

妥当性チェック

  • [ ] 規模: 数値は妥当な範囲内です。収益はマイナスではありません。パーセンテージは0〜100%の間です。
  • [ ] トレンドの連続性: 時系列に説明のつかない急上昇や急落はありません。
  • [ ] 相互参照: 主要な数値は他の既知のソース(ダッシュボード、以前のレポート、財務)と一致しています。
📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

/validate-data - Validate Analysis Before Sharing

If you see unfamiliar placeholders or need to check which tools are connected, see CONNECTORS.md.

Review an analysis for accuracy, methodology, and potential biases before sharing with stakeholders. Generates a confidence assessment and improvement suggestions.

Usage

/validate-data <analysis to review>

The analysis can be:

  • A document or report in the conversation
  • A file (markdown, notebook, spreadsheet)
  • SQL queries and their results
  • Charts and their underlying data
  • A description of methodology and findings

Workflow

1. Review Methodology and Assumptions

Examine:

  • Question framing: Is the analysis answering the right question? Could the question be interpreted differently?
  • Data selection: Are the right tables/datasets being used? Is the time range appropriate?
  • Population definition: Is the analysis population correctly defined? Are there unintended exclusions?
  • Metric definitions: Are metrics defined clearly and consistently? Do they match how stakeholders understand them?
  • Baseline and comparison: Is the comparison fair? Are time periods, cohort sizes, and contexts comparable?

2. Run the Pre-Delivery QA Checklist

Work through the checklist below — data quality, calculation, reasonableness, and presentation checks.

3. Check for Common Analytical Pitfalls

Systematically review against the detailed pitfall catalog below (join explosion, survivorship bias, incomplete period comparison, denominator shifting, average of averages, timezone mismatches, selection bias).

4. Verify Calculations and Aggregations

Where possible, spot-check:

  • Recalculate a few key numbers independently
  • Verify that subtotals sum to totals
  • Check that percentages sum to 100% (or close to it) where expected
  • Confirm that YoY/MoM comparisons use the correct base periods
  • Validate that filters are applied consistently across all metrics

Apply the result sanity-checking techniques below (magnitude checks, cross-validation, red-flag detection).

5. Assess Visualizations

If the analysis includes charts:

  • Do axes start at appropriate values (zero for bar charts)?
  • Are scales consistent across comparison charts?
  • Do chart titles accurately describe what's shown?
  • Could the visualization mislead a quick reader?
  • Are there truncated axes, inconsistent intervals, or 3D effects that distort perception?

6. Evaluate Narrative and Conclusions

Review whether:

  • Conclusions are supported by the data shown
  • Alternative explanations are acknowledged
  • Uncertainty is communicated appropriately
  • Recommendations follow logically from findings
  • The level of confidence matches the strength of evidence

7. Suggest Improvements

Provide specific, actionable suggestions:

  • Additional analyses that would strengthen the conclusions
  • Caveats or limitations that should be noted
  • Better visualizations or framings for key points
  • Missing context that stakeholders would want

8. Generate Confidence Assessment

Rate the analysis on a 3-level scale:

Ready to share -- Analysis is methodologically sound, calculations verified, caveats noted. Minor suggestions for improvement but nothing blocking.

Share with noted caveats -- Analysis is largely correct but has specific limitations or assumptions that must be communicated to stakeholders. List the required caveats.

Needs revision -- Found specific errors, methodological issues, or missing analyses that should be addressed before sharing. List the required changes with priority order.

Output Format

## Validation Report

### Overall Assessment: [Ready to share | Share with caveats | Needs revision]

### Methodology Review
[Findings about approach, data selection, definitions]

### Issues Found
1. [Severity: High/Medium/Low] [Issue description and impact]
2. ...

### Calculation Spot-Checks
- [Metric]: [Verified / Discrepancy found]
- ...

### Visualization Review
[Any issues with charts or visual presentation]

### Suggested Improvements
1. [Improvement and why it matters]
2. ...

### Required Caveats for Stakeholders
- [Caveat that must be communicated]
- ...

Pre-Delivery QA Checklist

Run through this checklist before sharing any analysis with stakeholders.

Data Quality Checks

  • [ ] Source verification: Confirmed which tables/data sources were used. Are they the right ones for this question?
  • [ ] Freshness: Data is current enough for the analysis. Noted the "as of" date.
  • [ ] Completeness: No unexpected gaps in time series or missing segments.
  • [ ] Null handling: Checked null rates in key columns. Nulls are handled appropriately (excluded, imputed, or flagged).
  • [ ] Deduplication: Confirmed no double-counting from bad joins or duplicate source records.
  • [ ] Filter verification: All WHERE clauses and filters are correct. No unintended exclusions.

Calculation Checks

  • [ ] Aggregation logic: GROUP BY includes all non-aggregated columns. Aggregation level matches the analysis grain.
  • [ ] Denominator correctness: Rate and percentage calculations use the right denominator. Denominators are non-zero.
  • [ ] Date alignment: Comparisons use the same time period length. Partial periods are excluded or noted.
  • [ ] Join correctness: JOIN types are appropriate (INNER vs LEFT). Many-to-many joins haven't inflated counts.
  • [ ] Metric definitions: Metrics match how stakeholders define them. Any deviations are noted.
  • [ ] Subtotals sum: Parts add up to the whole where expected. If they don't, explain why (e.g., overlap).

Reasonableness Checks

  • [ ] Magnitude: Numbers are in a plausible range. Revenue isn't negative. Percentages are between 0-100%.
  • [ ] Trend continuity: No unexplained jumps or drops in time series.
  • [ ] Cross-reference: Key numbers match other known sources (dashboards, previous reports, finance data).
  • [ ] Order of magnitude: Total revenue is in the right ballpark. User counts match known figures.
  • [ ] Edge cases: What happens at the boundaries? Empty segments, zero-activity periods, new entities.

Presentation Checks

  • [ ] Chart accuracy: Bar charts start at zero. Axes are labeled. Scales are consistent across panels.
  • [ ] Number formatting: Appropriate precision. Consistent currency/percentage formatting. Thousands separators where needed.
  • [ ] Title clarity: Titles state the insight, not just the metric. Date ranges are specified.
  • [ ] Caveat transparency: Known limitations and assumptions are stated explicitly.
  • [ ] Reproducibility: Someone else could recreate this analysis from the documentation provided.

Common Data Analysis Pitfalls

Join Explosion

The problem: A many-to-many join silently multiplies rows, inflating counts and sums.

How to detect:

-- Check row count before and after join
SELECT COUNT(*) FROM table_a;  -- 1,000
SELECT COUNT(*) FROM table_a a JOIN table_b b ON a.id = b.a_id;  -- 3,500 (uh oh)

How to prevent:

  • Always check row counts after joins
  • If counts increase, investigate the join relationship (is it really 1:1 or 1:many?)
  • Use COUNT(DISTINCT a.id) instead of COUNT(*) when counting entities through joins

Survivorship Bias

The problem: Analyzing only entities that exist today, ignoring those that were deleted, churned, or failed.

Examples:

  • Analyzing user behavior of "current users" misses churned users
  • Looking at "companies using our product" ignores those who evaluated and left
  • Studying properties of "successful" outcomes without "unsuccessful" ones

How to prevent: Ask "who is NOT in this dataset?" before drawing conclusions.

Incomplete Period Comparison

The problem: Comparing a partial period to a full period.

Examples:

  • "January revenue is $500K vs. December's $800K" -- but January isn't over yet
  • "This week's signups are down" -- checked on Wednesday, comparing to a full prior week

How to prevent: Always filter to complete periods, or compare same-day-of-month / same-number-of-days.

Denominator Shifting

The problem: The denominator changes between periods, making rates incomparable.

Examples:

  • Conversion rate improves because you changed how you count "eligible" users
  • Churn rate changes because the definition of "active" was updated

How to prevent: Use consistent definitions across all compared periods. Note any definition changes.

Average of Averages

The problem: Averaging pre-computed averages gives wrong results when group sizes differ.

Example:

  • Group A: 100 users, average revenue $50
  • Group B: 10 users, average revenue $200
  • Wrong: Average of averages = ($50 + $200) / 2 = $125
  • Right: Weighted average = (100$50 + 10$200) / 110 = $63.64

How to prevent: Always aggregate from raw data. Never average pre-aggregated averages.

Timezone Mismatches

The problem: Different data sources use different timezones, causing misalignment.

Examples:

  • Event timestamps in UTC vs. user-facing dates in local time
  • Daily rollups that use different cutoff times

How to prevent: Standardize all timestamps to a single timezone (UTC recommended) before analysis. Document the timezone used.

Selection Bias in Segmentation

The problem: Segments are defined by the outcome you're measuring, creating circular logic.

Examples:

  • "Users who completed onboarding have higher retention" -- obviously, they self-selected
  • "Power users generate more revenue" -- they became power users BY generating revenue

How to prevent: Define segments based on pre-treatment characteristics, not outcomes.

Other Statistical Traps

  • Simpson's paradox: Trend reverses when data is aggregated vs. segmented
  • Correlation presented as causation without supporting evidence
  • Small sample sizes leading to unreliable conclusions
  • Outliers disproportionately affecting averages (should medians be used instead?)
  • Multiple testing / cherry-picking significant results
  • Look-ahead bias: Using future information to explain past events
  • Cherry-picked time ranges that favor a particular narrative

Result Sanity Checking

Magnitude Checks

For any key number in your analysis, verify it passes the "smell test":

Metric Type Sanity Check
User counts Does this match known MAU/DAU figures?
Revenue Is this in the right order of magnitude vs. known ARR?
Conversion rates Is this between 0% and 100%? Does it match dashboard figures?
Growth rates Is 50%+ MoM growth realistic, or is there a data issue?
Averages Is the average reasonable given what you know about the distribution?
Percentages Do segment percentages sum to ~100%?

Cross-Validation Techniques

  1. Calculate the same metric two different ways and verify they match
  2. Spot-check individual records -- pick a few specific entities and trace their data manually
  3. Compare to known benchmarks -- match against published dashboards, finance reports, or prior analyses
  4. Reverse engineer -- if total revenue is X, does per-user revenue times user count approximately equal X?
  5. Boundary checks -- what happens when you filter to a single day, a single user, or a single category? Are those micro-results sensible?

Red Flags That Warrant Investigation

  • Any metric that changed by more than 50% period-over-period without an obvious cause
  • Counts or sums that are exact round numbers (suggests a filter or default value issue)
  • Rates exactly at 0% or 100% (may indicate incomplete data)
  • Results that perfectly confirm the hypothesis (reality is usually messier)
  • Identical values across time periods or segments (suggests the query is ignoring a dimension)

Documentation Standards for Reproducibility

Analysis Documentation Template

Every non-trivial analysis should include:

## Analysis: [Title]

### Question
[The specific question being answered]

### Data Sources
- Table: [schema.table_name] (as of [date])
- Table: [schema.other_table] (as of [date])
- File: [filename] (source: [where it came from])

### Definitions
- [Metric A]: [Exactly how it's calculated]
- [Segment X]: [Exactly how membership is determined]
- [Time period]: [Start date] to [end date], [timezone]

### Methodology
1. [Step 1 of the analysis approach]
2. [Step 2]
3. [Step 3]

### Assumptions and Limitations
- [Assumption 1 and why it's reasonable]
- [Limitation 1 and its potential impact on conclusions]

### Key Findings
1. [Finding 1 with supporting evidence]
2. [Finding 2 with supporting evidence]

### SQL Queries
[All queries used, with comments]

### Caveats
- [Things the reader should know before acting on this]

Code Documentation

For any code (SQL, Python) that may be reused:

"""
Analysis: Monthly Cohort Retention
Author: [Name]
Date: [Date]
Data Source: events table, users table
Last Validated: [Date] -- results matched dashboard within 2%

Purpose:
    Calculate monthly user retention cohorts based on first activity date.

Assumptions:
    - "Active" means at least one event in the month
    - Excludes test/internal accounts (user_type != 'internal')
    - Uses UTC dates throughout

Output:
    Cohort retention matrix with cohort_month rows and months_since_signup columns.
    Values are retention rates (0-100%).
"""

Version Control for Analyses

  • Save queries and code in version control (git) or a shared docs system
  • Note the date of the data snapshot used
  • If an analysis is re-run with updated data, document what changed and why
  • Link to prior versions of recurring analyses for trend comparison

Examples

/validate-data Review this quarterly revenue analysis before I send it to the exec team: [analysis]
/validate-data Check my churn analysis -- I'm comparing Q4 churn rates to Q3 but Q4 has a shorter measurement window
/validate-data Here's a SQL query and its results for our conversion funnel. Does the logic look right? [query + results]

Tips

  • Run /validate-data before any high-stakes presentation or decision
  • Even quick analyses benefit from a sanity check -- it takes a minute and can save your credibility
  • If the validation finds issues, fix them and re-validate
  • Share the validation output alongside your analysis to build stakeholder confidence