🛠️ 開発・MCP コミュニティ

content-evaluation-framework

書籍の章や教材などの質を、技術的な正確さ、教育効果、文章の質など6つのカテゴリーで評価し、段階的な評価と重み付けされたスコアリングによって改善点を見つけ出すSkill。

📜 元の英語説明(参考)

This skill should be used when evaluating the quality of book chapters, lessons, or educational content. It provides a systematic 6-category rubric with weighted scoring (Technical Accuracy 30%, Pedagogical Effectiveness 25%, Writing Quality 20%, Structure & Organization 15%, AI-First Teaching 10%, Constitution Compliance Pass/Fail) and multi-tier assessment (Excellent/Good/Needs Work/Insufficient). Use this during iterative drafting, after content completion, on-demand review requests, or before validation phases.

🇯🇵 日本人クリエイター向け解説

一言でいうと

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⚡ おすすめ: コマンド1行でインストール(60秒)

下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。ダウンロード → 解凍 → 配置まで全自動。

🍎 Mac / 🐧 Linux

mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o content-evaluation-framework.zip https://jpskill.com/download/16858.zip && unzip -o content-evaluation-framework.zip && rm content-evaluation-framework.zip

🪟 Windows (PowerShell)

$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/16858.zip -OutFile "$d\content-evaluation-framework.zip"; Expand-Archive "$d\content-evaluation-framework.zip" -DestinationPath $d -Force; ri "$d\content-evaluation-framework.zip"

完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。

💾 手動でダウンロードしたい(コマンドが難しい人向け)

1. 下の青いボタンを押して content-evaluation-framework.zip をダウンロード
2. ZIPファイルをダブルクリックで解凍 → content-evaluation-framework フォルダができる
3. そのフォルダを C:\Users\あなたの名前\.claude\skills\(Win)または ~/.claude/skills/(Mac)へ移動
4. Claude Code を再起動

⬇ .zip でダウンロード(推奨) ⬇ .skill 形式(上級者用) 元のソース ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-18
取得日時: 2026-05-18
同梱ファイル: 4

📖 Skill本文(日本語訳)

※ 原文(英語/中国語)を Gemini で日本語化したものです。Claude 自身は原文を読みます。誤訳がある場合は原文をご確認ください。

コンテンツ評価フレームワーク

このスキルは、教育的な書籍の章やレッスンを定量化可能な品質基準で評価するための、包括的かつ体系的なルーブリックを提供します。

Constitution Alignment: v4.0.1 を重視:

Principle 1: Specification Primacy ("Specs Are the New Syntax")
Section IIa: Panaversity 4-Layer Teaching Method
Section IIb: AI Three Roles Framework (双方向の共同学習)
8 Foundational Principles: Factual Accuracy、Coherent Structure、Progressive Complexity を含む
Nine Pillars (Section I): AI CLI、Markdown、MCP、AI-First IDEs、Cross-Platform、TDD、SDD、Composable Skills、Cloud-Native

目的

以下の点を保証するために、6つの重み付けされたカテゴリにわたって教育コンテンツを評価します。

技術的な正確性とコード品質
効果的な教育設計と学習成果
対象読者にとって明確でアクセスしやすい文章
適切な構造と構成
AI 拡張学習の原則 (AI から生成するのではなく、AI と共に学習する)
Constitution への準拠と標準の遵守

このスキルを使用するタイミング

この評価フレームワークを複数のチェックポイントで呼び出します。

反復的なドラフト作成中 - 問題を早期に発見するためのプロセス中間の品質チェック
レッスン/章の完了後 - 次のコンテンツユニットに進む前の包括的な評価
オンデマンドレビューリクエスト - ユーザーが品質評価を明示的に要求した場合
検証フェーズの前 - 最終承認のための SDD の Validate フェーズのワークフローの一部

評価方法

スコアリングシステム

多段階評価:

Excellent (90-100%) - 標準を超え、模範的な品質
Good (75-89%) - すべての標準を満たし、わずかな改善が可能
Needs Work (50-74%) - 一部の標準を満たしていますが、大幅な修正が必要です
Insufficient (<50%) - 最小限の標準を満たしておらず、大幅な手直しが必要です

重み付けされたカテゴリ

評価では、次の重みを持つ 6 つのカテゴリを使用します。

カテゴリ	重み	注力分野
Technical Accuracy	30%	コードの正確性、型ヒント、説明、例が述べられているように機能する
Pedagogical Effectiveness	25%	Show-then-explain パターン、段階的な複雑さ、質の高い演習
Writing Quality	20%	可読性 (Flesch-Kincaid 8-10)、ボイス、明瞭さ、学年レベルへの適合性
Structure & Organization	15%	学習目標の達成、論理的な流れ、適切な長さ、トランジション
AI-First Teaching	10%	共同学習パートナーシップの実証、Three Roles Framework の表示、Nine Pillars の整合、Specs-As-Syntax の強調
Constitution Compliance	Pass/Fail	Nine Pillars の整合 (ゲート) を含む、交渉の余地のないすべての憲法上の要件に合格する必要があります

合計重み付けスコアの計算:

Final Score = (Technical × 0.30) + (Pedagogical × 0.25) + (Writing × 0.20) +
              (Structure × 0.15) + (AI-First × 0.10)

Constitution Compliance: "Pass" ステータスを達成する必要があります。"Fail" の場合、重み付けされたスコアに関係なく、コンテンツは続行できません。

評価の実施方法

ステップ 1: コンテキストの準備

評価の前に、以下を収集します。

評価対象のコンテンツ (lesson.md、chapter.md、またはセクションファイル)
specs/<feature>/ からの関連する spec、plan、および tasks ファイル
Constitution ファイル (.specify/memory/constitution.md)
コンテンツユニットの学習目標と成功基準
使用される出力スタイルテンプレート (.claude/output-styles/lesson.md など)

ステップ 2: 詳細なルーブリックのロード

各カテゴリの詳細な階層基準を読みます。

Read: references/rubric-details.md

このファイルには、6 つのカテゴリそれぞれについて、Excellent/Good/Needs Work/Insufficient を定義する特定の基準が含まれています。

ステップ 3: 最初に Constitution Compliance を評価する

Constitution compliance は ゲート です。コンテンツが憲法上の要件を満たしていない場合、続行できません。

Constitution チェックリストを使用します。

Read: references/constitution-checklist.md

交渉の余地のないすべての原則と要件を評価します。特定の違反を指摘して、Pass または Fail としてマークします。

Constitution Compliance = Fail の場合: 評価を停止し、直ちに違反を報告します。続行する前にコンテンツを修正する必要があります。

Constitution Compliance = Pass の場合: 重み付けされたカテゴリの評価に進みます。

ステップ 4: 各重み付けされたカテゴリをスコアリングする

5 つの重み付けされたカテゴリ (Technical Accuracy、Pedagogical Effectiveness、Writing Quality、Structure & Organization、AI-First Teaching) のそれぞれについて:

そのカテゴリの rubric-details.md から 特定の基準を確認 します
各階層の基準に対して コンテンツを評価 します
階層 (Excellent/Good/Needs Work/Insufficient) をスコア範囲で 割り当て ます
具体的な証拠を記録 します - 例を引用し、行番号をメモし、特定の箇所を引用します
改善の推奨事項を提供する します - 具体的で実行可能なフィードバック

ステップ 5: 重み付けされたスコアを計算する

重み付けされた式を適用します。

Final Score = (Technical × 0.30) + (Pedagogical × 0.25) + (Writing × 0.20) +
              (Structure × 0.15) + (AI-First × 0.10)

階層スコアを数値に変換します。

Excellent: 95%
Good: 82%
Needs Work: 62%
Insufficient: 40%

(または、正当な理由がある場合は、階層範囲内の特定の数値スコアを使用します)

ステップ 6: 評価レポートを生成する

構造化された評価テンプレートを使用します。

Read: references/evaluation-template.md

すべてのセクションを完了します。

Executive Summary - 全体的なスコア、階層、合格/不合格ステータス
Category Scores - 各カテゴリのスコア、階層、および重み付けの貢献度を示す表
Detailed Findings - 各カテゴリの証拠に基づいた評価
Strengths - コンテンツが優れている点 (具体的な例)
Areas for Improvement - 推奨事項を含む問題の優先順位付けされたリスト
Constitution Compliance Status - 特定の原則チェックによる合格/不合格
Actionable Next Steps - コンテンツを改善するための具体的なタスク

ステップ 7: 結果を伝達する

評価レポートを以下と共に提示します。

**Cl

📜 原文 SKILL.md(Claudeが読む英語/中国語)を展開

Content Evaluation Framework

This skill provides a comprehensive, systematic rubric for evaluating educational book chapters and lessons with quantifiable quality standards.

Constitution Alignment: v4.0.1 emphasizing:

Principle 1: Specification Primacy ("Specs Are the New Syntax")
Section IIa: Panaversity 4-Layer Teaching Method
Section IIb: AI Three Roles Framework (bidirectional co-learning)
8 Foundational Principles: Including Factual Accuracy, Coherent Structure, Progressive Complexity
Nine Pillars (Section I): AI CLI, Markdown, MCP, AI-First IDEs, Cross-Platform, TDD, SDD, Composable Skills, Cloud-Native

Purpose

Evaluate educational content across 6 weighted categories to ensure:

Technical correctness and code quality
Effective pedagogical design and learning outcomes
Clear, accessible writing for target audience
Proper structure and organization
AI-augmented learning principles (learning WITH AI, not generating FROM AI)
Constitution compliance and standards adherence

When to Use This Skill

Invoke this evaluation framework at multiple checkpoints:

During Iterative Drafting - Mid-process quality checks to catch issues early
After Lesson/Chapter Completion - Comprehensive evaluation before moving to next content unit
On-Demand Review Requests - When user explicitly asks for quality assessment
Before Validation Phase - Part of the SDD Validate phase workflow for final sign-off

Evaluation Methodology

Scoring System

Multi-Tier Assessment:

Excellent (90-100%) - Exceeds standards, exemplary quality
Good (75-89%) - Meets all standards with minor improvements possible
Needs Work (50-74%) - Meets some standards but requires significant revision
Insufficient (<50%) - Does not meet minimum standards, requires major rework

Weighted Categories

The evaluation uses 6 categories with the following weights:

Category	Weight	Focus Area
Technical Accuracy	30%	Code correctness, type hints, explanations, examples work as stated
Pedagogical Effectiveness	25%	Show-then-explain pattern, progressive complexity, quality exercises
Writing Quality	20%	Readability (Flesch-Kincaid 8-10), voice, clarity, grade-level appropriateness
Structure & Organization	15%	Learning objectives met, logical flow, appropriate length, transitions
AI-First Teaching	10%	Co-learning partnership demonstrated, Three Roles Framework shown, Nine Pillars aligned, Specs-As-Syntax emphasized
Constitution Compliance	Pass/Fail	Must pass all non-negotiable constitutional requirements including Nine Pillars alignment (gate)

Total Weighted Score Calculation:

Final Score = (Technical × 0.30) + (Pedagogical × 0.25) + (Writing × 0.20) +
              (Structure × 0.15) + (AI-First × 0.10)

Constitution Compliance: Must achieve "Pass" status. If "Fail," content cannot proceed regardless of weighted score.

How to Conduct an Evaluation

Step 1: Prepare Context

Before evaluation, gather:

Content being evaluated (lesson.md, chapter.md, or section file)
Relevant spec, plan, and tasks files from specs/<feature>/
Constitution file (.specify/memory/constitution.md)
Learning objectives and success criteria for the content unit
Output style template used (.claude/output-styles/lesson.md or similar)

Step 2: Load Detailed Rubric

Read the detailed tier criteria for each category:

Read: references/rubric-details.md

This file contains specific criteria defining Excellent/Good/Needs Work/Insufficient for each of the 6 categories.

Step 3: Evaluate Constitution Compliance First

Constitution compliance is a gate - if content fails constitutional requirements, it cannot proceed.

Use the constitution checklist:

Read: references/constitution-checklist.md

Assess all non-negotiable principles and requirements. Mark as Pass or Fail with specific violations noted.

If Constitution Compliance = Fail: Stop evaluation and report violations immediately. Content must be revised before proceeding.

If Constitution Compliance = Pass: Continue to weighted category evaluation.

Step 4: Score Each Weighted Category

For each of the 5 weighted categories (Technical Accuracy, Pedagogical Effectiveness, Writing Quality, Structure & Organization, AI-First Teaching):

Review specific criteria from rubric-details.md for that category
Assess content against criteria for each tier
Assign tier (Excellent/Good/Needs Work/Insufficient) with score range
Record specific evidence - Quote examples, note line numbers, cite specific passages
Provide improvement recommendations - Concrete, actionable feedback

Step 5: Calculate Weighted Score

Apply the weighted formula:

Final Score = (Technical × 0.30) + (Pedagogical × 0.25) + (Writing × 0.20) +
              (Structure × 0.15) + (AI-First × 0.10)

Convert tier scores to numeric values:

Excellent: 95%
Good: 82%
Needs Work: 62%
Insufficient: 40%

(Or use specific numeric score within tier range if warranted)

Step 6: Generate Evaluation Report

Use the structured evaluation template:

Read: references/evaluation-template.md

Complete all sections:

Executive Summary - Overall score, tier, pass/fail status
Category Scores - Table showing each category score, tier, and weight contribution
Detailed Findings - Evidence-based assessment for each category
Strengths - What the content does well (specific examples)
Areas for Improvement - Prioritized list of issues with recommendations
Constitution Compliance Status - Pass/Fail with specific principle checks
Actionable Next Steps - Concrete tasks to improve content

Step 7: Communicate Results

Present evaluation report with:

Clear verdict - Pass/Fail and overall quality tier
Evidence-based feedback - Specific quotes and line numbers
Prioritized improvements - Most critical issues first
Encouragement - Acknowledge strengths and effort

Evaluation Best Practices

Be Objective and Evidence-Based

Quote specific passages from content being evaluated
Reference line numbers or section headers
Compare against objective rubric criteria, not subjective preference
Use concrete metrics where possible (word count, readability scores, etc.)

Focus on Standards, Not Perfection

Content rated "Good" (75-89%) is publication-ready with minor polish
Content rated "Excellent" (90-100%) exceeds standards but is not required
Focus improvements on moving "Needs Work" → "Good" before "Good" → "Excellent"

Provide Actionable Feedback

Don't just say "improve clarity" - specify which sentences are unclear and suggest rewrites
Don't just say "add examples" - suggest specific example types that would help
Prioritize recommendations: critical (blocking issues) → important → nice-to-have

Respect the Learning Journey

Recognize iterative improvement - drafts evolve through multiple passes
Celebrate progress and strengths
Frame criticism constructively as opportunities for growth
Remember: the goal is helping create excellent educational content, not gatekeeping

Quality Gates and Thresholds

Minimum Acceptance Threshold

Constitution Compliance: MUST be Pass (gate)
Overall Weighted Score: MUST be ≥ 75% (Good or better)
No category below 50%: Each individual category must achieve at least "Needs Work" tier

Recommended for Publication

Constitution Compliance: Pass
Overall Weighted Score: ≥ 82% (Good tier)
Technical Accuracy: ≥ 75% (Good tier) - Critical for credibility
Pedagogical Effectiveness: ≥ 75% (Good tier) - Critical for learning outcomes

Exemplary Content (Optional)

Overall Weighted Score: ≥ 90% (Excellent tier)
At least 3 categories at Excellent tier
No categories below Good tier

Common Evaluation Scenarios

Scenario 1: Mid-Draft Check (Iterative)

Context: Writer requests feedback on partial draft Approach:

Focus on foundational issues (structure, learning objectives, concept scaffolding)
Flag critical issues early (technical errors, constitution violations)
Provide guidance for remaining sections
Don't expect polish - prioritize content completeness and correctness

Scenario 2: Completion Review

Context: Writer believes content is complete and ready for validation Approach:

Conduct full evaluation across all 6 categories
Calculate final weighted score
Check all quality gates and thresholds
Provide comprehensive report with prioritized improvements
Determine if content meets publication standards

Scenario 3: Pre-Validation Quality Gate

Context: Content enters SDD Validate phase Approach:

Verify constitution compliance (gate)
Confirm minimum acceptance threshold (≥75%)
Validate all category scores meet minimums
Generate pass/fail recommendation with evidence
If fails gate: return to implementation with specific revision tasks

Scenario 4: On-Demand Spot Check

Context: User asks "How's this looking?" for specific section Approach:

Evaluate relevant categories for that section (may not be all 6)
Provide quick feedback on specific concerns
Highlight any critical issues
Suggest improvements without full formal report
Use judgment on depth based on context

Resources and References

This skill includes detailed reference materials:

references/rubric-details.md - Comprehensive tier criteria for all 6 categories with specific indicators
references/constitution-checklist.md - Pass/Fail checklist for constitutional compliance evaluation
references/evaluation-template.md - Structured template for consistent evaluation reports

Load these references as needed during evaluation to ensure consistency and thoroughness.

Example Evaluation Flow

User Request: "Please evaluate this lesson draft: apps/learn-app/docs/chapter-3/lesson-2.md"

Evaluation Process:

Read content: apps/learn-app/docs/chapter-3/lesson-2.md
Load context: spec, plan, constitution, learning objectives
Check constitution compliance: references/constitution-checklist.md
- Result: Pass (all non-negotiables met)
Load detailed rubric: references/rubric-details.md
Evaluate each category:
- Technical Accuracy: Good (80%) - Code works, minor type hint gaps
- Pedagogical Effectiveness: Excellent (92%) - Strong scaffolding, great exercises
- Writing Quality: Good (78%) - Clear writing, minor readability improvements
- Structure & Organization: Good (85%) - Good flow, all LOs met
- AI-First Teaching: Needs Work (65%) - AI exercises present but weak guidance
Calculate weighted score:
- (80×0.30) + (92×0.25) + (78×0.20) + (85×0.15) + (65×0.10) = 81.55%
- Final Tier: Good (81.55%)
Load template: references/evaluation-template.md
Generate report with findings, strengths, improvements, next steps
Communicate verdict: "Good (81.55%) - Ready for publication with minor improvements to AI-First Teaching section"

Use this skill to maintain consistent, objective, evidence-based quality standards for all educational content.

同梱ファイル

※ ZIPに含まれるファイル一覧。`SKILL.md` 本体に加え、参考資料・サンプル・スクリプトが入っている場合があります。

📄 SKILL.md (12,161 bytes)
📎 references/constitution-checklist.md (19,047 bytes)
📎 references/evaluation-template.md (14,308 bytes)
📎 references/rubric-details.md (26,184 bytes)