💼 ビジネスコミュニティ 🟡 少し慣れが必要 👤 経営者・事業責任者・マーケ

💼 Cost Trend

cost-trend

実行結果の記録を読み込み、勝率

⚡ ⏱ 提案書ドラフト 2日 → 半日

📺 まず動画で見る(YouTube)

※ jpskill.com 編集部が参考用に選んだ動画です。動画の内容と Skill の挙動は厳密には一致しないことがあります。

📜 元の英語説明(参考)

Read every docs/benchmarks/runs/*.json and surface drift in win rate, latency, escalation rate, and LLM-baseline cost over time

🇯🇵 日本人クリエイター向け解説

一言でいうと

実行結果の記録を読み込み、勝率

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⬇ このSkillをダウンロード(.skill) 元のソースを見る ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-17
取得日時: 2026-05-17
同梱ファイル: 1

💬 こう話しかけるだけ — サンプルプロンプト

› Cost Trend で、私のビジネスを分析して改善案を3つ提案して
› Cost Trend を使って、来週の会議用の資料を作って
› Cost Trend で、現状の課題を整理してアクションプランに落として

これをClaude Code に貼るだけで、このSkillが自動発動します。

📖 Claude が読む原文 SKILL.md(中身を展開)

この本文は AI(Claude)が読むための原文(英語または中国語)です。日本語訳は順次追加中。

Cost Trend

The smoke gate is binary (winRate ≥ 0.80 → pass/fail). The corpus benchmarks captured over time form a curve — and curves catch regressions the gate misses (win rate slowly creeping from 100% to 85% is "still passing" by smoke but a real degradation).

This skill reads every persisted run in docs/benchmarks/runs/*.json and reports first→last deltas plus a per-run series, flagging regressions in win rate or latency.

When to use

Before a release — check that the speedup hasn't drifted.
After expanding the corpus — verify older runs still hit the same win rate on the new corpus they reflected.
After upgrading agent-booster — surface latency / strategy changes.

Steps

Run the trend script from the project root:
```
node plugins/ruflo-cost-tracker/scripts/trend.mjs
```
Optional env:
- TREND_FORMAT=json — emit JSON instead of markdown
- TREND_LIMIT=10 — consider only the most recent N runs
Inspect the drift summary — first vs last on win rate, avg latency, p99, escalation rate, speedup vs Gemini.
Inspect the per-run series — one row per run, including Sonnet 4.6 + Opus 4.7 baseline latencies if those were enabled (BENCH_ANTHROPIC=1 at run time).
Regression flags — the script emits > ⚠ Regression callouts when:
- Win rate dropped between first and last run
- Avg latency rose ≥ 1.5× from first run

Cross-references

cost-benchmark — the producer of the run JSONs this skill consumes
bench/booster-corpus.json — the corpus version is recorded in each run, so trends across corpus versions remain interpretable
docs/benchmarks/runs/latest.json — the most-recent run; smoke step 23 gates on winRate ≥ 0.80 from this file