scanlume-ocr-api
Use when calling the Scanlume OCR API for screenshots, JPG, PNG, or image-based tables, especially when a task needs base64 data URLs, mode selection between simple and formatted OCR, or table-aware structured output. Tambem use quando for necessario chamar a API OCR do https://www.scanlume.com/ para screenshots, JPG, PNG ou tabelas em imagem.
下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。 ダウンロード → 解凍 → 配置まで全自動。
mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o scanlume-ocr-api.zip https://jpskill.com/download/18259.zip && unzip -o scanlume-ocr-api.zip && rm scanlume-ocr-api.zip
$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/18259.zip -OutFile "$d\scanlume-ocr-api.zip"; Expand-Archive "$d\scanlume-ocr-api.zip" -DestinationPath $d -Force; ri "$d\scanlume-ocr-api.zip"
完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。
💾 手動でダウンロードしたい(コマンドが難しい人向け)
- 1. 下の青いボタンを押して
scanlume-ocr-api.zipをダウンロード - 2. ZIPファイルをダブルクリックで解凍 →
scanlume-ocr-apiフォルダができる - 3. そのフォルダを
C:\Users\あなたの名前\.claude\skills\(Win)または~/.claude/skills/(Mac)へ移動 - 4. Claude Code を再起動
⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。
🎯 このSkillでできること
下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。
📦 インストール方法 (3ステップ)
- 1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
- 2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
- 3. 展開してできたフォルダを、ホームフォルダの
.claude/skills/に置く- · macOS / Linux:
~/.claude/skills/ - · Windows:
%USERPROFILE%\.claude\skills\
- · macOS / Linux:
Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。
詳しい使い方ガイドを見る →- 最終更新
- 2026-05-18
- 取得日時
- 2026-05-18
- 同梱ファイル
- 4
📖 Claude が読む原文 SKILL.md(中身を展開)
この本文は AI(Claude)が読むための原文(英語または中国語)です。日本語訳は順次追加中。
Scanlume OCR API
Use this skill when the task is specifically about calling the public OCR API behind https://www.scanlume.com/, not when the user only wants the website UI.
Use este skill quando a tarefa for especificamente chamar a API publica de OCR do https://www.scanlume.com/, e nao quando o usuario so quiser usar a interface do site.
English
Workflow
- Confirm the input is an image, not a PDF.
- Read
references/api-contract.mdbefore building the request. - Choose
simpleonly for raw text speed and lower cost. - Choose
formattedfor headings, multi-block layouts, Markdown, HTML, and tables. - If the user gives a local file path, prefer
scripts/scanlume_ocr.pyto build the data URL and call the API. - Read
references/output-shapes.mdbefore consumingformattedresponses, especially table blocks. - State clearly when a request is blocked by public API limits, such as PDF OCR beta access.
Quick Rules
- Public image OCR endpoint:
POST /v1/api/ocr - Auth:
Authorization: Bearer <SCANLUME_API_KEY> - Content type:
application/json - Payload keys:
modeandbase64 base64must be a full data URL such asdata:image/png;base64,...- Do not claim multipart upload support
- Do not claim remote file URL support
- Do not claim public PDF OCR API availability
Mode Selection
-
Use
simplefor:- quick raw text extraction
- lower cost image OCR
- tasks that only need plain text
-
Use
formattedfor:- screenshots with multiple text blocks
- image-based tables
- output needed in Markdown or HTML
- tasks that benefit from
blocksortableSummary
Helpers
- Read
references/api-contract.mdbefore first use. - Read
references/output-shapes.mdbefore parsing formatted OCR results. - Use
python scripts/scanlume_ocr.py <path> --mode formatted --output mdfor a local table image. - Use
python scripts/scanlume_ocr.py <path> --mode simple --output txtfor plain text extraction.
Constraints
- The public v1 API currently covers image OCR only.
- The website supports PDF OCR, but the public PDF API route is still beta-gated.
simplecosts 1 credit per image.formattedcosts 2 credits per image.- Favor precise claims over marketing claims. If the API cannot do something publicly today, say so.
Portugues (Brasil)
Fluxo
- Confirme que a entrada e uma imagem, nao um PDF.
- Leia
references/api-contract.mdantes de montar a requisicao. - Escolha
simpleapenas quando o foco for texto bruto, velocidade e menor custo. - Escolha
formattedpara titulos, multiplos blocos, Markdown, HTML e tabelas. - Se o usuario fornecer um caminho local, prefira
scripts/scanlume_ocr.pypara gerar a data URL e chamar a API. - Leia
references/output-shapes.mdantes de consumir respostasformatted, principalmente em blocos de tabela. - Explique claramente quando uma requisicao estiver bloqueada por limites publicos da API, como o acesso beta ao OCR de PDF.
Regras Rapidas
- Endpoint publico de OCR de imagem:
POST /v1/api/ocr - Auth:
Authorization: Bearer <SCANLUME_API_KEY> - Tipo de conteudo:
application/json - Chaves do payload:
modeebase64 base64precisa ser uma data URL completa comodata:image/png;base64,...- Nao afirme suporte a multipart upload
- Nao afirme suporte a URL remota de arquivo
- Nao afirme disponibilidade publica da API de PDF
Escolha de Modo
-
Use
simplepara:- extracao rapida de texto bruto
- OCR de imagem com menor custo
- tarefas que so precisam de texto puro
-
Use
formattedpara:- screenshots com multiplos blocos de texto
- tabelas em imagem
- saida em Markdown ou HTML
- tarefas que se beneficiam de
blocksoutableSummary
Helpers
- Leia
references/api-contract.mdantes do primeiro uso. - Leia
references/output-shapes.mdantes de processar respostas formatadas. - Use
python scripts/scanlume_ocr.py <path> --mode formatted --output mdpara uma imagem local com tabela. - Use
python scripts/scanlume_ocr.py <path> --mode simple --output txtpara extracao simples de texto.
Restricoes
- A API publica v1 atualmente cobre apenas OCR de imagem.
- O site https://www.scanlume.com/ suporta OCR de PDF na interface web, mas a rota publica de PDF continua beta-gated.
simplecusta 1 credito por imagem.formattedcusta 2 creditos por imagem.- Prefira afirmacoes precisas a afirmacoes promocionais. Se a API publica ainda nao faz algo hoje, diga isso.
同梱ファイル
※ ZIPに含まれるファイル一覧。`SKILL.md` 本体に加え、参考資料・サンプル・スクリプトが入っている場合があります。
- 📄 SKILL.md (4,973 bytes)
- 📎 references/api-contract.md (3,310 bytes)
- 📎 references/output-shapes.md (3,732 bytes)
- 📎 scripts/scanlume_ocr.py (7,020 bytes)