🛠️ 開発・MCP コミュニティ 🔴 エンジニア向け 👤 エンジニア・AI開発者

🛠️ Gemini Browser

Gemini Browser

Google Geminiにブラウザ経由で質??

⚡ ⏱ コードレビュー 1時間 → 10分

📺 まず動画で見る(YouTube)

▶ 【衝撃】最強のAIエージェント「Claude Code」の最新機能・使い方・プログラミングをAIで効率化する超実践術を解説! ↗

※ jpskill.com 編集部が参考用に選んだ動画です。動画の内容と Skill の挙動は厳密には一致しないことがあります。

📜 元の英語説明(参考)

Query Google Gemini via browser automation using OpenClaw's Browser Relay. Use when you need to ask Gemini questions and get AI responses. Requires OpenClaw with Browser Relay Chrome extension configured.

🇯🇵 日本人クリエイター向け解説

一言でいうと

Google Geminiにブラウザ経由で質??

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⚡ おすすめ: コマンド1行でインストール(60秒)

下記のコマンドをコピーしてターミナル(Mac/Linux)または PowerShell(Windows)に貼り付けてください。ダウンロード → 解凍 → 配置まで全自動。

🍎 Mac / 🐧 Linux

mkdir -p ~/.claude/skills && cd ~/.claude/skills && curl -L -o gemini-browser.zip https://jpskill.com/download/4852.zip && unzip -o gemini-browser.zip && rm gemini-browser.zip

🪟 Windows (PowerShell)

$d = "$env:USERPROFILE\.claude\skills"; ni -Force -ItemType Directory $d | Out-Null; iwr https://jpskill.com/download/4852.zip -OutFile "$d\gemini-browser.zip"; Expand-Archive "$d\gemini-browser.zip" -DestinationPath $d -Force; ri "$d\gemini-browser.zip"

完了後、Claude Code を再起動 → 普通に「動画プロンプト作って」のように話しかけるだけで自動発動します。

💾 手動でダウンロードしたい(コマンドが難しい人向け)

1. 下の青いボタンを押して gemini-browser.zip をダウンロード
2. ZIPファイルをダブルクリックで解凍 → gemini-browser フォルダができる
3. そのフォルダを C:\Users\あなたの名前\.claude\skills\(Win)または ~/.claude/skills/(Mac)へ移動
4. Claude Code を再起動

⬇ .zip でダウンロード(推奨) ⬇ .skill 形式(上級者用) 元のソース ↗

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
- · macOS / Linux: ~/.claude/skills/
- · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →

最終更新: 2026-05-17
取得日時: 2026-05-17
同梱ファイル: 2

💬 こう話しかけるだけ — サンプルプロンプト

› Gemini Browser を使って、最小構成のサンプルコードを示して
› Gemini Browser の主な使い方と注意点を教えて
› Gemini Browser を既存プロジェクトに組み込む方法を教えて

これをClaude Code に貼るだけで、このSkillが自動発動します。

📖 Claude が読む原文 SKILL.md(中身を展開)

この本文は AI(Claude)が読むための原文(英語または中国語)です。日本語訳は順次追加中。

Gemini Browser Skill

Query Google Gemini (gemini.google.com) via OpenClaw Browser Relay and extract responses.

⚠️ Security Notice: This skill operates on your real Chrome browser with your logged-in Google session via CDP (Chrome DevTools Protocol). The agent will have access to anything visible in the attached tab. Only attach tabs you explicitly intend for the agent to control. See Security Considerations.

Prerequisites

OpenClaw installed and running (this skill uses OpenClaw's browser command)
OpenClaw Browser Relay Chrome extension installed and configured
- Extension binds to loopback 127.0.0.1:18792 by default
- Gateway auth token must be configured in extension options
Google account logged in within Chrome (Gemini requires authentication)
Use profile=chrome to relay through your existing Chrome (not the isolated profile=openclaw-managed)

Quick Start

# 1. Open Gemini in Chrome
open -a "Google Chrome" "https://gemini.google.com"

# 2. Manually click the Browser Relay extension icon on the Gemini tab to attach
#    (the badge will show "ON" when attached)

# 3. Verify relay is connected
browser action=status profile=chrome
# Should show cdpReady: true

# 4. List tabs
browser action=tabs profile=chrome
# Note the targetId for the Gemini tab

Input Method

Gemini uses a Quill rich-text editor (contenteditable div), not a standard <textarea>. You must inject text via JavaScript:

browser action=act profile=chrome targetId=<id> request={
  "kind": "evaluate",
  "fn": "(() => { const editor = document.querySelector('div.ql-editor[contenteditable=\"true\"]'); if (!editor) return 'editor not found'; editor.focus(); editor.innerHTML = '<p>YOUR_QUERY_HERE</p>'; editor.dispatchEvent(new Event('input', { bubbles: true })); return 'ok'; })()"
}

Then submit:

browser action=act profile=chrome targetId=<id> request={"kind":"press","key":"Enter"}

Complete Workflow

1. Prepare

Open Gemini in Chrome and manually attach the Browser Relay extension to the tab.

open -a "Google Chrome" "https://gemini.google.com"
# Then click the Browser Relay extension icon on the Gemini tab

2. Get Tab ID

browser action=tabs profile=chrome

Find the Gemini tab entry and note its targetId.

3. Input Query

browser action=act profile=chrome targetId=<id> request={
  "kind": "evaluate",
  "fn": "(() => { const editor = document.querySelector('div.ql-editor[contenteditable=\"true\"]'); if (!editor) return 'editor not found'; editor.focus(); editor.innerHTML = '<p>What is quantum computing?</p>'; editor.dispatchEvent(new Event('input', { bubbles: true })); return 'ok'; })()"
}

4. Submit

browser action=act profile=chrome targetId=<id> request={"kind":"press","key":"Enter"}

5. Wait for Response

Gemini may take 10–60 seconds. Poll for completion by checking if the stop button has disappeared:

browser action=act profile=chrome targetId=<id> request={
  "kind": "evaluate",
  "fn": "(() => { const stop = document.querySelector('button[aria-label*=\"Stop\"]'); return stop ? 'generating' : 'done'; })()"
}

6. Extract Response

Option A — Clipboard (recommended, preserves Markdown formatting):

# Take a snapshot and find the Copy button
browser action=snapshot profile=chrome targetId=<id>

# Click the Copy button by its ref from the snapshot
browser action=act profile=chrome targetId=<id> request={"kind":"click","ref":"<copy_button_ref>"}

# Read from clipboard
pbpaste

Option B — DOM extraction (fallback):

browser action=act profile=chrome targetId=<id> request={
  "kind": "evaluate",
  "fn": "(() => { const msgs = document.querySelectorAll('.model-response-text'); if (msgs.length === 0) return 'no response found'; return msgs[msgs.length - 1].innerText; })()"
}

New Chat

For unrelated queries, start a fresh chat to avoid context pollution:

browser action=navigate profile=chrome targetId=<id> targetUrl="https://gemini.google.com"

Response Completion Signals

The response is complete when:

The stop button disappears
A copy button appears below the response
Suggested follow-up chips appear

Security Considerations

⚠️ Important: Understand these risks before using this skill.

Session access: profile=chrome uses your real Chrome with all logged-in sessions. The agent can see and interact with anything in the attached tab, including your Google account context.
JavaScript evaluation: The evaluate action runs arbitrary JavaScript in the page context. This skill limits it to DOM manipulation for the input field, but the mechanism itself is powerful.
Manual attachment required: The Browser Relay extension must be manually clicked by you to attach — the agent cannot auto-attach to arbitrary tabs. Only attach the specific Gemini tab.
Loopback only: The relay binds to 127.0.0.1 and requires an auth token, preventing remote access.
Recommendation: Use a separate Chrome profile dedicated to AI automation, logged into a non-primary Google account, to limit exposure.

Troubleshooting

Problem	Solution
`cdpReady: false`	Click the Browser Relay extension icon on the Gemini tab to re-attach
Tab not found	Run `browser action=tabs profile=chrome` to refresh tab list
Editor not found	Page may not be fully loaded; wait and retry. Gemini may have changed DOM — check for `div.ql-editor`
Copy button not found	Response may still be generating; poll stop button status first
Login wall	Ensure Chrome is logged into a Google account
Context overflow	Navigate to `gemini.google.com` for a fresh chat

同梱ファイル

※ ZIPに含まれるファイル一覧。`SKILL.md` 本体に加え、参考資料・サンプル・スクリプトが入っている場合があります。

📄 SKILL.md (6,108 bytes)
📎 README.md (2,919 bytes)