技能详情(站内镜像,无评论)
许可证:MIT-0
MIT-0 ·免费使用、修改和重新分发。无需归因。
版本:v1.2.0
统计:⭐ 0 · 290 · 2 current installs · 2 all-time installs
⭐ 0
安装量(当前) 2
🛡 VirusTotal :良性 · OpenClaw :可疑
Package:abczsl520/browser-use-pro
安全扫描(ClawHub)
- VirusTotal :良性
- OpenClaw :可疑
OpenClaw 评估
The skill's instructions mostly match a browser-automation purpose, but there are important inconsistencies and security-relevant choices (undeclared API key usage, browser profile access, and remote-debugging instructions) that the user should understand before installing.
目的
Name/description (browser automation for complex flows) aligns with the instructions and listed Python packages (browser-use, playwright, langchain-openai). However the registry metadata states 'Required env vars: none' while SKILL.md clearly requires an LLM API key (api_key in the example) and runtime configuration — this mismatch is unexplained.
说明范围
The SKILL.md instructs creating a virtualenv, pip-installing packages, running Playwright, writing/running Python scripts that drive real browsers, and optionally launching Chrome with --remote-debugging-port. Those steps can access local browser profiles, cookies, and pages; the skill also recommends sending screenshots and page content to an external LLM. Although the doc describes 'sensitive_data' placeholders, the instructions still place …
安装机制
This is instruction-only (no install spec). The suggested install flow uses pip and Playwright (standard package sources). That is lower-risk than arbitrary downloads, but the user should vet the PyPI packages (and the claimed 'browser-use' project) before pip installing into a host environment.
证书
Metadata declares no required env vars but SKILL.md demonstrates and implies an LLM API key (api_key) and use of user_data_dir for browser profiles. Requiring an API key for an LLM is expected for this functionality, but its absence from declared requirements is an incoherence. Also, connecting to a browser debug port and using an existing profile gives the agent access to cookies, sessions, and stored secrets — a high-scope capability that us…
持久
The skill does not request 'always: true'. It suggests creating a per-user virtualenv (~ /browser-use-env) and a profile dir (~/.browser-use/task-profile), which is standard for a local tool but creates persistent artifacts in the user's home. It does not modify other skills or system-wide settings as documented.
安装(复制给龙虾 AI)
将下方整段复制到龙虾中文库对话中,由龙虾按 SKILL.md 完成安装。
请把本段交给龙虾中文库(龙虾 AI)执行:为本机安装 OpenClaw 技能「Browser Use Pro」。简介:AI-powered browser automation for complex multi-step web workflows. Uses Browse…。
请 fetch 以下地址读取 SKILL.md 并按文档完成安装:https://raw.githubusercontent.com/openclaw/skills/refs/heads/main/skills/abczsl520/browser-use-pro/SKILL.md
(来源:yingzhi8.cn 技能库)
SKILL.md
---
name: browser-use
description: "AI-powered browser automation for complex multi-step web workflows. Uses Browser-Use framework when OpenClaw's built-in browser tool can't handle login flows, anti-bot sites, or 5+ step sequences."
---
# Browser-Use — AI Browser Automation
## Security & Privacy
- **No credential logging**: Passwords are handled via Browser-Use's `sensitive_data` parameter — the LLM never sees real credentials, only placeholder tokens.
- **User-initiated Chrome connection**: CDP mode (connecting to real Chrome) is opt-in and requires the user to manually launch Chrome with debug flag. The skill never silently connects to running browsers.
- **All packages are open-source**: Dependencies are `browser-use` (38k+ ⭐ on GitHub), `playwright` (by Microsoft), and `langchain-openai` — all widely audited open-source tools.
- **Local execution only**: Scripts run locally on the user's machine. No data is sent to any server except the configured LLM API for step-by-step reasoning.
- **Domain restriction available**: Use `allowed_domains` parameter to restrict which websites the agent can visit.
- **No telemetry**: This skill does not collect, store, or transmit any usage data.
## When to Use Browser-Use vs Built-in Tool
| Scenario | Built-in tool | Browser-Use |
|----------|:-:|:-:|
| Screenshot / click one button | ✅ Free & fast | ❌ Overkill |
| 5+ step workflow (login→navigate→fill→submit) | ❌ Breaks easily | ✅ |
| Anti-bot sites (real Chrome needed) | ❌ | ✅ |
| Batch repetitive operations | ❌ | ✅ |
**Cost**: Browser-Use calls an external LLM per step (costs money + slower). Use built-in tool for simple actions.
## Execution Flow
### 1. Check Environment
```bash
test -d ~/browser-use-env && echo "Installed" || echo "Need install"
```
### 2. First-Time Setup (once only)
```bash
python3 -m venv ~/browser-use-env
source ~/browser-use-env/bin/activate
pip install browser-use playwright langchain-openai
playwright install chromium
```
### 3. Choose Mode
- **Mode A — Built-in Chromium**: For simple automation or when detection doesn't matter. Runs immediately.
- **Mode B — Real Chrome CDP**: For anti-bot sites or when user's login session is needed. Requires user action.
Mode B setup — prompt user:
> Please quit Chrome completely (Mac: Cmd+Q), then tell me "done"
After user confirms:
```bash
/Applications/Google Chrome.app/Contents/MacOS/Google Chrome --remote-debugging-port=9222 &
```
Verify: `curl -s http://127.0.0.1:9222/json/version`
### 4. Write Script and Run
Write script to user's workspace, then:
```bash
source ~/browser-use-env/bin/activate
python3 script_path.py
```
### 5. Report Results
Return results to user. On failure, follow the troubleshooting tree below.
## Script Template
```python
import asyncio
from browser_use import Agent, ChatOpenAI, Browser
async def main():
# LLM — any OpenAI-compatible API
llm = ChatOpenAI(
model="gpt-4o-mini",
api_key="<YOUR_API_KEY>", # From env var or user config
base_url="https://api.openai.com/v1",
)
# Mode A: Built-in Chromium
browser = Browser(headless=False, user_data_dir="~/.browser-use/task-profile")
# Mode B: Real Chrome (user must launch with --remote-debugging-port=9222)
# browser = Browser(cdp_url="http://127.0.0.1:9222")
agent = Agent(
task="Detailed step-by-step task description (see guide below)",
llm=llm, browser=browser,
use_vision=True, max_steps=25,
)
result = await agent.run()
print(result)
asyncio.run(main())
```
## Task Writing Guide
### ✅ Good: Specific steps
```python
task = """
1. Open https://www.reddit.com/login
2. Enter username: x_user
3. Enter password: x_pass
4. Click login button
5. If CAPTCHA appears, wait 30s for user to complete
6. Navigate to https://www.reddit.com/r/xxx/submit
7. Enter title: xxx
8. Enter body: xxx
9. Click submit
"""
```
### ❌ Bad: Vague
```python
task = "Post something on Reddit"
```
### Tips
- **Keyboard fallback**: Add "If button can't be clicked, use Tab+Enter"
- **Error recovery**: Add "If page fails to load, refresh and retry"
- **Sensitive data**: Use placeholders + `sensitive_data` parameter
## Credential Security
```python
agent = Agent(
task="Login with x_user and x_pass",
sensitive_data={"x_user": "real@email.com", "x_pass": "S3cret!"},
use_vision=False, # Disable screenshots when handling passwords
llm=llm, browser=browser,
)
```
## Key Parameters
| Parameter | Purpose | Recommended |
|-----------|---------|-------------|
| `use_vision` | AI sees screenshots | True normally, False with passwords |
| `max_steps` | Max actions | 20-30 |
| `max_failures` | Max retries | 3 (default) |
| `flash_mode` | Skip reasoning | True for simple tasks |
| `extend_system_message` | Custom instructions | Add specific guidance |
| `allowed_domains` | Restrict URLs | Use for security |
| `fallback_llm` | Backup LLM | When primary is unstable |
## Troubleshooting
```
Detected as automation?
└→ Switch to Mode B (real Chrome)
CAPTCHA / human verification?
└→ Prompt user to complete manually, add wait time in task
LLM timeout?
└→ Set fallback_llm or use faster model
Action succeeded but no effect (e.g. post not published)?
└→ 1. Check if platform anti-spam blocked it (common with new accounts)
2. Add explicit confirmation steps to task
Website UI changed, can't find elements?
└→ Browser-Use auto-adapts, but add fallback paths in task
```
## LLM Compatibility
| LLM | Works | Notes |
|-----|:---:|-------|
| GPT-4o / 4o-mini | ✅ | Best choice, recommended |
| Claude | ✅ | Works well |
| Gemini | ❌ | Structured output incompatible |