openclaw 网盘下载
OpenClaw

技能详情(站内镜像,无评论)

首页 > 技能库 > document-parser

Extract structured data from PDFs, images, and Word files with layout analysis, table recognition, OCR, seal detection, and directory extraction.

媒体与内容

作者:token-ai @ankylala

许可证:MIT-0

MIT-0 ·免费使用、修改和重新分发。无需归因。

版本:v1.0.1

统计:⭐ 0 · 462 · 5 current installs · 6 all-time installs

0

安装量(当前) 6

🛡 VirusTotal :可疑 · OpenClaw :可疑

Package:ankylala/document-parser

安全扫描(ClawHub)

  • VirusTotal :可疑
  • OpenClaw :可疑

OpenClaw 评估

The skill implements a document parsing client that behaves as advertised, but by default it uploads user files to an unknown IP-based API endpoint (47.111.146.164) without requiring credentials — which is disproportionate and risky unless you deliberately configure a trusted server.

目的

The code and documentation consistently implement a remote-document-parser client (PDF/image/Word parsing, OCR, table and seal detection). Using a remote API for heavy tasks like OCR/layout analysis is reasonable, so the capability aligns with the name/description. However, the packaged default base_url is an IP address (47.111.146.164) embedded in examples and defaults, which is unexpected for a generic skill and should be justified by the au…

说明范围

Runtime instructions and the CLI cause the skill to read local files and POST their binary contents to a remote HTTP endpoint. The SKILL.md and config examples explicitly point to the same unknown IP. The skill will attempt uploads even without an API key (it logs a warning but proceeds), so users could inadvertently exfiltrate sensitive documents simply by running the default parse command.

安装机制

This is instruction-only plus a Python script; there is no download-from-URL or post-install arbitrary code fetch. Dependencies are standard (requests, python-docx, Pillow) and listed in requirements.txt. No high-risk install behavior was found.

证书

The skill does not require environment variables, but supports optional DOCUMENT_PARSER_API_KEY and DOCUMENT_PARSER_BASE_URL. The problem is not many credentials requested, but that the default configuration/README/config.example hardcodes an explicit IP-based endpoint. Sensitive files are sent to that endpoint by default, and the API key is optional — meaning data can be uploaded unauthenticated. That is disproportionate for a drop-in skill w…

持久

The package does not request always:true, does not modify other skills or system-wide settings, and only writes output files derived from user input to the current working directory. It does read a local config.json if present (expected). No elevated persistence or privilege escalation behavior observed.

clawhub.yaml:35

Install source points to URL shortener or raw IP.

config.example.json:2

Install source points to URL shortener or raw IP.

安装(复制给龙虾 AI)

将下方整段复制到龙虾中文库对话中,由龙虾按 SKILL.md 完成安装。

请把本段交给龙虾中文库(龙虾 AI)执行:为本机安装 OpenClaw 技能「document-parser」。简介:Extract structured data from PDFs, images, and Word files with layout analysis,…。
请 fetch 以下地址读取 SKILL.md 并按文档完成安装:https://raw.githubusercontent.com/openclaw/skills/refs/heads/main/skills/ankylala/document-parser/SKILL.md
(来源:yingzhi8.cn 技能库)

SKILL.md

打开原始 SKILL.md(GitHub raw)

# document-parser

高精度文档解析技能,从 PDF、图片、Word 文档中提取结构化数据。

## 用途
- 解析 PDF、图片 (JPG/PNG)、Word 文档
- 版面分析与结构提取
- 表格识别(输出 HTML/Markdown)
- OCR 文字识别
- 印章检测
- 目录提取

## 命令

### 解析文档
```
document-parser parse <文件路径> [选项]
```

示例:
```
document-parser parse C:docsreport.pdf
document-parser parse C:docsscan.jpg --layout --table
document-parser parse C:docscontract.docx --output markdown
```

### 查询任务状态
```
document-parser status <任务 ID>
```

## 参数说明

| 参数 | 说明 | 示例 |
|------|------|------|
| 文件路径 | PDF/图片/Word 文件路径 | `C:docsreport.pdf` |
| --layout | 启用版面分析 | `--layout` |
| --table | 启用表格识别 | `--table` |
| --seal | 启用印章检测 | `--seal` |
| --output | 输出格式 (json/markdown/both) | `--output markdown` |
| --pages | 页码范围 | `--pages 1-5,8,10-12` |

## 配置

### 方式一:环境变量
```
DOCUMENT_PARSER_API_KEY=your_api_key
DOCUMENT_PARSER_BASE_URL=http://47.111.146.164:8088/taidp/v1/idp/general_parse
```

### 方式二:配置文件
在技能目录创建 `config.json`:
```json
{
  "api_key": "your_api_key",
  "base_url": "http://47.111.146.164:8088/taidp/v1/idp/general_parse"
}
```

## 输出格式

返回结构化 JSON 包含:
- **pages**: 解析后的页面数组
- **elements**: 版面元素(文本、表格、图片等)
- **markdown**: Markdown 格式文本
- **data**: 数据统计摘要

## 依赖
- requests
- python-docx (Word 支持)
- Pillow (图片处理)

## 错误码

| 错误码 | 消息 | 说明 |
|--------|------|------|
| 10000 | Success | 识别成功 |
| 10001 | Missing parameter | 参数缺失 |
| 10002 | Invalid parameter | 非法参数 |
| 10003 | Invalid file | 文件格式非法 |
| 10004 | Failed to recognize | 识别失败 |
| 10005 | Internal error | 内部错误 |