Pdf Figure Extractor — 技能 — openclaw中文资讯站

技能详情（站内镜像，无评论）

从PDF论文中精确提取Figure图片，自动分析PDF结构、定位caption位置、裁剪干净图形，并验证图片质量。支持学术新闻稿、论文写作等场景的自动化图片处理。

媒体与内容

许可证：MIT-0

MIT-0 ·免费使用、修改和重新分发。无需归因。

版本：v1.0.0

统计：⭐ 0 · 41 · 0 current installs · 0 all-time installs

⭐ 0

安装量（当前） 0

🛡 VirusTotal ：良性 · OpenClaw ：良性

Package：438061781/pdf-figure-extractor

安全扫描（ClawHub）

VirusTotal ：良性
OpenClaw ：良性

OpenClaw 评估

The skill's instructions and requirements are coherent with its stated purpose (extracting figures from PDFs); it is an instruction-only recipe that does not request extra credentials or unusual installs, though it omits declaring a needed Python dependency (PyMuPDF/fitz).

目的

The name/description (PDF figure extraction) matches the SKILL.md: it contains concrete Python examples that analyze PDF text blocks, locate captions, compute crop rectangles, render, and save images. All requested operations are consistent with extracting figures.

说明范围

Instructions operate only on a PDF input (pdf_path, page_num) and explain how to find captions and crop images. They do not instruct reading unrelated filesystem paths or environment secrets. Note: the instructions rely on heuristics and hardcoded coordinate templates which may need tuning for diverse PDFs.

安装机制

This is an instruction-only skill with no install spec. The example code uses the 'fitz' module (PyMuPDF) but the skill does not declare that dependency or advise installing it. That omission reduces completeness/coherence but is not malicious.

证书

No environment variables, credentials, or config paths are requested. The skill only requires access to PDFs provided by the user, which is proportionate to its purpose.

持久

always is false and there is no indication the skill requests persistent privileges or modifies system/agent-wide settings. Autonomous invocation is allowed (platform default) but not excessive here.

综合结论

This skill is an instruction-only recipe for extracting figures from PDFs and appears to do what it claims. Before using it: (1) ensure the agent/runtime has Python and the PyMuPDF package (import fitz) installed — the SKILL.md uses fitz but does not list this dependency; (2) run it on non-sensitive sample PDFs first to tune crop coordinates and verify captions are removed; (3) process private or confidential PDFs in a trusted, isolated enviro…

安装（复制给龙虾 AI）

将下方整段复制到龙虾中文库对话中，由龙虾按 SKILL.md 完成安装。

请把本段交给龙虾中文库（龙虾 AI）执行：为本机安装 OpenClaw 技能「Pdf Figure Extractor」。简介：从PDF论文中精确提取Figure图片，自动分析PDF结构、定位caption位置、裁剪干净图形，并验证图片质量。支持学术新闻稿、论文写作等场景的自动化图片处…。
请 fetch 以下地址读取 SKILL.md 并按文档完成安装：https://raw.githubusercontent.com/openclaw/skills/refs/heads/main/skills/438061781/pdf-figure-extractor/SKILL.md
（来源：yingzhi8.cn 技能库）

SKILL.md

打开原始 SKILL.md（GitHub raw）

---
name: pdf-figure-extractor
description: 从PDF论文中精确提取Figure图片，自动分析PDF结构、定位caption位置、裁剪干净图形，并验证图片质量。支持学术新闻稿、论文写作等场景的自动化图片处理。
---

# PDF Figure提取技能

## 使用场景

- 从学术论文PDF提取Figure插入Word文档
- 需要干净、无caption、无正文的纯图形图片
- 批量提取多个Figure

## 标准工作流程

### 步骤1: 分析PDF结构
```python
import fitz

doc = fitz.open(pdf_path)
page = doc[page_num]

# 获取所有文本块
blocks = page.get_text("blocks")
for block in blocks:
    x0, y0, x1, y1, text, block_no, block_type = block
    if "Fig." in text or "Figure" in text:
        print(f"Figure相关: y={y0:.0f}-{y1:.0f}, {text[:50]}...")
```

### 步骤2: 定位Caption位置
```python
# 搜索Fig. X的精确位置
text_instances = page.search_for(f"Fig. {fig_num}")
for inst in text_instances:
    print(f"Fig.{fig_num}位置: y={inst.y0:.0f}-{inst.y1:.0f}")
```

### 步骤3: 确定裁剪区域
根据caption位置判断图形区域：

| Caption位置 | 图形区域 |
|------------|---------|
| y=400 (页面中部) | y=100-395 (caption上方) |
| y=666 (页面底部) | y=350-660 (caption上方) |
| y=326 (页面底部) | y=100-320 (caption上方) |

### 步骤4: 精确裁剪
```python
rect = fitz.Rect(50, y_start, page.rect.width - 50, y_end)
pix = page.get_pixmap(matrix=fitz.Matrix(2, 2), clip=rect)
pix.save(f"fig{fig_num}.png")
```

### 步骤5: 验证图片质量
检查清单：
- [ ] 包含所有子图(a,b,c,d...)
- [ ] 没有混入"Fig. X"开头的caption文字
- [ ] 没有混入正文段落
- [ ] 坐标轴和标签完整

## 常见PDF布局模板

### Nature/Science论文
- Fig.1: 通常caption在底部，图形y=350-660
- Fig.2+: caption位置不固定，需要先分析

### 会议论文
- 单栏布局: caption通常在图形下方
- 双栏布局: caption可能在图形上方或下方

## 错误处理

### 问题: 图片混入正文
**原因**: 裁剪范围太大
**解决**: 缩小y_end，确保在caption之前结束

### 问题: 子图缺失
**原因**: 裁剪范围太小
**解决**: 扩大y_start/y_end，包含完整图形

### 问题: caption未去除
**原因**: 裁剪范围包含了caption区域
**解决**: 根据caption的y坐标精确调整裁剪边界

## 最佳实践

1. **永远不要**凭感觉估计坐标
2. **始终先**分析PDF文本块结构
3. **高分辨率渲染**: 使用`matrix=fitz.Matrix(2, 2)`
4. **验证每张图片**: 确保干净无杂质
5. **记录坐标**: 为常见PDF类型建立坐标模板

## 触发关键词

"提取PDF图片", "从PDF提取Figure", "PDF图片裁剪", "学术论文图片提取"