openclaw 网盘下载
OpenClaw

技能详情(站内镜像,无评论)

首页 > 技能库 > LiteParse

Parse, extract text from, and screenshot PDF and document files locally using the LiteParse CLI (`lit`). Use when asked to extract text from a PDF, parse a W...

媒体与内容

许可证:MIT-0

MIT-0 ·免费使用、修改和重新分发。无需归因。

版本:v1.0.0

统计:⭐ 0 · 20 · 0 current installs · 0 all-time installs

0

安装量(当前) 0

🛡 VirusTotal :良性 · OpenClaw :良性

Package:alfred-intel-handler-source/liteparse

安全扫描(ClawHub)

  • VirusTotal :良性
  • OpenClaw :良性

OpenClaw 评估

The skill's instructions, requirements, and behavior are internally consistent with a local document parsing CLI, but there are a few minor mismatches and operational caveats you should be aware of before installing.

综合结论

This skill appears to do what it says: run a local CLI to extract text/screenshots from documents. Before installing: (1) confirm the npm package identity and publisher (search the npm registry and repository) because the registry metadata here lacks a homepage; (2) be aware that the first install/run will fetch packages and Tesseract language data over the network (so it’s not strictly offline until that completes); (3) npm global installs ma…

安装(复制给龙虾 AI)

将下方整段复制到龙虾中文库对话中,由龙虾按 SKILL.md 完成安装。

请把本段交给龙虾中文库(龙虾 AI)执行:为本机安装 OpenClaw 技能「LiteParse」。简介:Parse, extract text from, and screenshot PDF and document files locally using t…。
请 fetch 以下地址读取 SKILL.md 并按文档完成安装:https://raw.githubusercontent.com/openclaw/skills/refs/heads/main/skills/alfred-intel-handler-source/liteparse/SKILL.md
(来源:yingzhi8.cn 技能库)

SKILL.md

打开原始 SKILL.md(GitHub raw)

---
name: liteparse
description: Parse, extract text from, and screenshot PDF and document files locally using the LiteParse CLI (`lit`). Use when asked to extract text from a PDF, parse a Word/Excel/PowerPoint file, batch-process a folder of documents, or generate page screenshots for LLM vision workflows. Runs entirely offline — no cloud, no API key. Supports PDF, DOCX, XLSX, PPTX, images (jpg/png/webp), and more. Triggers on phrases like "extract text from this PDF", "parse this document", "get the text out of", "screenshot this PDF page", or any request to read/extract content from a file.
---

# LiteParse

Local document parser built on PDF.js + Tesseract.js. Zero cloud dependencies.

**Binary:** `lit` (installed globally via npm)
**Docs:** https://developers.llamaindex.ai/liteparse/

## Quick Reference

```bash
# Parse a PDF to text (stdout)
lit parse document.pdf

# Parse to file
lit parse document.pdf -o output.txt

# Parse to JSON (includes bounding boxes)
lit parse document.pdf --format json -o output.json

# Specific pages only
lit parse document.pdf --target-pages "1-5,10,15-20"

# No OCR (faster, text-layer PDFs only)
lit parse document.pdf --no-ocr

# Batch parse a directory
lit batch-parse ./input-dir ./output-dir

# Screenshot pages (for vision model input)
lit screenshot document.pdf -o ./screenshots
lit screenshot document.pdf --target-pages "1,3,5" --dpi 300 -o ./screenshots
```

## Output Formats

| Format | Use case |
|--------|----------|
| `text` (default) | Plain text extraction, feeding into prompts |
| `json` | Structured output with bounding boxes, useful for layout-aware tasks |

## OCR Behavior

- OCR is **on by default** via Tesseract.js (downloads ~10MB English data on first run)
- First run will be slow; subsequent runs use cached data
- `--no-ocr` for pure text-layer PDFs (faster, no network needed)
- For multi-language: `--ocr-language fra+eng`

## Supported File Types

Works natively: **PDF**

Requires **LibreOffice** (`brew install --cask libreoffice`): .docx, .doc, .xlsx, .xls, .pptx, .ppt, .odt, .csv

Requires **ImageMagick** (`brew install imagemagick`): .jpg, .png, .gif, .bmp, .tiff, .webp

## Installation Notes

- Installed via npm: `npm install -g @llamaindex/liteparse`
- Brew formula exists (`brew tap run-llama/liteparse`) but requires current macOS CLT — use npm as primary install path on this machine
- Binary path: `/opt/homebrew/bin/lit`

## Workflow Tips

- For **VA forms, job description PDFs, military docs**: `lit parse file.pdf -o /tmp/output.txt` then read into context
- For **scanned PDFs** (no text layer): OCR is required; complex layouts may degrade — consider LlamaParse cloud for critical docs
- For **vision model workflows**: use `lit screenshot` to generate page images, then pass to `image` tool or similar
- For **batch jobs**: use `lit batch-parse` — it reuses the PDF engine across files for efficiency

## Limitations

- Complex tables, multi-column layouts, and scanned government forms may produce imperfect output
- LlamaParse (cloud) handles the hard cases: https://cloud.llamaindex.ai
- Max recommended DPI for screenshots: 300 (higher = slower, larger files)

## Reference

See `references/output-examples.md` for sample JSON/text output structure.