Multimedia To Obsidian — 技能 — openclaw中文资讯站

技能详情（站内镜像，无评论）

将任意多媒体文档导入 Obsidian 知识库。支持 PPT、PDF、DOCX、图片等格式，自动提取每一页/每一张图片，使用多模态模型理解内容，生成文字描述后存入 OB。适用于：(1) 整理培训课件 (2) 迁移笔记到 OB (3) 将图片资料转为结构化知识。

媒体与内容

许可证：MIT-0

MIT-0 ·免费使用、修改和重新分发。无需归因。

版本：v1.0.0

统计：⭐ 0 · 195 · 1 current installs · 1 all-time installs

⭐ 0

安装量（当前） 1

🛡 VirusTotal ：良性 · OpenClaw ：可疑

Package：aidescend/multimedia-to-obsidian

安全扫描（ClawHub）

VirusTotal ：良性
OpenClaw ：可疑

OpenClaw 评估

The skill mostly does what it claims (extract images and send them to multimodal models), but the package metadata omits required binaries and environment variables and it will upload raw image data to an external host (a default host that is not a well-known provider), which is unexpected and privacy-sensitive.

目的

The skill's description matches the code: it extracts pages/images and sends them to multimodal models, then writes Markdown into an Obsidian repo. However the registry metadata lists no required binaries or env vars while the SKILL.md and the script require pandoc, poppler (pdftoppm), python-pptx/pillow, and also call 'soffice' — this mismatch is incoherent and could surprise users. The network/API access (MINIMAX/OPENAI/ANTHROPIC) is appropr…

说明范围

Runtime instructions are within the task scope (install tools, set one of several model API keys, run the script). Important privacy-sensitive behavior: the script encodes image bytes as base64 and transmits them to third-party endpoints (MiniMax/OpenAI/Anthropic). That is expected for model calls but should be explicitly highlighted because sensitive documents will be uploaded. The SKILL.md omits that the script invokes 'soffice' and 'rm -rf'…

安装机制

There is no automated install spec (lowest risk). The SKILL.md instructs manual installs via brew and pip (pandoc, poppler, python-pptx, pillow) which are reasonable. The script additionally calls 'soffice' (LibreOffice) and 'pdftoppm' but 'soffice' is not listed in prerequisites — this omission is inconsistent and may lead to runtime failures. No downloads from unknown URLs are performed by an installer.

证书

The registry metadata declared no required environment variables, but both SKILL.md and the script require at least one API key (MINIMAX_API_KEY or OPENAI_API_KEY or ANTHROPIC_API_KEY). Requiring model provider API keys is proportionate to the task, but the default MINIMAX_API_HOST (https://api.minimaxi.com) is not a known major provider — sending base64 images to an unfamiliar host is a privacy/exfiltration risk. The number of env vars reques…

持久

The skill is not always-enabled and is user-invocable. It writes Markdown files into the user-specified Obsidian directory and removes temporary files; it does not request system-wide configuration or modify other skills. No elevated persistence is requested.

安装（复制给龙虾 AI）

将下方整段复制到龙虾中文库对话中，由龙虾按 SKILL.md 完成安装。

请把本段交给龙虾中文库（龙虾 AI）执行：为本机安装 OpenClaw 技能「Multimedia To Obsidian」。简介：将任意多媒体文档导入 Obsidian 知识库。支持 PPT、PDF、DOCX、图片等格式，自动提取每一页/每一张图片，使用多模态模型理解内容，生成文字描述后…。
请 fetch 以下地址读取 SKILL.md 并按文档完成安装：https://raw.githubusercontent.com/openclaw/skills/refs/heads/main/skills/aidescend/multimedia-to-obsidian/SKILL.md
（来源：yingzhi8.cn 技能库）

SKILL.md

打开原始 SKILL.md（GitHub raw）

---
name: multimedia-to-obsidian
description: 将任意多媒体文档导入 Obsidian 知识库。支持 PPT、PDF、DOCX、图片等格式，自动提取每一页/每一张图片，使用多模态模型理解内容，生成文字描述后存入 OB。适用于：(1) 整理培训课件 (2) 迁移笔记到 OB (3) 将图片资料转为结构化知识。
---

# 多媒体导入 Obsidian

将 PPT、PDF、DOCX、图片等多媒体文档导入 Obsidian，每页/每张图片自动生成文字描述。

## 功能特点

- 支持多种格式：PPT、PDF、DOCX、图片（PNG、JPG 等）
- 自动提取媒体文件
- 调用多模态模型理解内容
- 自动分类到 OB 仓库

## 使用方法

### 前提条件

1. 安装依赖：
```bash
brew install pandoc poppler
pip install python-pptx pillow
```

2. 设置环境变量（至少一个）：
```bash
# MiniMax
export MINIMAX_API_KEY="your-key"
export MINIMAX_API_HOST="https://api.minimaxi.com"

# OpenAI
export OPENAI_API_KEY="your-key"

# Anthropic
export ANTHROPIC_API_KEY="your-key"
```

### 调用脚本

```bash
python3 <skill>/scripts/media_to_obsidian.py <源文件/目录> <输出OB目录> [options]
```

**参数：**
- 源文件/目录：单个文件或包含多媒体文件的目录
- 输出目录：Obsidian 仓库路径
- 选项：
  - `--format ppt|pdf|docx|image|all` : 指定格式，默认 all
  - `--model minimax|openai|anthropic` : 指定模型，默认 minimax
  - `--category 分类名` : 指定输出分类目录

### 示例

```bash
# 导入 PPT 课件
python3 ~/.openclaw/workspace/skills/multimedia-to-obsidian/scripts/media_to_obsidian.py 
  /path/to/培训课件.pptx 
  /path/to/Obsidian 
  --format ppt --category 培训

# 导入图片到 OB
python3 ~/.openclaw/workspace/skills/multimedia-to-obsidian/scripts/media_to_obsidian.py 
  /path/to/images/ 
  /path/to/Obsidian 
  --format image --category 素材

# 批量导入目录下的所有文档
python3 ~/.openclaw/workspace/skills/multimedia-to-obsidian/scripts/media_to_obsidian.py 
  /path/to/documents/ 
  /path/to/Obsidian
```

## 输出结构

```
Obsidian仓库/
├── 培训/
│   └── 培训课件.md (含每页图片理解)
├── 素材/
│   └── 图片1.md
│   └── 图片2.md
└── 导入/
    └── 文档名.md
```

## 工作流程

1. **提取媒体**：从 PPT/PDF/DOCX 提取每一页为图片
2. **理解内容**：调用多模态模型理解每张图片
3. **生成描述**：将图片描述写入 Markdown
4. **分类存储**：按指定分类存入 OB