Audio Summary — 技能 — openclaw中文资讯站

技能详情（站内镜像，无评论）

Automatically extracts audio from video, transcribes it using qwen3-asr-flash, and generates segmented text summaries saved alongside the original file.

媒体与内容

许可证：MIT-0

MIT-0 ·免费使用、修改和重新分发。无需归因。

版本：v1.0.0

统计：⭐ 0 · 180 · 3 current installs · 3 all-time installs

⭐ 0

安装量（当前） 3

🛡 VirusTotal ：可疑 · OpenClaw ：可疑

Package：alanoo7/audio-summary

安全扫描（ClawHub）

VirusTotal ：可疑
OpenClaw ：可疑

OpenClaw 评估

The skill's functionality matches its description, but it embeds an API key and sends full audio to a non-standard third‑party endpoint (not declared), which is disproportionate and raises privacy/exfiltration concerns.

目的

The code does what the name/description claim: it uses ffmpeg to extract/compress audio and calls a qwen3-asr-flash ASR model via the OpenAI Python client. Declared dependencies in SKILL.md (ffmpeg, openai SDK) match the implementation. However, the skill does not declare any required environment variables or primary credential in the registry metadata, yet the script contains a hard-coded API key and a custom base_url — an inconsistency betwe…

说明范围

Runtime instructions and the script convert entire audio files to a Base64 data URI and send that data to a remote model endpoint. The SKILL.md references the '百炼 API KEY' but does not disclose the actual network endpoint used by the code (the code targets dashscope.aliyuncs.com). Sending full audio data to an undeclared third‑party endpoint is a privacy/exfiltration risk. The instructions also recommend running the exact included script path, which will use the embedded key by default.

安装机制

There is no install spec (instruction-only with a single Python script). That lowers supply-chain risk because nothing will be automatically downloaded or extracted during install.

证书

The skill requires an API credential to call the ASR model, but instead of declaring a required env var or asking the user to supply a key, the script hard-codes an API key string and a non-standard base_url. The registry metadata declared no required credentials; embedding a key in the code is disproportionate and insecure. The endpoint in code (dashscope.aliyuncs.com) is not the public qwen/openai domain and is not explained in SKILL.md.

持久

The skill is not always-enabled and does not request elevated platform privileges or modify other skills/config. It runs only when invoked and does not persist configuration beyond writing its own summary output file in the same directory as the input.

安装（复制给龙虾 AI）

将下方整段复制到龙虾中文库对话中，由龙虾按 SKILL.md 完成安装。

请把本段交给龙虾中文库（龙虾 AI）执行：为本机安装 OpenClaw 技能「Audio Summary」。简介：Automatically extracts audio from video, transcribes it using qwen3-asr-flash, …。
请 fetch 以下地址读取 SKILL.md 并按文档完成安装：https://raw.githubusercontent.com/openclaw/skills/refs/heads/main/skills/alanoo7/audio-summary/SKILL.md
（来源：yingzhi8.cn 技能库）

SKILL.md

打开原始 SKILL.md（GitHub raw）

# audio-summary Skill

音频/视频转文本总结助手。

## 功能

1.  **自动音频提取**：使用 `ffmpeg` 从 MP4 等视频文件中提取 16k mono 压缩音频，以适配大模型体积限制。
2.  **转录转总结**：基于百炼 `qwen3-asr-flash` 模型，自动将音频转换为文字并生成内容分段总结。
3.  **大文件支持**：通过 48k 压缩，支持最长约 5-8 分钟的视频单次直接转录。

## 依赖

-   `ffmpeg` (已安装在系统路径)
-   `openai` Python SDK (已安装)
-   百炼 API KEY (已在脚本中配置为 `sk-76735...`)

## 使用方法

### 从命令行运行

```powershell
# 对指定视频进行提取和总结
python .openclaw/workspace/skills/audio-summary/audio_summary_skill.py "C:PathToYourVideo.mp4"
```

### 文件位置
- 提取出的总结文本将自动保存在视频同级目录下，并命名为 `视频名_summary.txt`。

## 注意事项
- 目前单次 Base64 转录限制为 6MB，对于超过 10 分钟的长视频，建议先手动切分或进一步降低码率。
- API 费用按 `qwen3-asr-flash` 模型计费。