Voice Recognition — 技能 — openclaw中文资讯站

技能详情（站内镜像，无评论）

Local speech-to-text with OpenAI Whisper CLI. Supports Chinese, English, 100+ languages with translation and summarization.

媒体与内容

许可证：MIT-0

MIT-0 ·免费使用、修改和重新分发。无需归因。

版本：v1.0.0

统计：⭐ 2 · 1.5k · 6 current installs · 7 all-time installs

⭐ 2

安装量（当前） 7

🛡 VirusTotal ：良性 · OpenClaw ：良性

Package：voice-recognition

安全扫描（ClawHub）

VirusTotal ：良性
OpenClaw ：良性

OpenClaw 评估

The skill is internally coherent: it wraps the local OpenAI Whisper CLI with a small Python helper and does not request credentials or external endpoints, though the SKILL.md contains user-specific hard-coded paths and the tool will download model data to the user's cache on first run.

目的

The name/description (local Whisper-based speech-to-text) match the included Python script and the SKILL.md. The README asks you to install openai-whisper via Homebrew and use Python 3.10+, which is appropriate. Minor oddity: usage examples in SKILL.md hard-code an absolute path (/Users/liyi/.openclaw/workspace/...) pointing to a specific user's workspace — this is inconsistent with distributing the script and should be updated to relative or …

说明范围

Runtime instructions simply run the included Python script which calls the external 'whisper' CLI (no shell=True). The script reads an audio file, writes a .txt transcript beside that file, and can generate a simple local summary. It does not read unrelated system files or environment variables, nor does it post data to remote endpoints. Note: first run will download model weights to ~/.cache/whisper (network and disk usage).

安装机制

There is no install spec (instruction-only skill). The SKILL.md recommends 'brew install openai-whisper' which is a reasonable, low-risk installation path for the Whisper CLI.

证书

The skill requests no environment variables, no credentials, and no config paths. The behavior (invoking a local 'whisper' binary) is proportionate to the stated function. Reminder: because it calls an external binary by name, it depends on the 'whisper' in PATH being the expected implementation.

持久

The skill does not request permanent/always inclusion, does not modify other skills, and contains no code that attempts to change system-wide agent settings. It only suggests an optional shell alias for convenience (user action).

综合结论

This skill appears to do what it says: a small Python wrapper that invokes the local OpenAI Whisper CLI and writes transcripts locally. Before installing/use: (1) install openai-whisper from a trusted source (Homebrew tap) so the 'whisper' binary on your PATH is legitimate; (2) be aware the first run will download model weights to ~/.cache/whisper (large download and disk usage); (3) update the SKILL.md usage examples to point to the script lo…

安装（复制给龙虾 AI）

将下方整段复制到龙虾中文库对话中，由龙虾按 SKILL.md 完成安装。

请把本段交给龙虾中文库（龙虾 AI）执行：为本机安装 OpenClaw 技能「Voice Recognition」。简介：Local speech-to-text with OpenAI Whisper CLI. Supports Chinese, English, 100+ l…。
请 fetch 以下地址读取 SKILL.md 并按文档完成安装：https://raw.githubusercontent.com/openclaw/skills/refs/heads/main/skills/gykdly/voice-recognition/SKILL.md
（来源：yingzhi8.cn 技能库）

SKILL.md

打开原始 SKILL.md（GitHub raw）

---
name: voice-recognition
description: Local speech-to-text with OpenAI Whisper CLI. Supports Chinese, English, 100+ languages with translation and summarization.
version: 1.0.0
---

# Voice Recognition (Whisper)

Local speech-to-text with OpenAI Whisper CLI.

## Features

- **Local processing** - No API key needed, free
- **Multi-language** - Chinese, English, 100+ languages
- **Translation** - Translate to English
- **Summarization** - Generate quick summary

## Usage

### Basic

```bash
# Chinese recognition
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a

# Force Chinese
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --zh

# English recognition  
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --en

# Translate to English
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --translate

# With summary
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --summarize
```

### Quick Command (add to ~/.zshrc)

```bash
alias voice="python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py"
```

Then use:

```bash
voice ~/Downloads/audio.m4a --zh
```

## Requirements

- OpenAI Whisper CLI: `brew install openai-whisper`
- Python 3.10+

## Files

- `scripts/voice识别_升级版.py` - Main script
- `scripts/voice_tool_README.md` - Documentation

## Supported Formats

- MP3, M4A, WAV, OGG, FLAC, WebM

## Language Support

100+ languages including:
- Chinese (zh)
- English (en)
- Japanese (ja)
- Korean (ko)
- And more...

## Notes

- Default model: `medium` (balance of speed and accuracy)
- First run downloads model to `~/.cache/whisper`
- Processing time varies by audio length and model size