技能详情(站内镜像,无评论)
许可证:MIT-0
MIT-0 ·免费使用、修改和重新分发。无需归因。
版本:v1.0.0
统计:⭐ 2 · 1.5k · 6 current installs · 7 all-time installs
⭐ 2
安装量(当前) 7
🛡 VirusTotal :良性 · OpenClaw :良性
Package:voice-recognition
安全扫描(ClawHub)
- VirusTotal :良性
- OpenClaw :良性
OpenClaw 评估
The skill is internally coherent: it wraps the local OpenAI Whisper CLI with a small Python helper and does not request credentials or external endpoints, though the SKILL.md contains user-specific hard-coded paths and the tool will download model data to the user's cache on first run.
目的
The name/description (local Whisper-based speech-to-text) match the included Python script and the SKILL.md. The README asks you to install openai-whisper via Homebrew and use Python 3.10+, which is appropriate. Minor oddity: usage examples in SKILL.md hard-code an absolute path (/Users/liyi/.openclaw/workspace/...) pointing to a specific user's workspace — this is inconsistent with distributing the script and should be updated to relative or …
说明范围
Runtime instructions simply run the included Python script which calls the external 'whisper' CLI (no shell=True). The script reads an audio file, writes a .txt transcript beside that file, and can generate a simple local summary. It does not read unrelated system files or environment variables, nor does it post data to remote endpoints. Note: first run will download model weights to ~/.cache/whisper (network and disk usage).
安装机制
There is no install spec (instruction-only skill). The SKILL.md recommends 'brew install openai-whisper' which is a reasonable, low-risk installation path for the Whisper CLI.
证书
The skill requests no environment variables, no credentials, and no config paths. The behavior (invoking a local 'whisper' binary) is proportionate to the stated function. Reminder: because it calls an external binary by name, it depends on the 'whisper' in PATH being the expected implementation.
持久
The skill does not request permanent/always inclusion, does not modify other skills, and contains no code that attempts to change system-wide agent settings. It only suggests an optional shell alias for convenience (user action).
综合结论
This skill appears to do what it says: a small Python wrapper that invokes the local OpenAI Whisper CLI and writes transcripts locally. Before installing/use: (1) install openai-whisper from a trusted source (Homebrew tap) so the 'whisper' binary on your PATH is legitimate; (2) be aware the first run will download model weights to ~/.cache/whisper (large download and disk usage); (3) update the SKILL.md usage examples to point to the script lo…
安装(复制给龙虾 AI)
将下方整段复制到龙虾中文库对话中,由龙虾按 SKILL.md 完成安装。
请把本段交给龙虾中文库(龙虾 AI)执行:为本机安装 OpenClaw 技能「Voice Recognition」。简介:Local speech-to-text with OpenAI Whisper CLI. Supports Chinese, English, 100+ l…。
请 fetch 以下地址读取 SKILL.md 并按文档完成安装:https://raw.githubusercontent.com/openclaw/skills/refs/heads/main/skills/gykdly/voice-recognition/SKILL.md
(来源:yingzhi8.cn 技能库)
SKILL.md
---
name: voice-recognition
description: Local speech-to-text with OpenAI Whisper CLI. Supports Chinese, English, 100+ languages with translation and summarization.
version: 1.0.0
---
# Voice Recognition (Whisper)
Local speech-to-text with OpenAI Whisper CLI.
## Features
- **Local processing** - No API key needed, free
- **Multi-language** - Chinese, English, 100+ languages
- **Translation** - Translate to English
- **Summarization** - Generate quick summary
## Usage
### Basic
```bash
# Chinese recognition
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a
# Force Chinese
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --zh
# English recognition
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --en
# Translate to English
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --translate
# With summary
python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py audio.m4a --summarize
```
### Quick Command (add to ~/.zshrc)
```bash
alias voice="python3 /Users/liyi/.openclaw/workspace/scripts/voice识别_升级版.py"
```
Then use:
```bash
voice ~/Downloads/audio.m4a --zh
```
## Requirements
- OpenAI Whisper CLI: `brew install openai-whisper`
- Python 3.10+
## Files
- `scripts/voice识别_升级版.py` - Main script
- `scripts/voice_tool_README.md` - Documentation
## Supported Formats
- MP3, M4A, WAV, OGG, FLAC, WebM
## Language Support
100+ languages including:
- Chinese (zh)
- English (en)
- Japanese (ja)
- Korean (ko)
- And more...
## Notes
- Default model: `medium` (balance of speed and accuracy)
- First run downloads model to `~/.cache/whisper`
- Processing time varies by audio length and model size