openclaw 网盘下载
OpenClaw

技能详情(站内镜像,无评论)

首页 > 技能库 > Bilibili Transcript

Transcribe Bilibili videos to text with high accuracy using Whisper medium model. Use when the user provides a Bilibili video URL (BVxxxxx) and wants to: (1)...

通信与消息

许可证:MIT-0

MIT-0 ·免费使用、修改和重新分发。无需归因。

版本:v2.2.0

统计:⭐ 0 · 321 · 1 current installs · 1 all-time installs

0

安装量(当前) 1

🛡 VirusTotal :可疑 · OpenClaw :可疑

Package:54lynnn/bilibili-transcript

安全扫描(ClawHub)

  • VirusTotal :可疑
  • OpenClaw :可疑

OpenClaw 评估

The skill appears to implement Bilibili transcription as described, but there are multiple mismatches and privacy-sensitive behaviors (automatic browser cookie access, filesystem probing, and conflicting documentation) that you should understand before installing or running it.

目的

Name/description match the actual script: it downloads subtitles or audio from Bilibili and transcribes with Whisper. However registry metadata says no required binaries while SKILL.md/README clearly require yt-dlp, whisper, ffmpeg and optionally opencc; README also lists a different version (2.8.0) than registry (2.2.0). SKILL.md claims default output folder 'workspace/Bilibili transcript/' but the script defaults OUTPUT_DIR to /tmp — documen…

说明范围

The runtime script probes local environment: it lists /mnt/c/Users to detect a Windows user and reads browser profile directories (WSL Chromium path and Windows Edge user data) to pass to yt-dlp's --cookies-from-browser. That means the script will attempt to read browser cookies (auth tokens) for member-only content. This is privacy-sensitive but is coherent with the stated goal of accessing member-only AI subtitles; still the script automatic…

安装机制

No install spec is present (instruction-only plus a shell script). No remote downloads or archive extraction are performed by the skill itself. The script relies on external binaries (yt-dlp, whisper, etc.) already on the host.

证书

No environment variables or external credentials are declared, which is good, but the script accesses local browser cookie stores via yt-dlp --cookies-from-browser and probes /mnt/c/Users. Those actions grant access to authentication cookies and reveal local usernames/profile paths. This access is proportionate to the stated need (member-only subtitles) but is sensitive and not explicitly reflected in registry 'required config paths' metadata.

持久

The skill is not always-enabled and does not request elevated privileges or modify other skills or system-wide configuration. It writes transcript files to an output directory (default /tmp or user-specified).

安装(复制给龙虾 AI)

将下方整段复制到龙虾中文库对话中,由龙虾按 SKILL.md 完成安装。

请把本段交给龙虾中文库(龙虾 AI)执行:为本机安装 OpenClaw 技能「Bilibili Transcript」。简介:Transcribe Bilibili videos to text with high accuracy using Whisper medium mode…。
请 fetch 以下地址读取 SKILL.md 并按文档完成安装:https://raw.githubusercontent.com/openclaw/skills/refs/heads/main/skills/54lynnn/bilibili-transcript/SKILL.md
(来源:yingzhi8.cn 技能库)

SKILL.md

打开原始 SKILL.md(GitHub raw)

---
name: bilibili-transcript
version: "2.2.0"
description: "Transcribe Bilibili videos to text with high accuracy using Whisper medium model. Use when the user provides a Bilibili video URL (BVxxxxx) and wants to: (1) Extract the complete audio content as text with high accuracy, (2) Get a detailed summary of the video content, (3) Save the transcript as a formatted TXT file instead of posting long text to Discord. Automatically detects CC subtitles if available, otherwise uses Whisper medium model with GPU acceleration. Output saves to 'Bilibili transcript' folder by default, includes video metadata, summary section, and full transcript in Simplified Chinese."
---

# Bilibili Transcript v2.2

High-accuracy Bilibili video transcription with multi-language AI subtitle support.

## Overview

This skill provides a **complete transcription workflow** for Bilibili videos:

1. **Extract Video Metadata** - Title, author, publish date, duration
2. **Smart Subtitle Detection** - Priority: CC subtitles → AI subtitles (multi-language) → Whisper transcription
3. **Multi-language AI Subtitle Support** - Auto-detects: `ai-zh`, `ai-en`, `ai-ja`, `ai-es`, `ai-ar`, `ai-pt`, `ai-ko`, `ai-de`, `ai-fr`
4. **Browser Cookie Support** - WSL Chromium or Windows Edge for member-only videos
5. **Formatted Output** - Saves as structured TXT file with metadata + summary placeholder + full transcript
6. **Simplified Chinese** - Automatically converts Traditional to Simplified Chinese

## What's New in v2.2

- ✅ **Fixed cookie detection** - Now uses browser config directory instead of SQLite file (avoids encoding errors)
- ✅ **One-stop solution** - CC subtitles → AI subtitles → Whisper transcription, all in one script
- ✅ **Better WSL support** - Automatically detects WSL Chromium and Windows Edge cookies
- ✅ **Smart fallback** - Seamlessly switches between subtitle sources without user intervention

## What's New in v2.1

- ✅ **Improved cookie handling** - Fixed UTF-8 encoding issues with snap Chromium
- ✅ **Three-tier fallback** - CC subtitles → AI subtitles → Whisper transcription
- ✅ **Better error handling** - Gracefully degrades when cookie sources fail

## What's New in v2.0

- ✅ **Multi-language AI subtitles** - Supports 9 languages: Chinese, English, Japanese, Spanish, Arabic, Portuguese, Korean, German, French
- ✅ **WSL Chromium support** - Better cookie extraction than Windows Edge
- ✅ **Correct subtitle download** - Uses `--write-subs --write-auto-subs` combo
- ✅ **Language auto-detection** - Automatically finds available AI subtitle language

## AI Subtitle Language Codes

Bilibili uses `ai-` prefix for AI-generated subtitles:

| Code | Language | 语言 |
|------|----------|------|
| `ai-zh` | Chinese | 中文 |
| `ai-en` | English | 英文 |
| `ai-ja` | Japanese | 日文 |
| `ai-es` | Spanish | 西班牙文 |
| `ai-ar` | Arabic | 阿拉伯文 |
| `ai-pt` | Portuguese | 葡萄牙文 |
| `ai-ko` | Korean | 韩文 |
| `ai-de` | German | 德文 |
| `ai-fr` | French | 法文 |

## Requirements

### Hardware (Your Setup)
- **GPU**: NVIDIA RTX 4070 Super (12GB VRAM) - ✅ Perfect for medium model
- **WSL Memory**: 16GB (configured)
- **WSL CPU**: 6 cores (configured)

### Software
- `yt-dlp` - Video/audio download
- `ffmpeg` - Audio processing
- `whisper` - Speech-to-text (local, no API key)
- `opencc` - Traditional to Simplified Chinese conversion (optional)

### Browser (for AI subtitles)
- **WSL Chromium** (recommended) - Log in to Bilibili in WSL
- **Windows Edge** - Alternative option

## Workflow

### Step 1: Run Transcription Script

```bash
./scripts/bilibili_transcript.sh "https://www.bilibili.com/video/BVxxxxx"
```

**Priority order:**
1. **CC Subtitles** (manual) - Fastest, highest accuracy
2. **AI Subtitles** (auto-generated) - Fast, good accuracy, multi-language
3. **Whisper Transcription** - Slowest, ~95% accuracy, works for all videos

### Step 2: Generate Detailed Summary

After the script completes, read the generated TXT file and:
1. Read the full transcript (第二部分)
2. Generate a comprehensive summary (第一部分)
3. Save the updated file

### Step 3: Present to User

In Discord, post:
- **Brief summary** in message
- **Attach the TXT file** for full content

## Setup WSL Chromium Login

For best results with AI subtitles:

1. Start WSL Chromium:
   ```bash
   chromium-browser &
   ```

2. Navigate to bilibili.com

3. Log in with your Bilibili account

4. Run the transcription script

The script will automatically use Chromium's cookies to access member-only AI subtitles.

## Usage Examples

### Example 1: Basic Transcription (Default Output)
```bash
./scripts/bilibili_transcript.sh "https://www.bilibili.com/video/BV1Z1wJzgEAj/"
# Output: workspace/Bilibili transcript/[VideoTitle]_BVxxxxx_transcript.txt
```

### Example 2: Custom Output Directory
```bash
./scripts/bilibili_transcript.sh "https://www.bilibili.com/video/BV1Z1wJzgEAj/" ~/Documents
```

## Notes

### Model Selection
- **Your config**: RTX 4070 Super 12GB + 16GB RAM + 6 cores
- **Default**: `medium` model (~95% accuracy, balanced speed) ✅
- **Fallback**: If GPU unavailable, automatically uses CPU (slower)

### Accuracy Comparison
| Source | Accuracy | Speed | Best For |
|--------|----------|-------|----------|
| CC Subtitles | 100% | ⚡ Instant | All videos with manual subtitles |
| AI Subtitles (ai-zh) | ~90% | ⚡ Instant | Chinese videos |
| AI Subtitles (ai-en) | ~85% | ⚡ Instant | English videos |
| Whisper medium | ~95% | 🐢 Slow | No subtitle videos |

### Default Output Directory
- **Location**: `workspace/Bilibili transcript/`
- **Created automatically** on first run
- All transcript files organized in one place

### File Naming
Output files are named: `[VideoTitle]_[BVID]_transcript.txt`
- Special characters (including Chinese punctuation) are replaced with underscores
- Title truncated to 50 characters
- Example: `股票分红_是从左口袋掏右口袋吗_BV1ddzUYTE27_transcript.txt`

### Subtitle Priority
The script tries subtitles in this order:
1. Manual CC subtitles (zh-CN, zh-TW, en, ja, etc.)
2. AI subtitles (any available language: ai-zh, ai-en, ai-ja, etc.)
3. Whisper voice transcription (fallback)

This ensures fastest processing while maintaining high accuracy.