← Back to list

whisper
by trpc-group
trpc-agent-go is a powerful Go framework for building intelligent agent systems using large language models (LLMs) and tools.
⭐ 832🍴 80📅 Jan 23, 2026
SKILL.md
name: whisper description: Transcribe audio files to text using OpenAI Whisper
Whisper Audio Transcription Skill
Transcribe audio files to text using OpenAI Whisper.
Capabilities
- Transcribe audio files (MP3, WAV, M4A, FLAC, OGG, etc.) to text
- Support for 90+ languages with auto-detection
- Optional timestamp generation
- Multiple model sizes (tiny/base/small/medium/large)
- Output in plain text or JSON format
Usage
Basic Transcription
python3 scripts/transcribe.py <audio_file> <output_file>
With Options
# Specify model size (default: base)
python3 scripts/transcribe.py audio.mp3 transcript.txt --model medium
# Specify language (improves accuracy)
python3 scripts/transcribe.py audio.mp3 transcript.txt --language zh
# Include timestamps
python3 scripts/transcribe.py audio.mp3 transcript.txt --timestamps
# JSON output with metadata
python3 scripts/transcribe.py audio.mp3 output.json --format json
Parameters
audio_file(required): Path to input audio fileoutput_file(required): Path to output text/JSON file--model: Whisper model size (tiny/base/small/medium/large, default: base)--language: Language code (e.g., en, zh, es, fr, auto for detection)--timestamps: Include word-level timestamps in output--format: Output format (text/json, default: text)
Model Sizes
| Model | Parameters | Speed | Accuracy | Memory |
|---|---|---|---|---|
| tiny | 39M | ~32x | Good | ~1GB |
| base | 74M | ~16x | Better | ~1GB |
| small | 244M | ~6x | Great | ~2GB |
| medium | 769M | ~2x | Excellent | ~5GB |
| large | 1.5B | 1x | Best | ~10GB |
Supported Audio Formats
MP3, WAV, M4A, FLAC, OGG, AAC, WMA, and more (via FFmpeg)
Dependencies
- Python 3.8+
- openai-whisper
- ffmpeg
Installation
pip install openai-whisper
sudo apt-get install ffmpeg # Ubuntu/Debian
Score
Total Score
90/100
Based on repository quality metrics
✓SKILL.md
SKILL.mdファイルが含まれている
+20
✓LICENSE
ライセンスが設定されている
+10
✓説明文
100文字以上の説明がある
+10
✓人気
GitHub Stars 500以上
+10
✓最近の活動
1ヶ月以内に更新
+10
✓フォーク
10回以上フォークされている
+5
✓Issue管理
オープンIssueが50未満
+5
✓言語
プログラミング言語が設定されている
+5
✓タグ
1つ以上のタグが設定されている
+5
Reviews
💬
Reviews coming soon
