Back to list
trpc-group

whisper

by trpc-group

trpc-agent-go is a powerful Go framework for building intelligent agent systems using large language models (LLMs) and tools.

832🍴 80📅 Jan 23, 2026

SKILL.md


name: whisper description: Transcribe audio files to text using OpenAI Whisper

Whisper Audio Transcription Skill

Transcribe audio files to text using OpenAI Whisper.

Capabilities

  • Transcribe audio files (MP3, WAV, M4A, FLAC, OGG, etc.) to text
  • Support for 90+ languages with auto-detection
  • Optional timestamp generation
  • Multiple model sizes (tiny/base/small/medium/large)
  • Output in plain text or JSON format

Usage

Basic Transcription

python3 scripts/transcribe.py <audio_file> <output_file>

With Options

# Specify model size (default: base)
python3 scripts/transcribe.py audio.mp3 transcript.txt --model medium

# Specify language (improves accuracy)
python3 scripts/transcribe.py audio.mp3 transcript.txt --language zh

# Include timestamps
python3 scripts/transcribe.py audio.mp3 transcript.txt --timestamps

# JSON output with metadata
python3 scripts/transcribe.py audio.mp3 output.json --format json

Parameters

  • audio_file (required): Path to input audio file
  • output_file (required): Path to output text/JSON file
  • --model: Whisper model size (tiny/base/small/medium/large, default: base)
  • --language: Language code (e.g., en, zh, es, fr, auto for detection)
  • --timestamps: Include word-level timestamps in output
  • --format: Output format (text/json, default: text)

Model Sizes

ModelParametersSpeedAccuracyMemory
tiny39M~32xGood~1GB
base74M~16xBetter~1GB
small244M~6xGreat~2GB
medium769M~2xExcellent~5GB
large1.5B1xBest~10GB

Supported Audio Formats

MP3, WAV, M4A, FLAC, OGG, AAC, WMA, and more (via FFmpeg)

Dependencies

  • Python 3.8+
  • openai-whisper
  • ffmpeg

Installation

pip install openai-whisper
sudo apt-get install ffmpeg  # Ubuntu/Debian

Score

Total Score

90/100

Based on repository quality metrics

SKILL.md

SKILL.mdファイルが含まれている

+20
LICENSE

ライセンスが設定されている

+10
説明文

100文字以上の説明がある

+10
人気

GitHub Stars 500以上

+10
最近の活動

1ヶ月以内に更新

+10
フォーク

10回以上フォークされている

+5
Issue管理

オープンIssueが50未満

+5
言語

プログラミング言語が設定されている

+5
タグ

1つ以上のタグが設定されている

+5

Reviews

💬

Reviews coming soon