openai-whisper-api
À la uneTranscribe any audio file to text using OpenAI's Whisper model — fast, accurate, and scriptable via a simple CLI.
npx clawhub@latest install openai-whisper-apiPrérequis
The OpenAI Whisper API skill lets you transcribe audio files directly from the command line using OpenAI's industry-leading /v1/audio/transcriptions endpoint. Drop in any supported audio file and get back a clean text transcript in seconds — no complex setup required.Powered by whisper-1, the skill handles a wide range of audio formats and supports optional language hints, speaker prompts, and JSON output — making it easy to integrate transcription into automated workflows.
Comment ça fonctionne
Once installed, the skill exposes a shell script that wraps OpenAI's audio transcription API using curl. Here's the basic flow:Point the script at your audio file: transcribe.sh /path/to/audio.m4aThe script sends the file to OpenAI's /v1/audio/transcriptions endpoint using your API key.The transcript is saved alongside the input file as .txt by default, or to a custom path using --out.You can tune the request with flags like --language en to improve accuracy, --prompt to provide context (e.g. speaker names), or --json to receive the raw API JSON response.
Fonctionnalités clés
Simple CLI Interface — Transcribe audio with a single command; no boilerplate code needed.Language Hints — Pass --language to guide Whisper toward a specific spoken language for better accuracy.Context Prompting — Use --prompt to inject speaker names or domain vocabulary, improving transcript quality.Flexible Output — Save transcripts as plain text (default) or raw JSON with the --json flag.Custom Output Path — Specify any destination file with --out /path/to/output.txt.Configurable Model — Select the Whisper model via --model (defaults to whisper-1).
Prérequis
OpenAI API Key (required) — Authenticates requests to OpenAI's audio transcription service. Set the OPENAI_API_KEY environment variable, or configure it in ~/.openclaw/openclaw.json. Supports: OpenAI.
Cas d'utilisation
Meeting & Interview Transcription — Convert recorded meetings or interviews into searchable text transcripts automatically.Content Repurposing — Turn podcast episodes or voice memos into written content ready for editing or publishing.Automated Pipelines — Integrate the script into CI/CD or data pipelines to batch-transcribe audio files without manual effort.Accessibility — Generate text versions of audio content to improve accessibility for hearing-impaired audiences.
Comment installer
npx clawhub@latest install openai-whisper-apinpx clawhub@latest install openai-whisper-apiPrérequis
FAQ
Avis
0 avisSe connecter pour écrire un avis
Aucun avis pour l'instant. Soyez le premier à partager votre expérience !