☁️

openai-whisper-api

Name: openai-whisper-api
Author: OpenClaw

À la une

Transcribe any audio file to text using OpenAI's Whisper model — fast, accurate, and scriptable via a simple CLI.

par OpenClawv1.0.0

Browser & AutomationProductivityAI PoweredAutomationAPICLIDeveloper Tool

Connexion à la VM...

npx clawhub@latest install openai-whisper-api

136Installations actuelles

v1.0.0Version

Prérequis

curlOpenAI API Key

The OpenAI Whisper API skill lets you transcribe audio files directly from the command line using OpenAI's industry-leading /v1/audio/transcriptions endpoint. Drop in any supported audio file and get back a clean text transcript in seconds — no complex setup required.Powered by whisper-1, the skill handles a wide range of audio formats and supports optional language hints, speaker prompts, and JSON output — making it easy to integrate transcription into automated workflows.

Comment ça fonctionne

Once installed, the skill exposes a shell script that wraps OpenAI's audio transcription API using curl. Here's the basic flow:Point the script at your audio file: transcribe.sh /path/to/audio.m4aThe script sends the file to OpenAI's /v1/audio/transcriptions endpoint using your API key.The transcript is saved alongside the input file as .txt by default, or to a custom path using --out.You can tune the request with flags like --language en to improve accuracy, --prompt to provide context (e.g. speaker names), or --json to receive the raw API JSON response.

Fonctionnalités clés

Simple CLI Interface — Transcribe audio with a single command; no boilerplate code needed.Language Hints — Pass --language to guide Whisper toward a specific spoken language for better accuracy.Context Prompting — Use --prompt to inject speaker names or domain vocabulary, improving transcript quality.Flexible Output — Save transcripts as plain text (default) or raw JSON with the --json flag.Custom Output Path — Specify any destination file with --out /path/to/output.txt.Configurable Model — Select the Whisper model via --model (defaults to whisper-1).

Prérequis

OpenAI API Key (required) — Authenticates requests to OpenAI's audio transcription service. Set the OPENAI_API_KEY environment variable, or configure it in ~/.openclaw/openclaw.json. Supports: OpenAI.

Cas d'utilisation

Meeting & Interview Transcription — Convert recorded meetings or interviews into searchable text transcripts automatically.Content Repurposing — Turn podcast episodes or voice memos into written content ready for editing or publishing.Automated Pipelines — Integrate the script into CI/CD or data pipelines to batch-transcribe audio files without manual effort.Accessibility — Generate text versions of audio content to improve accessibility for hearing-impaired audiences.

Comment installer

Run in your terminal

npx clawhub@latest install openai-whisper-api

Click the Install button at the top of this page for one-click setup

Connexion à la VM...

npx clawhub@latest install openai-whisper-api

136Installations actuelles

v1.0.0Version

Prérequis

curlOpenAI API Key

FAQ

Avis

0 avis

Se connecter pour écrire un avis

Aucun avis pour l'instant. Soyez le premier à partager votre expérience !