此内容尚未提供您的语言版本,正在以英文显示。

Openai Whisper Api

技能已验证活跃

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

目的

To transcribe audio files accurately and efficiently using the OpenAI Whisper API, providing a convenient command-line interface.

功能

Audio transcription via OpenAI Whisper API
Customizable output format (text/JSON)
Support for specifying language and prompt hints
Configurable API base URL for proxies
Environment variable for API key management

使用场景

Transcribing meeting recordings for notes
Converting spoken content from videos into text
Generating transcripts for podcasts or interviews
Processing voice commands or dictations

非目标

Real-time speech-to-text streaming
Speaker diarization or identification
On-device or offline audio transcription
Advanced audio editing or manipulation

Practical Utility

info:Unique selling propositionThe skill is a direct wrapper around the OpenAI API, with some convenience scripting. While it provides a usable interface, it doesn't offer significant custom logic beyond API interaction.

安装

npx skills add steipete/clawdis

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证

95 /100

1 day ago 分析

信任信号

最近提交1 day ago

GitHub 所有者 steipete

星标371.6k

下载量 4.6M

许可证MIT

网站openclaw.ai

状态

查看源代码

类似扩展

Whisper

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing. Best for robust, multilingual ASR.

技能

Orchestra-Research

Speech Generation Skill

100

Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; run the bundled CLI (`scripts/text_to_speech.py`) with built-in voices and require `OPENAI_API_KEY` for live calls. Custom voice creation is out of scope.

技能

openai

YouTube Downloader

100

Download and process YouTube content for research. Use when: downloading competitor videos for analysis; extracting audio for podcasts; getting transcripts for content repurposing; archiving webinars; research content curation

技能

guia-matthieu

Sheet Music Publisher

Converts mastered audio to sheet music and creates printable songbooks. Use after mastering when the user wants sheet music or a songbook for their album.

技能

bitwize-music-studio

Transcribe

Transcribe audio files to text with optional diarization and known-speaker hints. Use when a user asks to transcribe speech from audio/video, extract text from recordings, or label speakers in interviews or meetings.

技能

openai

Whisper Transcription

Transcribe audio and video files to text using OpenAI Whisper. Use when: converting podcasts to blog posts; creating video subtitles; extracting quotes from interviews; repurposing video content to text; building searchable audio archives

技能

guia-matthieu