跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Openai Whisper Api

技能 已验证 活跃

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

目的

To transcribe audio files accurately and efficiently using the OpenAI Whisper API, providing a convenient command-line interface.

功能

  • Audio transcription via OpenAI Whisper API
  • Customizable output format (text/JSON)
  • Support for specifying language and prompt hints
  • Configurable API base URL for proxies
  • Environment variable for API key management

使用场景

  • Transcribing meeting recordings for notes
  • Converting spoken content from videos into text
  • Generating transcripts for podcasts or interviews
  • Processing voice commands or dictations

非目标

  • Real-time speech-to-text streaming
  • Speaker diarization or identification
  • On-device or offline audio transcription
  • Advanced audio editing or manipulation

Practical Utility

  • info:Unique selling propositionThe skill is a direct wrapper around the OpenAI API, with some convenience scripting. While it provides a usable interface, it doesn't offer significant custom logic beyond API interaction.

安装

npx skills add steipete/clawdis

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证
95 /100
1 day ago 分析

信任信号

最近提交1 day ago
星标371.6k
许可证MIT
状态
查看源代码

类似扩展

Whisper

97

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing. Best for robust, multilingual ASR.

技能
Orchestra-Research

Speech Generation Skill

100

Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; run the bundled CLI (`scripts/text_to_speech.py`) with built-in voices and require `OPENAI_API_KEY` for live calls. Custom voice creation is out of scope.

技能
openai

YouTube Downloader

100

Download and process YouTube content for research. Use when: downloading competitor videos for analysis; extracting audio for podcasts; getting transcripts for content repurposing; archiving webinars; research content curation

技能
guia-matthieu

Sheet Music Publisher

99

Converts mastered audio to sheet music and creates printable songbooks. Use after mastering when the user wants sheet music or a songbook for their album.

技能
bitwize-music-studio

Transcribe

97

Transcribe audio files to text with optional diarization and known-speaker hints. Use when a user asks to transcribe speech from audio/video, extract text from recordings, or label speakers in interviews or meetings.

技能
openai

Whisper Transcription

95

Transcribe audio and video files to text using OpenAI Whisper. Use when: converting podcasts to blog posts; creating video subtitles; extracting quotes from interviews; repurposing video content to text; building searchable audio archives

技能
guia-matthieu