Zum Hauptinhalt springen
Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Whisper Transcription

Skill Verifiziert Aktiv

Transcribe audio and video files to text using OpenAI Whisper. Use when: converting podcasts to blog posts; creating video subtitles; extracting quotes from interviews; repurposing video content to text; building searchable audio archives

Zweck

To accurately convert spoken word from audio and video files into searchable text formats using advanced AI, enabling content repurposing and archival.

Funktionen

  • Transcribe audio and video files
  • Batch processing of multiple files
  • Translate transcriptions to specified languages
  • Extract timestamps with text segments
  • Support for multiple output formats (txt, srt, vtt, json, tsv)

Anwendungsfälle

  • Convert podcasts to blog posts
  • Create video subtitles (SRT/VTT)
  • Extract quotes from interviews
  • Build searchable audio archives

Nicht-Ziele

  • Replacing professional audio engineering
  • Making subjective creative decisions
  • Directly accessing or editing audio files
  • Guaranteeing commercial success of content

Workflow

  1. Specify input file and desired command (transcribe, batch, translate, timestamps).
  2. Select model size, output format, and optionally language.
  3. Execute the command via Python script.
  4. Receive the transcribed text or formatted output file.

Voraussetzungen

  • Python 3
  • pip install openai-whisper torch ffmpeg-python click
  • ffmpeg installed on system

Code Execution

  • info:LoggingThe script provides informative output to stdout/stderr during execution, detailing model loading, transcription progress, and output file creation.

Installation

npx skills add guia-matthieu/clawfu-skills

Führt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.

Qualitätspunktzahl

Verifiziert
95 /100
Analysiert about 22 hours ago

Vertrauenssignale

Letzter Commitabout 1 month ago
Sterne104
LizenzMIT
Status
Quellcode ansehen

Ähnliche Erweiterungen

YouTube Downloader

100

Download and process YouTube content for research. Use when: downloading competitor videos for analysis; extracting audio for podcasts; getting transcripts for content repurposing; archiving webinars; research content curation

Skill
guia-matthieu

Whisper

97

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing. Best for robust, multilingual ASR.

Skill
Orchestra-Research

Openai Whisper Api

95

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

Skill
steipete

Speech Generation Skill

100

Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; run the bundled CLI (`scripts/text_to_speech.py`) with built-in voices and require `OPENAI_API_KEY` for live calls. Custom voice creation is out of scope.

Skill
openai

Ffmpeg

99

Videobearbeitung und -verarbeitung mit FFmpeg. Verwenden Sie es für Formatkonvertierung, Größenänderung, Komprimierung, Audioextraktion und die Vorbereitung von Assets für Remotion. Trigger umfassen die Konvertierung von GIF in MP4, die Größenänderung von Videos, die Extraktion von Audio, die Komprimierung von Dateien oder jede Medien transformations aufgabe.

Skill
digitalsamba

Sheet Music Publisher

99

Converts mastered audio to sheet music and creates printable songbooks. Use after mastering when the user wants sheet music or a songbook for their album.

Skill
bitwize-music-studio