Video to Text (Groq)
Skill ActiveTranscribe video/audio URL or local file to text + SRT using yt-dlp + Groq Whisper API (free tier). Preferred default for v2 transcription. Returns raw text and segments; summary is caller's responsibility.
Transcribe video or audio content from various sources into text and SRT subtitles for further processing or analysis.
Features
- Transcribes video/audio URLs and local files.
- Generates raw text output.
- Generates SRT subtitle files.
- Uses yt-dlp for audio extraction.
- Leverages Groq Whisper API for transcription.
Use Cases
- Extracting text from video lectures or presentations.
- Generating subtitles for video content.
- Transcribing podcast audio for searchability.
- Converting audio recordings into searchable text documents.
Non-Goals
- Producing a summary of the transcribed content.
- Handling transcription for unsupported audio/video formats.
- Providing transcription without a valid Groq API key.
Workflow
- Extract audio from the provided URL or local file using yt-dlp.
- Prepare the extracted audio file (e.g., conversion if needed via ffmpeg).
- Send the audio file to the Groq Whisper API for transcription.
- Receive transcription results (raw text and SRT segments) from the API.
- Return structured JSON output containing text, SRT, and metadata.
Practices
- Transcription
- Content extraction
- API integration
Prerequisites
- GROQ_API_KEY environment variable set
- yt-dlp installed
- ffmpeg installed
Trust
- warning:Issues AttentionThere are 17 open issues and 3 closed issues in the last 90 days, indicating a low closure rate and potentially slow maintainer response.
Installation
npx skills add 0xmariowu/AutosearchRuns the Vercel skills CLI (skills.sh) via npx — needs Node.js locally and at least one installed skills-compatible agent (Claude Code, Cursor, Codex, …). Assumes the repo follows the agentskills.io format.
Quality Score
Similar Extensions
YouTube Downloader
100Download and process YouTube content for research. Use when: downloading competitor videos for analysis; extracting audio for podcasts; getting transcripts for content repurposing; archiving webinars; research content curation
Transcribe Video
98Generate subtitles (SRT/VTT) and plain text transcripts from video or audio files using AWS Transcribe. Use when creating captions, extracting spoken content, generating transcripts for notes, or making video content searchable.
Ffmpeg
99Video and audio processing with FFmpeg. Use for format conversion, resizing, compression, audio extraction, and preparing assets for Remotion. Triggers include converting GIF to MP4, resizing video, extracting audio, compressing files, or any media transformation task.
Sheet Music Publisher
99Converts mastered audio to sheet music and creates printable songbooks. Use after mastering when the user wants sheet music or a songbook for their album.
Summarize
99Summarize or transcribe URLs, YouTube/videos, podcasts, articles, transcripts, PDFs, and local files.
Cli Anything Videocaptioner
99AI-powered video captioning — transcribe speech, optimize/translate subtitles, and burn them into video via the stable VideoCaptioner backend. Free ASR and translation included.