此内容尚未提供您的语言版本,正在以英文显示。

Video to Text (Groq)

技能活跃

Transcribe video/audio URL or local file to text + SRT using yt-dlp + Groq Whisper API (free tier). Preferred default for v2 transcription. Returns raw text and segments; summary is caller's responsibility.

目的

Transcribe video or audio content from various sources into text and SRT subtitles for further processing or analysis.

功能

Transcribes video/audio URLs and local files.
Generates raw text output.
Generates SRT subtitle files.
Uses yt-dlp for audio extraction.
Leverages Groq Whisper API for transcription.

使用场景

Extracting text from video lectures or presentations.
Generating subtitles for video content.
Transcribing podcast audio for searchability.
Converting audio recordings into searchable text documents.

非目标

Producing a summary of the transcribed content.
Handling transcription for unsupported audio/video formats.
Providing transcription without a valid Groq API key.

工作流

Extract audio from the provided URL or local file using yt-dlp.
Prepare the extracted audio file (e.g., conversion if needed via ffmpeg).
Send the audio file to the Groq Whisper API for transcription.
Receive transcription results (raw text and SRT segments) from the API.
Return structured JSON output containing text, SRT, and metadata.

实践

Transcription
Content extraction
API integration

先决条件

GROQ_API_KEY environment variable set
yt-dlp installed
ffmpeg installed

Trust

warning:Issues AttentionThere are 17 open issues and 3 closed issues in the last 90 days, indicating a low closure rate and potentially slow maintainer response.

安装

npx skills add 0xmariowu/Autosearch

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

95 /100

1 day ago 分析

信任信号

最近提交3 days ago

GitHub 所有者 0xmariowu

星标18

许可证MIT

网站autosearch.dev

状态

查看源代码

类似扩展

YouTube Downloader

100

Download and process YouTube content for research. Use when: downloading competitor videos for analysis; extracting audio for podcasts; getting transcripts for content repurposing; archiving webinars; research content curation

技能

guia-matthieu

Transcribe Video

使用 AWS Transcribe 从视频或音频文件生成字幕（SRT/VTT）和纯文本记录。在创建字幕、提取口头内容、为笔记生成记录或使视频内容可搜索时使用。

技能

rameerez

Ffmpeg

使用 FFmpeg 进行视频和音频处理。用于格式转换、调整大小、压缩、音频提取以及为 Remotion 准备素材。触发器包括将 GIF 转换为 MP4、调整视频大小、提取音频、压缩文件或任何媒体转换任务。

技能

digitalsamba

Sheet Music Publisher

Converts mastered audio to sheet music and creates printable songbooks. Use after mastering when the user wants sheet music or a songbook for their album.

技能

bitwize-music-studio

Summarize

Summarize or transcribe URLs, YouTube/videos, podcasts, articles, transcripts, PDFs, and local files.

技能

steipete

Cli Anything Videocaptioner

AI-powered video captioning — transcribe speech, optimize/translate subtitles, and burn them into video via the stable VideoCaptioner backend. Free ASR and translation included.

技能

hkuds