Video Processor
Skill ZweryfikowanoDownload and process videos from YouTube and other platforms. Supports video download, audio extraction, format conversion (mp4, webm), and Whisper transcription. Use when user mentions YouTube download, video conversion, audio extraction, transcription, mp4, webm, ffmpeg, yt-dlp, or whisper transcription.
This skill utilizes yt-dlp, FFmpeg, and Whisper to provide a unified command-line interface for video downloading, audio extraction, format conversion (MP4, WebM), and speech-to-text transcription. It offers detailed options for model selection, language, and output formats, with clear prerequisites and examples.
Maintenance
- warning:Commit recencyThere are no recent commits on the default branch (pushedAt: n/a), indicating potential unmaintenance.
Code Execution
- warning:ValidationInput file paths are validated for existence and file type, but other parameters like URLs or format strings are not explicitly validated beyond type hints or choices.
- warning:Tool FallbackThe skill requires external tools like yt-dlp, FFmpeg, and Whisper, but lists them as required without providing built-in fallbacks if they are not installed.
Portability
- warning:Runtime stabilityThe script assumes external command-line tools (ffmpeg, yt-dlp, whisper) are installed and in the PATH. It checks for their existence but does not provide alternative fallbacks if they are missing, beyond an error message.
Instalacja
Najpierw dodaj marketplace
/plugin marketplace add iamzhihuix/happy-claude-skills/plugin install video-processor@happy-claude-skillsPodobne rozszerzenia
YouTube Clipper Skill
90YouTube 视频智能剪辑工具。下载视频和字幕,AI 分析生成精细章节(几分钟级别), 用户选择片段后自动剪辑、翻译字幕为中英双语、烧录字幕到视频,并生成总结文案。 使用场景:当用户需要剪辑 YouTube 视频、生成短视频片段、制作双语字幕版本时。 关键词:视频剪辑、YouTube、字幕翻译、双语字幕、视频下载、clip video
FFmpeg for Video Production
95Video and audio processing with FFmpeg. Use for format conversion, resizing, compression, audio extraction, and preparing assets for Remotion. Triggers include converting GIF to MP4, resizing video, extracting audio, compressing files, or any media transformation task.
AI Multimodal Processing Skill
95Multimodal AI processing via Google Gemini API (2M tokens context). Capabilities: audio (transcription, 9.5hr max, summarization, music analysis), images (captioning, OCR, object detection, segmentation, visual Q&A), video (scene detection, 6hr max, YouTube URLs, temporal analysis), documents (PDF extraction, tables, forms, charts), image generation (text-to-image, editing). Actions: transcribe, analyze, extract, caption, detect, segment, generate from media. Keywords: Gemini API, audio transcription, image captioning, OCR, object detection, video analysis, PDF extraction, text-to-image, multimodal, speech recognition, visual Q&A, scene detection, YouTube transcription, table extraction, form processing, image generation, Imagen. Use when: transcribing audio/video, analyzing images/screenshots, extracting data from PDFs, processing YouTube videos, generating images from text, implementing multimodal AI features.
YouTube Automation
93Automate YouTube content workflows including video management, analytics, scheduling, and channel optimization
Transcription Automation
92Automate audio/video transcription, meeting notes, subtitle generation, and content processing
Video Translation
45Translate and dub videos from one language to another, replacing the original audio with TTS while keeping the video intact.