이 콘텐츠는 아직 사용자의 언어로 제공되지 않아 영어로 표시됩니다.

Video Processor

Skill 확인됨

Download and process videos from YouTube and other platforms. Supports video download, audio extraction, format conversion (mp4, webm), and Whisper transcription. Use when user mentions YouTube download, video conversion, audio extraction, transcription, mp4, webm, ffmpeg, yt-dlp, or whisper transcription.

AI 요약

This skill utilizes yt-dlp, FFmpeg, and Whisper to provide a unified command-line interface for video downloading, audio extraction, format conversion (MP4, WebM), and speech-to-text transcription. It offers detailed options for model selection, language, and output formats, with clear prerequisites and examples.

Maintenance

warning:Commit recencyThere are no recent commits on the default branch (pushedAt: n/a), indicating potential unmaintenance.

Code Execution

warning:ValidationInput file paths are validated for existence and file type, but other parameters like URLs or format strings are not explicitly validated beyond type hints or choices.
warning:Tool FallbackThe skill requires external tools like yt-dlp, FFmpeg, and Whisper, but lists them as required without providing built-in fallbacks if they are not installed.

Portability

warning:Runtime stabilityThe script assumes external command-line tools (ffmpeg, yt-dlp, whisper) are installed and in the PATH. It checks for their existence but does not provide alternative fallbacks if they are missing, beyond an error message.

설치

먼저 마켓플레이스를 추가하세요

/plugin marketplace add iamzhihuix/happy-claude-skills

/plugin install video-processor@happy-claude-skills

21 days ago

iamzhihuix

285 stars

MIT

5 days ago에 업데이트됨

소스 코드 보기

유사한 확장

YouTube Clipper Skill

YouTube 视频智能剪辑工具。下载视频和字幕，AI 分析生成精细章节（几分钟级别），用户选择片段后自动剪辑、翻译字幕为中英双语、烧录字幕到视频，并生成总结文案。使用场景：当用户需要剪辑 YouTube 视频、生成短视频片段、制作双语字幕版本时。关键词：视频剪辑、YouTube、字幕翻译、双语字幕、视频下载、clip video

Skill

op7418

FFmpeg for Video Production

Video and audio processing with FFmpeg. Use for format conversion, resizing, compression, audio extraction, and preparing assets for Remotion. Triggers include converting GIF to MP4, resizing video, extracting audio, compressing files, or any media transformation task.

Skill

digitalsamba

AI Multimodal Processing Skill

Multimodal AI processing via Google Gemini API (2M tokens context). Capabilities: audio (transcription, 9.5hr max, summarization, music analysis), images (captioning, OCR, object detection, segmentation, visual Q&A), video (scene detection, 6hr max, YouTube URLs, temporal analysis), documents (PDF extraction, tables, forms, charts), image generation (text-to-image, editing). Actions: transcribe, analyze, extract, caption, detect, segment, generate from media. Keywords: Gemini API, audio transcription, image captioning, OCR, object detection, video analysis, PDF extraction, text-to-image, multimodal, speech recognition, visual Q&A, scene detection, YouTube transcription, table extraction, form processing, image generation, Imagen. Use when: transcribing audio/video, analyzing images/screenshots, extracting data from PDFs, processing YouTube videos, generating images from text, implementing multimodal AI features.

Skill

samhvw8

YouTube Automation

Automate YouTube content workflows including video management, analytics, scheduling, and channel optimization

Skill

claude-office-skills

Transcription Automation

Automate audio/video transcription, meeting notes, subtitle generation, and content processing

Skill

claude-office-skills

Video Translation

Translate and dub videos from one language to another, replacing the original audio with TTS while keeping the video intact.

Skill

noizai