此内容尚未提供您的语言版本,正在以英文显示。

Huggingface Vision Trainer

插件已验证活跃

Train and fine-tune object detection models (RTDETRv2, YOLOS, DETR and others) and image classification models (timm and transformers models — MobileNetV3, MobileViT, ResNet, ViT/DINOv3) using Transformers Trainer API on Hugging Face Jobs infrastructure or locally. Includes COCO dataset format support, Albumentations augmentation, mAP/mAR metrics, trackio tracking, hardware selection, and Hub persistence.

目的

To provide a seamless and powerful way for users to train and fine-tune computer vision models without managing local GPU infrastructure, leveraging Hugging Face's cloud capabilities.

功能

Train object detection models (RTDETRv2, YOLOS, DETR)
Train image classification models (timm, transformers)
Train SAM/SAM2 segmentation models
Support for COCO dataset format and Albumentations augmentation
Integration with Hugging Face Jobs for cloud GPU training
Automated dataset validation and Hub persistence

使用场景

Fine-tuning object detection models on custom datasets.
Training image classification models for specific tasks.
Experimenting with SAM/SAM2 models for segmentation on new data.
Leveraging cloud GPUs for computationally intensive vision model training.

非目标

Running training jobs on local hardware (though scripts can be run locally for inspection).
Providing a graphical user interface for model training.
Managing or providing datasets; users must supply their own datasets on the Hub.

安装

请先添加 Marketplace

/plugin marketplace add huggingface/skills

/plugin install huggingface-vision-trainer@huggingface-skills

质量评分

已验证

96 /100

1 day ago 分析

信任信号

最近提交2 days ago

GitHub 所有者 huggingface

星标10.5k

许可证Apache-2.0

网站huggingface.co

状态

查看源代码

类似扩展

Autoresearch Agent

100

Autonomous experiment loop that optimizes any file by a measurable metric. 5 slash commands, 8 evaluators, configurable loop intervals (10min to monthly).

插件

alirezarezvani

Train Sentence Transformers

Train or fine-tune sentence-transformers models across all three architectures: SentenceTransformer (bi-encoder embeddings), CrossEncoder (rerankers), and SparseEncoder (SPLADE). Covers loss selection, hard-negative mining, evaluators, distillation, LoRA, Matryoshka, and Hugging Face Hub publishing.

插件

huggingface

Transformers Js

Run state-of-the-art machine learning models directly in JavaScript/TypeScript for NLP, computer vision, audio processing, and multimodal tasks. Works in Node.js and browsers with WebGPU/WASM using Hugging Face models.

插件

huggingface

PM Market Research

Market research skills for PMs: user personas, market segmentation, sentiment analysis, and competitive analysis.

插件

phuryn