跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Huggingface Vision Trainer

插件 已验证 活跃

Train and fine-tune object detection models (RTDETRv2, YOLOS, DETR and others) and image classification models (timm and transformers models — MobileNetV3, MobileViT, ResNet, ViT/DINOv3) using Transformers Trainer API on Hugging Face Jobs infrastructure or locally. Includes COCO dataset format support, Albumentations augmentation, mAP/mAR metrics, trackio tracking, hardware selection, and Hub persistence.

目的

To provide a seamless and powerful way for users to train and fine-tune computer vision models without managing local GPU infrastructure, leveraging Hugging Face's cloud capabilities.

功能

  • Train object detection models (RTDETRv2, YOLOS, DETR)
  • Train image classification models (timm, transformers)
  • Train SAM/SAM2 segmentation models
  • Support for COCO dataset format and Albumentations augmentation
  • Integration with Hugging Face Jobs for cloud GPU training
  • Automated dataset validation and Hub persistence

使用场景

  • Fine-tuning object detection models on custom datasets.
  • Training image classification models for specific tasks.
  • Experimenting with SAM/SAM2 models for segmentation on new data.
  • Leveraging cloud GPUs for computationally intensive vision model training.

非目标

  • Running training jobs on local hardware (though scripts can be run locally for inspection).
  • Providing a graphical user interface for model training.
  • Managing or providing datasets; users must supply their own datasets on the Hub.

安装

请先添加 Marketplace

/plugin marketplace add huggingface/skills
/plugin install huggingface-vision-trainer@huggingface-skills

质量评分

已验证
96 /100
1 day ago 分析

信任信号

最近提交2 days ago
星标10.5k
许可证Apache-2.0
状态
查看源代码