跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Huggingface Community Evals

插件 已验证 活跃

Add and manage evaluation results in Hugging Face model cards. Supports extracting eval tables from README content, importing scores from Artificial Analysis API, and running custom evaluations with vLLM/lighteval.

目的

To enable developers and researchers to run and manage AI model evaluations efficiently on their local hardware, facilitating model selection and comparison.

功能

  • Run local evaluations with inspect-ai
  • Run local evaluations with lighteval
  • Support for vLLM, Transformers, and accelerate backends
  • Guidance on task selection and hardware requirements
  • Troubleshooting for common evaluation issues

使用场景

  • Quickly test models from Hugging Face Hub locally
  • Compare model performance using standard benchmarks
  • Choose the best inference backend (vLLM, Transformers) for local GPU evaluations
  • Debug and troubleshoot evaluation setups before scaling to remote jobs

非目标

  • Orchestrating evaluations on Hugging Face Jobs
  • Directly editing Hugging Face model cards or publishing results
  • Automating community-evals workflows
  • Replacing remote Hugging Face compute infrastructure

安装

请先添加 Marketplace

/plugin marketplace add huggingface/skills
/plugin install huggingface-community-evals@huggingface-skills

质量评分

已验证
98 /100
1 day ago 分析

信任信号

最近提交2 days ago
星标10.5k
许可证Apache-2.0
状态
查看源代码