Skip to main content

Huggingface Community Evals

Plugin Verified Active

Add and manage evaluation results in Hugging Face model cards. Supports extracting eval tables from README content, importing scores from Artificial Analysis API, and running custom evaluations with vLLM/lighteval.

Purpose

To enable developers and researchers to run and manage AI model evaluations efficiently on their local hardware, facilitating model selection and comparison.

Features

  • Run local evaluations with inspect-ai
  • Run local evaluations with lighteval
  • Support for vLLM, Transformers, and accelerate backends
  • Guidance on task selection and hardware requirements
  • Troubleshooting for common evaluation issues

Use Cases

  • Quickly test models from Hugging Face Hub locally
  • Compare model performance using standard benchmarks
  • Choose the best inference backend (vLLM, Transformers) for local GPU evaluations
  • Debug and troubleshoot evaluation setups before scaling to remote jobs

Non-Goals

  • Orchestrating evaluations on Hugging Face Jobs
  • Directly editing Hugging Face model cards or publishing results
  • Automating community-evals workflows
  • Replacing remote Hugging Face compute infrastructure

Installation

First, add the marketplace

/plugin marketplace add huggingface/skills
/plugin install huggingface-community-evals@huggingface-skills

Quality Score

Verified
98 /100
Analyzed about 14 hours ago

Trust Signals

Last commit1 day ago
Stars10.5k
LicenseApache-2.0
Status
View Source

© 2025 SkillRepo · Find the right skill, skip the noise.