Huggingface Datasets
技能 已验证 活跃Use this skill for Hugging Face Dataset Viewer API workflows that fetch subset/split metadata, paginate rows, search text, apply filters, download parquet URLs, and read size or statistics.
Use this skill for Hugging Face Dataset Viewer API workflows that fetch subset/split metadata, paginate rows, search text, apply filters, download parquet URLs, and read size or statistics.
功能
- Fetch subset/split metadata
- Paginate rows with offset and length
- Search text within dataset rows
- Apply filters with predicate syntax
- Download parquet URLs
- Read dataset size and statistics
- Validate dataset availability
使用场景
- Exploring dataset contents programmatically
- Extracting specific subsets of data
- Searching for patterns within dataset text
- Automating data retrieval for ML tasks
非目标
- Creating or uploading datasets (use hf-cli)
- Running ML models
- Training or fine-tuning models
- Managing Hugging Face Hub resources beyond dataset viewing
安装
/plugin install skills@huggingface-skills质量评分
已验证类似扩展
Website Extraction Api
100Extract typed JSON from public website pages using a schema.
Extract Supplier Catalog From Website
100Extract SKUs, product names, unit prices, availability, and minimum order quantities from a supplier catalog page.
Hugging Science
98Use when the user is doing AI/ML work in a scientific domain — biology, chemistry, physics, astronomy, climate, genomics, materials science, medicine, ecology, energy, conservation, engineering, mathematics, scientific reasoning, drug discovery, protein design, weather modeling, theorem proving, single-cell, PDE solving, or anything similar. Hugging Science (huggingscience.co) is a curated catalog of scientific datasets, models, blog posts, and interactive Spaces; the `hugging-science` org on Hugging Face hosts community datasets, models, and demo Spaces. This skill helps you discover the right resource AND actually use it — loading datasets via `datasets`, running models via `transformers` or the HF Inference API, calling Spaces like BoltzGen via `gradio_client`, and citing blog posts for methodology. Trigger this skill whenever a user mentions a scientific ML task, asks for "a dataset/model for X" where X is a scientific topic, wants to fine-tune on scientific data, asks about protein / molecule / genome / climate / materials / astronomy / pathology / weather ML, or needs AI tools for research — even if they never say "Hugging Science" explicitly. The catalog is purpose-built for LLM agents (it ships an `llms-full.txt`); prefer it over generic web search for these tasks.
X Twitter Scraper
100当用户需要通过 Xquik 获取 X (Twitter) 数据或执行需要确认的 X 操作时使用:推文搜索、用户查找、关注者提取、媒体下载、监控、Webhook、MCP、SDK、发布、点赞、私信和个人资料更新。需要 Xquik API 密钥。切勿索要 X 登录凭据。
Slack
100Use the Slack tool to react, pin/unpin, send, edit, delete messages, or fetch Slack member info.
Github
100Use gh for GitHub issues, PR status, CI/logs, comments, reviews, releases, and API queries.