此内容尚未提供您的语言版本,正在以英文显示。

Huggingface Datasets

技能已验证活跃

Use this skill for Hugging Face Dataset Viewer API workflows that fetch subset/split metadata, paginate rows, search text, apply filters, download parquet URLs, and read size or statistics.

目的

Use this skill for Hugging Face Dataset Viewer API workflows that fetch subset/split metadata, paginate rows, search text, apply filters, download parquet URLs, and read size or statistics.

功能

Fetch subset/split metadata
Paginate rows with offset and length
Search text within dataset rows
Apply filters with predicate syntax
Download parquet URLs
Read dataset size and statistics
Validate dataset availability

使用场景

Exploring dataset contents programmatically
Extracting specific subsets of data
Searching for patterns within dataset text
Automating data retrieval for ML tasks

非目标

Creating or uploading datasets (use hf-cli)
Running ML models
Training or fine-tuning models
Managing Hugging Face Hub resources beyond dataset viewing

安装

/plugin install skills@huggingface-skills

质量评分

已验证

97 /100

1 day ago 分析

信任信号

最近提交2 days ago

GitHub 所有者 huggingface

星标10.5k

许可证Apache-2.0

网站huggingface.co

状态

查看源代码

类似扩展

Website Extraction Api

100

Extract typed JSON from public website pages using a schema.

技能

iterationlayer

Extract Supplier Catalog From Website

100

Extract SKUs, product names, unit prices, availability, and minimum order quantities from a supplier catalog page.

技能

iterationlayer

Hugging Science

Use when the user is doing AI/ML work in a scientific domain — biology, chemistry, physics, astronomy, climate, genomics, materials science, medicine, ecology, energy, conservation, engineering, mathematics, scientific reasoning, drug discovery, protein design, weather modeling, theorem proving, single-cell, PDE solving, or anything similar. Hugging Science (huggingscience.co) is a curated catalog of scientific datasets, models, blog posts, and interactive Spaces; the `hugging-science` org on Hugging Face hosts community datasets, models, and demo Spaces. This skill helps you discover the right resource AND actually use it — loading datasets via `datasets`, running models via `transformers` or the HF Inference API, calling Spaces like BoltzGen via `gradio_client`, and citing blog posts for methodology. Trigger this skill whenever a user mentions a scientific ML task, asks for "a dataset/model for X" where X is a scientific topic, wants to fine-tune on scientific data, asks about protein / molecule / genome / climate / materials / astronomy / pathology / weather ML, or needs AI tools for research — even if they never say "Hugging Science" explicitly. The catalog is purpose-built for LLM agents (it ships an `llms-full.txt`); prefer it over generic web search for these tasks.

技能

K-Dense-AI

X Twitter Scraper

100

当用户需要通过 Xquik 获取 X (Twitter) 数据或执行需要确认的 X 操作时使用：推文搜索、用户查找、关注者提取、媒体下载、监控、Webhook、MCP、SDK、发布、点赞、私信和个人资料更新。需要 Xquik API 密钥。切勿索要 X 登录凭据。

技能

Xquik-dev

Slack

100

Use the Slack tool to react, pin/unpin, send, edit, delete messages, or fetch Slack member info.

技能

steipete

Github

100

Use gh for GitHub issues, PR status, CI/logs, comments, reviews, releases, and API queries.

技能

steipete