Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Ray Data

Skill Verifiziert Aktiv

Scalable data processing for ML workloads. Streaming execution across CPU/GPU, supports Parquet/CSV/JSON/images. Integrates with Ray Train, PyTorch, TensorFlow. Scales from single machine to 100s of nodes. Use for batch inference, data preprocessing, multi-modal data loading, or distributed ETL pipelines.

Zweck

To enable efficient and scalable data processing for machine learning workloads, facilitating batch inference, data preprocessing, multi-modal data loading, and distributed ETL pipelines.

Funktionen

Scalable data processing for ML workloads
Streaming execution across CPU/GPU
Support for Parquet, CSV, JSON, and image formats
Integration with Ray Train, PyTorch, and TensorFlow
Scales from single machine to hundreds of nodes

Anwendungsfälle

Processing large datasets (>100GB) for ML training
Distributed data preprocessing across a cluster
Building batch inference pipelines
Loading multi-modal data (images, audio, video)

Nicht-Ziele

Processing small data (<1GB) on a single machine (use Pandas)
SQL-like operations on tabular data (use Dask or Spark)
Enterprise ETL and complex SQL queries (use Spark)

Trust

info:Issues AttentionThe repository shows 17 issues opened in the last 90 days and 4 closed, indicating a closure rate below 50%, but the number of open issues is relatively low.

Compliance

info:GDPRThe skill processes datasets which may contain personal data, and while it doesn't submit this data to third parties, it doesn't include specific sanitization steps before potential LLM interaction.

Practical Utility

info:Edge casesWhile the documentation covers common transformations and integrations, explicit documentation of failure modes and recovery steps for edge cases (e.g., malformed input, rate limits) is not detailed.

Installation

npx skills add davila7/claude-code-templates

Führt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.

Qualitätspunktzahl

Verifiziert

95 /100

Analysiert about 21 hours ago

Vertrauenssignale

Letzter Commitabout 22 hours ago

GitHub-Inhaber davila7

Sterne27.2k

Downloads 23k

LizenzMIT

Websiteaitmpl.com

Status

Quellcode ansehen

Ray Data

Funktionen

Anwendungsfälle

Nicht-Ziele

Trust

Compliance

Practical Utility

Qualitätspunktzahl

Vertrauenssignale

Ähnliche Erweiterungen

Ray Data

TimesFM Forecasting

PyTDC (Therapeutics Data Commons)

Polars

Spark Engineer

Build Feature Store