Zum Hauptinhalt springen
Dieser Inhalt ist noch nicht in Ihrer Sprache verfügbar und wird auf Englisch angezeigt.

Dask

Skill Verifiziert Aktiv

Distributed computing for larger-than-RAM pandas/NumPy workflows. Use when you need to scale existing pandas/NumPy code beyond memory or across clusters. Best for parallel file processing, distributed ML, integration with existing pandas code. For out-of-core analytics on single machine use vaex; for in-memory speed use polars.

Zweck

To allow users to scale their existing pandas and NumPy workflows beyond memory limits or across clusters using the Dask library.

Funktionen

  • Larger-than-RAM execution on single machines
  • Parallel processing across multiple cores
  • Distributed computation for terabyte-scale datasets
  • Familiar pandas/NumPy APIs for DataFrames and Arrays
  • Task-based parallelization with Futures

Anwendungsfälle

  • Process datasets that exceed available RAM
  • Scale pandas or NumPy operations to larger datasets
  • Parallelize computations for performance improvements
  • Process multiple files efficiently (CSVs, Parquet, JSON, text logs)

Nicht-Ziele

  • Out-of-core analytics on a single machine (use vaex)
  • In-memory speed optimization (use polars)

Installation

npx skills add K-Dense-AI/claude-scientific-skills

Führt das Vercel skills CLI (skills.sh) via npx aus — benötigt Node.js lokal und mindestens einen installierten skills-kompatiblen Agent (Claude Code, Cursor, Codex, …). Setzt voraus, dass das Repo dem agentskills.io-Format folgt.

Qualitätspunktzahl

Verifiziert
98 /100
Analysiert 1 day ago

Vertrauenssignale

Letzter Commit3 days ago
Sterne21k
LizenzMIT
Status
Quellcode ansehen

Ähnliche Erweiterungen

Dask Data Science

99

Part of the AlterLab Academic Skills suite. Distributed computing for larger-than-RAM pandas/NumPy workflows. Use when you need to scale existing pandas/NumPy code beyond memory or across clusters. Best for parallel file processing, distributed ML, integration with existing pandas code. For out-of-core analytics on single machine use vaex; for in-memory speed use polars.

Skill
AlterLab-IEU

Ray Data

95

Scalable data processing for ML workloads. Streaming execution across CPU/GPU, supports Parquet/CSV/JSON/images. Integrates with Ray Train, PyTorch, TensorFlow. Scales from single machine to 100s of nodes. Use for batch inference, data preprocessing, multi-modal data loading, or distributed ETL pipelines.

Skill
Orchestra-Research

Create Spatial Visualization

100

Create interactive maps, elevation profiles, and spatial visualizations from GPX tracks, waypoints, or route data using R (sf, leaflet, tmap) or Observable (D3, deck.gl). Covers data import, coordinate system handling, map styling, and export to HTML or image formats. Use when visualizing a planned or completed tour route on an interactive map, creating elevation profiles for hiking or cycling routes, overlaying waypoints and POIs on a basemap, or building a web-based trip dashboard.

Skill
pjt222

Embedding Strategies

100

Select and optimize embedding models for semantic search and RAG applications. Use when choosing embedding models, implementing chunking strategies, or optimizing embedding quality for specific domains.

Skill
wshobson

Aws Cdk Development

100

AWS Cloud Development Kit (CDK) Experte für den Aufbau von Cloud-Infrastruktur mit TypeScript/Python. Verwenden Sie dies beim Erstellen von CDK-Stacks, Definieren von CDK-Konstrukten, Implementieren von Infrastructure as Code oder wenn der Benutzer CDK, CloudFormation, IaC, cdk synth, cdk deploy erwähnt oder AWS-Infrastruktur programmatisch definieren möchte. Behandelt CDK-App-Struktur, Konstruktmuster, Stack-Komposition und Bereitstellungs-Workflows.

Skill
zxkane

Fit Drift Diffusion Model

100

Fit cognitive drift-diffusion models (Ratcliff DDM) to reaction time and accuracy data with parameter estimation (drift rate, boundary separation, non-decision time), model comparison, and parameter recovery validation. Use when modeling binary decision-making with reaction time data, estimating cognitive parameters from experimental data, comparing sequential sampling model variants, or decomposing speed-accuracy tradeoff effects into latent cognitive components.

Skill
pjt222