跳转到主要内容
此内容尚未提供您的语言版本,正在以英文显示。

Polars

技能 已验证 活跃

Fast in-memory DataFrame library for datasets that fit in RAM. Use when pandas is too slow but data still fits in memory. Lazy evaluation, parallel execution, Apache Arrow backend. Best for 1-100GB datasets, ETL pipelines, faster pandas replacement. For larger-than-RAM data use dask or vaex.

目的

To offer a high-performance, in-memory DataFrame library for datasets that fit in RAM, serving as a faster alternative to pandas for ETL pipelines and data processing tasks.

功能

  • In-memory DataFrame operations
  • Lazy evaluation for query optimization
  • Parallel execution for speed
  • Apache Arrow backend for efficiency
  • Faster alternative to pandas

使用场景

  • Replacing slow pandas operations on in-memory datasets
  • Optimizing ETL pipelines for 1-100GB datasets
  • Performing complex data transformations and analyses
  • Migrating from pandas to a more performant library

非目标

  • Handling datasets larger than available RAM
  • Replacing dask or vaex for out-of-memory computation
  • General-purpose data storage or database management

Compliance

  • info:GDPRThe skill operates on user-provided data; while Polars itself doesn't store data, personal data submitted to the LLM for processing could be subject to GDPR if not handled carefully by the user.

Execution

  • info:Pinned dependenciesDependencies are managed by uv, which typically pins versions, but explicit lockfiles for the skill itself are not detailed. The Polars library itself would have its own versioning.

安装

npx skills add K-Dense-AI/claude-scientific-skills

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证
99 /100
1 day ago 分析

信任信号

最近提交3 days ago
星标21k
许可证MIT
状态
查看源代码

类似扩展

Polars Bio

99

High-performance genomic interval operations and bioinformatics file I/O on Polars DataFrames. Overlap, nearest, merge, coverage, complement, subtract for BED/VCF/BAM/GFF intervals. Streaming, cloud-native, faster bioframe alternative.

技能
K-Dense-AI

Ray Data

95

Scalable data processing for ML workloads. Streaming execution across CPU/GPU, supports Parquet/CSV/JSON/images. Integrates with Ray Train, PyTorch, TensorFlow. Scales from single machine to 100s of nodes. Use for batch inference, data preprocessing, multi-modal data loading, or distributed ETL pipelines.

技能
Orchestra-Research

Chdb Datastore

95

与 ClickHouse 性能兼容的即插即用 pandas 替代品。使用 `import chdb.datastore as pd`(或 `from datastore import DataStore`)并编写标准的 pandas 代码 — API 相同,在大数据集上速度提升 10-100 倍。支持 16 种以上数据源(MySQL、PostgreSQL、S3、MongoDB、ClickHouse、Iceberg、Delta Lake 等)和 10 种以上文件格式(Parquet、CSV、JSON、Arrow、ORC 等)以及跨源连接。当用户希望使用 pandas 风格的语法分析数据、加速缓慢的 pandas 代码、将远程数据库或云存储作为 DataFrame 查询,或连接不同来源的数据时,请使用此技能 — 即使他们没有明确提及 chdb 或 DataStore。请勿用于原始 SQL 查询、ClickHouse 服务器管理或非 Python 语言。

技能
clickhouse

AlterLab Polars

78

Part of the AlterLab Academic Skills suite. Fast in-memory DataFrame library for datasets that fit in RAM. Use when pandas is too slow but data still fits in memory. Lazy evaluation, parallel execution, Apache Arrow backend. Best for 1-100GB datasets, ETL pipelines, faster pandas replacement. For larger-than-RAM data use dask or vaex.

技能
AlterLab-IEU

Create Spatial Visualization

100

Create interactive maps, elevation profiles, and spatial visualizations from GPX tracks, waypoints, or route data using R (sf, leaflet, tmap) or Observable (D3, deck.gl). Covers data import, coordinate system handling, map styling, and export to HTML or image formats. Use when visualizing a planned or completed tour route on an interactive map, creating elevation profiles for hiking or cycling routes, overlaying waypoints and POIs on a basemap, or building a web-based trip dashboard.

技能
pjt222

Performance Analysis

100

Comprehensive performance analysis, bottleneck detection, and optimization recommendations for Claude Flow swarms

技能
ruvnet