此内容尚未提供您的语言版本,正在以英文显示。

Polars

技能已验证活跃

Fast in-memory DataFrame library for datasets that fit in RAM. Use when pandas is too slow but data still fits in memory. Lazy evaluation, parallel execution, Apache Arrow backend. Best for 1-100GB datasets, ETL pipelines, faster pandas replacement. For larger-than-RAM data use dask or vaex.

目的

To offer a high-performance, in-memory DataFrame library for datasets that fit in RAM, serving as a faster alternative to pandas for ETL pipelines and data processing tasks.

功能

In-memory DataFrame operations
Lazy evaluation for query optimization
Parallel execution for speed
Apache Arrow backend for efficiency
Faster alternative to pandas

使用场景

Replacing slow pandas operations on in-memory datasets
Optimizing ETL pipelines for 1-100GB datasets
Performing complex data transformations and analyses
Migrating from pandas to a more performant library

非目标

Handling datasets larger than available RAM
Replacing dask or vaex for out-of-memory computation
General-purpose data storage or database management

Compliance

info:GDPRThe skill operates on user-provided data; while Polars itself doesn't store data, personal data submitted to the LLM for processing could be subject to GDPR if not handled carefully by the user.

Execution

info:Pinned dependenciesDependencies are managed by uv, which typically pins versions, but explicit lockfiles for the skill itself are not detailed. The Polars library itself would have its own versioning.

安装

npx skills add K-Dense-AI/claude-scientific-skills

通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。

质量评分

已验证

99 /100

1 day ago 分析

信任信号

最近提交3 days ago

GitHub 所有者 K-Dense-AI

星标21k

许可证MIT

网站k-dense.ai

状态

查看源代码

类似扩展

Polars Bio

High-performance genomic interval operations and bioinformatics file I/O on Polars DataFrames. Overlap, nearest, merge, coverage, complement, subtract for BED/VCF/BAM/GFF intervals. Streaming, cloud-native, faster bioframe alternative.

技能

K-Dense-AI

Ray Data

Scalable data processing for ML workloads. Streaming execution across CPU/GPU, supports Parquet/CSV/JSON/images. Integrates with Ray Train, PyTorch, TensorFlow. Scales from single machine to 100s of nodes. Use for batch inference, data preprocessing, multi-modal data loading, or distributed ETL pipelines.

技能

Orchestra-Research

Chdb Datastore

与 ClickHouse 性能兼容的即插即用 pandas 替代品。使用 `import chdb.datastore as pd`（或 `from datastore import DataStore`）并编写标准的 pandas 代码 — API 相同，在大数据集上速度提升 10-100 倍。支持 16 种以上数据源（MySQL、PostgreSQL、S3、MongoDB、ClickHouse、Iceberg、Delta Lake 等）和 10 种以上文件格式（Parquet、CSV、JSON、Arrow、ORC 等）以及跨源连接。当用户希望使用 pandas 风格的语法分析数据、加速缓慢的 pandas 代码、将远程数据库或云存储作为 DataFrame 查询，或连接不同来源的数据时，请使用此技能 — 即使他们没有明确提及 chdb 或 DataStore。请勿用于原始 SQL 查询、ClickHouse 服务器管理或非 Python 语言。

技能

clickhouse

AlterLab Polars

Part of the AlterLab Academic Skills suite. Fast in-memory DataFrame library for datasets that fit in RAM. Use when pandas is too slow but data still fits in memory. Lazy evaluation, parallel execution, Apache Arrow backend. Best for 1-100GB datasets, ETL pipelines, faster pandas replacement. For larger-than-RAM data use dask or vaex.

技能

AlterLab-IEU

Create Spatial Visualization

100

Create interactive maps, elevation profiles, and spatial visualizations from GPX tracks, waypoints, or route data using R (sf, leaflet, tmap) or Observable (D3, deck.gl). Covers data import, coordinate system handling, map styling, and export to HTML or image formats. Use when visualizing a planned or completed tour route on an interactive map, creating elevation profiles for hiking or cycling routes, overlaying waypoints and POIs on a basemap, or building a web-based trip dashboard.

技能

pjt222

Performance Analysis

100

Comprehensive performance analysis, bottleneck detection, and optimization recommendations for Claude Flow swarms

技能

ruvnet