Polars
技能 已验证 活跃Fast in-memory DataFrame library for datasets that fit in RAM. Use when pandas is too slow but data still fits in memory. Lazy evaluation, parallel execution, Apache Arrow backend. Best for 1-100GB datasets, ETL pipelines, faster pandas replacement. For larger-than-RAM data use dask or vaex.
To offer a high-performance, in-memory DataFrame library for datasets that fit in RAM, serving as a faster alternative to pandas for ETL pipelines and data processing tasks.
功能
- In-memory DataFrame operations
- Lazy evaluation for query optimization
- Parallel execution for speed
- Apache Arrow backend for efficiency
- Faster alternative to pandas
使用场景
- Replacing slow pandas operations on in-memory datasets
- Optimizing ETL pipelines for 1-100GB datasets
- Performing complex data transformations and analyses
- Migrating from pandas to a more performant library
非目标
- Handling datasets larger than available RAM
- Replacing dask or vaex for out-of-memory computation
- General-purpose data storage or database management
Compliance
- info:GDPRThe skill operates on user-provided data; while Polars itself doesn't store data, personal data submitted to the LLM for processing could be subject to GDPR if not handled carefully by the user.
Execution
- info:Pinned dependenciesDependencies are managed by uv, which typically pins versions, but explicit lockfiles for the skill itself are not detailed. The Polars library itself would have its own versioning.
安装
npx skills add K-Dense-AI/claude-scientific-skills通过 npx 运行 Vercel skills CLI(skills.sh)— 需要本地安装 Node.js,以及至少一个兼容 skills 的智能体(Claude Code、Cursor、Codex 等)。前提是仓库遵循 agentskills.io 格式。
质量评分
已验证类似扩展
Polars Bio
99High-performance genomic interval operations and bioinformatics file I/O on Polars DataFrames. Overlap, nearest, merge, coverage, complement, subtract for BED/VCF/BAM/GFF intervals. Streaming, cloud-native, faster bioframe alternative.
Ray Data
95Scalable data processing for ML workloads. Streaming execution across CPU/GPU, supports Parquet/CSV/JSON/images. Integrates with Ray Train, PyTorch, TensorFlow. Scales from single machine to 100s of nodes. Use for batch inference, data preprocessing, multi-modal data loading, or distributed ETL pipelines.
Chdb Datastore
95与 ClickHouse 性能兼容的即插即用 pandas 替代品。使用 `import chdb.datastore as pd`(或 `from datastore import DataStore`)并编写标准的 pandas 代码 — API 相同,在大数据集上速度提升 10-100 倍。支持 16 种以上数据源(MySQL、PostgreSQL、S3、MongoDB、ClickHouse、Iceberg、Delta Lake 等)和 10 种以上文件格式(Parquet、CSV、JSON、Arrow、ORC 等)以及跨源连接。当用户希望使用 pandas 风格的语法分析数据、加速缓慢的 pandas 代码、将远程数据库或云存储作为 DataFrame 查询,或连接不同来源的数据时,请使用此技能 — 即使他们没有明确提及 chdb 或 DataStore。请勿用于原始 SQL 查询、ClickHouse 服务器管理或非 Python 语言。
AlterLab Polars
78Part of the AlterLab Academic Skills suite. Fast in-memory DataFrame library for datasets that fit in RAM. Use when pandas is too slow but data still fits in memory. Lazy evaluation, parallel execution, Apache Arrow backend. Best for 1-100GB datasets, ETL pipelines, faster pandas replacement. For larger-than-RAM data use dask or vaex.
Create Spatial Visualization
100Create interactive maps, elevation profiles, and spatial visualizations from GPX tracks, waypoints, or route data using R (sf, leaflet, tmap) or Observable (D3, deck.gl). Covers data import, coordinate system handling, map styling, and export to HTML or image formats. Use when visualizing a planned or completed tour route on an interactive map, creating elevation profiles for hiking or cycling routes, overlaying waypoints and POIs on a basemap, or building a web-based trip dashboard.
Performance Analysis
100Comprehensive performance analysis, bottleneck detection, and optimization recommendations for Claude Flow swarms