Rust❤️Lance: modern columnar data format for ML
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..
Lance: modern columnar data format for ML
Lance is a columnar data format that is easy and fast to version, query and train on. It’s designed to be used with images, videos, 3D point clouds, audio and of course tabular data. It supports any POSIX file systems, and cloud storage like AWS S3 and Google Cloud Storage. The key features of Lance include:
High-performance random access: 100x faster than Parquet.
Vector search: find nearest neighbors in under 1 millisecond and combine OLAP-queries with vector search.
Zero-copy, automatic versioning: manage versions of your data automatically, and reduce redundancy with zero-copy logic built-in.
Ecosystem integrations: Apache-Arrow, DuckDB and more on the way.
https://lancedb.github.io/lance/
Lance: modern columnar data format for ML — Lance documentation
Lance: modern columnar data format for ML Lance is a columnar data format that is easy and fast to version, query and train on. It’s designed to be used with images, videos, 3D point clouds, audio and of course tabular data. It supports any POSIX file sy
lancedb.github.io
https://lancedb.github.io/lance/
Lance: modern columnar data format for ML — Lance documentation
Lance: modern columnar data format for ML Lance is a columnar data format that is easy and fast to version, query and train on. It’s designed to be used with images, videos, 3D point clouds, audio and of course tabular data. It supports any POSIX file sy
lancedb.github.io