You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
**Modern columnar data format for ML. Convert from Parquet in 2-lines of code for 100x faster random access, a vector index, data versioning, and more.<br/>**
7
-
**Compatible with pandas, DuckDB, Polars, and pyarrow with more integrations on the way.**
6
+
**Modern columnar data format for ML. Convert from Parquet in 2-lines of code for 100x faster random access, zero-cost schema evolution, rich secondary indices, versioning, and more.<br/>**
7
+
**Compatible with Pandas, DuckDB, Polars, Pyarrow, and Ray with more integrations on the way.**
> Preview releases are released more often than full releases and contain the
68
68
> latest features and bug fixes. They receive the same level of testing as full releases.
69
-
> We guarantee they will remain published and available for download for at
69
+
> We guarantee they will remain published and available for download for at
70
70
> least 6 months. When you want to pin to a specific version, prefer a stable release.
71
71
72
72
**Converting to Lance**
@@ -186,8 +186,8 @@ Support both CPUs (``x86_64`` and ``arm``) and GPU (``Nvidia (cuda)`` and ``Appl
186
186
187
187
**Fast updates** (ROADMAP): Updates will be supported via write-ahead logs.
188
188
189
-
**Rich secondary indices** (ROADMAP):
190
-
- Inverted index for fuzzy search over many label / annotation fields.
189
+
**Rich secondary indices**: Support `BTree`, `Bitmap`, `Full text search`, `Label list`,
190
+
`NGrams`, and more.
191
191
192
192
## Benchmarks
193
193
@@ -253,11 +253,16 @@ A comparison of different data formats in each stage of ML development cycle.
253
253
254
254
Lance is currently used in production by:
255
255
*[LanceDB](https://github.com/lancedb/lancedb), a serverless, low-latency vector database for ML applications
256
+
*[LanceDB Enterprise](https://docs.lancedb.com/enterprise/introduction), hyperscale LanceDB with enterprise SLA.
257
+
* Leading multimodal Gen AI companies for training over petabyte-scale multimodal data.
256
258
* Self-driving car company for large-scale storage, retrieval and processing of multi-modal data.
257
259
* E-commerce company for billion-scale+ vector personalized search.
258
260
* and more.
259
261
260
-
## Presentations and Talks
262
+
## Presentations, Blogs and Talks
261
263
264
+
*[Designing a Table Format for ML Workloads](https://blog.lancedb.com/designing-a-table-format-for-ml-workloads/), Feb 2025.
265
+
*[Transforming Multimodal Data Management with LanceDB, Ray Summit](https://www.youtube.com/watch?v=xmTFEzAh8ho), Oct 2024.
266
+
*[Lance v2: A columnar container format for modern data](https://blog.lancedb.com/lance-v2/), Apr 2024.
262
267
*[Lance Deep Dive](https://drive.google.com/file/d/1Orh9rK0Mpj9zN_gnQF1eJJFpAc6lStGm/view?usp=drive_link). July 2023.
263
268
*[Lance: A New Columnar Data Format](https://docs.google.com/presentation/d/1a4nAiQAkPDBtOfXFpPg7lbeDAxcNDVKgoUkw3cUs2rE/edit#slide=id.p), [Scipy 2022, Austin, TX](https://www.scipy2022.scipy.org/posters). July, 2022.
0 commit comments