v0.19.1
What's Changed
Breaking Changes 🛠
- feat!: combine LanceFragment.merge_columns and LanceDataset.add_columns in one implementation by @westonpace in #3021
- feat!: support to customize tokenizer by @BubbleCal in #2992
New Features 🎉
- feat: add a blob encoding for large binary values by @westonpace in #2868
- feat: cuda acceleration for PQ builds/assignments by @jacketsj in #2946
- feat(rust): add checkout_latest to Dataset by @wjones127 in #2991
- feat: support storage options for spark read and write by @SaintBacchus in #2990
- feat: add some basic primitive for working with repdef data by @westonpace in #2998
- feat: one-pass IVF_PQ accelerated builds by @jacketsj in #3001
- feat: add bit-width, cardinality and data-size to datablock statistics by @broccoliSpicy in #2986
- feat: add table config by @dsgibbons in #2820
- feat: expose number of rows covered per delta by @chebbyChefNEQ in #2979
- feat: add the basic encode path for 2.1 by @westonpace in #3002
- feat: add a new BlobFile API that can be used to read blob data by @westonpace in #2983
- feat: make it possible to add columns from existing data by @westonpace in #3010
Bug Fixes 🐛
- fix: filter the vec index in function Index::scalar_index_info by @SaintBacchus in #3000
- fix: update cursor correctly in blob file API by @westonpace in #3018
- fix: create FTS on empty table/strings by @BubbleCal in #3024
Performance Improvements 🚀
- perf: reduce the required memory for indexing FTS by @BubbleCal in #2926
Other Changes
- refactor: convert scan args into struct by @westonpace in #2976
Full Changelog: v0.18.2...v0.19.1