v0.19.1

westonpace released this 21 Oct 21:58

· 278 commits to main since this release

c152d36

What's Changed

Breaking Changes 🛠

feat!: combine LanceFragment.merge_columns and LanceDataset.add_columns in one implementation by @westonpace in #3021
feat!: support to customize tokenizer by @BubbleCal in #2992

New Features 🎉

feat: add a blob encoding for large binary values by @westonpace in #2868
feat: cuda acceleration for PQ builds/assignments by @jacketsj in #2946
feat(rust): add checkout_latest to Dataset by @wjones127 in #2991
feat: support storage options for spark read and write by @SaintBacchus in #2990
feat: add some basic primitive for working with repdef data by @westonpace in #2998
feat: one-pass IVF_PQ accelerated builds by @jacketsj in #3001
feat: add bit-width, cardinality and data-size to datablock statistics by @broccoliSpicy in #2986
feat: add table config by @dsgibbons in #2820
feat: expose number of rows covered per delta by @chebbyChefNEQ in #2979
feat: add the basic encode path for 2.1 by @westonpace in #3002
feat: add a new BlobFile API that can be used to read blob data by @westonpace in #2983
feat: make it possible to add columns from existing data by @westonpace in #3010

Bug Fixes 🐛

fix: filter the vec index in function Index::scalar_index_info by @SaintBacchus in #3000
fix: update cursor correctly in blob file API by @westonpace in #3018
fix: create FTS on empty table/strings by @BubbleCal in #3024

Performance Improvements 🚀

perf: reduce the required memory for indexing FTS by @BubbleCal in #2926

Other Changes

refactor: convert scan args into struct by @westonpace in #2976

Full Changelog: v0.18.2...v0.19.1

Contributors

westonpace, jacketsj, and 6 other contributors

Assets 2