perf: concurrent loading FTS index files #2787

BubbleCal · 2024-08-26T10:40:55Z

get 30% improvement with the concurrent loading,
can help reduce the cold latency of full text search

Signed-off-by: BubbleCal <bubble-cal@outlook.com>

codecov-commenter · 2024-08-26T11:01:55Z

Codecov Report

Attention: Patch coverage is 67.85714% with 9 lines in your changes missing coverage. Please review.

Project coverage is 79.28%. Comparing base (144d207) to head (7ced7dc).

Files	Patch %	Lines
rust/lance-index/src/scalar/inverted/index.rs	67.85%	0 Missing and 9 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2787      +/-   ##
==========================================
- Coverage   79.28%   79.28%   -0.01%     
==========================================
  Files         227      227              
  Lines       68269    68290      +21     
  Branches    68269    68290      +21     
==========================================
+ Hits        54126    54142      +16     
  Misses      11019    11019              
- Partials     3124     3129       +5

Flag	Coverage Δ
unittests	`79.28% <67.85%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

wjones127

I think you could simplify this a bit. If you found there was some speedup from doing the spawn, consider looking if there was CPU-bound work you can instead submit to the CPU threadpool as a task. Checkout Weston's PR documenting our threadpools here: https://github.com/lancedb/lance/pull/2773/files

wjones127 · 2024-08-26T21:50:42Z

rust/lance-index/src/scalar/inverted/index.rs

+        let tokens_fut = tokio::spawn({
+            let store = store.clone();
+            async move {
+                let token_reader = store.open_index_file(TOKENS_FILE).await?;
+                let tokens = TokenSet::load(token_reader).await?;
+                Result::Ok(tokens)
+            }
+        });
+        let invert_list_fut = tokio::spawn({
+            let store = store.clone();
+            async move {
+                let invert_list_reader = store.open_index_file(INVERT_LIST_FILE).await?;
+                let invert_list = InvertedListReader::new(invert_list_reader)?;
+                Result::Ok(Arc::new(invert_list))
+            }
+        });
+        let docs_fut = tokio::spawn({
+            let store = store.clone();
+            async move {
+                let docs_reader = store.open_index_file(DOCS_FILE).await?;
+                let docs = DocSet::load(docs_reader).await?;
+                Result::Ok(docs)
+            }
+        });


It seems like we shouldn't need to spawn these.

Suggested change

let tokens_fut = tokio::spawn({

let store = store.clone();

async move {

let token_reader = store.open_index_file(TOKENS_FILE).await?;

let tokens = TokenSet::load(token_reader).await?;

Result::Ok(tokens)

}

});

let invert_list_fut = tokio::spawn({

let store = store.clone();

async move {

let invert_list_reader = store.open_index_file(INVERT_LIST_FILE).await?;

let invert_list = InvertedListReader::new(invert_list_reader)?;

Result::Ok(Arc::new(invert_list))

}

});

let docs_fut = tokio::spawn({

let store = store.clone();

async move {

let docs_reader = store.open_index_file(DOCS_FILE).await?;

let docs = DocSet::load(docs_reader).await?;

Result::Ok(docs)

}

});

let tokens_fut = store.open_index_file(TOKENS_FILE)

.and_then(|token_reader| TokenSet::load(token_reader));

let invert_list_fut = store.open_index_file(INVERT_LIST_FILE)

.and_then(|invert_list_reader| InvertedListReader::new(invert_list_reader))

.map_ok(Arc::new);

let docs_fut = store.open_index_file(DOCS_FILE)

.and_then(|docs_reader| DocSet::load(docs_reader));

wjones127 · 2024-08-26T21:53:35Z

rust/lance-index/src/scalar/inverted/index.rs

+        let tokens = tokens_fut.await??;
+        let inverted_list = invert_list_fut.await??;
+        let docs = docs_fut.await??;


You can await multiple futures at the same time with try_join!():

Suggested change

let tokens = tokens_fut.await??;

let inverted_list = invert_list_fut.await??;

let docs = docs_fut.await??;

let (tokens, inverted_list, docs) = try_join!(tokens_fut, invert_list_fut, docs_fut)?;

This has the upside that it will fail on the first failure of any of the three.

Yes your solution is what I tried, then I encountered that "FnOnce is not general enough" error, so decided to use spawn

It's a good idea to split the IO/CPU operations, now the load methods mixed them

Okay this is fine then.

BubbleCal · 2024-08-27T05:43:20Z

I did try submitting the IO/CPU operations into diff runtime (tokio::spawn/spawn_cpu), but it results in no perf improvement (say no obvious diff from serial execution).
But this PR can really benefit from parallelism (20% faster), my guess is that spawn_cpu would execute the CPU operations on a diff thread then it leads to cache miss.

@wjones127 any recommendations?

wjones127 · 2024-09-05T17:07:55Z

But this PR can really benefit from parallelism (20% faster), my guess is that spawn_cpu would execute the CPU operations on a diff thread then it leads to cache miss.

@wjones127 any recommendations?

Might be good reason to use block_in_place: https://docs.rs/tokio/latest/tokio/task/fn.block_in_place.html

perf: concurrent load FTS index files

7ced7dc

Signed-off-by: BubbleCal <bubble-cal@outlook.com>

github-actions bot added the performance label Aug 26, 2024

BubbleCal changed the title ~~perf: concurrent load FTS index files~~ perf: concurrent loading FTS index files Aug 26, 2024

BubbleCal requested review from eddyxu, westonpace and wjones127 August 26, 2024 13:12

BubbleCal marked this pull request as ready for review August 26, 2024 13:12

wjones127 requested changes Aug 26, 2024

View reviewed changes

BubbleCal mentioned this pull request Aug 27, 2024

Full text search (FTS) indices #1195

Open

26 tasks

wjones127 approved these changes Sep 5, 2024

View reviewed changes

BubbleCal merged commit 6016917 into lancedb:main Sep 9, 2024
28 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: concurrent loading FTS index files #2787

perf: concurrent loading FTS index files #2787

BubbleCal commented Aug 26, 2024 •

edited

Loading

codecov-commenter commented Aug 26, 2024

wjones127 left a comment

wjones127 Aug 26, 2024

wjones127 Aug 26, 2024

BubbleCal Aug 27, 2024 •

edited

Loading

wjones127 Sep 5, 2024

BubbleCal commented Aug 27, 2024 •

edited

Loading

wjones127 commented Sep 5, 2024

perf: concurrent loading FTS index files #2787

perf: concurrent loading FTS index files #2787

Conversation

BubbleCal commented Aug 26, 2024 • edited Loading

codecov-commenter commented Aug 26, 2024

Codecov Report

wjones127 left a comment

Choose a reason for hiding this comment

wjones127 Aug 26, 2024

Choose a reason for hiding this comment

wjones127 Aug 26, 2024

Choose a reason for hiding this comment

BubbleCal Aug 27, 2024 • edited Loading

Choose a reason for hiding this comment

wjones127 Sep 5, 2024

Choose a reason for hiding this comment

BubbleCal commented Aug 27, 2024 • edited Loading

wjones127 commented Sep 5, 2024

BubbleCal commented Aug 26, 2024 •

edited

Loading

BubbleCal Aug 27, 2024 •

edited

Loading

BubbleCal commented Aug 27, 2024 •

edited

Loading