-
-
Notifications
You must be signed in to change notification settings - Fork 699
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allocate docIds in a way that is monotonic with some fast field #29
Comments
I want to work on this (as soon as the other PR is closed, of course), could you give me some pointer for where to start? |
@fulmicoton can you explain a little how this would differ from the top docs by field collector? https://docs.rs/tantivy/0.8.1/tantivy/collector/struct.TopDocsByField.html |
@hwchen Sorry my previous message was misleading. I removed it. Right now documents get an internal document id. This document id is currenlty defined by the order in which document are added in the segment. There are a bunch of optimization that can be done if someone choose a better order for these. For instance, ordering by decreasing page rank makes it possible to get a nice SERP while scoring only the first Q docs. This is a very interesting feature for many business, but let's not consider it until somebody actually has the need and want to use tantivy. |
Correct . Thank you @shikhar |
Sorting documents by a given field can open the door to various optimization
The text was updated successfully, but these errors were encountered: