-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: incomplete query result, missing id #34820
Comments
I got the data. I need to check whether I can upload it. I dropped the collection re-ingested the data and this time it works. The only difference I see is 2 segments. So it's definitely not data. We should investigate that what can cause such kind of issues? Please, let me know if you need more details to debug it. We are seeing this issue here and there in our prod environment. |
/assign @bigsheeper /unassign |
@yanliang567 I have looked into the issue that you have shared. I also had a word with @xiaofan-luan in a different forum, where he talked about a known issue in 2.4.x, which is "If there are two or more duplicate row(key & value) within same segment, this could happen. But, in this case, I looked into the data, there is no duplicate rows, also I were not able to see "592". When I looked into code, it's getting all the data data in Result, but the specified key was missing when it fetched field data for primary key. |
Hi @prrs , you might try using milvus v2.4.6, as similar issues have been resolved in this version. |
@bigsheeper At this point of time we can't move away from 2.3.x, because of below reasons.
|
This is actually a complicated fix so it's hard to backport to 2.3. The reason is of this issue is that there are duplicated PK result in one segment. We don't find a easy way to to backport to 2.3, but maybe you can write a small tool to find duplicate pk and fix that by delete the old duplicted data. (Trigger compaction can work as well.) |
@bigsheeper we need to a tool find duplicate pks in same segment |
I don't think this is happening because of duplicate pk in 2.3.11, because of below reasons:
Also, for the collection in production, the issue had happened had no duplicate row, this I verified by looking into source. I didn't had a way to dump the rows from Milvus, it's a dual write system so we know what exactly went into Milvus. As I mentioned above, the issue got mitigated when I dropped the partition and inserted the same data. Now, my question is how to debug it further to identify the issue to find a mitigation and long term solution with 2.3.x? |
@prrs From your description, it doesn't seem to be caused by duplicate primary keys. Is the "incomplete query result" error occasional? and how often it occurs. |
@bigsheeper it's occasional. All of these time it happened in prod, we haven't plugged in the back up tool in prod so not able to get dump. We are prioritising this to better debug the issue. |
There is another fix in " Restore the MVCC functionality", but it should be there on any version after 2.3.5 we need a segment with that error to reproduce. |
Hi @prrs ~ This issue has been fixed, you can update to the latest version |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Is there an existing issue for this?
Environment
Current Behavior
For one of the collection it is very consistent. I am uploading the log trace.
incomplete_query.csv
Expected Behavior
It should fetch the result.
Steps To Reproduce
Milvus Log
incomplete_query.csv
Anything else?
Discord thread: https://discord.com/channels/1160323594396635310/1257950915269230634/1257950915269230634
The text was updated successfully, but these errors were encountered: