-
Notifications
You must be signed in to change notification settings - Fork 153
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] Blocks read inconsistent: expected xxx blocks, actual xxx blocks #124
Comments
Which version did you use? Do you set As long as the read client gets the metadata from the 2 of servers, it can check the integrity of data from any one of server. |
Yes
But this step seems execute before |
internal version 0.5.0-snapshot |
The metadata is acquired in advance, but data integrity check is executed when all blocks have been fetched. |
I know, but the application will fail |
I feel a little unreasonable about this implement. Should we read next shuffle server when the data isn't complete? |
I am trying to do this, and i think it needs to be fixed with #108 together |
I would be happy to review this PR, and you should avoid to fetch redundancy blocks from the another server (because the spark has consumed this blocks). |
In my opinion, memory data should also have data skip ability, and our read memory process should be optimized. |
Get |
This will change server's memory storage to add "index" like hdfs |
This problem will should discuss in another issue, we also should have a simple design doc. |
### What changes were proposed in this pull request? Add fallback mechanism for blocks read inconsistent ### Why are the changes needed? When the data in this first server is damaged, application will fail. #124 #129 ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Already added
closed by #276 |
spark.rss.data.replica.write=2
andspark.rss.data.replica=3
,Data integrity cannot be guaranteed in any one shuffle server. right?org.apache.uniffle.storage.handler.impl.LocalFileQuorumClientReadHandler#readShuffleData
, it just read from one shuffle serverThe text was updated successfully, but these errors were encountered: