Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[ISSUE-392] Fix the bug in the shuffle data cleanup checker that caus…
…es false reports of disk corruption (#393) ### What changes were proposed in this pull request? [ISSUE-392] Fix the bug in the shuffle data cleanup checker that causes false reports of disk corruption #393 ### Why are the changes needed? Fix the bug in the shuffle data checker that causes false reports of disk corruption during cleanup. ``` [INFO] 2022-12-07 16:27:51,411 leakShuffleDataChecker ShuffleTaskManager checkLeakShuffleData - Start check leak shuffle data [INFO] 2022-12-07 16:27:51,416 leakShuffleDataChecker LocalFileDeleteHandler delete - Delete shuffle data for appId[check] with /data1/uniffle/data/check cost 0 ms [INFO] 2022-12-07 16:27:51,420 leakShuffleDataChecker LocalFileDeleteHandler delete - Delete shuffle data for appId[check] with /data2/uniffle/data/check cost 0 ms [INFO] 2022-12-07 16:27:51,420 leakShuffleDataChecker LocalFileDeleteHandler delete - Delete shuffle data for appId[check] with /data3/uniffle/data/check cost 0 ms [INFO] 2022-12-07 16:27:51,420 leakShuffleDataChecker LocalFileDeleteHandler delete - Delete shuffle data for appId[check] with /data4/uniffle/data/check cost 0 ms [INFO] 2022-12-07 16:27:51,420 leakShuffleDataChecker ShuffleTaskManager checkLeakShuffleData - Finish check leak shuffle data [ERROR] 2022-12-07 16:27:51,685 HealthCheckService LocalStorageChecker checkStorageReadAndWrite - Storage read and write error java.io.FileNotFoundException: /data4/uniffle/data/check/test (No such file or directory) at java.io.FileInputStream.open0(Native Method) at java.io.FileInputStream.open(FileInputStream.java:195) at java.io.FileInputStream.<init>(FileInputStream.java:138) at org.apache.uniffle.server.LocalStorageChecker$StorageInfo.checkStorageReadAndWrite(LocalStorageChecker.java:180) at org.apache.uniffle.server.LocalStorageChecker.checkIsHealthy(LocalStorageChecker.java:73) at org.apache.uniffle.server.HealthCheck.check(HealthCheck.java:84) at org.apache.uniffle.server.HealthCheck.lambda$new$0(HealthCheck.java:70) at java.lang.Thread.run(Thread.java:745) [INFO] 2022-12-07 16:27:51,685 HealthCheckService LocalStorageChecker checkIsHealthy - shuffle server become unhealthy ``` ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? 1. UTs
- Loading branch information