-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] ERROR: retrieval failed: Retrieve: Retrieval Error: Unexpected deal response status: DealStatusErrored (miner rejected) #6297
Comments
@mgoelzer Would you be able to share the client & miner logs for this deal ? |
@aarshkshah1992 I just repeated the test exactly as described above and captured the daemon log only during the test period: test-repeat.log. This was done with Lotus @ PR #6149. Unf I cannot get the miner logs. This is just a random miner on the network, not one that I'm running. |
Here's another example of this bug with a different miner. Command Console Output
Logs Lotus Version
|
Relevant log lines. Looks like the channel was rejected by the miner:
On retrieval, this usually indicates an error on the miner when accepting the retrieval channel. The error probably comes from here: https://github.com/filecoin-project/go-data-transfer/blob/77b948c4eb91aac1638cdd2759d8aff39216ffe2/impl/events.go#L355 Unfortunately without the miner logs, we don't have any insight into what happened, since the error itself is not serialised across the wire. |
TL;DR: client created the payment channel (because retrieval price was non-zero), waited for it to be confirmed on chain. Miner then requested payment, client sent payment voucher. When the data channel was opened, the miner rejected it and the client failed. |
One note: to my knowledge all of @mgoelzer 's miners are running markets 1.3.0, which includes filecoin-project/go-fil-markets#540 -- a big retrieval change. I assume that most nodes on the internet are running 1.9.0 which includes markets 1.2.5 Possible hypothesis: we are looking at a networking protocol problem between 1.3.0 and 1.2.5? I'm not sure but we should likely do some compatibility testing and I think this is a signal we need to hold on tagging Lotus for markets 1.3.0, especially if Lotus 1.10 is not a mandatory upgrade |
Compatibility testing between 1.9.0 (go-fil-markets 1.2.5) and master (go-fil-markets 1.3.0) was green on testground using the e2e deals test plan. However, this test plan uses 8MiB sectors and performs a 5MiB deal. We might be seeing something else here. We continue investigating. |
Note: Need to confirm that this is indeed a Markets v1.3.0/data-transfer v1.5.0 regression by seeing repeated success running a Lotus v1.9.0 client against a Lotus v1.9.0 miner. |
This is a compatibility issue between Markets 1.3.0 clients and <1.3.0 miners:
The line in the payment channel code that generates that error when voucher amount is below a certain expectation is at: Line 374 in 5445b05
|
Describe the bug
A retrieval can't get past 816.1 MiB, despite repeated attempts. Provider is responsive. Client can complete other retrievals. Reproducible.
I ran this command
lotus client retrieve --miner f019002 bafykbzacea6lb336ja5htts4c2cetbl5qeswllzfl72ynvcya65jpluhqxgpg /dev/null
. The retrieval was working correctly for a while, but then froze at 816.1 MiB. It stayed in this state of ~15 minutes. Full output:Version (run
lotus version
):Occurs in 1.8.0 and a separate build off of PR #6149.
To Reproduce
Run the command above from
dealbot-mainnet
. (I can provide host, ip and credentials out of band.)Expected behavior
I expect the transfer to complete successfully.
Logs
I have not been able to correlate this with any additional information in the logs.
EDIT: Added <details><pre>...</pre></details> around output to improve readability.
The text was updated successfully, but these errors were encountered: