`messageQueue` not showing Error when `success` is no #478

albertov19 · 2023-07-21T13:51:31Z

There are some XCM messages that are failing to execute on the relay chain. We think we know where the issues lies but it is somewhat hard to debug this now that we can't see the execution error of the XCM messages.

Any idea how we can see this now?

Thanks

bkchr · 2023-07-21T14:04:37Z

CC @KiChjang

ggwpez · 2023-07-21T14:16:49Z

You mean on Polkadot or Kusama? What kind of messages are failing? Can you please post the links to the events and the messages themselves?
The MQ pallet got deployed to Polkadot with the .43 upgrade and was longer deployed on Kusama. But there were no error reports on Kusama so far...

ggwpez · 2023-07-21T14:41:31Z

Okay I assume you talk about this block https://polkadot.subscan.io/block/16477742?tab=event.
So far there were 9 failed MQ dispatches on Polkadot and 42 on Kusama, whereas there were 375 successful ones on Polkadot and 3446 (updated) on Kusama. It looks like that we need to specifically investigate the failing messages.

The failed events in JSON format: failed DOT.json.txt failed KSM.json.txt

albertov19 · 2023-07-21T15:12:23Z

@ggwpez I think more importantly, to be able to see the errors in Polkadot.js Apps would help debugging

We know why it failed.

Thanks

ggwpez · 2023-07-21T16:46:17Z

The MQ pallet has no introspection into the error type of the underlying implementation of ProcessMessage since it just returns a success bool (as long as there was no gross error with the processing itself) from here.

You can see the error in the batch ItemFailed event as BadOrigin. The message processor is responsible to emit any events that are needed to be known to the outside world in such a case.

I would like to close this since it is the wrong abstraction to solve your issue.

xlc · 2023-07-21T20:51:47Z

you can replay block with chopsticks and enable runtime logging to see bit more details about the failing reason.

if that’s not enough, you can add more loggings and override wasm to see additions logs

girazoki · 2023-07-24T11:20:51Z

@ggwpez this behavior changed with respect to the eventts that the ump queue was before throwing. An example of this is block 16483792.

Here we had a Transact instruction that was not specifying enough weight to execute the dispatchable. Before we would have the ump queue throwing a MaxWeightInvalid outcome, now we only see success:false. See block 16250668 as an example

We did not find a way to retrieve the execution error anywhere right now other than using chopsticks with additional logs like @xlc suggested.

girazoki · 2023-07-24T11:22:19Z

Debugging XCM failures is already not a straight-forward thing, so all facilities are welcome

albertov19 · 2023-07-24T11:22:47Z

Without needing to use chopsticks, we need to be able to understand why the message fails to easily debug these things. Why remove the error? Now it becomes 100x harder to debug. To drive XCM usability we need to make it easier and ensure it is a "welcoming" experience...

ggwpez · 2023-07-24T11:35:10Z

There is still an overweight error that the message processor can return: Overweight(Weight). the standard message processor should also return that.

I dont know how exactly Transact handles this, but at least the overweight error should still be there.

ggwpez · 2023-07-24T11:37:24Z

Without needing to use chopsticks, we need to be able to understand why the message fails to easily debug these things. Why remove the error? Now it becomes 100x harder to debug. To drive XCM usability we need to make it easier and ensure it is a "welcoming" experience...

We are not trying to remove errors or make debugging harder. Aggregating all possible errors from any possible implementation into one huge enum just does not sound like a good idea to me either. The MQ pallet is very abstract and generic. It should not need to know about any downstream error sources besides the ones defined in its trait: ProcessMessageError.

girazoki · 2023-07-24T11:53:23Z

Ah I see, I understand the problem better now. What about implementing the ProcessMessage trait to pallet-xcm? that way pallet-xcm could deposit an event showing the xcm-execution error before returning true or false

albertov19 · 2023-07-24T12:30:54Z

Without needing to use chopsticks, we need to be able to understand why the message fails to easily debug these things. Why remove the error? Now it becomes 100x harder to debug. To drive XCM usability we need to make it easier and ensure it is a "welcoming" experience...

We are not trying to remove errors or make debugging harder. Aggregating all possible errors from any possible implementation into one huge enum just does not sound like a good idea to me either. The MQ pallet is very abstract and generic. It should not need to know about any downstream error sources besides the ones defined in its trait: ProcessMessageError.

I understand but as users of XCM, we create tooling and debugging mechanisms expecting certain behaviors. If something is changed upstream, there should be a description of how to act or work around this new behavior.

Do you know a way I can get the error for a given message? In another scenario I'm testing I get Unsupported so that is pretty useless when trying to debug what is going on

albertov19 · 2023-07-25T09:21:39Z

@ggwpez another issue that is not straightforward is how to weight a message for example. I would like to weight the following XCM messages that will be executed in the relay chain:

ump: {
      "2004": [
        "0x03140004000000000700e40b540213000000000700e40b540200060002286bee02000400183c0135080000140d0102040001010070617261d4070000000000000000000000000000000000000000000000000000",
        "0x03140004000000000700e40b540213000000000700e40b540200060002286bee02000400383c0035080000e803000000900100140d0102040001010070617261d4070000000000000000000000000000000000000000000000000000"
      ]
    }

) * compute required storage keys in the message-lane pallet * Update modules/message-lane/src/lib.rs Co-authored-by: Hernando Castano <HCastano@users.noreply.github.com> Co-authored-by: Hernando Castano <HCastano@users.noreply.github.com>

ggwpez added T1-runtime labels Jul 21, 2023

juangirini transferred this issue from paritytech/polkadot Aug 24, 2023

the-right-joyce added I10-unconfirmed Issue might be valid, but it's not yet known. T1-FRAME This PR/Issue is related to core FRAME, the framework. and removed J2-unconfirmed labels Aug 25, 2023

franciscoaguirre added the T6-XCM This PR/Issue is related to XCM. label Mar 25, 2024

bkontur mentioned this issue Apr 3, 2024

[xcm] Investigate the possibility of ProcessXcmMessage / XcmExecutor emitting an event on failure. #2780

Open

This was referenced Jun 5, 2024

Update polkadot-sdk from v1.7.0 to v1.11.0 moondance-labs/tanssi#573

Closed

Update polkadot-sdk from v1.10.0 to v1.11.0 moondance-labs/tanssi#577

Closed

acatangiu mentioned this issue Oct 18, 2024

[XCM] Observability & Debuggability #6119

Open

15 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`messageQueue` not showing Error when `success` is no #478

`messageQueue` not showing Error when `success` is no #478

albertov19 commented Jul 21, 2023

bkchr commented Jul 21, 2023

ggwpez commented Jul 21, 2023 •

edited

Loading

ggwpez commented Jul 21, 2023 •

edited

Loading

albertov19 commented Jul 21, 2023

ggwpez commented Jul 21, 2023 •

edited

Loading

xlc commented Jul 21, 2023

girazoki commented Jul 24, 2023

girazoki commented Jul 24, 2023 •

edited

Loading

albertov19 commented Jul 24, 2023

ggwpez commented Jul 24, 2023

ggwpez commented Jul 24, 2023

girazoki commented Jul 24, 2023

albertov19 commented Jul 24, 2023

albertov19 commented Jul 25, 2023

messageQueue not showing Error when success is no #478

messageQueue not showing Error when success is no #478

Comments

albertov19 commented Jul 21, 2023

bkchr commented Jul 21, 2023

ggwpez commented Jul 21, 2023 • edited Loading

ggwpez commented Jul 21, 2023 • edited Loading

albertov19 commented Jul 21, 2023

ggwpez commented Jul 21, 2023 • edited Loading

xlc commented Jul 21, 2023

girazoki commented Jul 24, 2023

girazoki commented Jul 24, 2023 • edited Loading

albertov19 commented Jul 24, 2023

ggwpez commented Jul 24, 2023

ggwpez commented Jul 24, 2023

girazoki commented Jul 24, 2023

albertov19 commented Jul 24, 2023

albertov19 commented Jul 25, 2023

`messageQueue` not showing Error when `success` is no #478

`messageQueue` not showing Error when `success` is no #478

ggwpez commented Jul 21, 2023 •

edited

Loading

ggwpez commented Jul 21, 2023 •

edited

Loading

ggwpez commented Jul 21, 2023 •

edited

Loading

girazoki commented Jul 24, 2023 •

edited

Loading