[Feature]: modular testutil.Network #18145

tac0turtle · 2023-10-17T12:09:35Z

Summary

Testutil.Network is being used to spin up n number of validators using CometBFT. The Cosmos SDK and its users use this package in order to test their applications.

We have seen more testing frameworks come up for different needs, levels of testing and for different consensus

starship
cometMock
Rollkit

We should swap testutil.Network to an interface that can be used to against many different environments.

Problem Definition

Streamline testing with different environments without needing to rewrite tests

Proposed Feature

Modify testutil.Network into an interface that testing framework authors can implement so that e2e tests can be run against many different environments.

alexanderbez · 2023-10-17T15:54:52Z

Specifically, I'd like to see us use more practical environments to test RCs and regression test against prior releases. Most likely Starship will help us a ton here.

Anmol1696 · 2023-10-17T19:26:24Z

@alexanderbez that is something we are working on. Here are some more details.
If there is access to a development kubernetes cluster, (which we could hook up with gh-actions) then we should be able to create a large scale multi-node environment, and depending on resource consumption (which we can optimize for), should be able to run regression tests parallely as well.

I am working towards full monitoring support (so we could also generate reports), and also perform benchmarking and performance reports as well, and be able to compare releases in a much more realistic and automated fashion.

The one thing i am not super sure about, is what exactly to be running in the regression tests. This is where reusing the testutil.Network and being able to run e2e tests (even a subset) against multiple backends (including Starship) will help alot.

Some of limitations of an e2e system that runs everything as a blackbox, is lack of access directly to keepers and app internals. This makes it slightly more tricky around testing weird edge cases, but it would really make sense for regression tests. If we can abstract out all the special calls to the network in the testutil.Network interface, then we could implment the handlers for Starship, to be able to reuse e2e tests for regression testing as well. This could even include some admin functions (which we could run partially with cometmock).

This change implements a replacement for the current simulator based on testutil/network. Most of the changes are porting the module specific message generators to no longer rely on SimulationState, and to generate "real" messages, not simulator messages. The simulator driver is in simapp, as part of the IntegationTestSuite. The new approach aims to improve simulation in two important ways: - Simulation should more closely mimic a real network. The current simulator message delivery is implemented parallel to non-simulator message delivery, leading to loss of fidelity and higher maintenance. One symptom is cosmos#13843. - Simulation should be layered on top of modules, not part of modules. This means that modules should not import simulation packages, nor refer to its generator package (x/module/simulation). This should eventually fix cosmos#7622. There are also downsides, however. Where the current simulator is too high level, testutil/network is too low level: it runs a real network of validators which is difficult to control. For example: - AppHashes differ between runs, because modules may depend on non- deterministic state such as block header timestamps. - The validators runs in separate goroutines, which makes it hard to query app state without introducing race conditions. - Blocks are produced according tot time, and not under control by the test driver. This makes it hard to trigger processing of messages in particular blocks, which ruins determinism. Some of the issues may be worked around, for example by forcing the block headers to be deterministic; however, the real fix is to make testutil/network itself deterministic, providing the goldilock level of simulation: close enough to a real network, yet deterministic enough to generate the same chain state for a given random seed. A deterministic testutil/network is part of cosmos#18145. Future work includes: - Porting of the remaining module message generators. - Generating (and verifying) deterministic AppHashes, allowing reliable replay when a problematic message is detected. Depends on cosmos#18145. - Save/reload of state for faster debugging cycles. - Removal of the old simulator, most importantly the reference to it from module code. Updates cosmos#14753 (Simulator rewrite epic) Updates cosmos#7622 (reducing imports from modules to simulator) Updates cosmos#13843 (using real message delivery for simulation)

robert-zaremba · 2023-11-17T12:20:27Z

I see that task has been closed. What is the decision and the future direction?

tac0turtle · 2023-11-17T13:32:46Z

we merged the modularity and starship is working on integrating the interface

tac0turtle added T: Tests T:feature-request T:Sprint labels Oct 17, 2023

tac0turtle added this to Cosmos-SDK Oct 17, 2023

github-project-automation bot moved this to 👀 To Do in Cosmos-SDK Oct 17, 2023

tac0turtle mentioned this issue Oct 17, 2023

testing package #13580

Closed

3 tasks

elias-orijtech mentioned this issue Oct 30, 2023

x/staking/keeper: AppHash depends on non-deterministic block time #18299

Closed

elias-orijtech mentioned this issue Nov 2, 2023

feat(sim): port simulator to run on top of testutil/network #17938

Closed

tac0turtle mentioned this issue Nov 7, 2023

refactor(tests): testutil.Network as an interface #18389

Merged

20 tasks

tac0turtle assigned Anmol1696 and tac0turtle Nov 7, 2023

tac0turtle moved this from 👀 To Do to 📚 In review in Cosmos-SDK Nov 7, 2023

tac0turtle closed this as completed in #18389 Nov 8, 2023

github-project-automation bot moved this from 📚 In review to 🥳 Done in Cosmos-SDK Nov 8, 2023

tac0turtle removed this from Cosmos-SDK Nov 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: modular testutil.Network #18145

[Feature]: modular testutil.Network #18145

tac0turtle commented Oct 17, 2023

alexanderbez commented Oct 17, 2023

Anmol1696 commented Oct 17, 2023

robert-zaremba commented Nov 17, 2023

tac0turtle commented Nov 17, 2023

[Feature]: modular testutil.Network #18145

[Feature]: modular testutil.Network #18145

Comments

tac0turtle commented Oct 17, 2023

Summary

Problem Definition

Proposed Feature

alexanderbez commented Oct 17, 2023

Anmol1696 commented Oct 17, 2023

robert-zaremba commented Nov 17, 2023

tac0turtle commented Nov 17, 2023