simulate: resource population #6015

joe-p · 2024-06-05T22:59:02Z

Summary

When a user calls simulate with UnnamedResources enabled, simulate should suggest to the user how they can populate the resource arrays in their transactions to properly send the transaction group to the network.

Test Plan

Test ResourcePopulator works with simple local (not group sharing) resources
Test ResourcePopulator with group sharing
Test ResourcePopulator resource limit detection with group sharing (ie. it is able to find the correct transaction to put a resource in)
Test Simulate with ResourcePopulator functionality
Test /simulate endpoint with ResourcePopulator functionality
Write smaller tests for better ledger/simulation/resources.go coverage

codecov · 2024-06-06T11:17:49Z

Codecov Report

Attention: Patch coverage is 89.17526% with 42 lines in your changes missing coverage. Please review.

Project coverage is 52.03%. Comparing base (d52e3dd) to head (989e746).
Report is 4 commits behind head on master.

Files with missing lines	Patch %	Lines
daemon/algod/api/server/v2/utils.go	0.00%	29 Missing ⚠️
ledger/simulation/resources.go	97.40%	5 Missing and 4 partials ⚠️
ledger/simulation/simulator.go	66.66%	2 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #6015      +/-   ##
==========================================
+ Coverage   51.84%   52.03%   +0.19%     
==========================================
  Files         643      643              
  Lines       86384    86772     +388     
==========================================
+ Hits        44783    45152     +369     
- Misses      38737    38753      +16     
- Partials     2864     2867       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Co-authored-by: Jason Paulos <jasonpaulos@users.noreply.github.com>

Co-authored-by: Jason Paulos <jasonpaulos@users.noreply.github.com> Co-authored-by: John Jannotti <jannotti@gmail.com>

joe-p · 2024-10-09T13:36:22Z

In 292a9b9 I updated the API model to avoid using map[int] but for some reason it's not encoding the response properly: msgpack decode error [pos 50]: no matching struct field found when decoding stream map with key PopulatedResourceArrays.

If I print the raw response I see this:

��last-round��txn-groups��PopulatedResourceArrays��app-budget-added��app-budget-consumed��failed-at��failure-message��transaction SSG3ROSUBRSMXPTZYOORYAXCDYURLGM5OJJF5J3LMJ74LFEWJJAA: logic eval error: invalid Account reference CV6S42NRDBJZKQDDQPUVXSD4KUJ5FSYHXENR74ZPTTIEQASD2WBRBFODRY. Details: app=1006, pc=57, opcodes=store 2; load 0; balance�txn-results��app-budget-consumed��txn-result��pool-error��txn��sig�@۔sq��3b�l3��^�J��b�=�29�IұH=� �P��I��r��-e��A��}��q��?<��txn��apid��fee��fv��gh� ��Gp��y8�d��"��\>-c�P�2�P��݆~ݢlv��snd� ��ʏe�Lz�Y��W�[�+!��O��P�Фtype�appl�version

So for some reason PopulatedResourceArrays is present where I would expect it to be omitted (and if it was included I would expect it to be populated-resource-arrays) given the fact that it's defined as

// PreEncodedSimulateTxnResult mirrors model.SimulateTransactionResult
type PreEncodedSimulateTxnResult struct {
	Txn                      PreEncodedTxInfo                        `codec:"txn-result"`
	AppBudgetConsumed        *uint64                                 `codec:"app-budget-consumed,omitempty"`
	LogicSigBudgetConsumed   *uint64                                 `codec:"logic-sig-budget-consumed,omitempty"`
	TransactionTrace         *model.SimulationTransactionExecTrace   `codec:"exec-trace,omitempty"`
	UnnamedResourcesAccessed *model.SimulateUnnamedResourcesAccessed `codec:"unnamed-resources-accessed,omitempty"`
	FixedSigner              *string                                 `codec:"fixed-signer,omitempty"`
	PopulatedResourceArrays  *model.ResourceArrays                   `codec:"populated-resource-arrays,omitempty"`
}

Any ideas on what might be happening here?

jannotti · 2024-10-09T14:29:01Z

It almost seems like the codec line was completely ignored for encoding, since it has the default name and omitempty was ineffective. Yet, decoding was surprised to see the capitalized form. I don't know the context of your testing - is there any chance you encoded that bytestream before the codec line was added, then decoded it after?

kylebeee · 2024-10-09T16:22:56Z

At a glance it seems to me like you might have a bug somewhere where you're assigning *simulation.PopulatedResourceArrays to PreEncodedSimulateTxnResult.PopulatedResourceArrays instead of *model.ResourceArray but its not super clear where that would be happening & i dont know the go-algorand code base well enough to say definitively.

Where are you printing the raw response?

joe-p · 2025-01-17T12:35:28Z

Edit2: The error is actually in simulate. TestPopulateResources/mixed_resources is currently failing.

exmpty box ref is expected because the box size is 1025

joe-p · 2025-01-21T20:53:38Z

After having this on the backburner for awhile I've come back to working on this and discovered why I was slow to make progress once I started to implement the endpoint. I was making two mistakes

I was not building algod before running the e2e tests. In hindsight this seems obvious, but I was used to go test picking up the changes automatically for me. With the e2e tests the built algod is spawned as a seperate task, so any changes to algod need to be explicitly rebuilt.
The test cache was not being properly invalidated. Most likely because of the first problem, but I was running tests and getting incorrect cached results. This lead to me making changes that actually broke things but I was under the impression they were still working. This made debugging breaking changes harder because I was breaking things without realizing it (see 035ef72 fixed by 41d63dd )

Now with 41d63dd all tests are passing, although I am experiencing an intermittent issue with database tables being locked when testing, which is seemingly causing a tracked app to be missing

--- FAIL: TestPopulatorWithGlobalResources (0.00s)
    resources_test.go:431: 
                Error Trace:    /Users/joe/git/algorand/go-algorand/ledger/simulation/resources_test.go:431
                Error:          elements differ
                            
                                extra elements in list B:
                                ([]interface {}) (len=1) {
                                 (basics.AppIndex) 3
                                }
                            
                            
                                listA:
                                ([]basics.AppIndex) (len=2) {
                                 (basics.AppIndex) 11,
                                 (basics.AppIndex) 5
                                }
                            
                            
                                listB:
                                ([]basics.AppIndex) (len=3) {
                                 (basics.AppIndex) 5,
                                 (basics.AppIndex) 11,
                                 (basics.AppIndex) 3
                                }
                Test:           TestPopulatorWithGlobalResources
time="2025-01-21T15:46:24.630756 -0500" level=warning msg="db.LoggedRetry: 5 retries (last err: database table is locked: accountbase)" file=dbutil.go function=github.com/algorand/go-algorand/util/db.LoggedRetry line=171
time="2025-01-21T15:46:24.630995 -0500" level=warning msg="db.LoggedRetry: 6 retries (last err: database table is locked: accountbase)" file=dbutil.go function=github.com/algorand/go-algorand/util/db.LoggedRetry line=171
time="2025-01-21T15:46:24.631008 -0500" level=warning msg="db.LoggedRetry: 7 retries (last err: database table is locked: accountbase)" file=dbutil.go function=github.com/algorand/go-algorand/util/db.LoggedRetry line=171
time="2025-01-21T15:46:24.631220 -0500" level=warning msg="db.LoggedRetry: 8 retries (last err: database table is locked: acctrounds)" file=dbutil.go function=github.com/algorand/go-algorand/util/db.LoggedRetry line=171

Here is a gist showing the full output with 2/10 runs failing because of the above: https://gist.github.com/joe-p/860cf28908a99db2f58c5010cb378894

I have not yet tried to reproduce on the e2e tests, but I was running them extensively last week and never saw this issue.

Once this issue is resolved the only remaining work is to make some smaller unit tests to test the "bad" cases and make sure things fail gracefully.

as per the comment, these checks should never be needed due to the logic in eval context, but felt safer to add just in case

previously non appls weren't properly added to the populator, meaning their fields were not accounted for when checking for availability. This is actually probably fine since these sorts of duplicates should be prevented by the logic in evalcontext, but as mentioned in previous commits it feels safer to check here just in case

joe-p · 2025-01-31T12:07:58Z

I believe all comments have been addressed at this point and test coverage is near 100%. The only problem is I'm still occasionally getting database table is locked when running tests locally. So far it's only happened with TestPopulatorWithGlobalResources. I tried just running this test and disabling parallel testing but I'm still seeing the same error occasionally. I believe this is just a problem with the test harness so not sure if it should be considered a blocker or not. I'd be interested to know if others can replicate.

joe-p changed the title ~~Feat/populate_resources~~ resource population Jun 5, 2024

joe-p mentioned this pull request Jun 5, 2024

ResourcePopulator joe-p/go-algorand#2

Closed

ResourcePopulator

5ba0a9a

joe-p force-pushed the feat/populate_resources branch from 466fd50 to 5ba0a9a Compare June 5, 2024 23:06

joe-p changed the title ~~resource population~~ simulate: resource population Jun 5, 2024

TestPopulatorWithGlobalResources (arrays only)

028b8c4

joe-p added 14 commits June 6, 2024 07:30

test addBox

941ba6e

use ElementsMatch

a69091c

addHolding

d9e77ef

test variable renaming

4667647

appLocals

cbd1d8a

restore default limits

d79780f

fix rekey field

e6d59ba

populate with static properties and remove zeroAddr

54cd38c

empty boxes

673b03d

ensure duplicates are removed

ceddaa2

overflow txn resources

62213a0

use ConsensusParams

f616449

fix empty box count

6a4f1ee

golangci-lint

0524c3c

algorandskiy added Enhancement external contribution labels Jun 7, 2024

algorandskiy requested review from jasonpaulos and jannotti June 7, 2024 17:13

joe-p added 5 commits June 14, 2024 16:31

Merge branch 'master' into feat/populate_resources

73e5b4c

PopulateResourceArrays in simulate (untested)

6d8c161

populate from ResourceTracker

022c565

initial TestPopulateResources

93dd4e9

group sharing and no group sharing TestPopulateResources

3f78752

gmalouf assigned joe-p Sep 19, 2024

joe-p and others added 9 commits October 2, 2024 17:30

Apply suggestions from code review

96f1af1

Co-authored-by: Jason Paulos <jasonpaulos@users.noreply.github.com>

Apply suggestions from code review

02534ae

Co-authored-by: Jason Paulos <jasonpaulos@users.noreply.github.com> Co-authored-by: John Jannotti <jannotti@gmail.com>

PopulateResourceArrays -> PopulateResources

0815e29

static -> prefilled

b4f7f54

replace ifs with switch

e665817

hasAccount short circuit logic

c9e6e5d

check for room and return error in add... methods

2ea34f2

mixed resources test

93e7b81

use arrays in txn and group result rather than map[int] for API (WIP)

292a9b9

joe-p added 2 commits October 30, 2024 15:32

only populate resources when there's no error

035ef72

only make ExtraResourceArrays if any extra resources exist

daa0e1f

joe-p added 3 commits January 17, 2025 15:11

properly check for nil err

41d63dd

expect an empty box ref

7353379

exmpty box ref is expected because the box size is 1025

Merge branch 'master' into feat/populate_resources

64fd855

joe-p added 8 commits January 28, 2025 15:57

update extra resource arrays description

db6f01b

TestPopulatorWithAlreadyAvailableResources

affa449

TestPopulatorWithNoRoom

c4091e0

check all txns to see if they have a resource before adding

cf29efb

as per the comment, these checks should never be needed due to the logic in eval context, but felt safer to add just in case

test no room for empty box ref

bc58ba6

use consensus params for each max ref

e29877d

remove printlns

989e746

joe-p marked this pull request as ready for review January 31, 2025 12:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

simulate: resource population #6015

simulate: resource population #6015

joe-p commented Jun 5, 2024 •

edited

Loading

codecov bot commented Jun 6, 2024 •

edited

Loading

joe-p commented Oct 9, 2024 •

edited

Loading

jannotti commented Oct 9, 2024

kylebeee commented Oct 9, 2024

joe-p commented Jan 17, 2025 •

edited

Loading

joe-p commented Jan 21, 2025 •

edited

Loading

joe-p commented Jan 31, 2025

simulate: resource population #6015

Are you sure you want to change the base?

simulate: resource population #6015

Conversation

joe-p commented Jun 5, 2024 • edited Loading

Summary

Test Plan

codecov bot commented Jun 6, 2024 • edited Loading

Codecov Report

joe-p commented Oct 9, 2024 • edited Loading

jannotti commented Oct 9, 2024

kylebeee commented Oct 9, 2024

joe-p commented Jan 17, 2025 • edited Loading

joe-p commented Jan 21, 2025 • edited Loading

joe-p commented Jan 31, 2025

joe-p commented Jun 5, 2024 •

edited

Loading

codecov bot commented Jun 6, 2024 •

edited

Loading

joe-p commented Oct 9, 2024 •

edited

Loading

joe-p commented Jan 17, 2025 •

edited

Loading

joe-p commented Jan 21, 2025 •

edited

Loading