campaigns: skip changeset spec creation for cached empty diffs #397

LawnGnome · 2020-11-25T21:49:21Z

It's totally valid and normal for empty diffs to be created when
executing campaign specs: sometimes you just don't want anything to
change, even though the repo matched the initial query. When this
happens, #313 added a check that prevents the changeset spec from being
created, and print a verbose mode message indicating that the repo was
skipped:

https://sourcegraph.com/github.com/sourcegraph/src-cli@d29ad54eff678d96fb7ebdf75ff95890dce6a1cf/-/blob/internal/campaigns/executor.go?utm_source=VSCode-1.1.0#L273-278

So far, so good. In #374, we made our empty diff handling even better by
caching the empty diff: this means that we don't have to recalculate
that nothing happened. Unfortunately, the check that exists in the cache
miss code path to skip changeset spec creation doesn't exist in the
cache hit code path, which means that on subsequent applications of the
campaign, a changeset spec with an empty diff will be uploaded, and
gitserver will ultimately be very grumpy.

By applying the same logic to the cache hit code path, we can filter out
these problematic changeset specs.

It's totally valid and normal for empty diffs to be created when executing campaign specs: sometimes you just don't want anything to change, even though the repo matched the initial query. When this happens, #313 added a check that prevents the changeset spec from being created, and print a verbose mode message indicating that the repo was skipped: https://sourcegraph.com/github.com/sourcegraph/src-cli@d29ad54eff678d96fb7ebdf75ff95890dce6a1cf/-/blob/internal/campaigns/executor.go?utm_source=VSCode-1.1.0#L273-278 So far, so good. In #374, we made our empty diff handling even better by caching the empty diff: this means that we don't have to recalculate that nothing happened. Unfortunately, the check that exists in the cache miss code path to skip changeset spec creation doesn't exist in the cache hit code path, which means that on subsequent applications of the campaign, a changeset spec with an empty diff will be uploaded, and gitserver will ultimately be very grumpy. By applying the same logic to the cache hit code path, we can filter out these problematic changeset specs.

eseliger

Good catch. LGTM!

eseliger · 2020-11-25T21:54:21Z

internal/campaigns/executor.go

+			// send to the server. Instead, we can just report that the task is
+			// complete and move on.
+			if len(diff) == 0 {
+				status.FinishedAt = time.Now()


I think this line could also be moved up and deduped

I thought about it, but kept it there because createChangesetSpec() really marks the end of executing the task in the normal case, and I'd like to account for that time as well.

This also means that we run all the integration tests with cold and warm caches, which should help pick up these issues in future.

LawnGnome · 2020-11-25T23:26:09Z

Just asking for a quick re-review here to glance over the test changes, since they're moderately significant.

eseliger

Great work, I like that. Just a question re the test preventing this from happening again.

eseliger · 2020-11-25T23:31:52Z

internal/campaigns/executor_test.go

+			// No changesets should be generated.
+			wantFilesChanged: map[string][]string{},


I don't 100% understand how this tests the behavior is right but I trust you that it does what we look for. Regardless this is a good test.
I assume this would have failed before the fix because the hot run is going to yield a change? Even though it was empty? Not sure why it would show any files changed when there was an empty diff.

Each entry in wantFilesChanged represents a changeset spec. In this case, since true won't change anything, the test should hit the empty diff code paths and not generate changeset specs, even though the cache is populated.

This does indeed fail on the old main:

~/trees/sourcegraph/src-cli on  main via 🐹 v1.15.5 ❯ go test -race ./... ok github.com/sourcegraph/src-cli/cmd/src 0.224s ok github.com/sourcegraph/src-cli/internal/api (cached) --- FAIL: TestExecutor_Integration (0.46s) --- FAIL: TestExecutor_Integration/empty (0.04s) executor_test.go:189: wrong number of changeset specs. want=0, have=1 --- FAIL: TestExecutor_Integration/empty/warm_cache (0.00s) testing.go:1048: test executed panic(nil) or runtime.Goexit: subtest may have called FailNow on a parent test FAIL FAIL github.com/sourcegraph/src-cli/internal/campaigns 0.565s ? github.com/sourcegraph/src-cli/internal/campaigns/graphql [no test files] ok github.com/sourcegraph/src-cli/internal/codeintel (cached) ? github.com/sourcegraph/src-cli/internal/output [no test files] ok github.com/sourcegraph/src-cli/internal/servegit (cached) ? github.com/sourcegraph/src-cli/schema [no test files] FAIL

Ahh okay makes sense. Thanks for confirming!

The previous recommended version suffered from the bug fixed by sourcegraph/src-cli#397, so we'll want to move our users past that.

mrnugget · 2020-11-26T06:54:22Z

Ah man, what a bug! Thanks for fixing this so diligently!

* campaigns: skip changeset spec creation for cached empty diffs It's totally valid and normal for empty diffs to be created when executing campaign specs: sometimes you just don't want anything to change, even though the repo matched the initial query. When this happens, #313 added a check that prevents the changeset spec from being created, and print a verbose mode message indicating that the repo was skipped: https://sourcegraph.com/github.com/sourcegraph/src-cli@d29ad54eff678d96fb7ebdf75ff95890dce6a1cf/-/blob/internal/campaigns/executor.go?utm_source=VSCode-1.1.0#L273-278 So far, so good. In #374, we made our empty diff handling even better by caching the empty diff: this means that we don't have to recalculate that nothing happened. Unfortunately, the check that exists in the cache miss code path to skip changeset spec creation doesn't exist in the cache hit code path, which means that on subsequent applications of the campaign, a changeset spec with an empty diff will be uploaded, and gitserver will ultimately be very grumpy. By applying the same logic to the cache hit code path, we can filter out these problematic changeset specs. * Extend integration tests to cover the empty diff bug. This also means that we run all the integration tests with cold and warm caches, which should help pick up these issues in future.

LawnGnome requested a review from a team November 25, 2020 21:49

eseliger approved these changes Nov 25, 2020

View reviewed changes

LawnGnome added 2 commits November 25, 2020 15:13

Extend integration tests to cover the empty diff bug.

053b133

This also means that we run all the integration tests with cold and warm caches, which should help pick up these issues in future.

Fix data race in the test cache.

ca5631f

LawnGnome requested a review from eseliger November 25, 2020 23:25

eseliger approved these changes Nov 25, 2020

View reviewed changes

LawnGnome merged commit ff776f6 into main Nov 25, 2020

LawnGnome deleted the aharvey/skip-cached-empty-changesets branch November 25, 2020 23:39

LawnGnome mentioned this pull request Nov 26, 2020

src-cli: bump the minimum version and update docs sourcegraph/sourcegraph-public-snapshot#16187

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

campaigns: skip changeset spec creation for cached empty diffs #397

campaigns: skip changeset spec creation for cached empty diffs #397

LawnGnome commented Nov 25, 2020

eseliger left a comment

eseliger Nov 25, 2020

LawnGnome Nov 25, 2020

LawnGnome commented Nov 25, 2020

eseliger left a comment

eseliger Nov 25, 2020

LawnGnome Nov 25, 2020

eseliger Nov 25, 2020

mrnugget commented Nov 26, 2020

		// No changesets should be generated.
		wantFilesChanged: map[string][]string{},

campaigns: skip changeset spec creation for cached empty diffs #397

campaigns: skip changeset spec creation for cached empty diffs #397

Conversation

LawnGnome commented Nov 25, 2020

eseliger left a comment

Choose a reason for hiding this comment

eseliger Nov 25, 2020

Choose a reason for hiding this comment

LawnGnome Nov 25, 2020

Choose a reason for hiding this comment

LawnGnome commented Nov 25, 2020

eseliger left a comment

Choose a reason for hiding this comment

eseliger Nov 25, 2020

Choose a reason for hiding this comment

LawnGnome Nov 25, 2020

Choose a reason for hiding this comment

eseliger Nov 25, 2020

Choose a reason for hiding this comment

mrnugget commented Nov 26, 2020