-
Notifications
You must be signed in to change notification settings - Fork 413
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MCO-595: Remove MCO's pending config workflow #3700
MCO-595: Remove MCO's pending config workflow #3700
Conversation
@djoshy: This pull request references MCO-595 which is a valid jira issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Skipping CI for Draft Pull Request. |
/test unit |
/test e2e-aws-ovn-upgrade |
/test e2e-gcp-op |
92fb3aa
to
db80178
Compare
/test unit |
/test e2e-gcp-op |
1 similar comment
/test e2e-gcp-op |
db80178
to
124a9f9
Compare
/test e2e-gcp-op |
2 similar comments
/test e2e-gcp-op |
/test e2e-gcp-op |
There's a lot of subtlety here. I love the goal. The current logic is immensely confusing and hard to understand. (This strongly relates to #1190 ) That said, IIRC the idea with "pending config" is to be a "transient state" between "current" and "desired". I don't see how we can entirely remove that...it seems to me that we at least need a single bit of state that says "we're trying to transition to the desired config"? |
So, we do have that already today, done in two ways:
And when we revisited the workflow, the two did essentially the exact same thing, so there was a duplication of effort, which was fine when things went well, but if anything errored, the journal entry made it exponentially harder to fix if it was e.g. referring to a non-existent hash, even a forcefile did not remove it. So we wanted to do either or: 1. have the journal or 2. have the file indicate the transient state. We think 2 is easier to manage and less prone to errors + helps remove the MCO's dependency on journal Hopefully that makes sense, we will try to make sure that what I just described above isn't broken (no regressions due to the removal of this) |
/test e2e-gcp-op |
124a9f9
to
4d38203
Compare
/test e2e-gcp-op |
4d38203
to
c3ad9be
Compare
/test verify |
/test e2e-gcp-op |
/test e2e-aws-ovn-upgrade |
c3ad9be
to
a3d6063
Compare
/test e2e-gcp-op |
/test e2e-aws-ovn-upgrade |
1 similar comment
/test e2e-aws-ovn-upgrade |
/hold |
a3d6063
to
3221924
Compare
/hold cancel |
/test e2e-aws-ovn-upgrade |
Putting hold for qe pre-merge testing |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall LGTM.
/approve
I am little concerned that this change hitting some corner case regression. But I do agree with Jerry's rationale.
As Jerry has more content of this issue, will be good to get your review as well.
/retest-required |
Verified using IPI on AWS. We run a whole regression but the hypershift test cases. All cases passed. We add the qe-approved label /label qe-approved |
/unhold /retest-required |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
Thanks for chasing this down David!
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: djoshy, sinnykumari, yuqi-zhang The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
@djoshy: all tests passed! Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
- What I did
Removed all references/checks for the journal pending config writes. In some cases, I replaced them with
desiredConfig
checks and in other cases removed them entirely.- How to verify it
Upgrades tests should work without any new complaints (:
- Description for the changelog
daemon: removed pending config checks