Fix checks processing for multiple connectors #351

linkous8 · 2021-10-21T18:46:40Z

Set Initial ready value to false so we don't report ready when no
checks are run
Reduce all progressive check event results into single list of checks
instead of only processing the first result
refactor non-progressive check readiness accumulation to report
unready if no checks are run or any connector responds with empty list
of checks
fix successful prometheus check reporting caused errors when logged
due to curly braces in the check names
fix kubernetes checks shadowed by opsani_dev checks by renaming
check methods to produce unique check IDs

- Set Initial ready value to false so we don't report ready when no checks are run - Reduce all progressive check event results into single list of checks instead of only processing the first result - refactor non-progressive check readiness accumulation to report unready if no checks are run or any connector responds with empty list of checks

linear · 2021-10-21T18:46:42Z

ENG-503 Servox CLI check command does not process all checks

While testing the HPAPlus connector (ENG-466) I ran across an issue where the opsani_dev checks were not showing up in the output as well as all checks being reported as passing which prevented envoy-proxy from being injected into the target application

Example output of this behavior is attached:

In servo_log1.log you can see only the checks for the HPAPlus connector were reported on
In servo_log2.log you can see only the checks for the Kubernetes connector were reported on

After digging into the check code, I think I've found the source of the issue here:

servox/servo/cli.py

Line 1331 in 53078f3

if result := next(iter(results), None):

What happens is the logic is only processing check results for the first event result in the list of results without processing any subsequent results in that list which means its only processing the checks for a single connector before considering all checks to have passed. I have a proposed fix which I will push for testing shortly after this ticket is created

servo_log1.log

servo_log2.log

shadowing from redundant generated ckeck ids

IDs from the kubernetes checks

- add "kubernetes" to kubernetes test check IDs - add "opsani_dev" to opsani dev test check IDs - undo timeout changes to xfaling test test_rollout_check_annotations

ekalosak

LGTM, if I got it right, the changes are:

Multiple checks work now
Curly brackets no longer kill logger, solved using the escaped_name
Naming convention changes in the connectors

Question:

Does the if results have the side effects of iter or next on the results generator? Or is the truthyness check blissfully devoid of side effects?
Why use functools when you could do sum([b.value for b in results])?

linkous8 · 2021-10-25T20:11:24Z

Corrections:

Multiple checks work now

Checks from multiple connectors work now

Naming convention changes in the connectors

The check functions have been renamed to avoid duplicate IDs but there is no enforcement of a naming convention WRT to avoiding duplicate IDs yet (Task has been added to the backlog for this)

Answers:

Does the if results have the side effects of iter or next on the results generator? Or is the truthyness check blissfully devoid of side effects?

You may need to elaborate more on what you mean by the side effects but I think the answer is that the truthyness check of the results list is devoid of side effects. The next and iter calls being replaced were the logical equivalent of other languages' firstOrDefault methods which was the source of this bug

Why use functools when you could do sum([b.value for b in results])?

Mainly for styling reasons. @blakewatters established a precedent for avoiding comprehensions in favor of map/filter/reduce calls which I try to follow (I assume he finds it more readable that way)

ekalosak · 2021-10-25T20:21:24Z

re Blake using map(reduce(lambda)), I think he's the only one... It goes against most Python style guides iirc. I think, unless he shows up himself to deny, this is a Life of Brian moment.

e.g. Google Python style guide

linkous8 requested review from blakewatters, rstarmer, ekalosak and DanielHHowell October 21, 2021 18:46

linkous8 added 3 commits October 21, 2021 14:07

Bump timeout on flakey test

409c2fd

Further bump timeout on flakey test

ff6e29b

Fix reduce of check results

b50cb74

linkous8 marked this pull request as draft October 22, 2021 16:40

linkous8 added 4 commits October 22, 2021 13:41

Fix error from logging check name with prom query

f47885c

Rename kubernetes checks to prevent check

ec8de2d

shadowing from redundant generated ckeck ids

Adhere to check naming requirements

0c4af73

Rename opsani_dev checks to differentiate their

14f94e3

IDs from the kubernetes checks

linkous8 marked this pull request as ready for review October 25, 2021 17:18

Fix integration tests failing from renamed checks

bf2f4dc

- add "kubernetes" to kubernetes test check IDs - add "opsani_dev" to opsani dev test check IDs - undo timeout changes to xfaling test test_rollout_check_annotations

DanielHHowell approved these changes Oct 25, 2021

View reviewed changes

ekalosak reviewed Oct 25, 2021

View reviewed changes

Merge branch 'main' into fred/eng-503-servox-cli-check-command-does-not

b859c9e

linkous8 merged commit 1ff6ab1 into main Oct 26, 2021

linkous8 deleted the fred/eng-503-servox-cli-check-command-does-not branch October 26, 2021 16:51

linkous8 mentioned this pull request Oct 26, 2021

sanitized logging message #352

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix checks processing for multiple connectors #351

Fix checks processing for multiple connectors #351

linkous8 commented Oct 21, 2021 •

edited

Loading

linear bot commented Oct 21, 2021

ekalosak left a comment

linkous8 commented Oct 25, 2021

ekalosak commented Oct 25, 2021

Fix checks processing for multiple connectors #351

Fix checks processing for multiple connectors #351

Conversation

linkous8 commented Oct 21, 2021 • edited Loading

linear bot commented Oct 21, 2021

ekalosak left a comment

Choose a reason for hiding this comment

linkous8 commented Oct 25, 2021

ekalosak commented Oct 25, 2021

linkous8 commented Oct 21, 2021 •

edited

Loading