Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Flaky Test] Configuration is not available after an update for more than 10 seconds #2742

Closed
peternied opened this issue May 4, 2023 · 7 comments
Assignees
Labels
bug Something isn't working flaky-test Flaky Test issue triaged Issues labeled as 'Triaged' have been reviewed and are deemed actionable.

Comments

@peternied
Copy link
Member

Many tests are seeing failures like the following

org.opensearch.OpenSearchException: java.util.concurrent.TimeoutException: Timeout after 10SECONDS while retrieving configuration for [CONFIG](index=.opendistro_security)

Here is an example list of tests that are impacted

Tests with failures:
 - org.opensearch.security.TransportUserInjectorIntegTest.testSecurityUserInjection
 - org.opensearch.security.dlic.rest.api.legacy.LegacyNodesDnApiTests.testNodesDnApiWithPermissions
 - org.opensearch.security.multitenancy.test.TenancyDefaultTenantTests.testDefaultTenantUpdate
 - org.opensearch.security.multitenancy.test.TenancyMultitenancyEnabledTests.testMultitenancyDisabled_endToEndTest

Research done by @willyborankin that seems to indicate there is a cleanup/setup issue

I played a bit with tests and found out that if you run first TenancyDefaultTenantTests and after that TenancyPrivateTenantEnabledTests the second one will fail in JetBrains. If you re-run tests afterwards they all green. It looks like a bug in the SingleClusterTest class I will try to take a look deeper.

#2557 (comment)

Looks like this could be impacting many different pull requests:

@github-actions github-actions bot added the untriaged Require the attention of the repository maintainers and may need to be prioritized label May 4, 2023
@peternied peternied added bug Something isn't working flaky-test Flaky Test issue and removed untriaged Require the attention of the repository maintainers and may need to be prioritized labels May 4, 2023
@peternied
Copy link
Member Author

@willyborankin Would you be willing to take this issue on? If so please assign it to yourself, otherwise let me know and I'll figure out what to do next

@willyborankin
Copy link
Collaborator

Sure. I can't assigned myself since I'm not a maintainer.

@peternied
Copy link
Member Author

Thanks, I've assigned you the issue

I can't assigned myself since I'm yet not a maintainer.

Fixed that for you

@stephen-crawford
Copy link
Contributor

Hi @willyborankin and @peternied, please let me know if you need a hand getting this sorted. I know @willyborankin is very capable but I am also happy to help if needed :)

@willyborankin
Copy link
Collaborator

willyborankin commented May 8, 2023

@peternied and @scrawfor99 found out the root cause of the test flakiness. The reason is that tennacy rest endpoint implementation. It updates the sec index on each node and after the index has been updated the listener attached to the index sends client notifications to other nodes :-). I fixed it but still has some questions around REST end point itself. I will open PR most probably today.

@davidlago davidlago added the triaged Issues labeled as 'Triaged' have been reviewed and are deemed actionable. label May 8, 2023
@peternied
Copy link
Member Author

@willyborankin Is this resolved?

@willyborankin
Copy link
Collaborator

@peternied Yes I think I can close it now right?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working flaky-test Flaky Test issue triaged Issues labeled as 'Triaged' have been reviewed and are deemed actionable.
Projects
None yet
Development

No branches or pull requests

4 participants