Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Zero-Garbage ContextInstances::forEach #37529

Merged
merged 1 commit into from
Dec 8, 2023

Conversation

franz1981
Copy link
Contributor

@franz1981 franz1981 commented Dec 5, 2023

This improvement is necessary due to #36626, which has introduced a very good thing ie bytecode-generated context containers, but it's causing an Set allocation in the hot-path, which eventually lead to calling System::identityHashCode as well (which is not known to be super efficient nor fast).

The cost of Arc in plaintext Techempower for org/jboss/resteasy/reactive/server/core/ResteasyReactiveRequestContext.close was at ~2.87% and has become 6.7%, which is ~ twice as costy as it was before the mentioned pr.

The flamegraphs shows clearly the problem, before:

image

The problem was that, with a single ctx instance, the default capacity of ConcurrentHashMap was an overkill (16), and forEach and clear were forcing to iterate among all map's slots, including null-ones, twice.

#36626 has introduced a big improvement on it, but:
image

Shows that:

  • an hash map (with default capacity) and an iterator are allocated in the hot path
  • given that HashSet's are backed by a map using the values as keys, this force the ctx instances to have their own equals/hashCode implementations ore they would rely on System::identityHashCode which is not great

This pr is "fixing" the "symptom" by providing an optimized forEach method given that ContextInstances is a container itself and I see no harm in this, but still:

  1. do we really need the getAllPresent method? If is used elsewhere in the hotpath, is probably a good idea to fix there them as well IMO...
  2. I have found ApplicationContextInstancesTest: I still need to write a new test for this additional API method, which use a ctx instances with more than 1 instance in
  3. the existing generated implementation still cause lock/unlock to happen while reading the instances, which "could" still be an hot-path: I'm not fixing problems which I cannot observe, but calling volatile stores (twice, to enter/exit) in the hot path have some non-negligible performance impact, which could be improved as well. I don't plan to send the fix for it in this PR.

@quarkus-bot quarkus-bot bot added the area/arc Issue related to ARC (dependency injection) label Dec 5, 2023
@franz1981 franz1981 force-pushed the zero_gc_req_ctx_destroy branch from 4b60011 to c206da7 Compare December 5, 2023 13:00
@franz1981
Copy link
Contributor Author

@mkouba wdyt? suggestions on how to make it right?

@mkouba
Copy link
Contributor

mkouba commented Dec 5, 2023

@mkouba wdyt? suggestions on how to make it right?

I like it! I don't think that we need a default implementation though. ContextInstances is not part of the public API.

do we really need the getAllPresent method? If is used elsewhere in the hotpath, is probably a good idea to fix there them as well IMO...

I think that we need it for some use case with context propagation but it shouldn't be used in a hot path.

I have found ApplicationContextInstancesTest: I still need to write a new test for this additional API method, which use a ctx instances with more than 1 instance in

+1

@franz1981 franz1981 force-pushed the zero_gc_req_ctx_destroy branch from c206da7 to 4e85930 Compare December 6, 2023 15:25
@quarkus-bot quarkus-bot bot added area/config area/core area/dependencies Pull requests that update a dependency file area/docstyle issues related for manual docstyle review area/documentation area/security area/vertx labels Dec 6, 2023
@franz1981 franz1981 force-pushed the zero_gc_req_ctx_destroy branch from 4e85930 to 098d895 Compare December 6, 2023 15:25
@franz1981
Copy link
Contributor Author

@mkouba @geoand ready to go! I've removed the default impl and re-used the method in a place where an existing test is using it. We should be fine now!

Copy link

github-actions bot commented Dec 6, 2023

🎊 PR Preview b7887aa has been successfully built and deployed to https://quarkus-pr-main-37529-preview.surge.sh/version/main/guides/

@franz1981
Copy link
Contributor Author

@mkouba @geoand verified via running Techempower plaintext that we've moved from 3.6 M req/sec to 4 M req/sec
and now the cost for org/jboss/resteasy/reactive/server/core/ResteasyReactiveRequestContext.close is now ~2%, and as expected the 3 atomic ops (2 lock/unlock pairs and req ctx invalidation using a compareAndSet) in the hot paths are the last ones to improve

image

@geoand
Copy link
Contributor

geoand commented Dec 6, 2023

Great!!!

This comment has been minimized.

@franz1981 franz1981 force-pushed the zero_gc_req_ctx_destroy branch from 098d895 to 3ad2ee4 Compare December 6, 2023 19:56
@mkouba
Copy link
Contributor

mkouba commented Dec 7, 2023

🌘 This workflow status is outdated as a new workflow run has been triggered.

⚠️ Unable to include the stracktraces as the report was too long. See annotations below for the details. ⚠️ Unable to include the failure links as the report was too long. See annotations below for the details.

Failing Jobs - Building 098d895

Status Name Step Failures Logs Raw logs Build scan
✖ JVM Tests - JDK 17 Build Failures Logs Raw logs 🚧
✖ JVM Tests - JDK 17 Windows Build Failures Logs Raw logs 🚧
✖ JVM Tests - JDK 21 Build Failures Logs Raw logs 🚧

Failures

⚙️ JVM Tests - JDK 17 #

Just out of curiosity - what caused these failures?

@mkouba mkouba added the triage/waiting-for-ci Ready to merge when CI successfully finishes label Dec 7, 2023
@franz1981
Copy link
Contributor Author

@mkouba
It was due to https://github.com/quarkusio/quarkus/pull/37529/files#diff-3343cebc078d4441d20bca2600f2139c3b986ee734244472d40c52964758428dR44 which wasn't implemented like that but just iterating the values (without checking if present).

@franz1981 franz1981 force-pushed the zero_gc_req_ctx_destroy branch from 3ad2ee4 to eb8c8a7 Compare December 7, 2023 16:07
Copy link

quarkus-bot bot commented Dec 8, 2023

✔️ The latest workflow run for the pull request has completed successfully.

It should be safe to merge provided you have a look at the other checks in the summary.

@mkouba mkouba merged commit 653d5cc into quarkusio:main Dec 8, 2023
49 checks passed
@quarkus-bot quarkus-bot bot removed the triage/waiting-for-ci Ready to merge when CI successfully finishes label Dec 8, 2023
@quarkus-bot quarkus-bot bot added this to the 3.7 - main milestone Dec 8, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/arc Issue related to ARC (dependency injection) area/config area/core area/dependencies Pull requests that update a dependency file area/docstyle issues related for manual docstyle review area/documentation area/security area/vertx
Projects
Development

Successfully merging this pull request may close these issues.

3 participants