Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow user to provide desired percentiles for latency/other metrics #435

Closed
peteralfonsi opened this issue Jan 4, 2024 · 3 comments · Fixed by #441
Closed

Allow user to provide desired percentiles for latency/other metrics #435

peteralfonsi opened this issue Jan 4, 2024 · 3 comments · Fixed by #441
Assignees
Labels
enhancement New feature or request

Comments

@peteralfonsi
Copy link
Contributor

Is your feature request related to a problem? Please describe.

Currently, OSB only provides p50, 90, 99, 99.9, 99.99, and 100 for latency at the end of a workload. But, when adding new features to OpenSearch that might affect performance, we often also want to see other percentiles, like p0, p10, and p25.

To get more granular percentiles you have to publish metrics to a separate datastore and compute percentiles for them later, which adds a lot of complexity. If the user could provide a list of percentiles they'd like to see when running a workload through the CLI, and those percentiles were printed afterwards alongside the existing percentiles, this use case would be simpler.

Describe the solution you'd like

We could add some command-line flag, like --latency-percentiles, with a comma-separated list of additional percentiles the user wants. At the end of the run, OSB would display these percentiles as well as the default ones.

Describe alternatives you've considered

In #199, a more general solution was proposed, where users could define their own metrics. This solution would be simpler, but only covers the case where people want to see more percentiles for existing metrics.

We could also just add p0, p10, and p25 by default, but these might not be useful for everyone, and some users might want different percentiles than these.

Additional context

This would be a simpler sub-issue of #199, which is not currently being worked on.

It's also similar to #261, which requests more percentiles for throughput metrics. I don't think this is currently being worked on.

@peteralfonsi peteralfonsi added the enhancement New feature or request label Jan 4, 2024
@peteralfonsi
Copy link
Contributor Author

I would like to work on this if people think it's a good idea

@kkhatua
Copy link
Member

kkhatua commented Jan 4, 2024

These can be very useful in case of measuring cache performances where lower latencies can show greater improvements if some new cache acceleration features have been introduced.

@IanHoang
Copy link
Collaborator

Thanks for bringing attention to this @peteralfonsi. This will definitely be helpful. It might be better to have --latency-percentiles just override the default percentiles instead of appending them to the default ones. For examples, if users provide --latency-percentiles=0,10,25,50, the default will be overridden and only percentiles p0, p10, p25, and p50 will show instead of p0, p10, p25, p50, p90, p99, p99.9, p99.99, and p100. Let me know your thoughts.

Feel free to cut an implementation for this! Once this is addressed, we could use the same approach for throughput metric in #261.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
3 participants