-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Capture and report replication lag metric #339
Comments
I will work on this feature request to add new telemeter device of segment replication statistic by adding codes to the file https://github.com/opensearch-project/opensearch-benchmark/blob/1.0.0/osbenchmark/telemetry.py#L43 to capture response of |
Please raise a PR when your work is complete. Thanks. |
Temporarily put the code I wrote here tlfeng@f2eaf20
|
The above mysterious error got solved after restarting the system, it might be an issue of the python environment. 😅 The existing code for "telemetry device" of "legacy" searchable snapshots stats and new ccr stats are direct reference. |
There are 2 problems in the above PR #346.
I also find out that adding Examples of the output:
|
Is your feature request related to a problem? Please describe.
OpenSearch has recently launched Segment Replication. Segrep includes a new API exposing metrics specific to replication performance. The most critical of these metrics is replication lag, which measures the time between a primary refreshing on a new set of segments and a replica refreshing on the same set of segments. It would be valuable to capture and report on this metric when benchmarking using the segment replication strategy.
The service currently returns the last completed lag and any ongoing lag (shard is currently syncing to a new set of segments) through its /_cat/segment_replication API.
Describe the solution you'd like
Capture and report the min/max/avg lags for a benchmark run.
Describe alternatives you've considered
Aggregate the metrics service side and invoke the API to fetch after a benchmark is completed.
Additional context
https://opensearch.org/docs/latest/api-reference/cat/cat-segment-replication/
The text was updated successfully, but these errors were encountered: