Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fixed client processing time #462

Merged

Conversation

saimedhi
Copy link
Contributor

Description

Fixed client processing time

Issues Resolved

Fixed an issue raised with client processing time addition.
#450 (comment)

Testing

[Describe how this change was tested]
I tested it in most of the cases where OpenSearch call fails.


By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: saimedhi <saimedhi@amazon.com>
@saimedhi
Copy link
Contributor Author

@IanHoang, @gkamat Please take a look. This PR fixes #450 (comment)

@saimedhi
Copy link
Contributor Author

sample benchmark result when there is an error in force-merge

 opensearch-benchmark % opensearch-benchmark execute-test --distribution-version=2.11.1 --workload=geonames --test-mode --kill-running-processes

   ____                  _____                      __       ____                  __                         __
  / __ \____  ___  ____ / ___/___  ____ ___________/ /_     / __ )___  ____  _____/ /_  ____ ___  ____ ______/ /__
 / / / / __ \/ _ \/ __ \\__ \/ _ \/ __ `/ ___/ ___/ __ \   / __  / _ \/ __ \/ ___/ __ \/ __ `__ \/ __ `/ ___/ //_/
/ /_/ / /_/ /  __/ / / /__/ /  __/ /_/ / /  / /__/ / / /  / /_/ /  __/ / / / /__/ / / / / / / / / /_/ / /  / ,<
\____/ .___/\___/_/ /_/____/\___/\__,_/_/   \___/_/ /_/  /_____/\___/_/ /_/\___/_/ /_/_/ /_/ /_/\__,_/_/  /_/|_|
    /_/

[INFO] Preparing for test execution ...
[INFO] Executing test with workload [geonames], test_procedure [append-no-conflicts] and provision_config_instance ['defaults'] with version [2.11.1].

[WARNING] merges_total_time is 37 ms indicating that the cluster is not in a defined clean state. Recorded index time metrics may be misleading.
[WARNING] indexing_total_time is 1631 ms indicating that the cluster is not in a defined clean state. Recorded index time metrics may be misleading.
[WARNING] refresh_total_time is 2419 ms indicating that the cluster is not in a defined clean state. Recorded index time metrics may be misleading.
[WARNING] flush_total_time is 1190 ms indicating that the cluster is not in a defined clean state. Recorded index time metrics may be misleading.
Running delete-index                                                           [100% done]
Running create-index                                                           [100% done]
Running check-cluster-health                                                   [100% done]
Running index-append                                                           [100% done]
Running refresh-after-index                                                    [100% done]
[ERROR] no handler found for uri [/geonames/_forcemerges] and method [POST] ({'error': 'no handler found for uri [/geonames/_forcemerges] and method [POST]'})
Running force-merge                                                            [100% done]
Running refresh-after-force-merge                                              [100% done]
Running wait-until-merges-finish                                               [100% done]
Running index-stats                                                            [100% done]
Running node-stats                                                             [100% done]
Running default                                                                [100% done]
Running term                                                                   [100% done]
Running phrase                                                                 [100% done]
Running country_agg_uncached                                                   [100% done]
Running country_agg_cached                                                     [100% done]
Running scroll                                                                 [100% done]
Running expression                                                             [100% done]
Running painless_static                                                        [100% done]
Running painless_dynamic                                                       [100% done]
Running decay_geo_gauss_function_score                                         [100% done]

@IanHoang
Copy link
Collaborator

@saimedhi For future reference, please provide clearer context in PR description for others who might not have full-context on what this PR fixes. Could you provide the following as well:

  • summary results for the test ran above
  • benchmark-metrics-* document that is associated with the force merge operation

@saimedhi
Copy link
Contributor Author

@saimedhi For future reference, please provide clearer context in PR description for others who might not have full-context on what this PR fixes. Could you provide the following as well:

  • summary results for the test ran above
  • benchmark-metrics-* document that is associated with the force merge operation

summary results(created failure in forcemerge calls by making changes in opensearch-py client used in benchmarking).

opensearch-benchmark execute-test --distribution-version=2.11.1 --workload=geonames --test-mode --kill-running-processes

   ____                  _____                      __       ____                  __                         __
  / __ \____  ___  ____ / ___/___  ____ ___________/ /_     / __ )___  ____  _____/ /_  ____ ___  ____ ______/ /__
 / / / / __ \/ _ \/ __ \\__ \/ _ \/ __ `/ ___/ ___/ __ \   / __  / _ \/ __ \/ ___/ __ \/ __ `__ \/ __ `/ ___/ //_/
/ /_/ / /_/ /  __/ / / /__/ /  __/ /_/ / /  / /__/ / / /  / /_/ /  __/ / / / /__/ / / / / / / / / /_/ / /  / ,<
\____/ .___/\___/_/ /_/____/\___/\__,_/_/   \___/_/ /_/  /_____/\___/_/ /_/\___/_/ /_/_/ /_/ /_/\__,_/_/  /_/|_|
    /_/

[INFO] Preparing for test execution ...
[INFO] Executing test with workload [geonames], test_procedure [append-no-conflicts] and provision_config_instance ['defaults'] with version [2.11.1].

[WARNING] merges_total_time is 37 ms indicating that the cluster is not in a defined clean state. Recorded index time metrics may be misleading.
[WARNING] indexing_total_time is 1548 ms indicating that the cluster is not in a defined clean state. Recorded index time metrics may be misleading.
[WARNING] refresh_total_time is 2350 ms indicating that the cluster is not in a defined clean state. Recorded index time metrics may be misleading.
[WARNING] flush_total_time is 1120 ms indicating that the cluster is not in a defined clean state. Recorded index time metrics may be misleading.
Running delete-index                                                           [100% done]
Running create-index                                                           [100% done]
Running check-cluster-health                                                   [100% done]
Running index-append                                                           [100% done]
Running refresh-after-index                                                    [100% done]
[ERROR] no handler found for uri [/geonames/_forcemerges] and method [POST] ({'error': 'no handler found for uri [/geonames/_forcemerges] and method [POST]'})
Running force-merge                                                            [100% done]
Running refresh-after-force-merge                                              [100% done]
Running wait-until-merges-finish                                               [100% done]
Running index-stats                                                            [100% done]
Running node-stats                                                             [100% done]
Running default                                                                [100% done]
Running term                                                                   [100% done]
Running phrase                                                                 [100% done]
Running country_agg_uncached                                                   [100% done]
Running country_agg_cached                                                     [100% done]
Running scroll                                                                 [100% done]
Running expression                                                             [100% done]
Running painless_static                                                        [100% done]
Running painless_dynamic                                                       [100% done]
Running decay_geo_gauss_function_score                                         [100% done]
Running decay_geo_gauss_script_score                                           [100% done]
Running field_value_function_score                                             [100% done]
Running field_value_script_score                                               [100% done]
Running large_terms                                                            [100% done]
Running large_filtered_terms                                                   [100% done]
Running large_prohibited_terms                                                 [100% done]
Running desc_sort_population                                                   [100% done]
Running asc_sort_population                                                    [100% done]
Running asc_sort_with_after_population                                         [100% done]
Running desc_sort_geonameid                                                    [100% done]
Running desc_sort_with_after_geonameid                                         [100% done]
Running asc_sort_geonameid                                                     [100% done]
Running asc_sort_with_after_geonameid                                          [100% done]

------------------------------------------------------
    _______             __   _____
   / ____(_)___  ____ _/ /  / ___/_________  ________
  / /_  / / __ \/ __ `/ /   \__ \/ ___/ __ \/ ___/ _ \
 / __/ / / / / / /_/ / /   ___/ / /__/ /_/ / /  /  __/
/_/   /_/_/ /_/\__,_/_/   /____/\___/\____/_/   \___/
------------------------------------------------------
            
|                                                         Metric |                           Task |       Value |    Unit |
|---------------------------------------------------------------:|-------------------------------:|------------:|--------:|
|                     Cumulative indexing time of primary shards |                                |     0.02725 |     min |
|             Min cumulative indexing time across primary shards |                                |           0 |     min |
|          Median cumulative indexing time across primary shards |                                | 0.000433333 |     min |
|             Max cumulative indexing time across primary shards |                                |  0.00406667 |     min |
|            Cumulative indexing throttle time of primary shards |                                |           0 |     min |
|    Min cumulative indexing throttle time across primary shards |                                |           0 |     min |
| Median cumulative indexing throttle time across primary shards |                                |           0 |     min |
|    Max cumulative indexing throttle time across primary shards |                                |           0 |     min |
|                        Cumulative merge time of primary shards |                                | 0.000616667 |     min |
|                       Cumulative merge count of primary shards |                                |           6 |         |
|                Min cumulative merge time across primary shards |                                |           0 |     min |
|             Median cumulative merge time across primary shards |                                |           0 |     min |
|                Max cumulative merge time across primary shards |                                |     0.00015 |     min |
|               Cumulative merge throttle time of primary shards |                                |           0 |     min |
|       Min cumulative merge throttle time across primary shards |                                |           0 |     min |
|    Median cumulative merge throttle time across primary shards |                                |           0 |     min |
|       Max cumulative merge throttle time across primary shards |                                |           0 |     min |
|                      Cumulative refresh time of primary shards |                                |   0.0391167 |     min |
|                     Cumulative refresh count of primary shards |                                |       10867 |         |
|              Min cumulative refresh time across primary shards |                                |           0 |     min |
|           Median cumulative refresh time across primary shards |                                | 0.000283333 |     min |
|              Max cumulative refresh time across primary shards |                                |  0.00908333 |     min |
|                        Cumulative flush time of primary shards |                                |  0.00143333 |     min |
|                       Cumulative flush count of primary shards |                                |          43 |         |
|                Min cumulative flush time across primary shards |                                |           0 |     min |
|             Median cumulative flush time across primary shards |                                |           0 |     min |
|                Max cumulative flush time across primary shards |                                |  0.00143333 |     min |
|                                        Total Young Gen GC time |                                |       0.002 |       s |
|                                       Total Young Gen GC count |                                |           1 |         |
|                                          Total Old Gen GC time |                                |           0 |       s |
|                                         Total Old Gen GC count |                                |           0 |         |
|                                                     Store size |                                |  0.00159869 |      GB |
|                                                  Translog size |                                | 0.000320831 |      GB |
|                                         Heap used for segments |                                |           0 |      MB |
|                                       Heap used for doc values |                                |           0 |      MB |
|                                            Heap used for terms |                                |           0 |      MB |
|                                            Heap used for norms |                                |           0 |      MB |
|                                           Heap used for points |                                |           0 |      MB |
|                                    Heap used for stored fields |                                |           0 |      MB |
|                                                  Segment count |                                |          51 |         |
|                                                 Min Throughput |                   index-append |     10374.8 |  docs/s |
|                                                Mean Throughput |                   index-append |     10374.8 |  docs/s |
|                                              Median Throughput |                   index-append |     10374.8 |  docs/s |
|                                                 Max Throughput |                   index-append |     10374.8 |  docs/s |
|                                        50th percentile latency |                   index-append |     82.1728 |      ms |
|                                       100th percentile latency |                   index-append |     92.9563 |      ms |
|                                   50th percentile service time |                   index-append |     82.1728 |      ms |
|                                  100th percentile service time |                   index-append |     92.9563 |      ms |
|                                                     error rate |                   index-append |           0 |       % |
|                                       100th percentile latency |                    force-merge |     4.29933 |      ms |
|                                  100th percentile service time |                    force-merge |     4.29933 |      ms |
|                                                     error rate |                    force-merge |         100 |       % |
|                                                 Min Throughput |       wait-until-merges-finish |      131.99 |   ops/s |
|                                                Mean Throughput |       wait-until-merges-finish |      131.99 |   ops/s |
|                                              Median Throughput |       wait-until-merges-finish |      131.99 |   ops/s |
|                                                 Max Throughput |       wait-until-merges-finish |      131.99 |   ops/s |
|                                       100th percentile latency |       wait-until-merges-finish |     7.10388 |      ms |
|                                  100th percentile service time |       wait-until-merges-finish |     7.10388 |      ms |
|                                                     error rate |       wait-until-merges-finish |           0 |       % |
|                                                 Min Throughput |                    index-stats |      138.93 |   ops/s |
|                                                Mean Throughput |                    index-stats |      138.93 |   ops/s |
|                                              Median Throughput |                    index-stats |      138.93 |   ops/s |
|                                                 Max Throughput |                    index-stats |      138.93 |   ops/s |
|                                       100th percentile latency |                    index-stats |     11.6119 |      ms |
|                                  100th percentile service time |                    index-stats |     3.23237 |      ms |
|                                                     error rate |                    index-stats |           0 |       % |
|                                                 Min Throughput |                     node-stats |      124.81 |   ops/s |
|                                                Mean Throughput |                     node-stats |      124.81 |   ops/s |
|                                              Median Throughput |                     node-stats |      124.81 |   ops/s |
|                                                 Max Throughput |                     node-stats |      124.81 |   ops/s |
|                                       100th percentile latency |                     node-stats |     11.6207 |      ms |
|                                  100th percentile service time |                     node-stats |     3.02667 |      ms |
|                                                     error rate |                     node-stats |           0 |       % |
|                                                 Min Throughput |                        default |      182.38 |   ops/s |
|                                                Mean Throughput |                        default |      182.38 |   ops/s |
|                                              Median Throughput |                        default |      182.38 |   ops/s |
|                                                 Max Throughput |                        default |      182.38 |   ops/s |
|                                       100th percentile latency |                        default |     8.46829 |      ms |
|                                  100th percentile service time |                        default |     2.74842 |      ms |
|                                                     error rate |                        default |           0 |       % |
|                                                 Min Throughput |                           term |      191.53 |   ops/s |
|                                                Mean Throughput |                           term |      191.53 |   ops/s |
|                                              Median Throughput |                           term |      191.53 |   ops/s |
|                                                 Max Throughput |                           term |      191.53 |   ops/s |
|                                       100th percentile latency |                           term |     7.86704 |      ms |
|                                  100th percentile service time |                           term |     2.34917 |      ms |
|                                                     error rate |                           term |           0 |       % |
|                                                 Min Throughput |                         phrase |      192.61 |   ops/s |
|                                                Mean Throughput |                         phrase |      192.61 |   ops/s |
|                                              Median Throughput |                         phrase |      192.61 |   ops/s |
|                                                 Max Throughput |                         phrase |      192.61 |   ops/s |
|                                       100th percentile latency |                         phrase |     7.89125 |      ms |
|                                  100th percentile service time |                         phrase |     2.42783 |      ms |
|                                                     error rate |                         phrase |           0 |       % |
|                                                 Min Throughput |           country_agg_uncached |      179.71 |   ops/s |
|                                                Mean Throughput |           country_agg_uncached |      179.71 |   ops/s |
|                                              Median Throughput |           country_agg_uncached |      179.71 |   ops/s |
|                                                 Max Throughput |           country_agg_uncached |      179.71 |   ops/s |
|                                       100th percentile latency |           country_agg_uncached |       7.691 |      ms |
|                                  100th percentile service time |           country_agg_uncached |     1.80508 |      ms |
|                                                     error rate |           country_agg_uncached |           0 |       % |
|                                                 Min Throughput |             country_agg_cached |      172.78 |   ops/s |
|                                                Mean Throughput |             country_agg_cached |      172.78 |   ops/s |
|                                              Median Throughput |             country_agg_cached |      172.78 |   ops/s |
|                                                 Max Throughput |             country_agg_cached |      172.78 |   ops/s |
|                                       100th percentile latency |             country_agg_cached |     7.59642 |      ms |
|                                  100th percentile service time |             country_agg_cached |     1.48329 |      ms |
|                                                     error rate |             country_agg_cached |           0 |       % |
|                                                 Min Throughput |                         scroll |      109.95 | pages/s |
|                                                Mean Throughput |                         scroll |      109.95 | pages/s |
|                                              Median Throughput |                         scroll |      109.95 | pages/s |
|                                                 Max Throughput |                         scroll |      109.95 | pages/s |
|                                       100th percentile latency |                         scroll |      26.601 |      ms |
|                                  100th percentile service time |                         scroll |     7.24937 |      ms |
|                                                     error rate |                         scroll |           0 |       % |
|                                                 Min Throughput |                     expression |      163.48 |   ops/s |
|                                                Mean Throughput |                     expression |      163.48 |   ops/s |
|                                              Median Throughput |                     expression |      163.48 |   ops/s |
|                                                 Max Throughput |                     expression |      163.48 |   ops/s |
|                                       100th percentile latency |                     expression |     8.73983 |      ms |
|                                  100th percentile service time |                     expression |     2.29988 |      ms |
|                                                     error rate |                     expression |           0 |       % |
|                                                 Min Throughput |                painless_static |      172.43 |   ops/s |
|                                                Mean Throughput |                painless_static |      172.43 |   ops/s |
|                                              Median Throughput |                painless_static |      172.43 |   ops/s |
|                                                 Max Throughput |                painless_static |      172.43 |   ops/s |
|                                       100th percentile latency |                painless_static |     8.20821 |      ms |
|                                  100th percentile service time |                painless_static |     2.08642 |      ms |
|                                                     error rate |                painless_static |           0 |       % |
|                                                 Min Throughput |               painless_dynamic |       219.1 |   ops/s |
|                                                Mean Throughput |               painless_dynamic |       219.1 |   ops/s |
|                                              Median Throughput |               painless_dynamic |       219.1 |   ops/s |
|                                                 Max Throughput |               painless_dynamic |       219.1 |   ops/s |
|                                       100th percentile latency |               painless_dynamic |     6.88217 |      ms |
|                                  100th percentile service time |               painless_dynamic |     1.97271 |      ms |
|                                                     error rate |               painless_dynamic |           0 |       % |
|                                                 Min Throughput | decay_geo_gauss_function_score |      190.84 |   ops/s |
|                                                Mean Throughput | decay_geo_gauss_function_score |      190.84 |   ops/s |
|                                              Median Throughput | decay_geo_gauss_function_score |      190.84 |   ops/s |
|                                                 Max Throughput | decay_geo_gauss_function_score |      190.84 |   ops/s |
|                                       100th percentile latency | decay_geo_gauss_function_score |       7.153 |      ms |
|                                  100th percentile service time | decay_geo_gauss_function_score |     1.57658 |      ms |
|                                                     error rate | decay_geo_gauss_function_score |           0 |       % |
|                                                 Min Throughput |   decay_geo_gauss_script_score |      123.59 |   ops/s |
|                                                Mean Throughput |   decay_geo_gauss_script_score |      123.59 |   ops/s |
|                                              Median Throughput |   decay_geo_gauss_script_score |      123.59 |   ops/s |
|                                                 Max Throughput |   decay_geo_gauss_script_score |      123.59 |   ops/s |
|                                       100th percentile latency |   decay_geo_gauss_script_score |     10.1988 |      ms |
|                                  100th percentile service time |   decay_geo_gauss_script_score |     1.81088 |      ms |
|                                                     error rate |   decay_geo_gauss_script_score |           0 |       % |
|                                                 Min Throughput |     field_value_function_score |      166.09 |   ops/s |
|                                                Mean Throughput |     field_value_function_score |      166.09 |   ops/s |
|                                              Median Throughput |     field_value_function_score |      166.09 |   ops/s |
|                                                 Max Throughput |     field_value_function_score |      166.09 |   ops/s |
|                                       100th percentile latency |     field_value_function_score |     8.19325 |      ms |
|                                  100th percentile service time |     field_value_function_score |     1.86071 |      ms |
|                                                     error rate |     field_value_function_score |           0 |       % |
|                                                 Min Throughput |       field_value_script_score |      173.76 |   ops/s |
|                                                Mean Throughput |       field_value_script_score |      173.76 |   ops/s |
|                                              Median Throughput |       field_value_script_score |      173.76 |   ops/s |
|                                                 Max Throughput |       field_value_script_score |      173.76 |   ops/s |
|                                       100th percentile latency |       field_value_script_score |     9.59471 |      ms |
|                                  100th percentile service time |       field_value_script_score |     3.52008 |      ms |
|                                                     error rate |       field_value_script_score |           0 |       % |
|                                                 Min Throughput |                    large_terms |       10.38 |   ops/s |
|                                                Mean Throughput |                    large_terms |       10.38 |   ops/s |
|                                              Median Throughput |                    large_terms |       10.38 |   ops/s |
|                                                 Max Throughput |                    large_terms |       10.38 |   ops/s |
|                                       100th percentile latency |                    large_terms |     183.418 |      ms |
|                                  100th percentile service time |                    large_terms |      79.582 |      ms |
|                                                     error rate |                    large_terms |           0 |       % |
|                                                 Min Throughput |           large_filtered_terms |       16.07 |   ops/s |
|                                                Mean Throughput |           large_filtered_terms |       16.07 |   ops/s |
|                                              Median Throughput |           large_filtered_terms |       16.07 |   ops/s |
|                                                 Max Throughput |           large_filtered_terms |       16.07 |   ops/s |
|                                       100th percentile latency |           large_filtered_terms |     110.861 |      ms |
|                                  100th percentile service time |           large_filtered_terms |     42.0966 |      ms |
|                                                     error rate |           large_filtered_terms |           0 |       % |
|                                                 Min Throughput |         large_prohibited_terms |       10.45 |   ops/s |
|                                                Mean Throughput |         large_prohibited_terms |       10.45 |   ops/s |
|                                              Median Throughput |         large_prohibited_terms |       10.45 |   ops/s |
|                                                 Max Throughput |         large_prohibited_terms |       10.45 |   ops/s |
|                                       100th percentile latency |         large_prohibited_terms |     177.663 |      ms |
|                                  100th percentile service time |         large_prohibited_terms |     75.8065 |      ms |
|                                                     error rate |         large_prohibited_terms |           0 |       % |
|                                                 Min Throughput |           desc_sort_population |      283.35 |   ops/s |
|                                                Mean Throughput |           desc_sort_population |      283.35 |   ops/s |
|                                              Median Throughput |           desc_sort_population |      283.35 |   ops/s |
|                                                 Max Throughput |           desc_sort_population |      283.35 |   ops/s |
|                                       100th percentile latency |           desc_sort_population |     5.90892 |      ms |
|                                  100th percentile service time |           desc_sort_population |      2.2015 |      ms |
|                                                     error rate |           desc_sort_population |           0 |       % |
|                                                 Min Throughput |            asc_sort_population |      125.34 |   ops/s |
|                                                Mean Throughput |            asc_sort_population |      125.34 |   ops/s |
|                                              Median Throughput |            asc_sort_population |      125.34 |   ops/s |
|                                                 Max Throughput |            asc_sort_population |      125.34 |   ops/s |
|                                       100th percentile latency |            asc_sort_population |     12.8404 |      ms |
|                                  100th percentile service time |            asc_sort_population |     4.58783 |      ms |
|                                                     error rate |            asc_sort_population |           0 |       % |
|                                                 Min Throughput | asc_sort_with_after_population |      247.17 |   ops/s |
|                                                Mean Throughput | asc_sort_with_after_population |      247.17 |   ops/s |
|                                              Median Throughput | asc_sort_with_after_population |      247.17 |   ops/s |
|                                                 Max Throughput | asc_sort_with_after_population |      247.17 |   ops/s |
|                                       100th percentile latency | asc_sort_with_after_population |     8.76771 |      ms |
|                                  100th percentile service time | asc_sort_with_after_population |     4.40525 |      ms |
|                                                     error rate | asc_sort_with_after_population |           0 |       % |
|                                                 Min Throughput |            desc_sort_geonameid |      174.42 |   ops/s |
|                                                Mean Throughput |            desc_sort_geonameid |      174.42 |   ops/s |
|                                              Median Throughput |            desc_sort_geonameid |      174.42 |   ops/s |
|                                                 Max Throughput |            desc_sort_geonameid |      174.42 |   ops/s |
|                                       100th percentile latency |            desc_sort_geonameid |     9.42429 |      ms |
|                                  100th percentile service time |            desc_sort_geonameid |     3.40579 |      ms |
|                                                     error rate |            desc_sort_geonameid |           0 |       % |
|                                                 Min Throughput | desc_sort_with_after_geonameid |      279.93 |   ops/s |
|                                                Mean Throughput | desc_sort_with_after_geonameid |      279.93 |   ops/s |
|                                              Median Throughput | desc_sort_with_after_geonameid |      279.93 |   ops/s |
|                                                 Max Throughput | desc_sort_with_after_geonameid |      279.93 |   ops/s |
|                                       100th percentile latency | desc_sort_with_after_geonameid |     8.33296 |      ms |
|                                  100th percentile service time | desc_sort_with_after_geonameid |     4.53788 |      ms |
|                                                     error rate | desc_sort_with_after_geonameid |           0 |       % |
|                                                 Min Throughput |             asc_sort_geonameid |      196.25 |   ops/s |
|                                                Mean Throughput |             asc_sort_geonameid |      196.25 |   ops/s |
|                                              Median Throughput |             asc_sort_geonameid |      196.25 |   ops/s |
|                                                 Max Throughput |             asc_sort_geonameid |      196.25 |   ops/s |
|                                       100th percentile latency |             asc_sort_geonameid |     10.0462 |      ms |
|                                  100th percentile service time |             asc_sort_geonameid |     4.77104 |      ms |
|                                                     error rate |             asc_sort_geonameid |           0 |       % |
|                                                 Min Throughput |  asc_sort_with_after_geonameid |      179.22 |   ops/s |
|                                                Mean Throughput |  asc_sort_with_after_geonameid |      179.22 |   ops/s |
|                                              Median Throughput |  asc_sort_with_after_geonameid |      179.22 |   ops/s |
|                                                 Max Throughput |  asc_sort_with_after_geonameid |      179.22 |   ops/s |
|                                       100th percentile latency |  asc_sort_with_after_geonameid |      11.824 |      ms |
|                                  100th percentile service time |  asc_sort_with_after_geonameid |     5.94204 |      ms |
|                                                     error rate |  asc_sort_with_after_geonameid |           0 |       % |

[WARNING] Error rate is 100.0 for operation 'force-merge'. Please check the logs.
[WARNING] No throughput metrics available for [force-merge]. Likely cause: Error rate is 100.0%. Please check the logs.

@saimedhi
Copy link
Contributor Author

benchmark-metrics-* document

{
    "took": 2,
    "timed_out": false,
    "_shards": {
        "total": 5,
        "successful": 5,
        "skipped": 0,
        "failed": 0
    },
    "hits": {
        "total": {
            "value": 324,
            "relation": "eq"
        },
        "max_score": 1.0,
        "hits": [
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "KfFMn40BK0osdem3zqiS",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774626873,
                    "relative-time-ms": 5.7683749999988265,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "service_time",
                    "value": 4.766000000000048,
                    "unit": "ms",
                    "sample-type": "warmup",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "success": true
                    },
                    "task": "asc_sort_with_after_geonameid",
                    "operation": "asc_sort_with_after_geonameid",
                    "operation-type": "search"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "LvFMn40BK0osdem3zqiS",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774626878,
                    "relative-time-ms": 10.92083300000013,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "client_processing_time",
                    "value": 0.22079100000027552,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "success": true
                    },
                    "task": "asc_sort_with_after_geonameid",
                    "operation": "asc_sort_with_after_geonameid",
                    "operation-type": "search"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "5f9Mn40BBuuZzv8czmnm",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774627449,
                    "relative-time-ms": 586.0262919999997,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "node_young_gen_gc_count",
                    "value": 1,
                    "unit": null,
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "node_name": "benchmark-node-0",
                        "host_name": "127.0.0.1",
                        "os_name": "Mac OS X",
                        "os_version": "14.2.1",
                        "cpu_logical_cores": 10,
                        "jvm_vendor": "Oracle Corporation",
                        "jvm_version": "17.0.9"
                    }
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "6P9Mn40BBuuZzv8czmnm",
                "_score": 1.0,
                "_source": {
                    "name": "jvm_memory_pool_stats",
                    "young": {
                        "peak_usage": 642777088,
                        "unit": "byte"
                    },
                    "old": {
                        "peak_usage": 113386496,
                        "unit": "byte"
                    },
                    "survivor": {
                        "peak_usage": 56098816,
                        "unit": "byte"
                    },
                    "@timestamp": 1707774627449,
                    "relative-time-ms": 586.0837500000002,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "node_name": "benchmark-node-0",
                        "host_name": "127.0.0.1",
                        "os_name": "Mac OS X",
                        "os_version": "14.2.1",
                        "cpu_logical_cores": 10,
                        "jvm_vendor": "Oracle Corporation",
                        "jvm_version": "17.0.9"
                    }
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "6_9Mn40BBuuZzv8czmnm",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774627449,
                    "relative-time-ms": 586.1085840000015,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "node_total_old_gen_gc_time",
                    "value": 0,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss"
                    }
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "7v9Mn40BBuuZzv8czmnm",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774627462,
                    "relative-time-ms": 599.1520840000035,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "segments_memory_in_bytes",
                    "value": 0,
                    "unit": "byte",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss"
                    }
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "_v9Mn40BBuuZzv8czmnm",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774627462,
                    "relative-time-ms": 599.5308749999992,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "translog_size_in_bytes",
                    "value": 344490,
                    "unit": "byte",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss"
                    }
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "UP9Mn40BBuuZzv8cjWlA",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774610140,
                    "relative-time-ms": 10.907416000000225,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "processing_time",
                    "value": 69.75395899999981,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "index": "geonames",
                        "took": 64,
                        "success": true,
                        "success-count": 125,
                        "error-count": 0
                    },
                    "task": "index-append",
                    "operation": "index-append",
                    "operation-type": "bulk"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "W_9Mn40BBuuZzv8cjWlA",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774610158,
                    "relative-time-ms": 27.35629199999856,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "client_processing_time",
                    "value": 1.8018749999999528,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "index": "geonames",
                        "took": 76,
                        "success": true,
                        "success-count": 125,
                        "error-count": 0
                    },
                    "task": "index-append",
                    "operation": "index-append",
                    "operation-type": "bulk"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "Xf9Mn40BBuuZzv8cjWlA",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774610158,
                    "relative-time-ms": 26.233624999999705,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "latency",
                    "value": 82.27633300000115,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "index": "geonames",
                        "took": 76,
                        "success": true,
                        "success-count": 125,
                        "error-count": 0
                    },
                    "task": "index-append",
                    "operation": "index-append",
                    "operation-type": "bulk"
                }
            }
        ]
    }
}

@IanHoang
Copy link
Collaborator

benchmark-metrics-* document

{
    "took": 2,
    "timed_out": false,
    "_shards": {
        "total": 5,
        "successful": 5,
        "skipped": 0,
        "failed": 0
    },
    "hits": {
        "total": {
            "value": 324,
            "relation": "eq"
        },
        "max_score": 1.0,
        "hits": [
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "KfFMn40BK0osdem3zqiS",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774626873,
                    "relative-time-ms": 5.7683749999988265,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "service_time",
                    "value": 4.766000000000048,
                    "unit": "ms",
                    "sample-type": "warmup",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "success": true
                    },
                    "task": "asc_sort_with_after_geonameid",
                    "operation": "asc_sort_with_after_geonameid",
                    "operation-type": "search"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "LvFMn40BK0osdem3zqiS",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774626878,
                    "relative-time-ms": 10.92083300000013,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "client_processing_time",
                    "value": 0.22079100000027552,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "success": true
                    },
                    "task": "asc_sort_with_after_geonameid",
                    "operation": "asc_sort_with_after_geonameid",
                    "operation-type": "search"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "5f9Mn40BBuuZzv8czmnm",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774627449,
                    "relative-time-ms": 586.0262919999997,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "node_young_gen_gc_count",
                    "value": 1,
                    "unit": null,
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "node_name": "benchmark-node-0",
                        "host_name": "127.0.0.1",
                        "os_name": "Mac OS X",
                        "os_version": "14.2.1",
                        "cpu_logical_cores": 10,
                        "jvm_vendor": "Oracle Corporation",
                        "jvm_version": "17.0.9"
                    }
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "6P9Mn40BBuuZzv8czmnm",
                "_score": 1.0,
                "_source": {
                    "name": "jvm_memory_pool_stats",
                    "young": {
                        "peak_usage": 642777088,
                        "unit": "byte"
                    },
                    "old": {
                        "peak_usage": 113386496,
                        "unit": "byte"
                    },
                    "survivor": {
                        "peak_usage": 56098816,
                        "unit": "byte"
                    },
                    "@timestamp": 1707774627449,
                    "relative-time-ms": 586.0837500000002,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "node_name": "benchmark-node-0",
                        "host_name": "127.0.0.1",
                        "os_name": "Mac OS X",
                        "os_version": "14.2.1",
                        "cpu_logical_cores": 10,
                        "jvm_vendor": "Oracle Corporation",
                        "jvm_version": "17.0.9"
                    }
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "6_9Mn40BBuuZzv8czmnm",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774627449,
                    "relative-time-ms": 586.1085840000015,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "node_total_old_gen_gc_time",
                    "value": 0,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss"
                    }
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "7v9Mn40BBuuZzv8czmnm",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774627462,
                    "relative-time-ms": 599.1520840000035,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "segments_memory_in_bytes",
                    "value": 0,
                    "unit": "byte",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss"
                    }
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "_v9Mn40BBuuZzv8czmnm",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774627462,
                    "relative-time-ms": 599.5308749999992,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "translog_size_in_bytes",
                    "value": 344490,
                    "unit": "byte",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss"
                    }
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "UP9Mn40BBuuZzv8cjWlA",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774610140,
                    "relative-time-ms": 10.907416000000225,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "processing_time",
                    "value": 69.75395899999981,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "index": "geonames",
                        "took": 64,
                        "success": true,
                        "success-count": 125,
                        "error-count": 0
                    },
                    "task": "index-append",
                    "operation": "index-append",
                    "operation-type": "bulk"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "W_9Mn40BBuuZzv8cjWlA",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774610158,
                    "relative-time-ms": 27.35629199999856,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "client_processing_time",
                    "value": 1.8018749999999528,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "index": "geonames",
                        "took": 76,
                        "success": true,
                        "success-count": 125,
                        "error-count": 0
                    },
                    "task": "index-append",
                    "operation": "index-append",
                    "operation-type": "bulk"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "Xf9Mn40BBuuZzv8cjWlA",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707774610158,
                    "relative-time-ms": 26.233624999999705,
                    "test-execution-id": "b1c6c994-b57b-4afa-8298-278a4bdbc7c3",
                    "test-execution-timestamp": "20240212T214955Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "latency",
                    "value": 82.27633300000115,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "index": "geonames",
                        "took": 76,
                        "success": true,
                        "success-count": 125,
                        "error-count": 0
                    },
                    "task": "index-append",
                    "operation": "index-append",
                    "operation-type": "bulk"
                }
            }
        ]
    }
}

Do you have a sample document for any force-merges? You can filter the dashboards to look for operation-type = force-merge

@saimedhi
Copy link
Contributor Author

@IanHoang, I believe you're the best person to help ensure all possible failure scenarios are thoroughly tested. Could you please review and test each case for failure? Let me know if any changes are needed. Thank you!

Comment on lines 1678 to 1684
total_ops_unit = "ops"
request_meta_data = {"success": True}
except opensearchpy.TransportError as e:
request_context_holder.on_client_request_end()
# we *specifically* want to distinguish connection refused (a node died?) from connection timeouts
# pylint: disable=unidiomatic-typecheck
if type(e) is opensearchpy.ConnectionError:
Copy link
Collaborator

@gkamat gkamat Feb 13, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Generally, it is better to catch any exceptions raised by an API call at the point where the call is made, rather than up the call stack. That way, the caller can tailor the exceptions handled precisely, for instance, a search call may raise a different set of exceptions as compared to a force-merge. Otherwise, someone looking at the higher level call needs to dig down possible flows of control to identify exceptions that might get raised.

The current implementation of execute_single() is legacy in that it assumes it is tied to the ES/OS API. In the future, we will support other clients, that might raise other exceptions. Therefore, the strategy noted above would be preferable to the one implemented in this PR.

BTW, the execute_singe() routine should indeed capture exceptions, but they should be more general, just to handle the eventuality that lower-level modules have a bug and missed catching something they should.

In summary, the change above is functionally correct, but it is recommended that exceptions be caught at the point of call.

Copy link
Contributor Author

@saimedhi saimedhi Feb 13, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I understand your point, @gkamat. I'm not introducing new exception handling; I'm simply addressing the issue mentioned in #450 (comment).

And I believe the error handling as said here is unrelated to my PR. Lets raise a separate issue for it if required. Thank you.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The proposed change requires exception handling for the client time computation functionality, else there is a KeyError that gets raised. This should be caught at the point of the call, rather than up the stack.

The overall change does not need to be reverted; just exception handling for the current functionality needs to be added at each API call, rather than relying on the existing checks. Ideally, the checks in execute-single might be moved down, but that is not a prerequisite.

@IanHoang
Copy link
Collaborator

@gkamat and I had a discussion with @saimedhi. We'll merge this fix in and @saimedhi will open a separate issue to refactor this that will go in after OSB 1.3 release next week.

Copy link
Collaborator

@gkamat gkamat left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Noted comment by @IanHoang,

@IanHoang IanHoang merged commit 38cf15e into opensearch-project:main Feb 14, 2024
8 checks passed
@saimedhi
Copy link
Contributor Author

As discussed, I've opened an issue to refine the error handling strategy in the execute_single function.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants