Added a new metric: Client Processing Time #450

saimedhi · 2024-01-29T22:58:46Z

Description

Introduced a new metric: Client Processing Time.

Client Processing Time: The delta between total request time and service time.

Total Request Time: Defined as the duration between the runner sending a request to the client OpenSearch-py and receiving the response.

Service Time: Represents the interval from the server receiving the request to the server sending the response. Note: There was a discrepancy in the documentation regarding Service Time, and I have clarified my observations in the associated PR.

Issues Resolved

#432

Signed-off-by: saimedhi <saimedhi@amazon.com>

saimedhi · 2024-01-30T21:36:53Z

I've noticed inaccuracies in the calculation of 'Service Time' in Opensearch Benchmarks. I'll be raising an issue to address this concern. The accuracy of the Client Processing Time metric relies on the accuracy of 'Service Time' used in its computation.

IanHoang · 2024-01-31T17:12:03Z

osbenchmark/worker_coordinator/runner.py

+def time_func(func):
+    async def advised(*args, **kwargs):
+        request_context_holder.on_client_request_start()
+        rsl = await func(*args, **kwargs)


What does rsl represent? Is there a more descriptive name we could use?

@IanHoang, changed it to response

@IanHoang, kindly inform me of any additional corrections needed. I'm ready to address them promptly. Let's move forward with merging. Thank you.

Signed-off-by: saimedhi <saimedhi@amazon.com>

saimedhi · 2024-02-01T18:42:44Z

@IanHoang, @gkamat Please take a look

IanHoang · 2024-02-02T22:02:04Z

@saimedhi Overall, looks good. Did you run any tests or tests in --test-mode? If so, could you supply an example metric document from benchmark-metrics-* of external datastore?

saimedhi · 2024-02-05T20:24:17Z

@saimedhi Overall, looks good. Did you run any tests or tests in --test-mode? If so, could you supply an example metric document from benchmark-metrics-* of external datastore?

https://search-benchmarks-test-6ykfxpbqxyaslahdtm2ofk7ige.us-west-2.es.amazonaws.com/benchmark-metrics-2024-02/_search

{
    "query": {
        "match_all": {}
    }
}

{
    "took": 6,
    "timed_out": false,
    "_shards": {
        "total": 5,
        "successful": 5,
        "skipped": 0,
        "failed": 0
    },
    "hits": {
        "total": {
            "value": 324,
            "relation": "eq"
        },
        "max_score": 1.0,
        "hits": [
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "-NLreo0BAWfgjZcLzLhf",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707164289535,
                    "relative-time-ms": 9.440874999999238,
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "processing_time",
                    "value": 8.88333299999644,
                    "unit": "ms",
                    "sample-type": "warmup",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "success": true
                    },
                    "task": "desc_sort_geonameid",
                    "operation": "desc_sort_geonameid",
                    "operation-type": "search"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "-dLreo0BAWfgjZcLzLhf",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707164289544,
                    "relative-time-ms": 18.039542000003905,
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "latency",
                    "value": 12.431333000002098,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "success": true
                    },
                    "task": "desc_sort_geonameid",
                    "operation": "desc_sort_geonameid",
                    "operation-type": "search"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "-9Lreo0BAWfgjZcLzLhf",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707164289544,
                    "relative-time-ms": 18.039542000003905,
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "client_processing_time",
                    "value": 0.3031250000020691,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "success": true
                    },
                    "task": "desc_sort_geonameid",
                    "operation": "desc_sort_geonameid",
                    "operation-type": "search"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "_NLreo0BAWfgjZcLzLhf",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707164289544,
                    "relative-time-ms": 18.039542000003905,
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "processing_time",
                    "value": 3.5443749999970464,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "success": true
                    },
                    "task": "desc_sort_geonameid",
                    "operation": "desc_sort_geonameid",
                    "operation-type": "search"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "-v_reo0BBuuZzv8czmG6",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707164290143,
                    "relative-time-ms": 13.536166999998045,
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "latency",
                    "value": 11.079457999997544,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "success": true
                    },
                    "task": "desc_sort_with_after_geonameid",
                    "operation": "desc_sort_with_after_geonameid",
                    "operation-type": "search"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "_f_reo0BBuuZzv8czmG6",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707164290143,
                    "relative-time-ms": 13.536166999998045,
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "processing_time",
                    "value": 4.338791999998648,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "success": true
                    },
                    "task": "desc_sort_with_after_geonameid",
                    "operation": "desc_sort_with_after_geonameid",
                    "operation-type": "search"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "___reo0BBuuZzv8c0WES",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707164290724,
                    "relative-time-ms": 7.0460829999987595,
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "latency",
                    "value": 7.034042000000795,
                    "unit": "ms",
                    "sample-type": "warmup",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "success": true
                    },
                    "task": "asc_sort_geonameid",
                    "operation": "asc_sort_geonameid",
                    "operation-type": "search"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "Af_reo0BBuuZzv8c0WIS",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707164290724,
                    "relative-time-ms": 7.0460829999987595,
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "client_processing_time",
                    "value": 0.6644999999991796,
                    "unit": "ms",
                    "sample-type": "warmup",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "success": true
                    },
                    "task": "asc_sort_geonameid",
                    "operation": "asc_sort_geonameid",
                    "operation-type": "search"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "BP_reo0BBuuZzv8c0WIS",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707164290732,
                    "relative-time-ms": 14.581999999997208,
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "service_time",
                    "value": 1.9608329999982743,
                    "unit": "ms",
                    "sample-type": "normal",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "success": true
                    },
                    "task": "asc_sort_geonameid",
                    "operation": "asc_sort_geonameid",
                    "operation-type": "search"
                }
            },
            {
                "_index": "benchmark-metrics-2024-02",
                "_id": "_tLreo0BAWfgjZcL07ht",
                "_score": 1.0,
                "_source": {
                    "@timestamp": 1707164291333,
                    "relative-time-ms": 6.70079200000373,
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "environment": "local",
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "name": "latency",
                    "value": 6.702540999995676,
                    "unit": "ms",
                    "sample-type": "warmup",
                    "meta": {
                        "plugins": [
                            "opensearch-alerting",
                            "opensearch-anomaly-detection",
                            "opensearch-asynchronous-search",
                            "opensearch-cross-cluster-replication",
                            "opensearch-custom-codecs",
                            "opensearch-geospatial",
                            "opensearch-index-management",
                            "opensearch-job-scheduler",
                            "opensearch-knn",
                            "opensearch-ml",
                            "opensearch-neural-search",
                            "opensearch-notifications",
                            "opensearch-notifications-core",
                            "opensearch-observability",
                            "opensearch-performance-analyzer",
                            "opensearch-reports-scheduler",
                            "opensearch-security",
                            "opensearch-security-analytics",
                            "opensearch-sql"
                        ],
                        "attribute_shard_indexing_pressure_enabled": "true",
                        "source_revision": "6b1986e964d440be9137eba1413015c31c5a7752",
                        "distribution_version": "2.11.1",
                        "distribution_flavor": "oss",
                        "success": true
                    },
                    "task": "asc_sort_with_after_geonameid",
                    "operation": "asc_sort_with_after_geonameid",
                    "operation-type": "search"
                }
            }
        ]
    }
}

saimedhi · 2024-02-05T20:36:16Z

benchmark-results-2024-02

https://search-benchmarks-test-6ykfxpbqxyaslahdtm2ofk7ige.us-west-2.es.amazonaws.com/benchmark-results-2024-02/_search

{
    "query": {
        "match_all": {}
    }
}

{
    "took": 6,
    "timed_out": false,
    "_shards": {
        "total": 5,
        "successful": 5,
        "skipped": 0,
        "failed": 0
    },
    "hits": {
        "total": {
            "value": 220,
            "relation": "eq"
        },
        "max_score": 1.0,
        "hits": [
            {
                "_index": "benchmark-results-2024-02",
                "_id": "D9Lseo0BAWfgjZcLSbkR",
                "_score": 1.0,
                "_source": {
                    "benchmark-version": "1.3.0 (git revision: 9c716186)",
                    "benchmark-revision": "9c716186",
                    "environment": "local",
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "distribution-version": "2.11.1",
                    "distribution-flavor": "oss",
                    "user-tags": {},
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "active": true,
                    "distribution-major-version": 2,
                    "workload-revision": "6879b3d",
                    "task": "country_agg_cached",
                    "operation": "country_agg_cached",
                    "name": "client_processing_time",
                    "value": {
                        "100_0": 0.2687920033931732,
                        "mean": 0.2687920033931732,
                        "unit": "ms"
                    }
                }
            },
            {
                "_index": "benchmark-results-2024-02",
                "_id": "EtLseo0BAWfgjZcLSbkR",
                "_score": 1.0,
                "_source": {
                    "benchmark-version": "1.3.0 (git revision: 9c716186)",
                    "benchmark-revision": "9c716186",
                    "environment": "local",
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "distribution-version": "2.11.1",
                    "distribution-flavor": "oss",
                    "user-tags": {},
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "active": true,
                    "distribution-major-version": 2,
                    "workload-revision": "6879b3d",
                    "task": "painless_static",
                    "operation": "painless_static",
                    "name": "client_processing_time",
                    "value": {
                        "100_0": 0.27687498927116394,
                        "mean": 0.27687498927116394,
                        "unit": "ms"
                    }
                }
            },
            {
                "_index": "benchmark-results-2024-02",
                "_id": "HtLseo0BAWfgjZcLSbkR",
                "_score": 1.0,
                "_source": {
                    "benchmark-version": "1.3.0 (git revision: 9c716186)",
                    "benchmark-revision": "9c716186",
                    "environment": "local",
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "distribution-version": "2.11.1",
                    "distribution-flavor": "oss",
                    "user-tags": {},
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "active": true,
                    "distribution-major-version": 2,
                    "workload-revision": "6879b3d",
                    "task": "desc_sort_geonameid",
                    "operation": "desc_sort_geonameid",
                    "name": "client_processing_time",
                    "value": {
                        "100_0": 0.3031249940395355,
                        "mean": 0.3031249940395355,
                        "unit": "ms"
                    }
                }
            },
            {
                "_index": "benchmark-results-2024-02",
                "_id": "H9Lseo0BAWfgjZcLSbkR",
                "_score": 1.0,
                "_source": {
                    "benchmark-version": "1.3.0 (git revision: 9c716186)",
                    "benchmark-revision": "9c716186",
                    "environment": "local",
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "distribution-version": "2.11.1",
                    "distribution-flavor": "oss",
                    "user-tags": {},
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "active": true,
                    "distribution-major-version": 2,
                    "workload-revision": "6879b3d",
                    "task": "desc_sort_with_after_geonameid",
                    "operation": "desc_sort_with_after_geonameid",
                    "name": "client_processing_time",
                    "value": {
                        "100_0": 0.24741700291633606,
                        "mean": 0.24741700291633606,
                        "unit": "ms"
                    }
                }
            },
            {
                "_index": "benchmark-results-2024-02",
                "_id": "JNLseo0BAWfgjZcLSbkR",
                "_score": 1.0,
                "_source": {
                    "benchmark-version": "1.3.0 (git revision: 9c716186)",
                    "benchmark-revision": "9c716186",
                    "environment": "local",
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "distribution-version": "2.11.1",
                    "distribution-flavor": "oss",
                    "user-tags": {},
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "active": true,
                    "distribution-major-version": 2,
                    "workload-revision": "6879b3d",
                    "task": "index-stats",
                    "operation": "index-stats",
                    "name": "duration",
                    "value": {
                        "single": 17.28866699999898
                    }
                }
            },
            {
                "_index": "benchmark-results-2024-02",
                "_id": "JtLseo0BAWfgjZcLSbkR",
                "_score": 1.0,
                "_source": {
                    "benchmark-version": "1.3.0 (git revision: 9c716186)",
                    "benchmark-revision": "9c716186",
                    "environment": "local",
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "distribution-version": "2.11.1",
                    "distribution-flavor": "oss",
                    "user-tags": {},
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "active": true,
                    "distribution-major-version": 2,
                    "workload-revision": "6879b3d",
                    "task": "default",
                    "operation": "default",
                    "name": "duration",
                    "value": {
                        "single": 9.393625000001293
                    }
                }
            },
            {
                "_index": "benchmark-results-2024-02",
                "_id": "K9Lseo0BAWfgjZcLSbkR",
                "_score": 1.0,
                "_source": {
                    "benchmark-version": "1.3.0 (git revision: 9c716186)",
                    "benchmark-revision": "9c716186",
                    "environment": "local",
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "distribution-version": "2.11.1",
                    "distribution-flavor": "oss",
                    "user-tags": {},
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "active": true,
                    "distribution-major-version": 2,
                    "workload-revision": "6879b3d",
                    "task": "scroll",
                    "operation": "scroll",
                    "name": "duration",
                    "value": {
                        "single": 21.090707999999125
                    }
                }
            },
            {
                "_index": "benchmark-results-2024-02",
                "_id": "N9Lseo0BAWfgjZcLSbkR",
                "_score": 1.0,
                "_source": {
                    "benchmark-version": "1.3.0 (git revision: 9c716186)",
                    "benchmark-revision": "9c716186",
                    "environment": "local",
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "distribution-version": "2.11.1",
                    "distribution-flavor": "oss",
                    "user-tags": {},
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "active": true,
                    "distribution-major-version": 2,
                    "workload-revision": "6879b3d",
                    "task": "asc_sort_population",
                    "operation": "asc_sort_population",
                    "name": "duration",
                    "value": {
                        "single": 11.587667000000579
                    }
                }
            },
            {
                "_index": "benchmark-results-2024-02",
                "_id": "ONLseo0BAWfgjZcLSbkR",
                "_score": 1.0,
                "_source": {
                    "benchmark-version": "1.3.0 (git revision: 9c716186)",
                    "benchmark-revision": "9c716186",
                    "environment": "local",
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "distribution-version": "2.11.1",
                    "distribution-flavor": "oss",
                    "user-tags": {},
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "active": true,
                    "distribution-major-version": 2,
                    "workload-revision": "6879b3d",
                    "task": "asc_sort_with_after_population",
                    "operation": "asc_sort_with_after_population",
                    "name": "duration",
                    "value": {
                        "single": 9.533209000000653
                    }
                }
            },
            {
                "_index": "benchmark-results-2024-02",
                "_id": "RNLseo0BAWfgjZcLSbkR",
                "_score": 1.0,
                "_source": {
                    "benchmark-version": "1.3.0 (git revision: 9c716186)",
                    "benchmark-revision": "9c716186",
                    "environment": "local",
                    "test-execution-id": "6eec75c5-02ec-4bee-bad9-a0ab198d5c73",
                    "test-execution-timestamp": "20240205T201738Z",
                    "distribution-version": "2.11.1",
                    "distribution-flavor": "oss",
                    "user-tags": {},
                    "workload": "geonames",
                    "test_procedure": "append-no-conflicts",
                    "provision-config-instance": "defaults",
                    "active": true,
                    "distribution-major-version": 2,
                    "workload-revision": "6879b3d",
                    "task": "country_agg_uncached",
                    "operation": "country_agg_uncached",
                    "name": "error_rate",
                    "value": {
                        "single": 0.0
                    }
                }
            }
        ]
    }
}

osbenchmark/client.py

VijayanB · 2024-02-07T05:04:22Z

osbenchmark/worker_coordinator/runner.py

@@ -1212,7 +1238,9 @@ def status(v):
            # we're good with any count of relocating shards.
            expected_relocating_shards = sys.maxsize

+        request_context_holder.on_client_request_start()


@IanHoang @saimedhi We also have custom runners in workloads eg: https://github.com/opensearch-project/opensearch-benchmark-workloads/blob/9c0fdca67b0a929a7f28649c911737898609cb50/nyc_taxis/workload.py#L3

Shouldn't we update there as well?

I am not completely sure, but I tested running benchmarks against the 'nyc_taxis' workload and everything seems to be working fine.

gkamat · 2024-02-10T03:25:03Z

@saimedhi, @IanHoang there is an issue with this change. There is no error checking during the calls to the client, for instance, in the case below:

+            request_context_holder.on_client_request_start()
             await opensearch.indices.forcemerge(**merge_params)
+            request_context_holder.on_client_request_end()

So, if there is an exception thrown in that call, the client_request_end parameter will not be updated and there is an error caught upstream:

[ERROR] Cannot execute-test. Error in load generator [4]
        Cannot run task [index-append]: 'client_request_end'

This condition was suspected in a related comment. It might be best to revert this change until the fix goes in.

saimedhi · 2024-02-10T17:19:26Z

@saimedhi, @IanHoang there is an issue with this change. There is no error checking during the calls to the client, for instance, in the case below:
+            request_context_holder.on_client_request_start()
             await opensearch.indices.forcemerge(**merge_params)
+            request_context_holder.on_client_request_end()
So, if there is an exception thrown in that call, the client_request_end parameter will not be updated and there is an error caught upstream:
[ERROR] Cannot execute-test. Error in load generator [4]
        Cannot run task [index-append]: 'client_request_end'
This condition was suspected in a related comment. It might be best to revert this change until the fix goes in.

@govind, Instead of reverting we can use wrapper for all the runners. Wrapper is already in the code. Just we need to add @time_func to runners removing time_measurement within the runners.

def time_func(func):
async def advised(*args, **kwargs):
request_context_holder.on_client_request_start()
response = await func(*args, **kwargs)
request_context_holder.on_client_request_end()
return response
return advised

saimedhi · 2024-02-10T17:23:32Z

@govind, Instead of reverting we can use wrapper for all the runners. Wrapper is already in the code. Just we need to add @time_func to runners removing time_measurement within the runners.

def time_func(func): async def advised(*args, **kwargs): request_context_holder.on_client_request_start() response = await func(*args, **kwargs) request_context_holder.on_client_request_end() return response return advised

@IanHoang, @gkamat Please try to make this change. If not I will work on Tuesday. I am on-call this weekend.

gkamat · 2024-02-10T19:23:24Z

@saimedhi essentially that wrapper will need to go around the _call() method for all operations, which has the downside of reducing the accuracy of the time measurement. If you are implying the opensearch.indices.forcemerge function should be wrapped, that wrapping would need to be done either in the opensearchpy package or by redefining the function name cell after it is imported in the runner.

Furthermore, even if this were done, it does not take into account the exception handling. The call may fail withibn the client, leading to the client_request_end() call getting skipped.

All-in-all, the best and cleanest option may be to update each of the calls with the appropriate exception handling, either with a try-except or a with (context manager).

Since you are busy right now, the most expeditious option for now is probably reverting. The change can be checked-in later with additional testing. Thanks.

cc: @IanHoang

saimedhi · 2024-02-10T20:23:41Z

@gkamat, I will again try to raise PR soon. But just one confirmation, when there is error during the calls to the client both client request start and client request end should be calculated right.

gkamat · 2024-02-11T03:36:33Z

Yes, client_request_start will already have been computed before the call. All possible exceptions should be caught and then client_request_end should be set subsequently. Else, there will be an error upstream due to the unset variable.

saimedhi · 2024-02-12T09:09:42Z

@saimedhi, @IanHoang there is an issue with this change. There is no error checking during the calls to the client, for instance, in the case below:
+            request_context_holder.on_client_request_start()
             await opensearch.indices.forcemerge(**merge_params)
+            request_context_holder.on_client_request_end()
So, if there is an exception thrown in that call, the client_request_end parameter will not be updated and there is an error caught upstream:
[ERROR] Cannot execute-test. Error in load generator [4]
        Cannot run task [index-append]: 'client_request_end'
This condition was suspected in a related comment. It might be best to revert this change until the fix goes in.

@gkamat please take a look at PR462. I hope it addresses the above issue. Thank you.

VijayanB · 2024-02-24T22:54:55Z

@saimedhi, @IanHoang there is an issue with this change. There is no error checking during the calls to the client, for instance, in the case below:
+            request_context_holder.on_client_request_start()
             await opensearch.indices.forcemerge(**merge_params)
+            request_context_holder.on_client_request_end()
So, if there is an exception thrown in that call, the client_request_end parameter will not be updated and there is an error caught upstream:
[ERROR] Cannot execute-test. Error in load generator [4]
        Cannot run task [index-append]: 'client_request_end'
This condition was suspected in a related comment. It might be best to revert this change until the fix goes in.
@gkamat please take a look at PR462. I hope it addresses the above issue. Thank you.

@saimedhi If i understood correctly your PR will add client request end if any runner raises exception. However, for force merge, there won't be any exception if polling is used , see here . Did you test whether your fix will work for case where force merge time out at first call and complete is true on subsequent call ?

saimedhi · 2024-02-26T06:09:48Z

request_context_holder.on_client_request_end()

Hi @VijayanB, thank you for checking out the PR! I'm not quite sure how to replicate that scenario, but your point makes sense. Maybe we should calculate request_context_holder.on_client_request_end() before setting complete = True. What do you think?

             try:
                  request_context_holder.on_client_request_start()
                  await opensearch.indices.forcemerge(**merge_params)
                  request_context_holder.on_client_request_end()
                  complete = True
              except opensearchpy.ConnectionTimeout:
                  pass
              while not complete:
                  await asyncio.sleep(params.get("poll-period"))
                  tasks = await opensearch.tasks.list(params={"actions": "indices:admin/forcemerge"})
                  if len(tasks["nodes"]) == 0:
                      request_context_holder.on_client_request_end()             #Added here
                      # empty nodes response indicates no tasks
                      complete = True

VijayanB · 2024-02-26T16:14:42Z

Looks good to me. Thanks. Vector search using high dimensions with large number of vectors will take longer time to merge segments. I was able to reproduce for 768D, 10 M vectors. Thanks.

saimedhi added 3 commits January 29, 2024 12:46

Added client processing time metric

5aacd81

Signed-off-by: saimedhi <saimedhi@amazon.com>

Added client processing time metric

a5688a2

Signed-off-by: saimedhi <saimedhi@amazon.com>

Added client processing time metric

f912304

Signed-off-by: saimedhi <saimedhi@amazon.com>

saimedhi requested review from IanHoang and gkamat as code owners January 29, 2024 22:58

saimedhi mentioned this pull request Jan 29, 2024

Added client processing time #433

Closed

IanHoang reviewed Jan 31, 2024

View reviewed changes

saimedhi and others added 4 commits January 31, 2024 09:23

Added client processing time metric

24c66d8

Signed-off-by: saimedhi <saimedhi@amazon.com>

Added client processing time metric

777d2b2

Signed-off-by: saimedhi <saimedhi@amazon.com>

Merge branch 'opensearch-project:main' into client_time/final

d02da12

Added client processing time metric

9c71618

Signed-off-by: saimedhi <saimedhi@amazon.com>

IanHoang added the High Priority label Feb 1, 2024

IanHoang approved these changes Feb 5, 2024

View reviewed changes

IanHoang merged commit 83449b5 into opensearch-project:main Feb 5, 2024
8 checks passed

VijayanB reviewed Feb 7, 2024

View reviewed changes

osbenchmark/client.py Show resolved Hide resolved

VijayanB reviewed Feb 7, 2024

View reviewed changes

saimedhi mentioned this pull request Feb 12, 2024

Fixed client processing time #462

Merged

saimedhi mentioned this pull request Feb 26, 2024

Fix client processing time calculation in ForceMerge Runner #470

Merged

Kyle-sandeman-mrdfood mentioned this pull request Jun 18, 2024

Add request start and end parameter to sleep runner #491

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added a new metric: Client Processing Time #450

Added a new metric: Client Processing Time #450

saimedhi commented Jan 29, 2024

saimedhi commented Jan 30, 2024

IanHoang Jan 31, 2024

saimedhi Jan 31, 2024

saimedhi Jan 31, 2024

saimedhi commented Feb 1, 2024

IanHoang commented Feb 2, 2024

saimedhi commented Feb 5, 2024

saimedhi commented Feb 5, 2024

VijayanB Feb 7, 2024

saimedhi Feb 7, 2024

gkamat commented Feb 10, 2024 •

edited

Loading

saimedhi commented Feb 10, 2024

saimedhi commented Feb 10, 2024

gkamat commented Feb 10, 2024

saimedhi commented Feb 10, 2024

gkamat commented Feb 11, 2024

saimedhi commented Feb 12, 2024

VijayanB commented Feb 24, 2024 •

edited

Loading

saimedhi commented Feb 26, 2024

VijayanB commented Feb 26, 2024

Added a new metric: Client Processing Time #450

Added a new metric: Client Processing Time #450

Conversation

saimedhi commented Jan 29, 2024

Description

Issues Resolved

saimedhi commented Jan 30, 2024

IanHoang Jan 31, 2024

Choose a reason for hiding this comment

saimedhi Jan 31, 2024

Choose a reason for hiding this comment

saimedhi Jan 31, 2024

Choose a reason for hiding this comment

saimedhi commented Feb 1, 2024

IanHoang commented Feb 2, 2024

saimedhi commented Feb 5, 2024

saimedhi commented Feb 5, 2024

VijayanB Feb 7, 2024

Choose a reason for hiding this comment

saimedhi Feb 7, 2024

Choose a reason for hiding this comment

gkamat commented Feb 10, 2024 • edited Loading

saimedhi commented Feb 10, 2024

saimedhi commented Feb 10, 2024

gkamat commented Feb 10, 2024

saimedhi commented Feb 10, 2024

gkamat commented Feb 11, 2024

saimedhi commented Feb 12, 2024

VijayanB commented Feb 24, 2024 • edited Loading

saimedhi commented Feb 26, 2024

VijayanB commented Feb 26, 2024

gkamat commented Feb 10, 2024 •

edited

Loading

VijayanB commented Feb 24, 2024 •

edited

Loading