Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: [benchmark][cluster] search, query, flush failed in DML & DQL scenario, search error message=incomplete query result #29656

Closed
1 task done
wangting0128 opened this issue Jan 3, 2024 · 26 comments
Assignees
Labels
2.4-features kind/bug Issues or changes related a bug stale indicates no udpates for 30 days test/benchmark benchmark test triage/accepted Indicates an issue or PR is ready to be actively worked on.
Milestone

Comments

@wangting0128
Copy link
Contributor

Is there an existing issue for this?

  • I have searched the existing issues

Environment

- Milvus version: master-20231231-3f46c6d4
- Deployment mode(standalone or cluster): cluster
- MQ type(rocksmq, pulsar or kafka): pulsar    
- SDK version(e.g. pymilvus v2.0.0rc2): 2.4.0rc12
- OS(Ubuntu or CentOS): 
- CPU/Memory: 
- GPU: 
- Others:

Current Behavior

argo task: fouramf-multi-vector-t6hds

test case: test_inverted_locust_hnsw_ivf_sq8_dml_dql_cluster

server:

[2024-01-03 11:14:28,223 -  INFO - fouram]: [Base] Deploy initial state: 
I0103 09:48:13.165351     413 request.go:665] Waited for 1.155303121s due to client-side throttling, not priority and fairness, request: GET:https://kubernetes.default.svc.cluster.local/apis/certificates.k8s.io/v1?timeout=32s
NAME                                                              READY   STATUS              RESTARTS           AGE     IP              NODE         NOMINATED NODE   READINESS GATES
fouramf-multi-vr-t6hds-74-5665-etcd-0                             1/1     Running             0                  6m53s   10.104.28.35    4am-node33   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-etcd-1                             1/1     Running             0                  6m53s   10.104.15.132   4am-node20   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-etcd-2                             1/1     Running             0                  6m53s   10.104.34.206   4am-node37   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-datacoord-c559649d6-6rn22   1/1     Running             0                  6m53s   10.104.34.204   4am-node37   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-datanode-759c489fd7-znzs7   1/1     Running             1 (2m22s ago)      6m53s   10.104.30.244   4am-node38   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-indexcoord-778c5998545krc   1/1     Running             0                  6m53s   10.104.20.69    4am-node22   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-indexnode-7fffdbf75dv52gr   1/1     Running             0                  6m52s   10.104.34.202   4am-node37   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-indexnode-7fffdbf75dxwkxz   1/1     Running             0                  6m53s   10.104.17.143   4am-node23   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-proxy-5846469db9-8292b      1/1     Running             1 (2m21s ago)      6m53s   10.104.34.203   4am-node37   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-querycoord-85d86f4dbvmjlh   1/1     Running             1 (2m22s ago)      6m53s   10.104.17.142   4am-node23   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-querynode-c45cd44f8-qb78n   1/1     Running             0                  6m53s   10.104.20.70    4am-node22   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-rootcoord-6cc5949d-hj6tl    1/1     Running             1 (2m22s ago)      6m53s   10.104.33.61    4am-node36   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-minio-0                            1/1     Running             0                  6m53s   10.104.28.36    4am-node33   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-minio-1                            1/1     Running             0                  6m53s   10.104.15.133   4am-node20   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-minio-2                            1/1     Running             0                  6m52s   10.104.33.63    4am-node36   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-minio-3                            1/1     Running             0                  6m52s   10.104.24.116   4am-node29   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-bookie-0                    1/1     Running             0                  6m53s   10.104.16.95    4am-node21   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-bookie-1                    1/1     Running             0                  6m53s   10.104.18.172   4am-node25   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-bookie-2                    1/1     Running             0                  6m52s   10.104.28.39    4am-node33   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-bookie-init-47lmb           0/1     Completed           0                  6m53s   10.104.13.60    4am-node16   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-broker-0                    1/1     Running             0                  6m53s   10.104.13.61    4am-node16   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-proxy-0                     1/1     Running             0                  6m53s   10.104.13.58    4am-node16   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-pulsar-init-wcrbf           0/1     Completed           0                  6m53s   10.104.13.59    4am-node16   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-recovery-0                  1/1     Running             0                  6m53s   10.104.13.57    4am-node16   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-zookeeper-0                 1/1     Running             0                  6m53s   10.104.18.171   4am-node25   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-zookeeper-1                 1/1     Running             0                  6m2s    10.104.15.138   4am-node20   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-zookeeper-2                 1/1     Running             0                  5m18s   10.104.24.123   4am-node29   <none>           <none> (base.py:257)
[2024-01-03 11:14:28,223 -  INFO - fouram]: [Cmd Exe]  kubectl get pods  -n qa-milvus  -o wide | grep -E 'STATUS|fouramf-multi-vr-t6hds-74-5665-milvus|fouramf-multi-vr-t6hds-74-5665-minio|fouramf-multi-vr-t6hds-74-5665-etcd|fouramf-multi-vr-t6hds-74-5665-pulsar|fouramf-multi-vr-t6hds-74-5665-kafka'  (util_cmd.py:14)
[2024-01-03 11:14:37,592 -  INFO - fouram]: [CliClient] pod details of release(fouramf-multi-vr-t6hds-74-5665): 
 I0103 11:14:29.508587     545 request.go:665] Waited for 1.161791214s due to client-side throttling, not priority and fairness, request: GET:https://kubernetes.default.svc.cluster.local/apis/autoscaling/v2beta2?timeout=32s
NAME                                                              READY   STATUS             RESTARTS           AGE     IP              NODE         NOMINATED NODE   READINESS GATES
fouramf-multi-vr-t6hds-74-5665-etcd-0                             1/1     Running            0                  93m     10.104.28.35    4am-node33   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-etcd-1                             1/1     Running            0                  93m     10.104.15.132   4am-node20   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-etcd-2                             1/1     Running            0                  93m     10.104.34.206   4am-node37   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-datacoord-c559649d6-6rn22   1/1     Running            0                  93m     10.104.34.204   4am-node37   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-datanode-759c489fd7-znzs7   1/1     Running            1 (88m ago)        93m     10.104.30.244   4am-node38   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-indexcoord-778c5998545krc   1/1     Running            0                  93m     10.104.20.69    4am-node22   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-indexnode-7fffdbf75dv52gr   1/1     Running            0                  93m     10.104.34.202   4am-node37   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-indexnode-7fffdbf75dxwkxz   1/1     Running            0                  93m     10.104.17.143   4am-node23   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-proxy-5846469db9-8292b      1/1     Running            1 (88m ago)        93m     10.104.34.203   4am-node37   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-querycoord-85d86f4dbvmjlh   1/1     Running            1 (88m ago)        93m     10.104.17.142   4am-node23   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-querynode-c45cd44f8-qb78n   1/1     Running            0                  93m     10.104.20.70    4am-node22   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-milvus-rootcoord-6cc5949d-hj6tl    1/1     Running            1 (88m ago)        93m     10.104.33.61    4am-node36   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-minio-0                            1/1     Running            0                  93m     10.104.28.36    4am-node33   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-minio-1                            1/1     Running            0                  93m     10.104.15.133   4am-node20   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-minio-2                            1/1     Running            0                  93m     10.104.33.63    4am-node36   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-minio-3                            1/1     Running            0                  93m     10.104.24.116   4am-node29   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-bookie-0                    1/1     Running            0                  93m     10.104.16.95    4am-node21   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-bookie-1                    1/1     Running            0                  93m     10.104.18.172   4am-node25   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-bookie-2                    1/1     Running            0                  93m     10.104.28.39    4am-node33   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-bookie-init-47lmb           0/1     Completed          0                  93m     10.104.13.60    4am-node16   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-broker-0                    1/1     Running            0                  93m     10.104.13.61    4am-node16   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-proxy-0                     1/1     Running            0                  93m     10.104.13.58    4am-node16   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-pulsar-init-wcrbf           0/1     Completed          0                  93m     10.104.13.59    4am-node16   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-recovery-0                  1/1     Running            0                  93m     10.104.13.57    4am-node16   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-zookeeper-0                 1/1     Running            0                  93m     10.104.18.171   4am-node25   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-zookeeper-1                 1/1     Running            0                  92m     10.104.15.138   4am-node20   <none>           <none>
fouramf-multi-vr-t6hds-74-5665-pulsar-zookeeper-2                 1/1     Running            0                  91m     10.104.24.123   4am-node29   <none>           <none> (cli_client.py:132)

client logs:
fouram_log.log

search error message
search_error_message.log
截屏2024-01-03 20 44 50

query error message
query_error_message.log
截屏2024-01-03 20 46 53

flush error message
flush_error_message.log
截屏2024-01-03 20 45 54

Collection schema: {'auto_id': False, 'description': '', 'fields': [{'name': 'id', 'description': '', 'type': <DataType.INT64: 5>, 'is_primary': True, 'auto_id': False}, {'name': 'float_vector', 'description': '', 'type': <DataType.FLOAT_VECTOR: 101>, 'params': {'dim': 128}}, {'name': 'float_vector_1', 'description': '', 'type': <DataType.FLOAT_VECTOR: 101>, 'params': {'dim': 200}}, {'name': 'int8_1', 'description': '', 'type': <DataType.INT8: 2>}, {'name': 'int16_1', 'description': '', 'type': <DataType.INT16: 3>}, {'name': 'int32_1', 'description': '', 'type': <DataType.INT32: 4>}, {'name': 'int64_1', 'description': '', 'type': <DataType.INT64: 5>}, {'name': 'double_1', 'description': '', 'type': <DataType.DOUBLE: 11>}, {'name': 'float_1', 'description': '', 'type': <DataType.FLOAT: 10>}, {'name': 'varchar_1', 'description': '', 'type': <DataType.VARCHAR: 21>, 'params': {'max_length': 256}}, {'name': 'bool_1', 'description': '', 'type': <DataType.BOOL: 1>}, {'name': 'int8_2', 'description': '', 'type': <DataType.INT8: 2>}, {'name': 'int16_2', 'description': '', 'type': <DataType.INT16: 3>}, {'name': 'int32_2', 'description': '', 'type': <DataType.INT32: 4>}, {'name': 'int64_2', 'description': '', 'type': <DataType.INT64: 5>}, {'name': 'double_2', 'description': '', 'type': <DataType.DOUBLE: 11>}, {'name': 'float_2', 'description': '', 'type': <DataType.FLOAT: 10>}, {'name': 'varchar_2', 'description': '', 'type': <DataType.VARCHAR: 21>, 'params': {'max_length': 256}}, {'name': 'bool_2', 'description': '', 'type': <DataType.BOOL: 1>}]}

Expected Behavior

No response

Steps To Reproduce

1. create collection with fields:
                'float_vector': 128dim, 'float_vector_1': 200dim
                'int8_1', 'int16_1', 'int32_1', 'int64_1', 'double_1', 'float_1', 'varchar_1', 'bool_1',
                'int8_2', 'int16_2', 'int32_2', 'int64_2', 'double_2', 'float_2', 'varchar_2', 'bool_2'
            2. build indexes:
                HNSW: 'float_vector'
                IVF_SQ8: 'float_vector_1'
                scalar_default_index: 'int8_1', 'int16_1', 'int32_1', 'int64_1', 'double_1', 'float_1', 'varchar_1'
                scalar_INVERTED_index: 'int8_2', 'int16_2', 'int32_2', 'int64_2', 'double_2', 'float_2', 'varchar_2', 'bool_2'
            3. insert 5 million data
            4. flush collection
            5. build indexes again using the same params
            6. load collection
            7. concurrent request:
                - insert
                - delete
                - flush
                - load
                - search
                - query

Milvus Log

No response

Anything else?

test result:

{'server': {'deploy_tool': 'helm',
            'deploy_mode': 'cluster',
            'config_name': 'cluster_8c16m',
            'config': {'queryNode': {'resources': {'limits': {'cpu': '8.0',
                                                              'memory': '64Gi'},
                                                   'requests': {'cpu': '5.0',
                                                                'memory': '33Gi'}},
                                     'replicas': 1},
                       'indexNode': {'resources': {'limits': {'cpu': '8.0',
                                                              'memory': '16Gi'},
                                                   'requests': {'cpu': '5.0',
                                                                'memory': '9Gi'}},
                                     'replicas': 2},
                       'dataNode': {'resources': {'limits': {'cpu': '8.0',
                                                             'memory': '16Gi'},
                                                  'requests': {'cpu': '5.0',
                                                               'memory': '9Gi'}}},
                       'cluster': {'enabled': True},
                       'pulsar': {},
                       'kafka': {},
                       'minio': {'metrics': {'podMonitor': {'enabled': True}}},
                       'etcd': {'metrics': {'enabled': True,
                                            'podMonitor': {'enabled': True}}},
                       'metrics': {'serviceMonitor': {'enabled': True}},
                       'log': {'level': 'debug'},
                       'image': {'all': {'repository': 'harbor.milvus.io/milvus/milvus',
                                         'tag': 'master-20231231-3f46c6d4'}}},
            'host': 'fouramf-multi-vr-t6hds-74-5665-milvus.qa-milvus.svc.cluster.local',
            'port': '19530',
            'uri': ''},
 'client': {'test_case_type': 'ConcurrentClientBase',
            'test_case_name': 'test_inverted_locust_hnsw_ivf_sq8_dml_dql_cluster',
            'test_case_params': {'dataset_params': {'metric_type': 'L2',
                                                    'dim': 128,
                                                    'scalars_index': {'int8_1': {},
                                                                      'int16_1': {},
                                                                      'int32_1': {},
                                                                      'int64_1': {},
                                                                      'double_1': {},
                                                                      'float_1': {},
                                                                      'varchar_1': {},
                                                                      'int8_2': {'index_type': 'INVERTED'},
                                                                      'int16_2': {'index_type': 'INVERTED'},
                                                                      'int32_2': {'index_type': 'INVERTED'},
                                                                      'int64_2': {'index_type': 'INVERTED'},
                                                                      'double_2': {'index_type': 'INVERTED'},
                                                                      'float_2': {'index_type': 'INVERTED'},
                                                                      'varchar_2': {'index_type': 'INVERTED'},
                                                                      'bool_2': {'index_type': 'INVERTED'}},
                                                    'vectors_index': {'float_vector_1': {'index_type': 'IVF_SQ8',
                                                                                         'index_param': {'nlist': 1024},
                                                                                         'metric_type': 'L2'}},
                                                    'scalars_params': {'float_vector_1': {'params': {'dim': 200},
                                                                                          'other_params': {'dataset': 'text2img',
                                                                                                           'dim': 200}}},
                                                    'dataset_name': 'sift',
                                                    'dataset_size': 5000000,
                                                    'ni_per': 5000},
                                 'collection_params': {'other_fields': ['float_vector_1',
                                                                        'int8_1',
                                                                        'int16_1',
                                                                        'int32_1',
                                                                        'int64_1',
                                                                        'double_1',
                                                                        'float_1',
                                                                        'varchar_1',
                                                                        'bool_1',
                                                                        'int8_2',
                                                                        'int16_2',
                                                                        'int32_2',
                                                                        'int64_2',
                                                                        'double_2',
                                                                        'float_2',
                                                                        'varchar_2',
                                                                        'bool_2'],
                                                       'shards_num': 2},
                                 'resource_groups_params': {'reset': False},
                                 'database_user_params': {'reset_rbac': False,
                                                          'reset_db': False},
                                 'index_params': {'index_type': 'HNSW',
                                                  'index_param': {'M': 8,
                                                                  'efConstruction': 200}},
                                 'concurrent_params': {'concurrent_number': 20,
                                                       'during_time': '1h',
                                                       'interval': 20,
                                                       'spawn_rate': None},
                                 'concurrent_tasks': [{'type': 'insert',
                                                       'weight': 1,
                                                       'params': {'nb': 10,
                                                                  'timeout': 30,
                                                                  'random_id': True,
                                                                  'random_vector': True,
                                                                  'varchar_filled': False,
                                                                  'start_id': 0}},
                                                      {'type': 'delete',
                                                       'weight': 1,
                                                       'params': {'expr': '',
                                                                  'delete_length': 9,
                                                                  'timeout': 30}},
                                                      {'type': 'flush',
                                                       'weight': 1,
                                                       'params': {'timeout': 30}},
                                                      {'type': 'load',
                                                       'weight': 1,
                                                       'params': {'replica_number': 1,
                                                                  'timeout': 30}},
                                                      {'type': 'search',
                                                       'weight': 1,
                                                       'params': {'nq': 1000,
                                                                  'top_k': 1,
                                                                  'search_param': {'ef': 64},
                                                                  'expr': 'int64_1 '
                                                                          '> '
                                                                          '-1 '
                                                                          '&& '
                                                                          'id '
                                                                          '> '
                                                                          '-1',
                                                                  'guarantee_timestamp': None,
                                                                  'output_fields': ['*'],
                                                                  'ignore_growing': False,
                                                                  'timeout': 60,
                                                                  'random_data': True}},
                                                      {'type': 'query',
                                                       'weight': 1,
                                                       'params': {'ids': None,
                                                                  'expr': 'int64_1 '
                                                                          '> '
                                                                          '-1 '
                                                                          '&&  '
                                                                          'int64_2 '
                                                                          '> '
                                                                          '-1 '
                                                                          '&& ',
                                                                  'output_fields': ['*'],
                                                                  'offset': None,
                                                                  'limit': None,
                                                                  'ignore_growing': False,
                                                                  'timeout': 60,
                                                                  'random_data': True,
                                                                  'random_count': 20,
                                                                  'random_range': [2500000.0,
                                                                                   5000000],
                                                                  'field_name': 'id',
                                                                  'field_type': 'int64'}}]},
            'run_id': 2024010348859758,
            'datetime': '2024-01-03 09:41:25.534119',
            'client_version': '2.2'},
 'result': {'test_result': {'index': {'RT': 529.3387,
                                      'float_vector_1': {'RT': 200.3813},
                                      'int8_1': {'RT': 29.9841},
                                      'int16_1': {'RT': 5.0418},
                                      'int32_1': {'RT': 0.5145},
                                      'int64_1': {'RT': 0.5143},
                                      'double_1': {'RT': 0.5127},
                                      'float_1': {'RT': 0.5129},
                                      'varchar_1': {'RT': 0.5131},
                                      'int8_2': {'RT': 0.5152},
                                      'int16_2': {'RT': 6.5552},
                                      'int32_2': {'RT': 0.5133},
                                      'int64_2': {'RT': 0.5128},
                                      'double_2': {'RT': 0.5168},
                                      'float_2': {'RT': 0.5153},
                                      'varchar_2': {'RT': 0.5129},
                                      'bool_2': {'RT': 0.5128}},
                            'insert': {'total_time': 479.0253,
                                       'VPS': 10437.8621,
                                       'batch_time': 0.479,
                                       'batch': 5000},
                            'flush': {'RT': 3.5338},
                            'load': {'RT': 11.0735},
                            'Locust': {'Aggregated': {'Requests': 3607,
                                                      'Fails': 1048,
                                                      'RPS': 1.0,
                                                      'fail_s': 0.29,
                                                      'RT_max': 93448.14,
                                                      'RT_avg': 19766.88,
                                                      'TP50': 3800.0,
                                                      'TP99': 75000.0},
                                       'delete': {'Requests': 584,
                                                  'Fails': 0,
                                                  'RPS': 0.16,
                                                  'fail_s': 0.0,
                                                  'RT_max': 40938.02,
                                                  'RT_avg': 1291.7,
                                                  'TP50': 9,
                                                  'TP99': 29000.0},
                                       'flush': {'Requests': 596,
                                                 'Fails': 264,
                                                 'RPS': 0.17,
                                                 'fail_s': 0.44,
                                                 'RT_max': 63808.45,
                                                 'RT_avg': 21269.83,
                                                 'TP50': 25000.0,
                                                 'TP99': 60000.0},
                                       'insert': {'Requests': 606,
                                                  'Fails': 0,
                                                  'RPS': 0.17,
                                                  'fail_s': 0.0,
                                                  'RT_max': 33436.98,
                                                  'RT_avg': 1681.02,
                                                  'TP50': 64,
                                                  'TP99': 29000.0},
                                       'load': {'Requests': 624,
                                                'Fails': 0,
                                                'RPS': 0.17,
                                                'fail_s': 0.0,
                                                'RT_max': 38427.28,
                                                'RT_avg': 2084.82,
                                                'TP50': 10,
                                                'TP99': 32000.0},
                                       'query': {'Requests': 594,
                                                 'Fails': 313,
                                                 'RPS': 0.17,
                                                 'fail_s': 0.53,
                                                 'RT_max': 93448.14,
                                                 'RT_avg': 45908.79,
                                                 'TP50': 60000.0,
                                                 'TP99': 92000.0},
                                       'search': {'Requests': 603,
                                                  'Fails': 471,
                                                  'RPS': 0.17,
                                                  'fail_s': 0.78,
                                                  'RT_max': 60219.44,
                                                  'RT_avg': 46896.41,
                                                  'TP50': 60000.0,
                                                  'TP99': 60000.0}}}}}
@wangting0128 wangting0128 added kind/bug Issues or changes related a bug needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. test/benchmark benchmark test labels Jan 3, 2024
@wangting0128 wangting0128 added this to the 2.4.0 milestone Jan 3, 2024
@yanliang567
Copy link
Contributor

@xige-16 is it about multiple vectors ?
/assign @xige-16
/unassign

@sre-ci-robot sre-ci-robot assigned xige-16 and unassigned yanliang567 Jan 4, 2024
@yanliang567 yanliang567 added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Jan 4, 2024
@czs007
Copy link
Collaborator

czs007 commented Jan 8, 2024

@xige-16 @yanliang567
Multidimensional vectors also encounter the issue of MVCC, necessitating the introduction of an additional PR to address it.

sre-ci-robot pushed a commit that referenced this issue Jan 9, 2024
When the TimeTravel functionality was previously removed, it
inadvertently affected the MVCC functionality within the system. This PR
aims to reintroduce the internal MVCC functionality as follows:

1. Add MvccTimestamp to the requests of Search/Query and the results of
Search internally.
2. When the delegator receives a Query/Search request and there is no
MVCC timestamp set in the request, set the delegator's current tsafe as
the MVCC timestamp of the request. If the request already has an MVCC
timestamp, do not modify it.
3. When the Proxy handles Search and triggers the second phase ReQuery,
divide the ReQuery into different shards and pass the MVCC timestamp to
the corresponding Query requests.

issue: #29656

Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>
czs007 added a commit to czs007/milvus that referenced this issue Jan 9, 2024
When the TimeTravel functionality was previously removed, it
inadvertently affected the MVCC functionality within the system. This PR
aims to reintroduce the internal MVCC functionality as follows:

1. Add MvccTimestamp to the requests of Search/Query and the results of
Search internally.
2. When the delegator receives a Query/Search request and there is no
MVCC timestamp set in the request, set the delegator's current tsafe as
the MVCC timestamp of the request. If the request already has an MVCC
timestamp, do not modify it.
3. When the Proxy handles Search and triggers the second phase ReQuery,
divide the ReQuery into different shards and pass the MVCC timestamp to
the corresponding Query requests.

issue: milvus-io#29656

Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>
@wangting0128
Copy link
Contributor Author

Occasionally throws error: incomplete query result

截屏2024-01-09 19 12 43

argo task: fouramf-multi-vector-kfsgv
client pod name: fouramf-multi-vector-kfsgv-1041655259
milvus tag: milvus_tag=master-20240109-60e88fb8-amd64

client config:

{
     "dataset_params": {
          "metric_type": "L2",
          "dim": 128,
          "dataset_name": "sift",
          "dataset_size": 5000000,
          "ni_per": 5000
     },
     "index_params": {
          "index_type": "HNSW",
          "index_param": {
               "M": 8,
               "efConstruction": 200
          }
     },
     "concurrent_params": {
          "concurrent_number": [
               20
          ],
          "during_time": "1h",
          "interval": 20
     },
     "concurrent_tasks": [
          {
               "type": "insert",
               "weight": 1,
               "params": {
                    "nb": 10,
                    "timeout": 30,
                    "random_id": true,
                    "random_vector": true,
                    "varchar_filled": false,
                    "start_id": 0
               }
          },
          {
               "type": "delete",
               "weight": 1,
               "params": {
                    "expr": "",
                    "delete_length": 9,
                    "timeout": 30
               }
          },
          {
               "type": "flush",
               "weight": 1,
               "params": {
                    "timeout": null
               }
          },
          {
               "type": "load",
               "weight": 1,
               "params": {
                    "replica_number": 1,
                    "timeout": 30
               }
          },
          {
               "type": "search",
               "weight": 1,
               "params": {
                    "nq": 1000,
                    "top_k": 1,
                    "search_param": {
                         "ef": 64
                    },
                    "expr": "id > -1",
                    "guarantee_timestamp": null,
                    "output_fields": [
                         "*"
                    ],
                    "ignore_growing": false,
                    "timeout": null,
                    "random_data": true
               }
          },
          {
               "type": "query",
               "weight": 1,
               "params": {
                    "ids": null,
                    "expr": "id > -1  && ",
                    "output_fields": [
                         "*"
                    ],
                    "offset": null,
                    "limit": null,
                    "ignore_growing": false,
                    "timeout": null,
                    "random_data": true,
                    "random_count": 20,
                    "random_range": [
                         2500000,
                         5000000
                    ],
                    "field_name": "id",
                    "field_type": "int64"
               }
          }
     ]
}

server:

NAME                                                              READY   STATUS             RESTARTS         AGE     IP              NODE         NOMINATED NODE   READINESS GATES
verify-dml-dql-timeout-noscalar-etcd-0                            1/1     Running            0                70m     10.104.29.65    4am-node35   <none>           <none>
verify-dml-dql-timeout-noscalar-etcd-1                            1/1     Running            0                70m     10.104.15.78    4am-node20   <none>           <none>
verify-dml-dql-timeout-noscalar-etcd-2                            1/1     Running            0                70m     10.104.17.211   4am-node23   <none>           <none>
verify-dml-dql-timeout-noscalar-milvus-datacoord-6999f6cc4ttjpm   1/1     Running            0                70m     10.104.23.77    4am-node27   <none>           <none>
verify-dml-dql-timeout-noscalar-milvus-datanode-55bd9bdb4cd9gvq   1/1     Running            1 (66m ago)      70m     10.104.23.78    4am-node27   <none>           <none>
verify-dml-dql-timeout-noscalar-milvus-indexcoord-689f54fft2rp9   1/1     Running            0                70m     10.104.23.80    4am-node27   <none>           <none>
verify-dml-dql-timeout-noscalar-milvus-indexnode-67c6b57bc57zqr   1/1     Running            0                70m     10.104.18.81    4am-node25   <none>           <none>
verify-dml-dql-timeout-noscalar-milvus-indexnode-67c6b57bchrw8t   1/1     Running            0                70m     10.104.21.28    4am-node24   <none>           <none>
verify-dml-dql-timeout-noscalar-milvus-proxy-5b458df475-7sf74     1/1     Running            1 (66m ago)      70m     10.104.34.236   4am-node37   <none>           <none>
verify-dml-dql-timeout-noscalar-milvus-querycoord-6b449c94jkw4w   1/1     Running            1 (66m ago)      70m     10.104.27.60    4am-node31   <none>           <none>
verify-dml-dql-timeout-noscalar-milvus-querynode-7555d67f7ws5vf   1/1     Running            0                70m     10.104.23.79    4am-node27   <none>           <none>
verify-dml-dql-timeout-noscalar-milvus-rootcoord-7b6bd4f958grgm   1/1     Running            0                70m     10.104.27.61    4am-node31   <none>           <none>
verify-dml-dql-timeout-noscalar-minio-0                           1/1     Running            0                70m     10.104.15.76    4am-node20   <none>           <none>
verify-dml-dql-timeout-noscalar-minio-1                           1/1     Running            0                70m     10.104.17.215   4am-node23   <none>           <none>
verify-dml-dql-timeout-noscalar-minio-2                           1/1     Running            0                70m     10.104.32.176   4am-node39   <none>           <none>
verify-dml-dql-timeout-noscalar-minio-3                           1/1     Running            0                70m     10.104.28.87    4am-node33   <none>           <none>
verify-dml-dql-timeout-noscalar-pulsar-bookie-0                   1/1     Running            0                70m     10.104.29.67    4am-node35   <none>           <none>
verify-dml-dql-timeout-noscalar-pulsar-bookie-1                   1/1     Running            0                70m     10.104.20.210   4am-node22   <none>           <none>
verify-dml-dql-timeout-noscalar-pulsar-bookie-2                   1/1     Running            0                70m     10.104.15.84    4am-node20   <none>           <none>
verify-dml-dql-timeout-noscalar-pulsar-bookie-init-fmfh7          0/1     Completed          0                70m     10.104.23.81    4am-node27   <none>           <none>
verify-dml-dql-timeout-noscalar-pulsar-broker-0                   1/1     Running            0                70m     10.104.5.59     4am-node12   <none>           <none>
verify-dml-dql-timeout-noscalar-pulsar-proxy-0                    1/1     Running            0                70m     10.104.13.101   4am-node16   <none>           <none>
verify-dml-dql-timeout-noscalar-pulsar-pulsar-init-sc5mw          0/1     Completed          0                70m     10.104.30.20    4am-node38   <none>           <none>
verify-dml-dql-timeout-noscalar-pulsar-recovery-0                 1/1     Running            0                70m     10.104.16.198   4am-node21   <none>           <none>
verify-dml-dql-timeout-noscalar-pulsar-zookeeper-0                1/1     Running            0                70m     10.104.29.62    4am-node35   <none>           <none>
verify-dml-dql-timeout-noscalar-pulsar-zookeeper-1                1/1     Running            0                69m     10.104.23.84    4am-node27   <none>           <none>
verify-dml-dql-timeout-noscalar-pulsar-zookeeper-2                1/1     Running            0                69m     10.104.30.26    4am-node38   <none>           <none>

@czs007

sre-ci-robot pushed a commit that referenced this issue Jan 11, 2024
When the TimeTravel functionality was previously removed, it
inadvertently affected the MVCC functionality within the system. This PR
aims to reintroduce the internal MVCC functionality as follows:

1. Add MvccTimestamp to the requests of Search/Query and the results of
Search internally.
2. When the delegator receives a Query/Search request and there is no
MVCC timestamp set in the request, set the delegator's current tsafe as
the MVCC timestamp of the request. If the request already has an MVCC
timestamp, do not modify it.
3. When the Proxy handles Search and triggers the second phase ReQuery,
divide the ReQuery into different shards and pass the MVCC timestamp to
the corresponding Query requests.

issue: #29656
pr: #29749

Signed-off-by: zhenshan.cao <zhenshan.cao@zilliz.com>
@wangting0128
Copy link
Contributor Author

wangting0128 commented Jan 17, 2024

Hybrid search error

argo task: inverted-corn-dj9g6
image: master-20240116-2dacca7b-amd64
test case: test_inverted_locust_partitions_dml_dql_standalone

server:

NAME                                                              READY   STATUS                   RESTARTS          AGE     IP              NODE         NOMINATED NODE   READINESS GATES
inverted-corn-dj9g6-3-25-7031-etcd-0                              1/1     Running                  0                 2m13s   10.104.19.164   4am-node28   <none>           <none>
inverted-corn-dj9g6-3-25-7031-milvus-standalone-56df797f6fkng9m   1/1     Running                  0                 2m13s   10.104.23.228   4am-node27   <none>           <none>
inverted-corn-dj9g6-3-25-7031-minio-56f86b47bd-xprb4              1/1     Running                  0                 2m13s   10.104.23.229   4am-node27   <none>           <none> (base.py:257)
[2024-01-16 11:35:56,892 -  INFO - fouram]: [Cmd Exe]  kubectl get pods  -n qa-milvus  -o wide | grep -E 'STATUS|inverted-corn-dj9g6-3-25-7031-milvus|inverted-corn-dj9g6-3-25-7031-minio|inverted-corn-dj9g6-3-25-7031-etcd|inverted-corn-dj9g6-3-25-7031-pulsar|inverted-corn-dj9g6-3-25-7031-kafka'  (util_cmd.py:14)
[2024-01-16 11:36:06,197 -  INFO - fouram]: [CliClient] pod details of release(inverted-corn-dj9g6-3-25-7031): 
 I0116 11:35:58.161272     525 request.go:665] Waited for 1.17522047s due to client-side throttling, not priority and fairness, request: GET:https://kubernetes.default.svc.cluster.local/apis/milvus.io/v1alpha1?timeout=32s
NAME                                                              READY   STATUS                   RESTARTS          AGE     IP              NODE         NOMINATED NODE   READINESS GATES
inverted-corn-dj9g6-3-25-7031-etcd-0                              1/1     Running                  0                 3h6m    10.104.19.164   4am-node28   <none>           <none>
inverted-corn-dj9g6-3-25-7031-milvus-standalone-56df797f6fkng9m   1/1     Running                  0                 3h6m    10.104.23.228   4am-node27   <none>           <none>
inverted-corn-dj9g6-3-25-7031-minio-56f86b47bd-xprb4              1/1     Running                  0                 3h6m    10.104.23.229   4am-node27   <none>

client log:

截屏2024-01-17 11 58 34

test result:

{'server': {'deploy_tool': 'helm',
            'deploy_mode': 'standalone',
            'config_name': 'standalone_16c16m',
            'config': {'standalone': {'resources': {'limits': {'cpu': '16.0',
                                                               'memory': '16Gi'},
                                                    'requests': {'cpu': '9.0',
                                                                 'memory': '9Gi'}}},
                       'cluster': {'enabled': False},
                       'etcd': {'replicaCount': 1,
                                'metrics': {'enabled': True,
                                            'podMonitor': {'enabled': True}}},
                       'minio': {'mode': 'standalone',
                                 'metrics': {'podMonitor': {'enabled': True}}},
                       'pulsar': {'enabled': False},
                       'metrics': {'serviceMonitor': {'enabled': True}},
                       'log': {'level': 'debug'},
                       'image': {'all': {'repository': 'harbor.milvus.io/milvus/milvus',
                                         'tag': 'master-20240116-2dacca7b-amd64'}}},
            'host': 'inverted-corn-dj9g6-3-25-7031-milvus.qa-milvus.svc.cluster.local',
            'port': '19530',
            'uri': ''},
 'client': {'test_case_type': 'ConcurrentClientBase',
            'test_case_name': 'test_inverted_locust_partitions_dml_dql_standalone',
            'test_case_params': {'dataset_params': {'metric_type': 'L2',
                                                    'dim': 128,
                                                    'scalars_index': {'id': {'index_type': 'INVERTED'},
                                                                      'int64_1': {'index_type': 'INVERTED'}},
                                                    'extra_partitions': {'partitions': 10,
                                                                         'data_repeated': False},
                                                    'dataset_name': 'sift',
                                                    'dataset_size': 5000000,
                                                    'ni_per': 50000},
                                 'collection_params': {'other_fields': ['int64_1'],
                                                       'shards_num': 2},
                                 'resource_groups_params': {'reset': False},
                                 'database_user_params': {'reset_rbac': False,
                                                          'reset_db': False},
                                 'index_params': {'index_type': 'IVF_FLAT',
                                                  'index_param': {'nlist': 1024}},
                                 'concurrent_params': {'concurrent_number': 20,
                                                       'during_time': '3h',
                                                       'interval': 20,
                                                       'spawn_rate': None},
                                 'concurrent_tasks': [{'type': 'insert',
                                                       'weight': 1,
                                                       'params': {'nb': 10,
                                                                  'timeout': 30,
                                                                  'random_id': True,
                                                                  'random_vector': True,
                                                                  'varchar_filled': False,
                                                                  'start_id': 5000000}},
                                                      {'type': 'delete',
                                                       'weight': 1,
                                                       'params': {'expr': '',
                                                                  'delete_length': 9,
                                                                  'timeout': 30}},
                                                      {'type': 'flush',
                                                       'weight': 1,
                                                       'params': {'timeout': 180}},
                                                      {'type': 'load',
                                                       'weight': 1,
                                                       'params': {'replica_number': 1,
                                                                  'timeout': 30}},
                                                      {'type': 'search',
                                                       'weight': 1,
                                                       'params': {'nq': 1000,
                                                                  'top_k': 10,
                                                                  'search_param': {'nprobe': 16},
                                                                  'expr': None,
                                                                  'guarantee_timestamp': None,
                                                                  'output_fields': None,
                                                                  'ignore_growing': False,
                                                                  'group_by_field': None,
                                                                  'timeout': 180,
                                                                  'random_data': True}},
                                                      {'type': 'hybrid_search',
                                                       'weight': 1,
                                                       'params': {'nq': 1,
                                                                  'top_k': 10,
                                                                  'reqs': [{'search_param': {'nprobe': 16},
                                                                            'anns_field': 'float_vector',
                                                                            'top_k': 2000},
                                                                           {'search_param': {'nprobe': 32},
                                                                            'anns_field': 'float_vector',
                                                                            'expr': 'int64_1 '
                                                                                    '> '
                                                                                    '-1 '
                                                                                    '&& '
                                                                                    'id '
                                                                                    '> '
                                                                                    '-1'},
                                                                           {'search_param': {'nprobe': 64},
                                                                            'anns_field': 'float_vector',
                                                                            'expr': 'id '
                                                                                    '> '
                                                                                    '10',
                                                                            'top_k': 60}],
                                                                  'rerank': {'WeightedRanker': [0.3,
                                                                                                0.4,
                                                                                                0.3]},
                                                                  'output_fields': ['*'],
                                                                  'ignore_growing': False,
                                                                  'guarantee_timestamp': None,
                                                                  'timeout': 60,
                                                                  'random_data': True}},
                                                      {'type': 'query',
                                                       'weight': 1,
                                                       'params': {'ids': None,
                                                                  'expr': 'int64_1 '
                                                                          '> '
                                                                          '-1 '
                                                                          '&&',
                                                                  'output_fields': ['*'],
                                                                  'offset': None,
                                                                  'limit': None,
                                                                  'ignore_growing': False,
                                                                  'timeout': 180,
                                                                  'random_data': True,
                                                                  'random_count': 20,
                                                                  'random_range': [0,
                                                                                   1000000.0],
                                                                  'field_name': 'id',
                                                                  'field_type': 'int64'}}]},
            'run_id': 2024011637641753,
            'datetime': '2024-01-16 08:29:24.939510',
            'client_version': '2.4.0'},
 'result': {'test_result': {'index': {'RT': 15.205,
                                      'id': {'RT': 1.0113},
                                      'int64_1': {'RT': 0.5089}},
                            'insert': {'total_time': 147.6451,
                                       'VPS': 34073.6098,
                                       'batch_time': 1.4765,
                                       'batch': 50000.0},
                            'flush': {'RT': 2.5172},
                            'load': {'RT': 5.5751},
                            'Locust': {'Aggregated': {'Requests': 74467,
                                                      'Fails': 524,
                                                      'RPS': 6.9,
                                                      'fail_s': 0.01,
                                                      'RT_max': 42738.8,
                                                      'RT_avg': 2887.77,
                                                      'TP50': 1700.0,
                                                      'TP99': 21000.0},
                                       'delete': {'Requests': 10660,
                                                  'Fails': 0,
                                                  'RPS': 0.99,
                                                  'fail_s': 0.0,
                                                  'RT_max': 8493.77,
                                                  'RT_avg': 335.91,
                                                  'TP50': 65,
                                                  'TP99': 3200.0},
                                       'flush': {'Requests': 10589,
                                                 'Fails': 0,
                                                 'RPS': 0.98,
                                                 'fail_s': 0.0,
                                                 'RT_max': 42738.8,
                                                 'RT_avg': 9814.85,
                                                 'TP50': 8200.0,
                                                 'TP99': 30000.0},
                                       'hybrid_search': {'Requests': 10682,
                                                         'Fails': 524,
                                                         'RPS': 0.99,
                                                         'fail_s': 0.05,
                                                         'RT_max': 15746.05,
                                                         'RT_avg': 4011.19,
                                                         'TP50': 3900.0,
                                                         'TP99': 10000.0},
                                       'insert': {'Requests': 10603,
                                                  'Fails': 0,
                                                  'RPS': 0.98,
                                                  'fail_s': 0.0,
                                                  'RT_max': 9145.29,
                                                  'RT_avg': 453.96,
                                                  'TP50': 140.0,
                                                  'TP99': 3500.0},
                                       'load': {'Requests': 10678,
                                                'Fails': 0,
                                                'RPS': 0.99,
                                                'fail_s': 0.0,
                                                'RT_max': 12928.48,
                                                'RT_avg': 597.84,
                                                'TP50': 120.0,
                                                'TP99': 4900.0},
                                       'query': {'Requests': 10619,
                                                 'Fails': 0,
                                                 'RPS': 0.98,
                                                 'fail_s': 0.0,
                                                 'RT_max': 15294.76,
                                                 'RT_avg': 2539.39,
                                                 'TP50': 2300.0,
                                                 'TP99': 8200.0},
                                       'search': {'Requests': 10636,
                                                  'Fails': 0,
                                                  'RPS': 0.98,
                                                  'fail_s': 0.0,
                                                  'RT_max': 12363.5,
                                                  'RT_avg': 2493.7,
                                                  'TP50': 2300.0,
                                                  'TP99': 7300.0}}}}}

@xige-16

@wangting0128
Copy link
Contributor Author

Occasionally throws error: incomplete query result

argo task:fouramf-multi-vector-f2hrf
test case name:test_hybrid_search_locust_dql_dml_partition_cluster

server:

NAME                                                              READY   STATUS        RESTARTS        AGE     IP              NODE         NOMINATED NODE   READINESS GATES
verify-issue-30276-2-etcd-0                                       1/1     Running       0               4h7m    10.104.24.50    4am-node29   <none>           <none>
verify-issue-30276-2-etcd-1                                       1/1     Running       0               4h7m    10.104.25.103   4am-node30   <none>           <none>
verify-issue-30276-2-etcd-2                                       1/1     Running       0               4h7m    10.104.31.65    4am-node34   <none>           <none>
verify-issue-30276-2-milvus-datacoord-84967b9d97-d4mh6            1/1     Running       0               4h7m    10.104.26.148   4am-node32   <none>           <none>
verify-issue-30276-2-milvus-datanode-758f7b4694-jcgkk             1/1     Running       1 (4h3m ago)    4h7m    10.104.26.145   4am-node32   <none>           <none>
verify-issue-30276-2-milvus-indexcoord-6884b47fd8-hpchg           1/1     Running       0               4h7m    10.104.26.146   4am-node32   <none>           <none>
verify-issue-30276-2-milvus-indexnode-586c6d6449-lbmlv            1/1     Running       0               4h7m    10.104.6.173    4am-node13   <none>           <none>
verify-issue-30276-2-milvus-indexnode-586c6d6449-pgzg5            1/1     Running       0               4h7m    10.104.4.159    4am-node11   <none>           <none>
verify-issue-30276-2-milvus-indexnode-586c6d6449-w4nzt            1/1     Running       0               4h7m    10.104.5.173    4am-node12   <none>           <none>
verify-issue-30276-2-milvus-indexnode-586c6d6449-zsjws            1/1     Running       0               4h7m    10.104.26.147   4am-node32   <none>           <none>
verify-issue-30276-2-milvus-proxy-949bb7b6d-r8gpd                 1/1     Running       0               4h7m    10.104.9.22     4am-node14   <none>           <none>
verify-issue-30276-2-milvus-querycoord-657d7f8dd7-gzrp4           1/1     Running       1 (4h3m ago)    4h7m    10.104.26.149   4am-node32   <none>           <none>
verify-issue-30276-2-milvus-querynode-6cf9bd4789-bxjsv            1/1     Running       2 (4h5m ago)    4h7m    10.104.13.118   4am-node16   <none>           <none>
verify-issue-30276-2-milvus-querynode-6cf9bd4789-dx9cq            1/1     Running       2 (4h6m ago)    4h7m    10.104.9.24     4am-node14   <none>           <none>
verify-issue-30276-2-milvus-rootcoord-66cdc47dd9-r7hc9            1/1     Running       0               4h7m    10.104.9.23     4am-node14   <none>           <none>
verify-issue-30276-2-minio-0                                      1/1     Running       0               4h7m    10.104.24.49    4am-node29   <none>           <none>
verify-issue-30276-2-minio-1                                      1/1     Running       0               4h7m    10.104.25.102   4am-node30   <none>           <none>
verify-issue-30276-2-minio-2                                      1/1     Running       0               4h7m    10.104.31.66    4am-node34   <none>           <none>
verify-issue-30276-2-minio-3                                      1/1     Running       0               4h7m    10.104.15.123   4am-node20   <none>           <none>
verify-issue-30276-2-pulsar-bookie-0                              1/1     Running       0               4h7m    10.104.24.51    4am-node29   <none>           <none>
verify-issue-30276-2-pulsar-bookie-1                              1/1     Running       0               4h7m    10.104.25.104   4am-node30   <none>           <none>
verify-issue-30276-2-pulsar-bookie-2                              1/1     Running       0               4h7m    10.104.31.69    4am-node34   <none>           <none>
verify-issue-30276-2-pulsar-bookie-init-l5gmv                     0/1     Completed     0               4h7m    10.104.6.172    4am-node13   <none>           <none>
verify-issue-30276-2-pulsar-broker-0                              1/1     Running       0               4h7m    10.104.1.126    4am-node10   <none>           <none>
verify-issue-30276-2-pulsar-proxy-0                               1/1     Running       0               4h7m    10.104.24.46    4am-node29   <none>           <none>
verify-issue-30276-2-pulsar-pulsar-init-pvfnm                     0/1     Completed     0               4h7m    10.104.6.171    4am-node13   <none>           <none>
verify-issue-30276-2-pulsar-recovery-0                            1/1     Running       0               4h7m    10.104.14.52    4am-node18   <none>           <none>
verify-issue-30276-2-pulsar-zookeeper-0                           1/1     Running       0               4h7m    10.104.28.198   4am-node33   <none>           <none>
verify-issue-30276-2-pulsar-zookeeper-1                           1/1     Running       0               4h6m    10.104.24.53    4am-node29   <none>           <none>
verify-issue-30276-2-pulsar-zookeeper-2                           1/1     Running       0               4h6m    10.104.15.125   4am-node20   <none>           <none>

client logs:
client.log
截屏2024-01-26 14 26 48
(partition: create->insert->flush->index again->load->search->release->search error->drop)
search error expected:partition not loaded

test result:

{'server': {'deploy_tool': 'helm',
            'deploy_mode': 'cluster',
            'config_name': 'cluster_2c8m',
            'config': {'queryNode': {'resources': {'limits': {'cpu': 32,
                                                              'memory': '64Gi'},
                                                   'requests': {'cpu': 17,
                                                                'memory': '33Gi'}},
                                     'replicas': 2},
                       'indexNode': {'resources': {'limits': {'cpu': '8.0',
                                                              'memory': '8Gi'},
                                                   'requests': {'cpu': '5.0',
                                                                'memory': '5Gi'}},
                                     'replicas': 4},
                       'dataNode': {'resources': {'limits': {'cpu': '2.0',
                                                             'memory': '8Gi'},
                                                  'requests': {'cpu': '2.0',
                                                               'memory': '5Gi'}}},
                       'cluster': {'enabled': True},
                       'pulsar': {},
                       'kafka': {},
                       'minio': {'metrics': {'podMonitor': {'enabled': True}}},
                       'etcd': {'metrics': {'enabled': True,
                                            'podMonitor': {'enabled': True}}},
                       'metrics': {'serviceMonitor': {'enabled': True}},
                       'log': {'level': 'debug'},
                       'image': {'all': {'repository': 'harbor.milvus.io/milvus/milvus',
                                         'tag': 'master-20240126-a847d56a-amd64'}}},
            'host': 'verify-issue-30276-2-milvus.qa-milvus.svc.cluster.local',
            'port': '19530',
            'uri': ''},
 'client': {'test_case_type': 'ConcurrentClientBase',
            'test_case_name': 'test_hybrid_search_locust_dql_dml_partition_cluster',
            'test_case_params': {'dataset_params': {'metric_type': 'L2',
                                                    'dim': 128,
                                                    'scalars_index': {'id': {},
                                                                      'int64_1': {'index_type': 'INVERTED'},
                                                                      'varchar_1': {'index_type': 'INVERTED'}},
                                                    'vectors_index': {'float_vector_1': {'index_type': 'HNSW',
                                                                                         'index_param': {'M': 8,
                                                                                                         'efConstruction': 200},
                                                                                         'metric_type': 'L2'},
                                                                      'float_vector_2': {'index_type': 'DISKANN',
                                                                                         'index_param': {},
                                                                                         'metric_type': 'IP'},
                                                                      'float_vector_3': {'index_type': 'IVF_SQ8',
                                                                                         'index_param': {'nlist': 2048},
                                                                                         'metric_type': 'L2'}},
                                                    'scalars_params': {'float_vector_1': {'params': {'dim': 128},
                                                                                          'other_params': {'dataset': 'sift',
                                                                                                           'dim': 128}},
                                                                       'float_vector_2': {'params': {'dim': 128},
                                                                                          'other_params': {'dataset': 'sift',
                                                                                                           'dim': 128}},
                                                                       'float_vector_3': {'params': {'dim': 128},
                                                                                          'other_params': {'dataset': 'sift',
                                                                                                           'dim': 128}}},
                                                    'extra_partitions': {'partitions': 10,
                                                                         'data_repeated': False},
                                                    'dataset_name': 'sift',
                                                    'dataset_size': 1000000,
                                                    'ni_per': 10000},
                                 'collection_params': {'other_fields': ['float_vector_1',
                                                                        'float_vector_2',
                                                                        'float_vector_3',
                                                                        'int64_1',
                                                                        'varchar_1'],
                                                       'shards_num': 2},
                                 'resource_groups_params': {'reset': False},
                                 'database_user_params': {'reset_rbac': False,
                                                          'reset_db': False},
                                 'index_params': {'index_type': 'IVF_FLAT',
                                                  'index_param': {'nlist': 1024}},
                                 'concurrent_params': {'concurrent_number': 20,
                                                       'during_time': '3h',
                                                       'interval': 20,
                                                       'spawn_rate': None},
                                 'concurrent_tasks': [{'type': 'scene_test_partition',
                                                       'weight': 1,
                                                       'params': {'data_size': 3000,
                                                                  'ni': 3000,
                                                                  'nq': 1,
                                                                  'search_param': {'nprobe': 64},
                                                                  'limit': 1,
                                                                  'expr': None,
                                                                  'output_fields': ['*'],
                                                                  'guarantee_timestamp': None,
                                                                  'timeout': 600}},
                                                      {'type': 'search',
                                                       'weight': 8,
                                                       'params': {'nq': 10000,
                                                                  'top_k': 1,
                                                                  'search_param': {'nprobe': 1000},
                                                                  'expr': 'int64_1 '
                                                                          '>= '
                                                                          '0',
                                                                  'guarantee_timestamp': None,
                                                                  'output_fields': None,
                                                                  'ignore_growing': False,
                                                                  'group_by_field': None,
                                                                  'timeout': 600,
                                                                  'random_data': True}},
                                                      {'type': 'hybrid_search',
                                                       'weight': 8,
                                                       'params': {'nq': 1,
                                                                  'top_k': 100,
                                                                  'reqs': [{'search_param': {'nprobe': 128},
                                                                            'anns_field': 'float_vector',
                                                                            'top_k': 100},
                                                                           {'search_param': {'ef': 64},
                                                                            'anns_field': 'float_vector_1',
                                                                            'top_k': 10},
                                                                           {'search_param': {'search_list': 32},
                                                                            'anns_field': 'float_vector_2',
                                                                            'top_k': 30},
                                                                           {'search_param': {'nprobe': 16},
                                                                            'anns_field': 'float_vector_3',
                                                                            'top_k': 400}],
                                                                  'rerank': {'WeightedRanker': [0.85,
                                                                                                0.95,
                                                                                                0.51,
                                                                                                0.32]},
                                                                  'output_fields': ['*'],
                                                                  'ignore_growing': False,
                                                                  'guarantee_timestamp': None,
                                                                  'timeout': 600,
                                                                  'random_data': True}},
                                                      {'type': 'query',
                                                       'weight': 1,
                                                       'params': {'ids': None,
                                                                  'expr': 'int64_1 '
                                                                          '> '
                                                                          '-1 '
                                                                          '&& ',
                                                                  'output_fields': ['*'],
                                                                  'offset': None,
                                                                  'limit': None,
                                                                  'ignore_growing': False,
                                                                  'timeout': 600,
                                                                  'random_data': True,
                                                                  'random_count': 20,
                                                                  'random_range': [0,
                                                                                   100000],
                                                                  'field_name': 'id',
                                                                  'field_type': 'int64'}}]},
            'run_id': 2024012653722500,
            'datetime': '2024-01-26 02:16:12.489873',
            'client_version': '2.2'},
 'result': {'test_result': {'index': {'RT': 126.8697,
                                      'float_vector_1': {'RT': 0.5233},
                                      'float_vector_2': {'RT': 10.6549},
                                      'float_vector_3': {'RT': 0.526},
                                      'id': {'RT': 0.5237},
                                      'int64_1': {'RT': 0.5491},
                                      'varchar_1': {'RT': 0.5164}},
                            'insert': {'total_time': 158.5121,
                                       'VPS': 6315.5462,
                                       'batch_time': 1.5851,
                                       'batch': 10000.0},
                            'flush': {'RT': 3.0741},
                            'load': {'RT': 8.1257},
                            'Locust': {'Aggregated': {'Requests': 484,
                                                      'Fails': 89,
                                                      'RPS': 0.04,
                                                      'fail_s': 0.18,
                                                      'RT_max': 6864444.33,
                                                      'RT_avg': 387593.75,
                                                      'TP50': 274000.0,
                                                      'TP99': 5615000.0},
                                       'hybrid_search': {'Requests': 225,
                                                         'Fails': 87,
                                                         'RPS': 0.02,
                                                         'fail_s': 0.39,
                                                         'RT_max': 546798.83,
                                                         'RT_avg': 183661.38,
                                                         'TP50': 172000.0,
                                                         'TP99': 522000.0},
                                       'query': {'Requests': 25,
                                                 'Fails': 0,
                                                 'RPS': 0.0,
                                                 'fail_s': 0.0,
                                                 'RT_max': 1216790.76,
                                                 'RT_avg': 454530.48,
                                                 'TP50': 373000.0,
                                                 'TP99': 1217000.0},
                                       'scene_test_partition': {'Requests': 15,
                                                                'Fails': 0,
                                                                'RPS': 0.0,
                                                                'fail_s': 0.0,
                                                                'RT_max': 6864444.33,
                                                                'RT_avg': 4663460.33,
                                                                'TP50': 4650000.0,
                                                                'TP99': 6864000.0},
                                       'search': {'Requests': 219,
                                                  'Fails': 2,
                                                  'RPS': 0.02,
                                                  'fail_s': 0.01,
                                                  'RT_max': 600764.75,
                                                  'RT_avg': 296604.56,
                                                  'TP50': 287000.0,
                                                  'TP99': 560000.0}}}}}

test steps:

        concurrent test and calculation of RT and QPS

        :purpose:  `DQL & DML(partition)`
            verify concurrent DQL & DML(partition) scenario,
            which has 4 vector fields(IVF_FLAT, HNSW, DISKANN, IVF_SQ8) and scalar fields: `int64_1`, `varchar_1`

        :test steps:
            1. create collection with fields:
                'float_vector': 128dim,
                'float_vector_1': 128dim,
                'float_vector_2': 128dim,
                'float_vector_3': 128dim,
                scalar field: int64_1, varchar_1
            2. build indexes:
                IVF_FLAT: 'float_vector'
                HNSW: 'float_vector_1',
                DISKANN: 'float_vector_2'
                IVF_SQ8: 'float_vector_3'
                INVERTED: 'int64_1', 'varchar_1'
                default scalar index: 'id'
            3. insert 1m data into 10 partitions
            4. flush collection
            5. build indexes again using the same params
            6. load collection
                replica: 1
            7. concurrent request:
                - scene_test_partition
                    (partition: create->insert->flush->index again->load->search->release->search error->drop)
                - search
                - hybrid_search
                - query

sre-ci-robot pushed a commit that referenced this issue Feb 1, 2024
issue: #29656
/kind bug

Signed-off-by: xige-16 <xi.ge@zilliz.com>

---------

Signed-off-by: xige-16 <xi.ge@zilliz.com>
@wangting0128
Copy link
Contributor Author

Recurrent

argo task:inverted-corn-1706976000
test case name:test_inverted_locust_hnsw_ivf_sq8_dml_dql_cluster
milvus tag: master-20240203-7ce876a0

server:

NAME                                                              READY   STATUS             RESTARTS          AGE     IP              NODE         NOMINATED NODE   READINESS GATES
inverted-corn-176000-7-26-7391-etcd-0                             1/1     Running            0                 8m5s    10.104.29.69    4am-node35   <none>           <none>
inverted-corn-176000-7-26-7391-etcd-1                             1/1     Running            0                 8m5s    10.104.25.243   4am-node30   <none>           <none>
inverted-corn-176000-7-26-7391-etcd-2                             1/1     Running            0                 8m5s    10.104.23.231   4am-node27   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-datacoord-779cf56bb-x4d56   1/1     Running            0                 8m5s    10.104.4.175    4am-node11   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-datanode-566bf9bbc7-2vqsp   1/1     Running            1 (3m35s ago)     8m5s    10.104.28.46    4am-node33   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-indexcoord-77d5f45d4m47gw   1/1     Running            0                 8m5s    10.104.4.174    4am-node11   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-indexnode-8698875cd662rzd   1/1     Running            0                 8m5s    10.104.12.151   4am-node17   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-indexnode-8698875cd6fs8ls   1/1     Running            0                 8m5s    10.104.6.122    4am-node13   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-proxy-6f94cb58bf-6qrsz      1/1     Running            1 (3m34s ago)     8m5s    10.104.24.223   4am-node29   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-querycoord-66b5d8854x2p9n   1/1     Running            1 (3m35s ago)     8m5s    10.104.4.172    4am-node11   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-querynode-6c7f875fcbcl7dj   1/1     Running            0                 8m5s    10.104.24.224   4am-node29   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-rootcoord-75979c9dc4l4mcf   1/1     Running            1 (3m34s ago)     8m5s    10.104.12.150   4am-node17   <none>           <none>
inverted-corn-176000-7-26-7391-minio-0                            1/1     Running            0                 8m5s    10.104.25.244   4am-node30   <none>           <none>
inverted-corn-176000-7-26-7391-minio-1                            1/1     Running            0                 8m5s    10.104.29.68    4am-node35   <none>           <none>
inverted-corn-176000-7-26-7391-minio-2                            1/1     Running            0                 8m5s    10.104.33.27    4am-node36   <none>           <none>
inverted-corn-176000-7-26-7391-minio-3                            1/1     Running            0                 8m4s    10.104.23.230   4am-node27   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-bookie-0                    1/1     Running            0                 8m5s    10.104.25.245   4am-node30   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-bookie-1                    1/1     Running            0                 8m4s    10.104.23.233   4am-node27   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-bookie-2                    1/1     Running            0                 8m4s    10.104.19.177   4am-node28   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-bookie-init-2ljjh           0/1     Completed          0                 8m5s    10.104.1.223    4am-node10   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-broker-0                    1/1     Running            0                 8m5s    10.104.1.221    4am-node10   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-proxy-0                     1/1     Running            0                 8m5s    10.104.4.173    4am-node11   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-pulsar-init-j56fp           0/1     Completed          0                 8m5s    10.104.1.222    4am-node10   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-recovery-0                  1/1     Running            0                 8m5s    10.104.23.221   4am-node27   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-zookeeper-0                 1/1     Running            0                 8m5s    10.104.29.70    4am-node35   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-zookeeper-1                 1/1     Running            0                 7m20s   10.104.16.138   4am-node21   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-zookeeper-2                 1/1     Running            0                 6m9s    10.104.20.139   4am-node22   <none>           <none> (base.py:257)
[2024-02-03 19:46:51,339 -  INFO - fouram]: [Cmd Exe]  kubectl get pods  -n qa-milvus  -o wide | grep -E 'STATUS|inverted-corn-176000-7-26-7391-milvus|inverted-corn-176000-7-26-7391-minio|inverted-corn-176000-7-26-7391-etcd|inverted-corn-176000-7-26-7391-pulsar|inverted-corn-176000-7-26-7391-kafka|inverted-corn-176000-7-26-7391-log|inverted-corn-176000-7-26-7391-tikv'  (util_cmd.py:14)
[2024-02-03 19:47:01,190 -  INFO - fouram]: [CliClient] pod details of release(inverted-corn-176000-7-26-7391): 
 I0203 19:46:52.617342     511 request.go:665] Waited for 1.160595299s due to client-side throttling, not priority and fairness, request: GET:https://kubernetes.default.svc.cluster.local/api/v1?timeout=32s
NAME                                                              READY   STATUS             RESTARTS         AGE     IP              NODE         NOMINATED NODE   READINESS GATES
inverted-corn-176000-7-26-7391-etcd-0                             1/1     Running            0                3h44m   10.104.29.69    4am-node35   <none>           <none>
inverted-corn-176000-7-26-7391-etcd-1                             1/1     Running            0                3h44m   10.104.25.243   4am-node30   <none>           <none>
inverted-corn-176000-7-26-7391-etcd-2                             1/1     Running            0                3h44m   10.104.23.231   4am-node27   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-datacoord-779cf56bb-x4d56   1/1     Running            0                3h44m   10.104.4.175    4am-node11   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-datanode-566bf9bbc7-2vqsp   1/1     Running            1 (3h40m ago)    3h44m   10.104.28.46    4am-node33   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-indexcoord-77d5f45d4m47gw   1/1     Running            0                3h44m   10.104.4.174    4am-node11   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-indexnode-8698875cd662rzd   1/1     Running            0                3h44m   10.104.12.151   4am-node17   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-indexnode-8698875cd6fs8ls   1/1     Running            0                3h44m   10.104.6.122    4am-node13   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-proxy-6f94cb58bf-6qrsz      1/1     Running            1 (3h40m ago)    3h44m   10.104.24.223   4am-node29   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-querycoord-66b5d8854x2p9n   1/1     Running            1 (3h40m ago)    3h44m   10.104.4.172    4am-node11   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-querynode-6c7f875fcbcl7dj   1/1     Running            0                3h44m   10.104.24.224   4am-node29   <none>           <none>
inverted-corn-176000-7-26-7391-milvus-rootcoord-75979c9dc4l4mcf   1/1     Running            1 (3h40m ago)    3h44m   10.104.12.150   4am-node17   <none>           <none>
inverted-corn-176000-7-26-7391-minio-0                            1/1     Running            0                3h44m   10.104.25.244   4am-node30   <none>           <none>
inverted-corn-176000-7-26-7391-minio-1                            1/1     Running            0                3h44m   10.104.29.68    4am-node35   <none>           <none>
inverted-corn-176000-7-26-7391-minio-2                            1/1     Running            0                3h44m   10.104.33.27    4am-node36   <none>           <none>
inverted-corn-176000-7-26-7391-minio-3                            1/1     Running            0                3h44m   10.104.23.230   4am-node27   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-bookie-0                    1/1     Running            0                3h44m   10.104.25.245   4am-node30   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-bookie-1                    1/1     Running            0                3h44m   10.104.23.233   4am-node27   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-bookie-2                    1/1     Running            0                3h44m   10.104.19.177   4am-node28   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-bookie-init-2ljjh           0/1     Completed          0                3h44m   10.104.1.223    4am-node10   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-broker-0                    1/1     Running            0                3h44m   10.104.1.221    4am-node10   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-proxy-0                     1/1     Running            0                3h44m   10.104.4.173    4am-node11   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-pulsar-init-j56fp           0/1     Completed          0                3h44m   10.104.1.222    4am-node10   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-recovery-0                  1/1     Running            0                3h44m   10.104.23.221   4am-node27   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-zookeeper-0                 1/1     Running            0                3h44m   10.104.29.70    4am-node35   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-zookeeper-1                 1/1     Running            0                3h44m   10.104.16.138   4am-node21   <none>           <none>
inverted-corn-176000-7-26-7391-pulsar-zookeeper-2                 1/1     Running            0                3h43m   10.104.20.139   4am-node22   <none>           <none> 

client pod name: inverted-corn-1706976000-2027595845
client log:
截屏2024-02-04 10 38 17

截屏2024-02-04 10 38 59

test steps:

        concurrent test and calculation of RT and QPS

        :purpose:  `vector: memory index`
            verify concurrent DML & DQL scenario which has 2 float_vector fields & 16 scalar fields

        :test steps:
            1. create collection with fields:
                'float_vector': 128dim,
                'float_vector_1': 200dim,
                'int8_1', 'int16_1', 'int32_1', 'int64_1', 'double_1', 'float_1', 'varchar_1', 'bool_1',
                'int8_2', 'int16_2', 'int32_2', 'int64_2', 'double_2', 'float_2', 'varchar_2', 'bool_2'
            2. build indexes:
                HNSW: 'float_vector'
                IVF_SQ8: 'float_vector_1'
                scalar_default_index: 'int8_1', 'int16_1', 'int32_1', 'int64_1', 'double_1', 'float_1', 'varchar_1'
                scalar_INVERTED_index: 'int8_2', 'int16_2', 'int32_2', 'int64_2', 'double_2', 'float_2', 'varchar_2', 'bool_2'
            3. insert 5 million data
            4. flush collection
            5. build indexes again using the same params
            6. load collection
            7. concurrent request:
                - insert
                - delete
                - flush
                - load
                - search
                - hybrid_search
                - query

test result:

{'server': {'deploy_tool': 'helm',
            'deploy_mode': 'cluster',
            'config_name': 'cluster_8c16m',
            'config': {'queryNode': {'resources': {'limits': {'cpu': '16.0',
                                                              'memory': '64Gi'},
                                                   'requests': {'cpu': '9.0',
                                                                'memory': '33Gi'}},
                                     'replicas': 1},
                       'indexNode': {'resources': {'limits': {'cpu': '8.0',
                                                              'memory': '16Gi'},
                                                   'requests': {'cpu': '5.0',
                                                                'memory': '9Gi'}},
                                     'replicas': 2},
                       'dataNode': {'resources': {'limits': {'cpu': '8.0',
                                                             'memory': '16Gi'},
                                                  'requests': {'cpu': '5.0',
                                                               'memory': '9Gi'}}},
                       'cluster': {'enabled': True},
                       'pulsar': {},
                       'kafka': {},
                       'minio': {'metrics': {'podMonitor': {'enabled': True}}},
                       'etcd': {'metrics': {'enabled': True,
                                            'podMonitor': {'enabled': True}}},
                       'metrics': {'serviceMonitor': {'enabled': True}},
                       'log': {'level': 'debug'},
                       'image': {'all': {'repository': 'harbor.milvus.io/milvus/milvus',
                                         'tag': 'master-20240203-7ce876a0'}}},
            'host': 'inverted-corn-176000-7-26-7391-milvus.qa-milvus.svc.cluster.local',
            'port': '19530',
            'uri': ''},
 'client': {'test_case_type': 'ConcurrentClientBase',
            'test_case_name': 'test_inverted_locust_hnsw_ivf_sq8_dml_dql_cluster',
            'test_case_params': {'dataset_params': {'metric_type': 'L2',
                                                    'dim': 128,
                                                    'scalars_index': {'int8_1': {},
                                                                      'int16_1': {},
                                                                      'int32_1': {},
                                                                      'int64_1': {},
                                                                      'double_1': {},
                                                                      'float_1': {},
                                                                      'varchar_1': {},
                                                                      'int8_2': {'index_type': 'INVERTED'},
                                                                      'int16_2': {'index_type': 'INVERTED'},
                                                                      'int32_2': {'index_type': 'INVERTED'},
                                                                      'int64_2': {'index_type': 'INVERTED'},
                                                                      'double_2': {'index_type': 'INVERTED'},
                                                                      'float_2': {'index_type': 'INVERTED'},
                                                                      'varchar_2': {'index_type': 'INVERTED'},
                                                                      'bool_2': {'index_type': 'INVERTED'}},
                                                    'vectors_index': {'float_vector_1': {'index_type': 'IVF_SQ8',
                                                                                         'index_param': {'nlist': 1024},
                                                                                         'metric_type': 'L2'}},
                                                    'scalars_params': {'float_vector_1': {'params': {'dim': 200},
                                                                                          'other_params': {'dataset': 'text2img',
                                                                                                           'dim': 200}}},
                                                    'dataset_name': 'sift',
                                                    'dataset_size': 5000000,
                                                    'ni_per': 5000},
                                 'collection_params': {'other_fields': ['float_vector_1',
                                                                        'int8_1',
                                                                        'int16_1',
                                                                        'int32_1',
                                                                        'int64_1',
                                                                        'double_1',
                                                                        'float_1',
                                                                        'varchar_1',
                                                                        'bool_1',
                                                                        'int8_2',
                                                                        'int16_2',
                                                                        'int32_2',
                                                                        'int64_2',
                                                                        'double_2',
                                                                        'float_2',
                                                                        'varchar_2',
                                                                        'bool_2'],
                                                       'shards_num': 2},
                                 'resource_groups_params': {'reset': False},
                                 'database_user_params': {'reset_rbac': False,
                                                          'reset_db': False},
                                 'index_params': {'index_type': 'HNSW',
                                                  'index_param': {'M': 8,
                                                                  'efConstruction': 200}},
                                 'concurrent_params': {'concurrent_number': 20,
                                                       'during_time': '3h',
                                                       'interval': 20,
                                                       'spawn_rate': None},
                                 'concurrent_tasks': [{'type': 'insert',
                                                       'weight': 1,
                                                       'params': {'nb': 10,
                                                                  'timeout': 30,
                                                                  'random_id': True,
                                                                  'random_vector': True,
                                                                  'varchar_filled': False,
                                                                  'start_id': 5000000}},
                                                      {'type': 'delete',
                                                       'weight': 1,
                                                       'params': {'expr': '',
                                                                  'delete_length': 9,
                                                                  'timeout': 30}},
                                                      {'type': 'flush',
                                                       'weight': 1,
                                                       'params': {'timeout': 600}},
                                                      {'type': 'load',
                                                       'weight': 1,
                                                       'params': {'replica_number': 1,
                                                                  'timeout': 30}},
                                                      {'type': 'search',
                                                       'weight': 1,
                                                       'params': {'nq': 1000,
                                                                  'top_k': 1,
                                                                  'search_param': {'ef': 64},
                                                                  'expr': 'int64_1 '
                                                                          '> '
                                                                          '-1 '
                                                                          '&& '
                                                                          'id '
                                                                          '> '
                                                                          '-1',
                                                                  'guarantee_timestamp': None,
                                                                  'partition_names': None,
                                                                  'output_fields': ['*'],
                                                                  'ignore_growing': False,
                                                                  'group_by_field': None,
                                                                  'timeout': 180,
                                                                  'random_data': True}},
                                                      {'type': 'hybrid_search',
                                                       'weight': 1,
                                                       'params': {'nq': 1,
                                                                  'top_k': 10,
                                                                  'reqs': [{'search_param': {'nprobe': 16},
                                                                            'anns_field': 'float_vector_1',
                                                                            'expr': 'varchar_1 '
                                                                                    'like '
                                                                                    '"0%" '
                                                                                    '&& '
                                                                                    'bool_2 '
                                                                                    '== '
                                                                                    'True',
                                                                            'top_k': 2000},
                                                                           {'search_param': {'ef': 128},
                                                                            'anns_field': 'float_vector',
                                                                            'expr': 'int64_1 '
                                                                                    '< '
                                                                                    '100000 '
                                                                                    '&& '
                                                                                    'float_2 '
                                                                                    '> '
                                                                                    '10.0'}],
                                                                  'rerank': {'WeightedRanker': [0.5,
                                                                                                0.5]},
                                                                  'output_fields': ['*'],
                                                                  'ignore_growing': False,
                                                                  'guarantee_timestamp': None,
                                                                  'partition_names': None,
                                                                  'timeout': 60,
                                                                  'random_data': True}},
                                                      {'type': 'query',
                                                       'weight': 1,
                                                       'params': {'ids': None,
                                                                  'expr': 'int64_1 '
                                                                          '> '
                                                                          '-1 '
                                                                          '&&  '
                                                                          'int64_2 '
                                                                          '> '
                                                                          '-1 '
                                                                          '&& ',
                                                                  'output_fields': ['*'],
                                                                  'offset': None,
                                                                  'limit': None,
                                                                  'ignore_growing': False,
                                                                  'partition_names': None,
                                                                  'timeout': 180,
                                                                  'random_data': True,
                                                                  'random_count': 20,
                                                                  'random_range': [2500000.0,
                                                                                   5000000],
                                                                  'field_name': 'id',
                                                                  'field_type': 'int64'}}]},
            'run_id': 2024020361218732,
            'datetime': '2024-02-03 16:02:01.217562',
            'client_version': '2.4.0'},
 'result': {'test_result': {'index': {'RT': 730.1371,
                                      'float_vector_1': {'RT': 442.7606},
                                      'int8_1': {'RT': 185.0674},
                                      'int16_1': {'RT': 5.0524},
                                      'int32_1': {'RT': 41.8555},
                                      'int64_1': {'RT': 1.0221},
                                      'double_1': {'RT': 0.5196},
                                      'float_1': {'RT': 0.5153},
                                      'varchar_1': {'RT': 0.5252},
                                      'int8_2': {'RT': 0.5165},
                                      'int16_2': {'RT': 0.5238},
                                      'int32_2': {'RT': 0.5174},
                                      'int64_2': {'RT': 0.5156},
                                      'double_2': {'RT': 0.6054},
                                      'float_2': {'RT': 0.5234},
                                      'varchar_2': {'RT': 0.516},
                                      'bool_2': {'RT': 0.52}},
                            'insert': {'total_time': 461.2794,
                                       'VPS': 10839.4175,
                                       'batch_time': 0.4613,
                                       'batch': 5000},
                            'flush': {'RT': 3.0305},
                            'load': {'RT': 16.1175},
                            'Locust': {'Aggregated': {'Requests': 58670,
                                                      'Fails': 920,
                                                      'RPS': 5.43,
                                                      'fail_s': 0.02,
                                                      'RT_max': 57018.29,
                                                      'RT_avg': 3643.46,
                                                      'TP50': 1400.0,
                                                      'TP99': 30000.0},
                                       'delete': {'Requests': 8200,
                                                  'Fails': 0,
                                                  'RPS': 0.76,
                                                  'fail_s': 0.0,
                                                  'RT_max': 13360.85,
                                                  'RT_avg': 462.72,
                                                  'TP50': 65,
                                                  'TP99': 6800.0},
                                       'flush': {'Requests': 8288,
                                                 'Fails': 0,
                                                 'RPS': 0.77,
                                                 'fail_s': 0.0,
                                                 'RT_max': 57018.29,
                                                 'RT_avg': 8563.04,
                                                 'TP50': 7100.0,
                                                 'TP99': 36000.0},
                                       'hybrid_search': {'Requests': 8493,
                                                         'Fails': 0,
                                                         'RPS': 0.79,
                                                         'fail_s': 0.0,
                                                         'RT_max': 17352.75,
                                                         'RT_avg': 2855.89,
                                                         'TP50': 1900.0,
                                                         'TP99': 11000.0},
                                       'insert': {'Requests': 8412,
                                                  'Fails': 0,
                                                  'RPS': 0.78,
                                                  'fail_s': 0.0,
                                                  'RT_max': 14171.44,
                                                  'RT_avg': 745.84,
                                                  'TP50': 340.0,
                                                  'TP99': 7800.0},
                                       'load': {'Requests': 8535,
                                                'Fails': 918,
                                                'RPS': 0.79,
                                                'fail_s': 0.11,
                                                'RT_max': 36891.34,
                                                'RT_avg': 5928.53,
                                                'TP50': 780.0,
                                                'TP99': 30000.0},
                                       'query': {'Requests': 8459,
                                                 'Fails': 0,
                                                 'RPS': 0.78,
                                                 'fail_s': 0.0,
                                                 'RT_max': 19431.08,
                                                 'RT_avg': 2737.01,
                                                 'TP50': 1700.0,
                                                 'TP99': 13000.0},
                                       'search': {'Requests': 8283,
                                                  'Fails': 2,
                                                  'RPS': 0.77,
                                                  'fail_s': 0.0,
                                                  'RT_max': 19359.4,
                                                  'RT_avg': 4191.2,
                                                  'TP50': 3100.0,
                                                  'TP99': 14000.0}}}}}

@czs007

@zhagnlu
Copy link
Contributor

zhagnlu commented Feb 7, 2024

image
diff search and requery segments. requery lack 10 segments.
and this ten segment are compacted to a single segment, and target segment has no record, show as figure:
image
so, lack some search record although requery have same mvcc timestamp

@zhagnlu
Copy link
Contributor

zhagnlu commented Feb 7, 2024

#30576

Copy link

stale bot commented Mar 10, 2024

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Rotten issues close after 30d of inactivity. Reopen the issue with /reopen.

@stale stale bot added the stale indicates no udpates for 30 days label Mar 10, 2024
sre-ci-robot pushed a commit that referenced this issue Mar 18, 2024
…results (#31343)

During requery, segments may change (e.g., due to compaction), so we
need to return specific error codes when encountering incomplete requery
results. Clients can then retry to avoid this issue.

issue: #29656

Signed-off-by: bigsheeper <yihao.dai@zilliz.com>
sre-ci-robot pushed a commit to milvus-io/milvus-sdk-go that referenced this issue Mar 18, 2024
…ry (#675)

1. Support retry on milvus errors.
2. Retry on ErrInconsistentRequery for search and hybrid search
operations.

issue: milvus-io/milvus#29656

related milvus pr: milvus-io/milvus#31343

---------

Signed-off-by: bigsheeper <yihao.dai@zilliz.com>
sre-ci-robot pushed a commit that referenced this issue Mar 18, 2024
…results (#31343) (#31359)

During requery, segments may change (e.g., due to compaction), so we
need to return specific error codes when encountering incomplete requery
results. Clients can then retry to avoid this issue.

issue: #29656

pr: #31343

Signed-off-by: bigsheeper <yihao.dai@zilliz.com>
@bigsheeper
Copy link
Contributor

/assign @wangting0128
please help to make a verify

@wangting0128
Copy link
Contributor Author

occurred again

argo task:inverted-corn-vjvws
test case name:test_inverted_locust_hnsw_ivf_sq8_dml_dql_cluster
image:2.4-20240319-0dbc0f2c-amd64

server:

NAME                                                              READY   STATUS            RESTARTS        AGE     IP              NODE         NOMINATED NODE   READINESS GATES
inverted-corn-vjvws-7-68-5635-etcd-0                              1/1     Running           0               6m42s   10.104.31.51    4am-node34   <none>           <none>
inverted-corn-vjvws-7-68-5635-etcd-1                              1/1     Running           0               6m42s   10.104.18.143   4am-node25   <none>           <none>
inverted-corn-vjvws-7-68-5635-etcd-2                              1/1     Running           0               6m42s   10.104.27.211   4am-node31   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-datacoord-59b89f769f-2p8fn   1/1     Running           0               6m42s   10.104.5.239    4am-node12   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-datanode-7fcc7b66f4-d57cv    1/1     Running           0               6m42s   10.104.13.128   4am-node16   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-indexcoord-6768548-nk89x     1/1     Running           0               6m42s   10.104.13.127   4am-node16   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-indexnode-754dd7b5cc-ddfwz   1/1     Running           0               6m42s   10.104.14.199   4am-node18   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-indexnode-754dd7b5cc-sp7xc   1/1     Running           0               6m42s   10.104.5.242    4am-node12   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-proxy-648ccfd48-wjz7t        1/1     Running           1 (2m11s ago)   6m42s   10.104.5.240    4am-node12   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-querycoord-5774d75b47nnncg   1/1     Running           1 (2m12s ago)   6m42s   10.104.5.238    4am-node12   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-querynode-7b7ddbd46c-6srhx   1/1     Running           0               6m42s   10.104.1.90     4am-node10   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-rootcoord-7978cdd979-hhx8q   1/1     Running           0               6m42s   10.104.1.89     4am-node10   <none>           <none>
inverted-corn-vjvws-7-68-5635-minio-0                             1/1     Running           0               6m42s   10.104.18.142   4am-node25   <none>           <none>
inverted-corn-vjvws-7-68-5635-minio-1                             1/1     Running           0               6m42s   10.104.24.15    4am-node29   <none>           <none>
inverted-corn-vjvws-7-68-5635-minio-2                             1/1     Running           0               6m42s   10.104.27.210   4am-node31   <none>           <none>
inverted-corn-vjvws-7-68-5635-minio-3                             1/1     Running           0               6m41s   10.104.20.139   4am-node22   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-bookie-0                     1/1     Running           0               6m42s   10.104.24.16    4am-node29   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-bookie-1                     1/1     Running           0               6m42s   10.104.20.138   4am-node22   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-bookie-2                     1/1     Running           0               6m41s   10.104.25.97    4am-node30   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-bookie-init-4btfg            0/1     Completed         0               6m42s   10.104.20.130   4am-node22   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-broker-0                     1/1     Running           0               6m42s   10.104.31.46    4am-node34   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-proxy-0                      1/1     Running           0               6m42s   10.104.4.210    4am-node11   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-pulsar-init-xxbcf            0/1     Completed         0               6m42s   10.104.20.131   4am-node22   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-recovery-0                   1/1     Running           0               6m42s   10.104.20.132   4am-node22   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-zookeeper-0                  1/1     Running           0               6m42s   10.104.31.50    4am-node34   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-zookeeper-1                  1/1     Running           0               6m1s    10.104.24.24    4am-node29   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-zookeeper-2                  1/1     Running           0               5m7s    10.104.34.153   4am-node37   <none>           <none> (base.py:257)
[2024-03-19 06:20:06,297 -  INFO - fouram]: [Cmd Exe]  kubectl get pods  -n qa-milvus  -o wide | grep -E 'NAME|inverted-corn-vjvws-7-68-5635-milvus|inverted-corn-vjvws-7-68-5635-minio|inverted-corn-vjvws-7-68-5635-etcd|inverted-corn-vjvws-7-68-5635-pulsar|inverted-corn-vjvws-7-68-5635-zookeeper|inverted-corn-vjvws-7-68-5635-kafka|inverted-corn-vjvws-7-68-5635-log|inverted-corn-vjvws-7-68-5635-tikv'  (util_cmd.py:14)
[2024-03-19 06:20:16,183 -  INFO - fouram]: [CliClient] pod details of release(inverted-corn-vjvws-7-68-5635): 
 I0319 06:20:07.564534     516 request.go:665] Waited for 1.17218291s due to client-side throttling, not priority and fairness, request: GET:https://kubernetes.default.svc.cluster.local/apis/authorization.k8s.io/v1?timeout=32s
NAME                                                              READY   STATUS             RESTARTS        AGE     IP              NODE         NOMINATED NODE   READINESS GATES
inverted-corn-vjvws-7-68-5635-etcd-0                              1/1     Running            0               3h45m   10.104.31.51    4am-node34   <none>           <none>
inverted-corn-vjvws-7-68-5635-etcd-1                              1/1     Running            0               3h45m   10.104.18.143   4am-node25   <none>           <none>
inverted-corn-vjvws-7-68-5635-etcd-2                              1/1     Running            0               3h45m   10.104.27.211   4am-node31   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-datacoord-59b89f769f-2p8fn   1/1     Running            0               3h45m   10.104.5.239    4am-node12   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-datanode-7fcc7b66f4-d57cv    1/1     Running            0               3h45m   10.104.13.128   4am-node16   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-indexcoord-6768548-nk89x     1/1     Running            0               3h45m   10.104.13.127   4am-node16   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-indexnode-754dd7b5cc-ddfwz   1/1     Running            1 (3h38m ago)   3h45m   10.104.14.199   4am-node18   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-indexnode-754dd7b5cc-sp7xc   1/1     Running            0               3h45m   10.104.5.242    4am-node12   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-proxy-648ccfd48-wjz7t        1/1     Running            1 (3h41m ago)   3h45m   10.104.5.240    4am-node12   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-querycoord-5774d75b47nnncg   1/1     Running            1 (3h41m ago)   3h45m   10.104.5.238    4am-node12   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-querynode-7b7ddbd46c-6srhx   1/1     Running            0               3h45m   10.104.1.90     4am-node10   <none>           <none>
inverted-corn-vjvws-7-68-5635-milvus-rootcoord-7978cdd979-hhx8q   1/1     Running            0               3h45m   10.104.1.89     4am-node10   <none>           <none>
inverted-corn-vjvws-7-68-5635-minio-0                             1/1     Running            0               3h45m   10.104.18.142   4am-node25   <none>           <none>
inverted-corn-vjvws-7-68-5635-minio-1                             1/1     Running            0               3h45m   10.104.24.15    4am-node29   <none>           <none>
inverted-corn-vjvws-7-68-5635-minio-2                             1/1     Running            0               3h45m   10.104.27.210   4am-node31   <none>           <none>
inverted-corn-vjvws-7-68-5635-minio-3                             1/1     Running            0               3h45m   10.104.20.139   4am-node22   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-bookie-0                     1/1     Running            0               3h45m   10.104.24.16    4am-node29   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-bookie-1                     1/1     Running            0               3h45m   10.104.20.138   4am-node22   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-bookie-2                     1/1     Running            0               3h45m   10.104.25.97    4am-node30   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-bookie-init-4btfg            0/1     Completed          0               3h45m   10.104.20.130   4am-node22   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-broker-0                     1/1     Running            0               3h45m   10.104.31.46    4am-node34   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-proxy-0                      1/1     Running            0               3h45m   10.104.4.210    4am-node11   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-pulsar-init-xxbcf            0/1     Completed          0               3h45m   10.104.20.131   4am-node22   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-recovery-0                   1/1     Running            0               3h45m   10.104.20.132   4am-node22   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-zookeeper-0                  1/1     Running            0               3h45m   10.104.31.50    4am-node34   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-zookeeper-1                  1/1     Running            0               3h45m   10.104.24.24    4am-node29   <none>           <none>
inverted-corn-vjvws-7-68-5635-pulsar-zookeeper-2                  1/1     Running            0               3h44m   10.104.34.153   4am-node37   <none>           <none>

client pod name: inverted-corn-vjvws-3305228201
client log:
client.log

截屏2024-03-19 14 58 10

test steps:

        concurrent test and calculation of RT and QPS

        :purpose:  `vector: memory index`
            verify concurrent DML & DQL scenario which has 2 float_vector fields & 16 scalar fields

        :test steps:
            1. create collection with fields:
                'float_vector': 128dim,
                'float_vector_1': 200dim,
                'int8_1', 'int16_1', 'int32_1', 'int64_1', 'double_1', 'float_1', 'varchar_1', 'bool_1',
                'int8_2', 'int16_2', 'int32_2', 'int64_2', 'double_2', 'float_2', 'varchar_2', 'bool_2'
            2. build indexes:
                HNSW: 'float_vector'
                IVF_SQ8: 'float_vector_1'
                scalar_default_index: 'int8_1', 'int16_1', 'int32_1', 'int64_1', 'double_1', 'float_1', 'varchar_1'
                scalar_INVERTED_index: 'int8_2', 'int16_2', 'int32_2', 'int64_2', 'double_2', 'float_2', 'varchar_2', 'bool_2'
            3. insert 5 million data
            4. flush collection
            5. build indexes again using the same params
            6. load collection
            7. concurrent request:
                - insert
                - delete
                - flush
                - load
                - search
                - hybrid_search
                - query

test result:

[2024-03-19 06:19:26,924 -  INFO - fouram]: Print locust final stats. (locust_runner.py:56)
[2024-03-19 06:19:26,925 -  INFO - fouram]: Type     Name                                                                          # reqs      # fails |    Avg     Min     Max    Med |   req/s  failures/s (stats.py:789)
[2024-03-19 06:19:26,925 -  INFO - fouram]: --------|----------------------------------------------------------------------------|-------|-------------|-------|-------|-------|-------|--------|----------- (stats.py:789)
[2024-03-19 06:19:26,925 -  INFO - fouram]: grpc     delete                                                                          7440     0(0.00%) |    861       4   20963    190 |    0.69        0.00 (stats.py:789)
[2024-03-19 06:19:26,925 -  INFO - fouram]: grpc     flush                                                                           7524     0(0.00%) |   9032     286   54936   7700 |    0.70        0.00 (stats.py:789)
[2024-03-19 06:19:26,925 -  INFO - fouram]: grpc     hybrid_search                                                                   7469     0(0.00%) |   4733     201   25410   4000 |    0.69        0.00 (stats.py:789)
[2024-03-19 06:19:26,925 -  INFO - fouram]: grpc     insert                                                                          7397     0(0.00%) |   1304      37   22339    370 |    0.68        0.00 (stats.py:789)
[2024-03-19 06:19:26,925 -  INFO - fouram]: grpc     load                                                                            7473     0(0.00%) |   1662       7   28010    400 |    0.69        0.00 (stats.py:789)
[2024-03-19 06:19:26,925 -  INFO - fouram]: grpc     query                                                                           7260     0(0.00%) |   4616     181   25707   3600 |    0.67        0.00 (stats.py:789)
[2024-03-19 06:19:26,925 -  INFO - fouram]: grpc     search                                                                          7294    53(0.73%) |   6726     317   25273   5900 |    0.68        0.00 (stats.py:789)
[2024-03-19 06:19:26,925 -  INFO - fouram]: --------|----------------------------------------------------------------------------|-------|-------------|-------|-------|-------|-------|--------|----------- (stats.py:789)
[2024-03-19 06:19:26,925 -  INFO - fouram]:          Aggregated                                                                     51857    53(0.10%) |   4134       4   54936   2800 |    4.80        0.00 (stats.py:789)
[2024-03-19 06:19:26,926 -  INFO - fouram]:  (stats.py:790)
[2024-03-19 06:19:26,930 -  INFO - fouram]: [PerfTemplate] Report data: 
{'server': {'deploy_tool': 'helm',
            'deploy_mode': 'cluster',
            'config_name': 'cluster_8c16m',
            'config': {'queryNode': {'resources': {'limits': {'cpu': '16.0',
                                                              'memory': '64Gi'},
                                                   'requests': {'cpu': '9.0',
                                                                'memory': '33Gi'}},
                                     'replicas': 1},
                       'indexNode': {'resources': {'limits': {'cpu': '8.0',
                                                              'memory': '16Gi'},
                                                   'requests': {'cpu': '5.0',
                                                                'memory': '9Gi'}},
                                     'replicas': 2},
                       'dataNode': {'resources': {'limits': {'cpu': '8.0',
                                                             'memory': '16Gi'},
                                                  'requests': {'cpu': '5.0',
                                                               'memory': '9Gi'}}},
                       'cluster': {'enabled': True},
                       'pulsar': {},
                       'kafka': {},
                       'minio': {'metrics': {'podMonitor': {'enabled': True}}},
                       'etcd': {'metrics': {'enabled': True,
                                            'podMonitor': {'enabled': True}}},
                       'metrics': {'serviceMonitor': {'enabled': True}},
                       'log': {'level': 'debug'},
                       'image': {'all': {'repository': 'harbor.milvus.io/milvus/milvus',
                                         'tag': '2.4-20240319-0dbc0f2c-amd64'}}},
            'host': 'inverted-corn-vjvws-7-68-5635-milvus.qa-milvus.svc.cluster.local',
            'port': '19530',
            'uri': ''},
 'client': {'test_case_type': 'ConcurrentClientBase',
            'test_case_name': 'test_inverted_locust_hnsw_ivf_sq8_dml_dql_cluster',
            'test_case_params': {'dataset_params': {'metric_type': 'L2',
                                                    'dim': 128,
                                                    'scalars_index': {'int8_1': {},
                                                                      'int16_1': {},
                                                                      'int32_1': {},
                                                                      'int64_1': {},
                                                                      'double_1': {},
                                                                      'float_1': {},
                                                                      'varchar_1': {},
                                                                      'int8_2': {'index_type': 'INVERTED'},
                                                                      'int16_2': {'index_type': 'INVERTED'},
                                                                      'int32_2': {'index_type': 'INVERTED'},
                                                                      'int64_2': {'index_type': 'INVERTED'},
                                                                      'double_2': {'index_type': 'INVERTED'},
                                                                      'float_2': {'index_type': 'INVERTED'},
                                                                      'varchar_2': {'index_type': 'INVERTED'},
                                                                      'bool_2': {'index_type': 'INVERTED'}},
                                                    'vectors_index': {'float_vector_1': {'index_type': 'IVF_SQ8',
                                                                                         'index_param': {'nlist': 1024},
                                                                                         'metric_type': 'L2'}},
                                                    'scalars_params': {'float_vector_1': {'params': {'dim': 200},
                                                                                          'other_params': {'dataset': 'text2img',
                                                                                                           'dim': 200}}},
                                                    'dataset_name': 'sift',
                                                    'dataset_size': 5000000,
                                                    'ni_per': 5000},
                                 'collection_params': {'other_fields': ['float_vector_1',
                                                                        'int8_1',
                                                                        'int16_1',
                                                                        'int32_1',
                                                                        'int64_1',
                                                                        'double_1',
                                                                        'float_1',
                                                                        'varchar_1',
                                                                        'bool_1',
                                                                        'int8_2',
                                                                        'int16_2',
                                                                        'int32_2',
                                                                        'int64_2',
                                                                        'double_2',
                                                                        'float_2',
                                                                        'varchar_2',
                                                                        'bool_2'],
                                                       'shards_num': 2},
                                 'resource_groups_params': {'reset': False},
                                 'database_user_params': {'reset_rbac': False,
                                                          'reset_db': False},
                                 'index_params': {'index_type': 'HNSW',
                                                  'index_param': {'M': 8,
                                                                  'efConstruction': 200}},
                                 'concurrent_params': {'concurrent_number': 20,
                                                       'during_time': '3h',
                                                       'interval': 20,
                                                       'spawn_rate': None},
                                 'concurrent_tasks': [{'type': 'insert',
                                                       'weight': 1,
                                                       'params': {'nb': 10,
                                                                  'timeout': 30,
                                                                  'random_id': True,
                                                                  'random_vector': True,
                                                                  'varchar_filled': False,
                                                                  'start_id': 5000000}},
                                                      {'type': 'delete',
                                                       'weight': 1,
                                                       'params': {'expr': '',
                                                                  'delete_length': 9,
                                                                  'timeout': 30}},
                                                      {'type': 'flush',
                                                       'weight': 1,
                                                       'params': {'timeout': 600}},
                                                      {'type': 'load',
                                                       'weight': 1,
                                                       'params': {'replica_number': 1,
                                                                  'timeout': 180}},
                                                      {'type': 'search',
                                                       'weight': 1,
                                                       'params': {'nq': 1000,
                                                                  'top_k': 1,
                                                                  'search_param': {'ef': 64},
                                                                  'expr': 'int64_1 '
                                                                          '> '
                                                                          '-1 '
                                                                          '&& '
                                                                          'id '
                                                                          '> '
                                                                          '-1',
                                                                  'guarantee_timestamp': None,
                                                                  'partition_names': None,
                                                                  'output_fields': ['*'],
                                                                  'ignore_growing': False,
                                                                  'group_by_field': None,
                                                                  'timeout': 180,
                                                                  'random_data': True}},
                                                      {'type': 'hybrid_search',
                                                       'weight': 1,
                                                       'params': {'nq': 1,
                                                                  'top_k': 10,
                                                                  'reqs': [{'search_param': {'nprobe': 16},
                                                                            'anns_field': 'float_vector_1',
                                                                            'expr': 'varchar_1 '
                                                                                    'like '
                                                                                    '"0%" '
                                                                                    '&& '
                                                                                    'bool_2 '
                                                                                    '== '
                                                                                    'True',
                                                                            'top_k': 2000},
                                                                           {'search_param': {'ef': 128},
                                                                            'anns_field': 'float_vector',
                                                                            'expr': 'int64_1 '
                                                                                    '< '
                                                                                    '100000 '
                                                                                    '&& '
                                                                                    'float_2 '
                                                                                    '> '
                                                                                    '10.0'}],
                                                                  'rerank': {'WeightedRanker': [0.5,
                                                                                                0.5]},
                                                                  'output_fields': ['*'],
                                                                  'ignore_growing': False,
                                                                  'guarantee_timestamp': None,
                                                                  'partition_names': None,
                                                                  'timeout': 60,
                                                                  'random_data': True}},
                                                      {'type': 'query',
                                                       'weight': 1,
                                                       'params': {'ids': None,
                                                                  'expr': 'int64_1 '
                                                                          '> '
                                                                          '-1 '
                                                                          '&&  '
                                                                          'int64_2 '
                                                                          '> '
                                                                          '-1 '
                                                                          '&& ',
                                                                  'output_fields': ['*'],
                                                                  'offset': None,
                                                                  'limit': None,
                                                                  'ignore_growing': False,
                                                                  'partition_names': None,
                                                                  'timeout': 180,
                                                                  'random_data': True,
                                                                  'random_count': 20,
                                                                  'random_range': [2500000.0,
                                                                                   5000000],
                                                                  'field_name': 'id',
                                                                  'field_type': 'int64'}}]},
            'run_id': 2024031956542136,
            'datetime': '2024-03-19 02:34:14.948248',
            'client_version': '2.4.0'},
 'result': {'test_result': {'index': {'RT': 712.5351,
                                      'float_vector_1': {'RT': 446.2326},
                                      'int8_1': {'RT': 249.0921},
                                      'int16_1': {'RT': 107.561},
                                      'int32_1': {'RT': 34.1335},
                                      'int64_1': {'RT': 0.5643},
                                      'double_1': {'RT': 0.5215},
                                      'float_1': {'RT': 5.1502},
                                      'varchar_1': {'RT': 1.0534},
                                      'int8_2': {'RT': 0.5715},
                                      'int16_2': {'RT': 1.0273},
                                      'int32_2': {'RT': 0.5708},
                                      'int64_2': {'RT': 0.5262},
                                      'double_2': {'RT': 2.0815},
                                      'float_2': {'RT': 0.5207},
                                      'varchar_2': {'RT': 0.5636},
                                      'bool_2': {'RT': 0.5186}},
                            'insert': {'total_time': 468.7792,
                                       'VPS': 10666.0022,
                                       'batch_time': 0.4688,
                                       'batch': 5000},
                            'flush': {'RT': 3.5322},
                            'load': {'RT': 22.6683},
                            'Locust': {'Aggregated': {'Requests': 51857,
                                                      'Fails': 53,
                                                      'RPS': 4.8,
                                                      'fail_s': 0.0,
                                                      'RT_max': 54936.41,
                                                      'RT_avg': 4134.01,
                                                      'TP50': 2800.0,
                                                      'TP99': 20000.0},
                                       'delete': {'Requests': 7440,
                                                  'Fails': 0,
                                                  'RPS': 0.69,
                                                  'fail_s': 0.0,
                                                  'RT_max': 20963.27,
                                                  'RT_avg': 861.11,
                                                  'TP50': 190.0,
                                                  'TP99': 9400.0},
                                       'flush': {'Requests': 7524,
                                                 'Fails': 0,
                                                 'RPS': 0.7,
                                                 'fail_s': 0.0,
                                                 'RT_max': 54936.41,
                                                 'RT_avg': 9032.73,
                                                 'TP50': 7700.0,
                                                 'TP99': 30000.0},
                                       'hybrid_search': {'Requests': 7469,
                                                         'Fails': 0,
                                                         'RPS': 0.69,
                                                         'fail_s': 0.0,
                                                         'RT_max': 25410.87,
                                                         'RT_avg': 4733.72,
                                                         'TP50': 4000.0,
                                                         'TP99': 16000.0},
                                       'insert': {'Requests': 7397,
                                                  'Fails': 0,
                                                  'RPS': 0.68,
                                                  'fail_s': 0.0,
                                                  'RT_max': 22339.22,
                                                  'RT_avg': 1304.75,
                                                  'TP50': 370.0,
                                                  'TP99': 12000.0},
                                       'load': {'Requests': 7473,
                                                'Fails': 0,
                                                'RPS': 0.69,
                                                'fail_s': 0.0,
                                                'RT_max': 28010.94,
                                                'RT_avg': 1662.48,
                                                'TP50': 400.0,
                                                'TP99': 13000.0},
                                       'query': {'Requests': 7260,
                                                 'Fails': 0,
                                                 'RPS': 0.67,
                                                 'fail_s': 0.0,
                                                 'RT_max': 25707.62,
                                                 'RT_avg': 4616.39,
                                                 'TP50': 3600.0,
                                                 'TP99': 18000.0},
                                       'search': {'Requests': 7294,
                                                  'Fails': 53,
                                                  'RPS': 0.68,
                                                  'fail_s': 0.01,
                                                  'RT_max': 25273.71,
                                                  'RT_avg': 6726.39,
                                                  'TP50': 5900.0,
                                                  'TP99': 20000.0}}}}} 

@bigsheeper

@wangting0128
Copy link
Contributor Author

occurred again

argo task:inverted-corn-vjvws
test case name:test_inverted_locust_hnsw_diskann_dml_dql_cluster
image:2.4-20240319-0dbc0f2c-amd64

server:

NAME                                                              READY   STATUS             RESTARTS        AGE     IP              NODE         NOMINATED NODE   READINESS GATES
inverted-corn-vjvws-8-25-8270-etcd-0                              1/1     Running            0               4h22m   10.104.23.61    4am-node27   <none>           <none>
inverted-corn-vjvws-8-25-8270-etcd-1                              1/1     Running            0               4h22m   10.104.34.154   4am-node37   <none>           <none>
inverted-corn-vjvws-8-25-8270-etcd-2                              1/1     Running            0               4h22m   10.104.24.32    4am-node29   <none>           <none>
inverted-corn-vjvws-8-25-8270-milvus-datacoord-75bf5c87c5-4pfwb   1/1     Running            0               4h22m   10.104.14.202   4am-node18   <none>           <none>
inverted-corn-vjvws-8-25-8270-milvus-datanode-5db49fdc66-grc2r    1/1     Running            1 (4h17m ago)   4h22m   10.104.14.203   4am-node18   <none>           <none>
inverted-corn-vjvws-8-25-8270-milvus-indexcoord-5984b6757bnlckl   1/1     Running            0               4h22m   10.104.31.56    4am-node34   <none>           <none>
inverted-corn-vjvws-8-25-8270-milvus-indexnode-84c9b574c7-8fgv6   1/1     Running            0               4h22m   10.104.32.227   4am-node39   <none>           <none>
inverted-corn-vjvws-8-25-8270-milvus-indexnode-84c9b574c7-crkpv   1/1     Running            0               4h22m   10.104.20.142   4am-node22   <none>           <none>
inverted-corn-vjvws-8-25-8270-milvus-indexnode-84c9b574c7-cvqkt   1/1     Running            0               4h22m   10.104.4.226    4am-node11   <none>           <none>
inverted-corn-vjvws-8-25-8270-milvus-indexnode-84c9b574c7-m74dw   1/1     Running            0               4h22m   10.104.23.57    4am-node27   <none>           <none>
inverted-corn-vjvws-8-25-8270-milvus-proxy-7dc64d7964-jnf58       1/1     Running            1 (4h17m ago)   4h22m   10.104.23.54    4am-node27   <none>           <none>
inverted-corn-vjvws-8-25-8270-milvus-querycoord-c8f5c89f9-nnpts   1/1     Running            1 (4h17m ago)   4h22m   10.104.14.201   4am-node18   <none>           <none>
inverted-corn-vjvws-8-25-8270-milvus-querynode-5f5b7cbb55-jb9ch   1/1     Running            0               4h22m   10.104.14.204   4am-node18   <none>           <none>
inverted-corn-vjvws-8-25-8270-milvus-querynode-5f5b7cbb55-z8r8j   1/1     Running            0               4h22m   10.104.31.57    4am-node34   <none>           <none>
inverted-corn-vjvws-8-25-8270-milvus-rootcoord-7df74877f-5vvfk    1/1     Running            1 (4h17m ago)   4h22m   10.104.14.200   4am-node18   <none>           <none>
inverted-corn-vjvws-8-25-8270-minio-0                             1/1     Running            0               4h22m   10.104.34.147   4am-node37   <none>           <none>
inverted-corn-vjvws-8-25-8270-minio-1                             1/1     Running            0               4h22m   10.104.25.105   4am-node30   <none>           <none>
inverted-corn-vjvws-8-25-8270-minio-2                             1/1     Running            0               4h22m   10.104.18.147   4am-node25   <none>           <none>
inverted-corn-vjvws-8-25-8270-minio-3                             1/1     Running            0               4h22m   10.104.24.33    4am-node29   <none>           <none>
inverted-corn-vjvws-8-25-8270-pulsar-bookie-0                     1/1     Running            0               4h22m   10.104.24.29    4am-node29   <none>           <none>
inverted-corn-vjvws-8-25-8270-pulsar-bookie-1                     1/1     Running            0               4h22m   10.104.16.102   4am-node21   <none>           <none>
inverted-corn-vjvws-8-25-8270-pulsar-bookie-2                     1/1     Running            0               4h22m   10.104.25.108   4am-node30   <none>           <none>
inverted-corn-vjvws-8-25-8270-pulsar-bookie-init-z9mj4            0/1     Completed          0               4h22m   10.104.4.225    4am-node11   <none>           <none>
inverted-corn-vjvws-8-25-8270-pulsar-broker-0                     1/1     Running            0               4h22m   10.104.4.221    4am-node11   <none>           <none>
inverted-corn-vjvws-8-25-8270-pulsar-proxy-0                      1/1     Running            0               4h22m   10.104.4.223    4am-node11   <none>           <none>
inverted-corn-vjvws-8-25-8270-pulsar-pulsar-init-k42gd            0/1     Completed          0               4h22m   10.104.4.224    4am-node11   <none>           <none>
inverted-corn-vjvws-8-25-8270-pulsar-recovery-0                   1/1     Running            0               4h22m   10.104.4.222    4am-node11   <none>           <none>
inverted-corn-vjvws-8-25-8270-pulsar-zookeeper-0                  1/1     Running            0               4h22m   10.104.23.59    4am-node27   <none>           <none>
inverted-corn-vjvws-8-25-8270-pulsar-zookeeper-1                  1/1     Running            0               4h21m   10.104.34.156   4am-node37   <none>           <none>
inverted-corn-vjvws-8-25-8270-pulsar-zookeeper-2                  1/1     Running            0               4h20m   10.104.24.40    4am-node29   <none>           <none> 

client pod name: inverted-corn-vjvws-2315877231
client log:
client.log

截屏2024-03-19 15 07 52

test steps:

        concurrent test and calculation of RT and QPS

        :purpose:  `vector: memory and disk index`
            verify concurrent DML & DQL scenario which has 4 float_vector fields & 16 scalar fields

        :test steps:
            1. create collection with fields:
                'float_vector': 128dim,
                'float_vector_1': 128dim,
                'float_vector_2': 200dim,
                'float_vector_3': 200dim,
                'int8_1', 'int16_1', 'int32_1', 'int64_1', 'double_1', 'float_1', 'varchar_1', 'bool_1',
                'int8_2', 'int16_2', 'int32_2', 'int64_2', 'double_2', 'float_2', 'varchar_2', 'bool_2'
            2. build indexes:
                HNSW: 'float_vector'
                DIAKANN_IP: 'float_vector_1'
                HNSW: 'float_vector_2'
                DIAKANN_L2: 'float_vector_3'
                scalar_default_index: 'int8_1', 'int16_1', 'int32_1', 'int64_1', 'double_1', 'float_1', 'varchar_1'
                scalar_INVERTED_index: 'int8_2', 'int16_2', 'int32_2', 'int64_2', 'double_2', 'float_2', 'varchar_2', 'bool_2'
            3. insert 5 million data
            4. flush collection
            5. build indexes again using the same params
            6. load collection
            7. concurrent request:
                - insert
                - delete
                - flush
                - load
                - search
                - hybrid_search
                - query

test result:

[2024-03-19 06:56:50,352 -  INFO - fouram]: Print locust final stats. (locust_runner.py:56)
[2024-03-19 06:56:50,353 -  INFO - fouram]: Type     Name                                                                          # reqs      # fails |    Avg     Min     Max    Med |   req/s  failures/s (stats.py:789)
[2024-03-19 06:56:50,353 -  INFO - fouram]: --------|----------------------------------------------------------------------------|-------|-------------|-------|-------|-------|-------|--------|----------- (stats.py:789)
[2024-03-19 06:56:50,354 -  INFO - fouram]: grpc     delete                                                                          9072     0(0.00%) |    940       4   15478    380 |    0.84        0.00 (stats.py:789)
[2024-03-19 06:56:50,354 -  INFO - fouram]: grpc     flush                                                                           9184     0(0.00%) |   7880     273   36730   7200 |    0.85        0.00 (stats.py:789)
[2024-03-19 06:56:50,354 -  INFO - fouram]: grpc     hybrid_search                                                                   8974     3(0.03%) |   3209     179   16929   2900 |    0.83        0.00 (stats.py:789)
[2024-03-19 06:56:50,354 -  INFO - fouram]: grpc     insert                                                                          9102     0(0.00%) |   1191      53   15468    590 |    0.84        0.00 (stats.py:789)
[2024-03-19 06:56:50,354 -  INFO - fouram]: grpc     load                                                                            9180     0(0.00%) |   2047      10   23619    980 |    0.85        0.00 (stats.py:789)
[2024-03-19 06:56:50,354 -  INFO - fouram]: grpc     query                                                                           9061     0(0.00%) |   3293     120   17611   2500 |    0.84        0.00 (stats.py:789)
[2024-03-19 06:56:50,354 -  INFO - fouram]: grpc     search                                                                          9024    12(0.13%) |   4516     365   17101   4500 |    0.84        0.00 (stats.py:789)
[2024-03-19 06:56:50,354 -  INFO - fouram]: --------|----------------------------------------------------------------------------|-------|-------------|-------|-------|-------|-------|--------|----------- (stats.py:789)
[2024-03-19 06:56:50,355 -  INFO - fouram]:          Aggregated                                                                     63597    15(0.02%) |   3301       4   36730   2200 |    5.89        0.00 (stats.py:789)
[2024-03-19 06:56:50,355 -  INFO - fouram]:  (stats.py:790)
[2024-03-19 06:56:50,362 -  INFO - fouram]: [PerfTemplate] Report data: 
{'server': {'deploy_tool': 'helm',
            'deploy_mode': 'cluster',
            'config_name': 'cluster_8c16m',
            'config': {'queryNode': {'resources': {'limits': {'cpu': '16.0',
                                                              'memory': '64Gi'},
                                                   'requests': {'cpu': '9.0',
                                                                'memory': '33Gi'}},
                                     'replicas': 2},
                       'indexNode': {'resources': {'limits': {'cpu': '8.0',
                                                              'memory': '16Gi'},
                                                   'requests': {'cpu': '5.0',
                                                                'memory': '9Gi'}},
                                     'replicas': 4},
                       'dataNode': {'resources': {'limits': {'cpu': '8.0',
                                                             'memory': '16Gi'},
                                                  'requests': {'cpu': '5.0',
                                                               'memory': '9Gi'}}},
                       'cluster': {'enabled': True},
                       'pulsar': {},
                       'kafka': {},
                       'minio': {'metrics': {'podMonitor': {'enabled': True}}},
                       'etcd': {'metrics': {'enabled': True,
                                            'podMonitor': {'enabled': True}}},
                       'metrics': {'serviceMonitor': {'enabled': True}},
                       'log': {'level': 'debug'},
                       'image': {'all': {'repository': 'harbor.milvus.io/milvus/milvus',
                                         'tag': '2.4-20240319-0dbc0f2c-amd64'}}},
            'host': 'inverted-corn-vjvws-8-25-8270-milvus.qa-milvus.svc.cluster.local',
            'port': '19530',
            'uri': ''},
 'client': {'test_case_type': 'ConcurrentClientBase',
            'test_case_name': 'test_inverted_locust_hnsw_diskann_dml_dql_cluster',
            'test_case_params': {'dataset_params': {'metric_type': 'L2',
                                                    'dim': 128,
                                                    'scalars_index': {'int8_1': {},
                                                                      'int16_1': {},
                                                                      'int32_1': {},
                                                                      'int64_1': {},
                                                                      'double_1': {},
                                                                      'float_1': {},
                                                                      'varchar_1': {},
                                                                      'int8_2': {'index_type': 'INVERTED'},
                                                                      'int16_2': {'index_type': 'INVERTED'},
                                                                      'int32_2': {'index_type': 'INVERTED'},
                                                                      'int64_2': {'index_type': 'INVERTED'},
                                                                      'double_2': {'index_type': 'INVERTED'},
                                                                      'float_2': {'index_type': 'INVERTED'},
                                                                      'varchar_2': {'index_type': 'INVERTED'},
                                                                      'bool_2': {'index_type': 'INVERTED'}},
                                                    'vectors_index': {'float_vector_1': {'index_type': 'DISKANN',
                                                                                         'index_param': {},
                                                                                         'metric_type': 'IP'},
                                                                      'float_vector_2': {'index_type': 'HNSW',
                                                                                         'index_param': {'M': 8,
                                                                                                         'efConstruction': 200},
                                                                                         'metric_type': 'L2'},
                                                                      'float_vector_3': {'index_type': 'DISKANN',
                                                                                         'index_param': {},
                                                                                         'metric_type': 'L2'}},
                                                    'scalars_params': {'float_vector_1': {'params': {'dim': 128},
                                                                                          'other_params': {'dataset': 'sift',
                                                                                                           'dim': 128}},
                                                                       'float_vector_2': {'params': {'dim': 200},
                                                                                          'other_params': {'dataset': 'text2img',
                                                                                                           'dim': 200}},
                                                                       'float_vector_3': {'params': {'dim': 200},
                                                                                          'other_params': {'dataset': 'text2img',
                                                                                                           'dim': 200}}},
                                                    'dataset_name': 'sift',
                                                    'dataset_size': 5000000,
                                                    'ni_per': 5000},
                                 'collection_params': {'other_fields': ['float_vector_1',
                                                                        'float_vector_2',
                                                                        'float_vector_3',
                                                                        'int8_1',
                                                                        'int16_1',
                                                                        'int32_1',
                                                                        'int64_1',
                                                                        'double_1',
                                                                        'float_1',
                                                                        'varchar_1',
                                                                        'bool_1',
                                                                        'int8_2',
                                                                        'int16_2',
                                                                        'int32_2',
                                                                        'int64_2',
                                                                        'double_2',
                                                                        'float_2',
                                                                        'varchar_2',
                                                                        'bool_2'],
                                                       'shards_num': 2},
                                 'resource_groups_params': {'reset': False},
                                 'database_user_params': {'reset_rbac': False,
                                                          'reset_db': False},
                                 'index_params': {'index_type': 'HNSW',
                                                  'index_param': {'M': 8,
                                                                  'efConstruction': 200}},
                                 'concurrent_params': {'concurrent_number': 20,
                                                       'during_time': '3h',
                                                       'interval': 20,
                                                       'spawn_rate': None},
                                 'concurrent_tasks': [{'type': 'insert',
                                                       'weight': 1,
                                                       'params': {'nb': 10,
                                                                  'timeout': 30,
                                                                  'random_id': True,
                                                                  'random_vector': True,
                                                                  'varchar_filled': False,
                                                                  'start_id': 5000000}},
                                                      {'type': 'delete',
                                                       'weight': 1,
                                                       'params': {'expr': '',
                                                                  'delete_length': 9,
                                                                  'timeout': 30}},
                                                      {'type': 'flush',
                                                       'weight': 1,
                                                       'params': {'timeout': 180}},
                                                      {'type': 'load',
                                                       'weight': 1,
                                                       'params': {'replica_number': 1,
                                                                  'timeout': 180}},
                                                      {'type': 'search',
                                                       'weight': 1,
                                                       'params': {'nq': 1000,
                                                                  'top_k': 1,
                                                                  'search_param': {'ef': 64},
                                                                  'expr': 'int64_1 '
                                                                          '> '
                                                                          '-1 '
                                                                          '&& '
                                                                          'id '
                                                                          '> '
                                                                          '-1',
                                                                  'guarantee_timestamp': None,
                                                                  'partition_names': None,
                                                                  'output_fields': ['*'],
                                                                  'ignore_growing': False,
                                                                  'group_by_field': None,
                                                                  'timeout': 180,
                                                                  'random_data': True}},
                                                      {'type': 'hybrid_search',
                                                       'weight': 1,
                                                       'params': {'nq': 1,
                                                                  'top_k': 10,
                                                                  'reqs': [{'search_param': {'ef': 1280},
                                                                            'anns_field': 'float_vector',
                                                                            'expr': 'int64_1 '
                                                                                    '< '
                                                                                    '100000 '
                                                                                    '&& '
                                                                                    'float_2 '
                                                                                    '> '
                                                                                    '10.0',
                                                                            'top_k': 1000},
                                                                           {'search_param': {'search_list': 30},
                                                                            'anns_field': 'float_vector_1',
                                                                            'expr': 'varchar_1 '
                                                                                    'like '
                                                                                    '"0%" '
                                                                                    '&& '
                                                                                    'bool_2 '
                                                                                    '== '
                                                                                    'True'},
                                                                           {'search_param': {'ef': 1024},
                                                                            'anns_field': 'float_vector_2',
                                                                            'expr': 'int8_1 '
                                                                                    '< '
                                                                                    '64 '
                                                                                    '&& '
                                                                                    'bool_1 '
                                                                                    '== '
                                                                                    'False',
                                                                            'top_k': 1009},
                                                                           {'search_param': {'search_list': 40},
                                                                            'anns_field': 'float_vector_3',
                                                                            'expr': 'int8_2 '
                                                                                    '> '
                                                                                    '64 '
                                                                                    '|| '
                                                                                    'double_2 '
                                                                                    '> '
                                                                                    '1000000.0'}],
                                                                  'rerank': {'RRFRanker': []},
                                                                  'output_fields': ['*'],
                                                                  'ignore_growing': False,
                                                                  'guarantee_timestamp': None,
                                                                  'partition_names': None,
                                                                  'timeout': 60,
                                                                  'random_data': True}},
                                                      {'type': 'query',
                                                       'weight': 1,
                                                       'params': {'ids': None,
                                                                  'expr': 'int64_1 '
                                                                          '> '
                                                                          '-1 '
                                                                          '&&  '
                                                                          'int64_2 '
                                                                          '> '
                                                                          '-1 '
                                                                          '&& ',
                                                                  'output_fields': ['*'],
                                                                  'offset': None,
                                                                  'limit': None,
                                                                  'ignore_growing': False,
                                                                  'partition_names': None,
                                                                  'timeout': 180,
                                                                  'random_data': True,
                                                                  'random_count': 20,
                                                                  'random_range': [2500000.0,
                                                                                   5000000],
                                                                  'field_name': 'id',
                                                                  'field_type': 'int64'}}]},
            'run_id': 2024031957479767,
            'datetime': '2024-03-19 02:35:47.977011',
            'client_version': '2.4.0'},
 'result': {'test_result': {'index': {'RT': 908.1076,
                                      'float_vector_1': {'RT': 766.9575},
                                      'float_vector_2': {'RT': 184.6956},
                                      'float_vector_3': {'RT': 141.2526},
                                      'int8_1': {'RT': 1.0444},
                                      'int16_1': {'RT': 0.5228},
                                      'int32_1': {'RT': 0.536},
                                      'int64_1': {'RT': 14.8703},
                                      'double_1': {'RT': 1.062},
                                      'float_1': {'RT': 0.5643},
                                      'varchar_1': {'RT': 0.5243},
                                      'int8_2': {'RT': 0.5235},
                                      'int16_2': {'RT': 0.5335},
                                      'int32_2': {'RT': 0.5247},
                                      'int64_2': {'RT': 0.5271},
                                      'double_2': {'RT': 0.5286},
                                      'float_2': {'RT': 0.5708},
                                      'varchar_2': {'RT': 0.5385},
                                      'bool_2': {'RT': 0.5454}},
                            'insert': {'total_time': 1458.7467,
                                       'VPS': 3427.5999,
                                       'batch_time': 1.4587,
                                       'batch': 5000},
                            'flush': {'RT': 3.5673},
                            'load': {'RT': 156.4908},
                            'Locust': {'Aggregated': {'Requests': 63597,
                                                      'Fails': 15,
                                                      'RPS': 5.89,
                                                      'fail_s': 0.0,
                                                      'RT_max': 36730.32,
                                                      'RT_avg': 3301.02,
                                                      'TP50': 2200.0,
                                                      'TP99': 15000.0},
                                       'delete': {'Requests': 9072,
                                                  'Fails': 0,
                                                  'RPS': 0.84,
                                                  'fail_s': 0.0,
                                                  'RT_max': 15478.88,
                                                  'RT_avg': 940.2,
                                                  'TP50': 380.0,
                                                  'TP99': 7600.0},
                                       'flush': {'Requests': 9184,
                                                 'Fails': 0,
                                                 'RPS': 0.85,
                                                 'fail_s': 0.0,
                                                 'RT_max': 36730.32,
                                                 'RT_avg': 7880.55,
                                                 'TP50': 7200.0,
                                                 'TP99': 22000.0},
                                       'hybrid_search': {'Requests': 8974,
                                                         'Fails': 3,
                                                         'RPS': 0.83,
                                                         'fail_s': 0.0,
                                                         'RT_max': 16929.99,
                                                         'RT_avg': 3209.13,
                                                         'TP50': 2900.0,
                                                         'TP99': 9600.0},
                                       'insert': {'Requests': 9102,
                                                  'Fails': 0,
                                                  'RPS': 0.84,
                                                  'fail_s': 0.0,
                                                  'RT_max': 15468.22,
                                                  'RT_avg': 1191.24,
                                                  'TP50': 590.0,
                                                  'TP99': 7900.0},
                                       'load': {'Requests': 9180,
                                                'Fails': 0,
                                                'RPS': 0.85,
                                                'fail_s': 0.0,
                                                'RT_max': 23619.56,
                                                'RT_avg': 2047.08,
                                                'TP50': 980.0,
                                                'TP99': 10000.0},
                                       'query': {'Requests': 9061,
                                                 'Fails': 0,
                                                 'RPS': 0.84,
                                                 'fail_s': 0.0,
                                                 'RT_max': 17611.04,
                                                 'RT_avg': 3293.67,
                                                 'TP50': 2500.0,
                                                 'TP99': 12000.0},
                                       'search': {'Requests': 9024,
                                                  'Fails': 12,
                                                  'RPS': 0.84,
                                                  'fail_s': 0.0,
                                                  'RT_max': 17101.12,
                                                  'RT_avg': 4516.03,
                                                  'TP50': 4500.0,
                                                  'TP99': 12000.0}}}}}

@bigsheeper
Copy link
Contributor

Please use the latest version of pymilvus, as the retry logic for this error is only available in the newest version (> 2.4.0rc60). @wangting0128

nianliuu added a commit to milvus-io/milvus-sdk-java that referenced this issue Mar 21, 2024
Signed-off-by: Nian Liu <nian.liu@zilliz.com>
@wangting0128
Copy link
Contributor Author

verification passed

pymilvus version: 2.4.0rc66

Since this fix is on the SDK side, please synchronize each SDK owner to add retry logic simultaneously.

notifications:
Restful API cannot solve this problem

@czs007 @bigsheeper

sre-ci-robot pushed a commit to milvus-io/milvus-sdk-java that referenced this issue Mar 27, 2024
#811)

Signed-off-by: Nian Liu <nian.liu@zilliz.com>
nianliuu added a commit to milvus-io/milvus-sdk-java that referenced this issue Mar 27, 2024
#811)

Signed-off-by: Nian Liu <nian.liu@zilliz.com>
(cherry picked from commit 81219a8)
@Sun-Qiming
Copy link

verification passed

pymilvus version: 2.4.0rc66

Since this fix is on the SDK side, please synchronize each SDK owner to add retry logic simultaneously.

notifications: Restful API cannot solve this problem

@czs007 @bigsheeper

so , how can i get the pymilvus 2.4.0rc66 installation package or source code ? thanks

@yanliang567
Copy link
Contributor

2.4.0rc66

only for test purpose please: https://test.pypi.org/project/pymilvus/2.4.0rc66/

@Sun-Qiming
Copy link

Sun-Qiming commented Apr 11, 2024

2.4.0rc66

only for test purpose please: https://test.pypi.org/project/pymilvus/2.4.0rc66/

thank you but i still get error when i use hybrid_search() in a collection with more than 2 vector field : code 2200, message=retry run out of 75 retry times, message=incomplete query result .........

i do not understand why? It seems to happen with large amounts of data (now my data less then 2 million)

@wangting0128
Copy link
Contributor Author

2.4.0rc66

only for test purpose please: https://test.pypi.org/project/pymilvus/2.4.0rc66/

thank you but i still get error when i use hybrid_search() in a collection with more than 2 vector field : code 2200, message=retry run out of 75 retry times, message=incomplete query result .........

i do not understand why? It seems to happen with large amounts of data (now my data less then 2 million)

Could you please provide the version of milvus you are using?

@Sun-Qiming
Copy link

Sun-Qiming commented Apr 12, 2024

2.4.0rc66

only for test purpose please: https://test.pypi.org/project/pymilvus/2.4.0rc66/

thank you but i still get error when i use hybrid_search() in a collection with more than 2 vector field : code 2200, message=retry run out of 75 retry times, message=incomplete query result .........
i do not understand why? It seems to happen with large amounts of data (now my data less then 2 million)

Could you please provide the version of milvus you are using?

milvus:2.4.0-rc1
pymilvus: 2.4.0、2.4.0.rc66、2.5.0.rc11、pymilvuspowder-2.5.0-rc6 (all fail)
my collection: about 10 varchar fields, 3 floatvector fields(768 dim),IVF_FLAT index(nlist=2048,L2)
hybrid_search(nprobe=128)
data 1.8 million

search() function works well, but hybrid_search fails all the time

@yanliang567
Copy link
Contributor

i suggest you retry on latest v2.3.13 or 2.4, and if it reproduced to you , please file a new issue with full logs attached.

@Sun-Qiming
Copy link

i suggest you retry on latest v2.3.13 or 2.4, and if it reproduced to you , please file a new issue with full logs attached.

does milvus v2.3.13 support multi-vector search? i need this feature in my task.
and i just seen milvus 2.4.0-rc1 in the releases, where can i get the v2.4?
thanks a lot !

@yanliang567 yanliang567 modified the milestones: 2.4.0, 2.4.1 Apr 18, 2024
@yanliang567 yanliang567 modified the milestones: 2.4.1, 2.4.2 May 7, 2024
Copy link

stale bot commented Jun 10, 2024

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Rotten issues close after 30d of inactivity. Reopen the issue with /reopen.

@stale stale bot added the stale indicates no udpates for 30 days label Jun 10, 2024
@stale stale bot closed this as completed Jul 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2.4-features kind/bug Issues or changes related a bug stale indicates no udpates for 30 days test/benchmark benchmark test triage/accepted Indicates an issue or PR is ready to be actively worked on.
Projects
None yet
Development

No branches or pull requests

8 participants