Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Incorporate searchLimit in dedupe cacheKey #15185

Merged
merged 1 commit into from
Sep 2, 2019

Conversation

eileenmcnaughton
Copy link
Contributor

@eileenmcnaughton eileenmcnaughton commented Sep 1, 2019

Overview

Fixes a bug whereby changing the search limit after a search has been performed does not result in a new search

Before

Altering the limit in the url after the initial retrieval has no effect

Screen Shot 2019-09-02 at 11 47 22 AM

This can also be accessed via the api - without this patch running these 2 in sequence will result in the second call 'respecting' the limit from the first

    $pairs = $this->callAPISuccess('Dedupe', 'getduplicates', [
      'rule_group_id' => 1,
      'search_limit' => 500,
    ]);
    $pairs = $this->callAPISuccess('Dedupe', 'getduplicates', [
      'rule_group_id' => 1,
      'search_limit' => 5,
    ]);

After

Search limit is included in the cacheKey so if it changes the cache is refreshed.

Technical Details

When accessing dedupes by the api call or on the dedupe screen it's possible to pass in
a searchLimit param. This works like the group limit in that it limits the number of
contacts for whom a match is sought.

For example if there are 2million contacts in the database
and you have a search limit of 0 then it will look for duplicates for all 2 million. (unset
is the same as 0). If you have a search limit of 1000 it will look for matches for the first
1000 contacts that match the criteria (criteria could be the group or other criteria passed in
via the url although the api is the most obvious way to pass in criteria)

Note there is a separate limit (sometimes called batch limit) that limits results from within
the found matches. If you pass in ['options' => ['limit' => 10]] via the api this will return 10 results if 10 or more exist. However, this limit has no effect on building the cache.

To test the searchLimit it is possible to add limit=5 to the url generated in the
url by findContacts. Without this patch changing the limit once the search has been done
will not alter the results as the limit is not part of the cachekey - this patch
changes that.

Comments

@pfigel

@civibot
Copy link

civibot bot commented Sep 1, 2019

(Standard links)

@civibot civibot bot added the master label Sep 1, 2019
@eileenmcnaughton eileenmcnaughton force-pushed the dedupe10 branch 4 times, most recently from 73dd903 to 2394f39 Compare September 2, 2019 03:16
Copy link
Contributor

@pfigel pfigel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One question (see comment), otherwise this LGTM! 👍

(CiviCRM Review Template WORD-1.2)

  • General standards
    • (r-explain) PASS
    • (r-user) PASS: Limit being respected and results being refreshed makes sense to users
    • (r-doc) PASS
    • (r-run) PASS: Tested via API & UI
  • Developer standards
    • (r-tech) PASS: Signature change of internal code, shows no relevant matches in universe.
    • (r-code) Undecided: See comment
    • (r-maint) PASS: Some existing test coverage
    • (r-test) PASS

@@ -107,7 +107,8 @@ function civicrm_api3_dedupe_getstatistics($params) {
$params['rule_group_id'],
CRM_Utils_Array::value('group_id', $params),
CRM_Utils_Array::value('criteria', $params, []),
CRM_Utils_Array::value('check_permissions', $params, [])
CRM_Utils_Array::value('check_permissions', $params, []),
CRM_Utils_Array::value('search_limit', $params, [])
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do these fallbacks make sense? Feels like it should be TRUE or FALSE for check_permissions (not sure which, [] seems to have worked so far but that looks weird) and 0 for search_limit.

When accessing dedupes by the api call or on the dedupe screen it's possible to pass in
a searchLimit param. This works like the group limit in that it limits the number of
contacts for whom a match is sought. For example if there are 2million contacts in the database
and you have a search limit of 0 then it will look for duplicates for all 2 million. (unset
is the same as 0). If you have a search limit of 1000 it will look for matches for the first
1000 contacts that match the criteria (criteria could be the group or other criteria passed in
via the url although the api is the most obvious way to pass in criteria)

Note there is a separate limit (sometimes called batch limit) that limits results from within
the found matches.

To test the searchLimit it is possible to add limit=5 to the url generated in the
url by findContacts. Without this patch changing the limit once the search has been done
will not alter the results as the limit is not part of the cachekey - this patch
changes that.
@eileenmcnaughton
Copy link
Contributor Author

Thanks @pfigel I changed search_limit to default to 0 and check_permissions to use !empty($params['check_permissions']) - which is consistent with elsewhere. Adding merge-on-pass

@eileenmcnaughton eileenmcnaughton merged commit 0ceb153 into civicrm:master Sep 2, 2019
@eileenmcnaughton eileenmcnaughton deleted the dedupe10 branch September 2, 2019 20:58
wmfgerrit pushed a commit to wikimedia/wikimedia-fundraising-crm that referenced this pull request Sep 4, 2019
This can safely be deployed in advance of the upgrade as it will be ignored

civicrm/civicrm-core#15185

Change-Id: Id1ce52fa786ffe93b6829d68ade0d7afd902e68b
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants