use Cmr-search-after instead of paging #87

wallinb · 2020-06-18T17:18:32Z

Unfortunately EGI does not support scrolling yet, so this only updates granule search.

Some thoughts: Ordering and querying could be less coupled than they are now e.g. place_order calls get_avail to know how many granules will be ordered, but the sub-setting service will do the similar search itself to fulfill the order. In some edge cases these two searches can become inconsistent with each other, especially now that one uses scrolling and one uses paging. At the moment it is the only way to know/estimate the number of pages/orders necessary so I left it as is. Maybe long term we want to separate 'querying' functionality from 'ordering' into separate classes or something?

Should be using reg_a.granules.avail instead of reg_a.granules

Reformat CONTRIBUTORS.rst, add myself, fix typo

Better to not depend on 'page_num' from previous query (especially since using scrolling now)

icepyx/tests/test_granules.py

requirements.txt

Updated Contributers, added my name

review-notebook-app · 2022-02-02T20:21:05Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

codecov-commenter · 2022-02-02T21:41:47Z

Codecov Report

Merging #87 (02481ae) into development (a9c7813) will increase coverage by 0.00%.
The diff coverage is 52.63%.

@@             Coverage Diff              @@
##           development      #87   +/-   ##
============================================
  Coverage        55.45%   55.45%           
============================================
  Files               28       28           
  Lines             1951     1960    +9     
  Branches           404      406    +2     
============================================
+ Hits              1082     1087    +5     
- Misses             800      804    +4     
  Partials            69       69

Impacted Files	Coverage Δ
icepyx/core/APIformatting.py	`75.64% <ø> (-0.16%)`	⬇️
icepyx/core/query.py	`51.76% <ø> (ø)`
icepyx/tests/is2class_query.py	`21.73% <0.00%> (ø)`
icepyx/core/granules.py	`38.42% <52.94%> (+1.11%)`	⬆️
icepyx/tests/test_granules.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a9c7813...02481ae. Read the comment docs.

JessicaS11 · 2022-02-02T21:55:35Z

See #22 (comment) for an update on why the switch from cmr scrolling to cmr-search-after.

Note: the PR-linked RTDs build is failing because of a recent pip release combined with the fact that you can't wipe the build environment for PRs. A branch build shows they pass with no issues.

JessicaS11 · 2022-02-02T21:57:50Z

@betolink Do you think you could review this PR (or recommend someone who can)?

betolink · 2022-02-03T16:07:52Z

Hi @JessicaS11 I can take a look this week, maybe Andy would be another person to ask as well. One thing, Search-After is great for performance on the CMR side but it is not compatible with deep pagination.

i.e. this would be an anti-pattern for an initial search:

reqparams = {'page_size': 2000, 'page_num': 3}

Are the pagination parameters exposed to the users or it's an internal Icepyx interface? As a user I don't really need to access say page 565, usually we just want to know how many granules we get back from CMR and then download them all. The PR Looks good!

JessicaS11 · 2022-02-03T16:46:25Z

Are the pagination parameters exposed to the users or it's an internal Icepyx interface?

Primarily the latter. With this PR, page_num is not submitted during the CMR query step, instead using Search_After to collect available results. page_numIS used by icepyx during ordering (and download) to iterate through the orders, since to my understanding Search_After is not available outside of CMR queries. reqparams, and thus page_num, is not hidden to the user, but my guess is nobody really pays it any mind (and if they don't view reqparams after placing an order, they'll probably never see page_num since it's not a required parameter for the CMR query step).

In general, I tried to design icepyx so that users had 100% control over (and visibility of) any request/parameters being sent to an endpoint if they want it (and to help it map clearly to the CMR docs for programmatic data access), but don't actually have to be bothered with the parameter dictionaries and request syntax/parsing otherwise.

This PR has been updated to use a cmr-search-after; this review is outdated.

betolink

It's looking great, there won't be a noticeable performance difference for Icepyx but CMR will handle these big queries better than just using pagination. I'll wait for Andy to take a look before approving it. Great work!

icepyx/tests/test_granules.py

betolink

I think this PR is ready to be merged and tested in the dev branch. Looks like regular pagination and search-after will coexist and will be transparent to the users so that's great.

weiji14 and others added 20 commits June 17, 2020 09:45

Let travis test everything in icepyx

94acb79

Fix test_granule_info

74dd0e3

Fix test_correct_granule_list_returned

929594e

Should be using reg_a.granules.avail instead of reg_a.granules

Reformat CONTRIBUTORS.rst, add myself, fix typo

8026be1

Test everything except behind_NSIDC_API_login.py

58ed1ea

add pr builds to travis

16f83e8

Merge branch 'development' into development

8da8239

Merge pull request #76 from dshean/development

f4aee1d

Reformat CONTRIBUTORS.rst, add myself, fix typo

Add self to contributors

3319680

Replace paging with scrolling in granule cmr query

3760d43

Update place_order method paging

a9bbdd2

Better to not depend on 'page_num' from previous query (especially since using scrolling now)

Cleanup in granules

e49e1b8

Fix errors in tests

8925f75

Rename and update tests

5e4187a

Remove datetime from requirements (builtin)

a951f6f

codeblock defined as none is not compatible with pypi markup

fcbe215

Added T Johnson to list of contributors

823cf34

Add codecov

aefbf6f

Updated Contributers, added my name

e948fa6

Add dev requirements and specify travis install

9a7eab6

wallinb requested review from JessicaS11 and liuzheng-arctic June 18, 2020 17:18

wallinb added 2 commits June 18, 2020 11:19

Merge branch 'development' into cmr-scrolling-search

f6c064f

Update CONTRIBUTORS.rst

f66388e

wallinb commented Jun 18, 2020

View reviewed changes

icepyx/tests/test_granules.py Outdated Show resolved Hide resolved

wallinb commented Jun 18, 2020

View reviewed changes

requirements.txt Outdated Show resolved Hide resolved

Bruce Wallin and others added 4 commits June 18, 2020 12:05

Fix codecov badge links (use icesat2py repo)

b71d574

Updated installation instructions

35444ec

Changed installation instructions on readthedocs

96bab86

Merge pull request #84 from icesat2py/anna_dev2

ca0b4fe

Updated Contributers, added my name

JessicaS11 added 4 commits February 2, 2022 12:43

fix missed merge conflicts

b37be51

manual updates from commit history

5119cdb

fix missed merge conflicts

009755f

turn 'scroll' into a reqparam for CMR searches

da465bd

switch from scrolling to CMR-Search-After

a68d71f

JessicaS11 added 2 commits February 2, 2022 16:43

remove missed scroll kwargs

e9a7d2a

remove changes to example I had checked out but committed anyway

a67848b

JessicaS11 linked an issue Feb 2, 2022 that may be closed by this pull request

interact with NSIDC API without paging #22

Closed

JessicaS11 mentioned this pull request Feb 2, 2022

interact with NSIDC API without paging #22

Closed

JessicaS11 changed the title ~~Cmr scrolling search~~ use Cmr-search-after instead of paging Feb 2, 2022

JessicaS11 requested review from andypbarrett and removed request for liuzheng-arctic February 3, 2022 16:47

JessicaS11 mentioned this pull request Feb 3, 2022

add page size checks for sync/async request mode #243

Open

add ability to only order one page by specifying 'page-num'

4c8e379

JessicaS11 mentioned this pull request Feb 3, 2022

reqparams page_num gets overwritten when placing order #188

Closed

JessicaS11 linked an issue Feb 3, 2022 that may be closed by this pull request

reqparams page_num gets overwritten when placing order #188

Closed

betolink reviewed Feb 4, 2022

View reviewed changes

icepyx/tests/test_granules.py Show resolved Hide resolved

Merge branch 'development' into cmr-scrolling-search

02481ae

betolink approved these changes Feb 8, 2022

View reviewed changes

betolink and others added 2 commits February 8, 2022 17:19

GitHub action UML generation auto-update

fe9ca49

undo unintentional example commit

fa8bb68

JessicaS11 merged commit e424fed into development Feb 9, 2022

JessicaS11 deleted the cmr-scrolling-search branch February 9, 2022 17:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use Cmr-search-after instead of paging #87

use Cmr-search-after instead of paging #87

wallinb commented Jun 18, 2020

review-notebook-app bot commented Feb 2, 2022

codecov-commenter commented Feb 2, 2022 •

edited

Loading

JessicaS11 commented Feb 2, 2022

JessicaS11 commented Feb 2, 2022

betolink commented Feb 3, 2022

JessicaS11 commented Feb 3, 2022

betolink left a comment

betolink left a comment

use Cmr-search-after instead of paging #87

use Cmr-search-after instead of paging #87

Conversation

wallinb commented Jun 18, 2020

review-notebook-app bot commented Feb 2, 2022

codecov-commenter commented Feb 2, 2022 • edited Loading

Codecov Report

JessicaS11 commented Feb 2, 2022

JessicaS11 commented Feb 2, 2022

betolink commented Feb 3, 2022

JessicaS11 commented Feb 3, 2022

betolink left a comment

Choose a reason for hiding this comment

betolink left a comment

Choose a reason for hiding this comment

codecov-commenter commented Feb 2, 2022 •

edited

Loading