-
-
Notifications
You must be signed in to change notification settings - Fork 18.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
issue #27642 - timedelta merge asof with tolerance #27650
issue #27642 - timedelta merge asof with tolerance #27650
Conversation
My first PR, please be gentle. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you add a release note to doc/source/whatsnew/v0.25.1.rst
?
We'll also want a test ensuring this continues to work in the future. Can you simplify your example from #27642 and make it a unit test? Make sure you include a reference to the GitHub issue in a comment in the test.
is_datetime64_dtype(lt) | ||
or is_datetime64tz_dtype(lt) | ||
or is_timedelta64_dtype(lt) | ||
): | ||
if not isinstance(self.tolerance, Timedelta): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should probably be if not isinstance(self.tolerance, datetime.timedelta)
so users can pass datetime.timedelta
objects. If you could add a test case for that situation that'd be great.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
see commit "mroeschke: this test is for you 🚀"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the test fails
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we'll need a different PR for this test I think, it fails at this conversion check.
pandas/pandas/core/reshape/merge.py
Lines 1704 to 1710 in 143bc34
# initial type conversion as needed | |
if needs_i8_conversion(left_values): | |
left_values = left_values.view("i8") | |
right_values = right_values.view("i8") | |
if tolerance is not None: | |
tolerance = tolerance.value | |
With tolerance = datetime.timedelta -> ERROR: has no attribute "value"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
datetime.timedelta
objects do not have nanoseconds like pd.Timedelta
Another check and conversion is necessary to convert datetime.timedelta
correctly to nanoseconds similarly to pd.Timedelta.value
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Okay thanks for investigating. Yes, I'd be great to address in another issues/PR.
Does anyone understand why my test fails in python3.5? |
On phone so haven’t checked logs but if 3.5 only most likely the construction of data frames from Dict don’t have a defined order.
Probably easiest to construct from lists
…Sent from my iPhone
On Aug 8, 2019, at 8:27 AM, ianzur ***@***.***> wrote:
Does anyone understand why my test fails in python3.5?
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or mute the thread.
|
@ianzur you may want to set up a 3.5 env locally for testing that. |
Timedelta("1day"), | ||
pytest.param( | ||
datetime.timedelta(days=1), | ||
marks=pytest.mark.xfail(reason="not implemented", strict=True), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ianzur Do we have an open issue to implement this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No, not that I know of.
…e asof with tolerance
Could you open one with an example of the failure right now?
…On Thu, Aug 22, 2019 at 8:27 AM ianzur ***@***.***> wrote:
***@***.**** commented on this pull request.
------------------------------
In pandas/tests/reshape/merge/test_merge_asof.py
<#27650 (comment)>:
> @@ -588,14 +590,23 @@ def test_non_sorted(self):
# ok, though has dupes
merge_asof(trades, self.quotes, on="time", by="ticker")
- def test_tolerance(self):
+ @pytest.mark.parametrize(
+ "tolerance",
+ [
+ Timedelta("1day"),
+ pytest.param(
+ datetime.timedelta(days=1),
+ marks=pytest.mark.xfail(reason="not implemented", strict=True),
No, not that I know of.
—
You are receiving this because you modified the open/close state.
Reply to this email directly, view it on GitHub
<#27650?email_source=notifications&email_token=AAKAOIU5BQ66CXKQS367NEDQF2H55A5CNFSM4IHWN4YKYY3PNVWWK3TUL52HS4DFWFIHK3DMKJSXC5LFON2FEZLWNFSXPKTDN5WW2ZLOORPWSZGOCCME7GY#discussion_r316676824>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAKAOIR2IAWBEU2N2CDI5TTQF2H55ANCNFSM4IHWN4YA>
.
|
I can later today, I'll @you.
On Thu, Aug 22, 2019 at 9:03 AM Tom Augspurger <notifications@github.com>
wrote:
… Could you open one with an example of the failure right now?
On Thu, Aug 22, 2019 at 8:27 AM ianzur ***@***.***> wrote:
> ***@***.**** commented on this pull request.
> ------------------------------
>
> In pandas/tests/reshape/merge/test_merge_asof.py
> <#27650 (comment)>:
>
> > @@ -588,14 +590,23 @@ def test_non_sorted(self):
> # ok, though has dupes
> merge_asof(trades, self.quotes, on="time", by="ticker")
>
> - def test_tolerance(self):
> + @pytest.mark.parametrize(
> + "tolerance",
> + [
> + Timedelta("1day"),
> + pytest.param(
> + datetime.timedelta(days=1),
> + marks=pytest.mark.xfail(reason="not implemented", strict=True),
>
> No, not that I know of.
>
> —
> You are receiving this because you modified the open/close state.
> Reply to this email directly, view it on GitHub
> <
#27650?email_source=notifications&email_token=AAKAOIU5BQ66CXKQS367NEDQF2H55A5CNFSM4IHWN4YKYY3PNVWWK3TUL52HS4DFWFIHK3DMKJSXC5LFON2FEZLWNFSXPKTDN5WW2ZLOORPWSZGOCCME7GY#discussion_r316676824
>,
> or mute the thread
> <
https://github.com/notifications/unsubscribe-auth/AAKAOIR2IAWBEU2N2CDI5TTQF2H55ANCNFSM4IHWN4YA
>
> .
>
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#27650?email_source=notifications&email_token=AICYMWLKCMZOYWV7C5F4QBTQF2MCFA5CNFSM4IHWN4YKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD45GEOI#issuecomment-523919929>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AICYMWOVBXYT7QB5BPLVE6DQF2MCFANCNFSM4IHWN4YA>
.
|
* issue pandas-dev#27642 - timedelta merge asof with tolerance
* master: (40 commits) DOC: Fix GL01 and GL02 errors in the docstrings (pandas-dev#27988) Remove Encoding of values in char** For Labels (pandas-dev#27618) TYPING: more type hints for io.formats.printing (pandas-dev#27765) TST: fix compression tests when run without virtualenv/condaenv (pandas-dev#28051) DOC: Start 0.25.2 (pandas-dev#28111) DOC: Fix docstrings lack of punctuation (pandas-dev#28031) DOC: Remove alias for numpy.random.randn from the docs (pandas-dev#28082) DOC: update GroupBy.head()/tail() documentation (pandas-dev#27844) BUG: timedelta merge asof with tolerance (pandas-dev#27650) BUG: Series.rename raises error on values accepted by Series construc… (pandas-dev#27814) Preserve index when setting new column on empty dataframe. (pandas-dev#26471) BUG: Fixed groupby quantile for listlike q (pandas-dev#27827) BUG: iter with readonly values, closes pandas-dev#28055 (pandas-dev#28074) TST: non-strict xfail for period test (pandas-dev#28072) DOC: Update whatsnew (pandas-dev#28073) CI: disable codecov (pandas-dev#28065) CI: Set SHA for codecov upload (pandas-dev#28067) BUG: Correct the previous bug fixing on xlim for plotting (pandas-dev#28059) CI: Add pip dependence explicitly (pandas-dev#28008) DOC: Change document code prun in a row (pandas-dev#28029) ...
black pandas
git diff upstream/master -u -- "*.py" | flake8 --diff