TST: added test for pd.where overflow error GH31687 #49926

mliu08 · 2022-11-27T06:46:26Z

Added a test for pd.where to /indexing/test_where.py

closes pd.where OverflowError with large numbers #31687
tests added / passed
passes all pre-commit code checks
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.

MarcoGorelli

thanks for working on this!

MarcoGorelli · 2022-11-27T12:21:30Z

pandas/tests/frame/indexing/test_where.py

+
+
+@pytest.mark.parametrize(
+    "replacement", [0.001, True, "snake", DATETIME_JAN_1_1900_OPTIONAL_TZ]


DATETIME_JAN_1_1900_OPTIONAL_TZ is gonna be a LazyStrategy object here, which I guess isn't quite what you wanted, I think you'll need @given to use hypothesis tests
Not sure we need that though, just some date should be fine if you want to test using a date as a replacement

Also, the original issue used None, shall we include that?

You're right, I didn't want a LazyStrategy object! I'll change that to a date, and also include None to align better with the original issue. I was also wondering if there was already a quick way to test all scalar types rather than having to use a long parametrized list. Or would that be overkill?

yeah I think just keeping the original snippet from the issue as a test should be fine

MarcoGorelli · 2022-11-27T12:28:18Z

pandas/tests/frame/indexing/test_where.py

+)
+def test_where_int_overflow(replacement):
+    # GH 31687
+    df = DataFrame([[1.0, 2e19, "nine"], [np.nan, 0.1, None]])


ideally for regression tests it's better to keep them as close as possible to the original - any reason to use 2e19 instead of 2e25?

I think I reduced it down to the smallest order of magnitude that still had the problem in pandas 1.4.1, but I don't have strong feelings about using that instead of the 2e25 from the original.

MarcoGorelli

Thanks @mliu08 !

TST: added test for pd.where overflow error GH31687

af2dfbf

MarcoGorelli requested changes Nov 27, 2022

View reviewed changes

mliu08 added 2 commits November 27, 2022 15:35

updated to better match original issue

48f81cd

typo'd datetime.date

abc954c

mliu08 force-pushed the pd.where-overflow-error-test branch from abc954c to 48f81cd Compare November 28, 2022 00:31

removed date object from test

f6ce291

MarcoGorelli approved these changes Nov 28, 2022

View reviewed changes

MarcoGorelli added the Needs Tests Unit test(s) needed to prevent regressions label Nov 28, 2022

MarcoGorelli added this to the 2.0 milestone Nov 28, 2022

MarcoGorelli merged commit 494025c into pandas-dev:main Nov 28, 2022

mliu08 deleted the pd.where-overflow-error-test branch November 29, 2022 01:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TST: added test for pd.where overflow error GH31687 #49926

TST: added test for pd.where overflow error GH31687 #49926

mliu08 commented Nov 27, 2022

MarcoGorelli left a comment

MarcoGorelli Nov 27, 2022

mliu08 Nov 27, 2022

MarcoGorelli Nov 27, 2022

MarcoGorelli Nov 27, 2022

mliu08 Nov 27, 2022

MarcoGorelli left a comment



		@pytest.mark.parametrize(
		"replacement", [0.001, True, "snake", DATETIME_JAN_1_1900_OPTIONAL_TZ]

TST: added test for pd.where overflow error GH31687 #49926

TST: added test for pd.where overflow error GH31687 #49926

Conversation

mliu08 commented Nov 27, 2022

MarcoGorelli left a comment

Choose a reason for hiding this comment

MarcoGorelli Nov 27, 2022

Choose a reason for hiding this comment

mliu08 Nov 27, 2022

Choose a reason for hiding this comment

MarcoGorelli Nov 27, 2022

Choose a reason for hiding this comment

MarcoGorelli Nov 27, 2022

Choose a reason for hiding this comment

mliu08 Nov 27, 2022

Choose a reason for hiding this comment

MarcoGorelli left a comment

Choose a reason for hiding this comment