Timedelta always returning False for equality tests for incompatible types #20829

Sup3rGeo · 2018-04-26T11:47:43Z

Code Sample, a copy-pastable example if possible

import datetime
import pandas

class CustomClass:

    def __init__(self):
        pass
       
    def __eq__(self, other):
        raise Exception("Custom Class eq")

        
custom = CustomClass()
var1 = datetime.timedelta(seconds=1)
var2 = pandas.Timedelta("1s")    

# Following code raises CustomClass exception
var1 == custom

# Following code returns False, does not call CustomClass.__eq__
var2 == custom

Problem description

I am trying to implement a version of pytest.approx for Timedeltas, which basically returns a class with a custom __eq__ implementation.

`Timedelta("5s") == AproxTimedelta(...)

However it does not work with pandas Timedelta because it always return False, so the __eq__ of AproxTimedelta object is never called.

This is not the behavior for python datetime timedelta, as can be seen in the example script.

Expected Output

Exception("Custom Class eq") raised for both cases

Output of `pd.show_versions()`

INSTALLED VERSIONS

commit: None
python: 3.6.3.final.0
python-bits: 64
OS: Windows
OS-release: 10
machine: AMD64
processor: Intel64 Family 6 Model 158 Stepping 9, GenuineIntel
byteorder: little
LC_ALL: None
LANG: None
LOCALE: None.None
pandas: 0.22.0
pytest: 3.5.0
pip: 10.0.0
setuptools: 38.5.1
Cython: 0.28.2
numpy: 1.14.1
scipy: 1.0.0
pyarrow: None
xarray: None
IPython: 6.3.1
sphinx: 1.7.1
patsy: None
dateutil: 2.6.1
pytz: 2017.2
blosc: None
bottleneck: None
tables: None
numexpr: None
feather: None
matplotlib: 2.1.1
openpyxl: 1.7.0
xlrd: 1.1.0
xlwt: None
xlsxwriter: None
lxml: None
bs4: None
html5lib: None
sqlalchemy: None
pymysql: None
psycopg2: None
jinja2: 2.10
s3fs: None
fastparquet: None
pandas_gbq: None
pandas_datareader: None

The text was updated successfully, but these errors were encountered:

TomAugspurger · 2018-04-26T13:28:17Z

Can you try on master?

custom = CustomClass()
var1 = datetime.timedelta(seconds=1)
var2 = pandas.Timedelta("1s")

# Following code raises CustomClass exception
var1 == custom

# Following code returns False, does not call CustomClass.__eq__
var2 == custom

## -- End pasted text --
---------------------------------------------------------------------------
Exception                                 Traceback (most recent call last)
<ipython-input-1-82804cdda1a5> in <module>()
     16
     17 # Following code raises CustomClass exception
---> 18 var1 == custom
     19
     20 # Following code returns False, does not call CustomClass.__eq__

<ipython-input-1-82804cdda1a5> in __eq__(self, other)
      8
      9     def __eq__(self, other):
---> 10         raise Exception("Custom Class eq")
     11
     12

Exception: Custom Class eq

Sup3rGeo · 2018-04-26T15:12:04Z

Same thing on master.

Note that in the example we want exceptions both in var1 == custom and var2 == custom.

TomAugspurger · 2018-04-26T15:38:32Z

Ah sorry I didn't get to that because of the first exception.

cc @jbrockmendel whose been active here recently. I think this is may be non-trivial to support.

jbrockmendel · 2018-04-26T15:44:42Z

Tom is right, this would be non-trivial to support. There is no branch of Timedelta.__eq__ (actually Timedelta.__richcmp__) that returns NotImplemented.

That said, if you were careful to always write custom == td instead of td == custom then it custom.__eq__ would get called first instead of td.__eq__.

Sup3rGeo · 2018-05-29T15:37:33Z

@jbrockmendel I was taking a look at the code, shouldn't it be just a matter of changing this branch:

pandas/pandas/_libs/tslibs/timedeltas.pyx

Lines 695 to 702 in c85ab08

    
           else: 
        
               if op == Py_EQ: 
        
                   return False 
        
               elif op == Py_NE: 
        
                   return True 
        
               raise TypeError('Cannot compare type {!r} with type ' \ 
        
                               '{!r}'.format(type(self).__name__, 
        
                                             type(other).__name__))

To return NotImplemented ?

 else: 
     return NotImplemented

jbrockmendel · 2018-05-29T18:08:40Z

It might be reasonable to return NotImplemented instead of raising TypeError (though tracking down the affected tests and changing the expected error message would be a hassle), but the Py_EQ and Py_NE branches we wouldn't want to change because we need to accommodate non-custom classes.

Also if ApproxTimedelta happens to subclass timedelta, then the comparison wouldn't go through the linked code, but would instead go through lines 674 and 704.

Have you considered using mock to patch Timedelta.__eq__? That might be easier on your end.

Sup3rGeo · 2018-05-30T11:54:35Z

but the Py_EQ and Py_NE branches we wouldn't want to change because we need to accommodate non-custom classes.

Could you elaborate a bit more on than? What would be an example?

Have you considered using mock to patch Timedelta.eq? That might be easier on your end.

Yep that's a good idea for time being!

Another question related to it: If I also happen to have a custom Timedelta subclass (for instance allowing to sum a timedelta with "1s" directly). Is there an easy way to make pandas use this custom when working with TimedeltaIndexes?

jbrockmendel · 2018-05-30T17:06:49Z

Could you elaborate a bit more on than? What would be an example?

Timedelta(whatever) == 6 --> False
Timedelta(whatever) != "foo" --> True

Is there an easy way to make pandas use this custom when working with TimedeltaIndexes?

Try pd.Index([custom_timedelta], dtype='o')

Sup3rGeo · 2018-05-30T21:42:37Z

I thought python would fallback automatically to identity comparison if both classes returned NotImplemented for a comparison.

Is it a different situation for pandas Timedelta?

jbrockmendel · 2018-05-31T15:57:04Z

@Sup3rGeo Off the top of my head I can't think of any reason why Timedelta would behave differently, but the check-for-eq-then-ne-then-raise pattern is present in a handful of places in pandas.

Maybe try making the substitution you have in mind and see if it causes any test failures?

Sup3rGeo · 2018-06-01T16:00:05Z

Will do it!

Just as quick snippet:

import pandas as pd

def CustomClass(object):
    def __eq__(self, other):
        print("Custom class __eq__")
        return NotImplemented

pd.Timedelta("1s") == 6 # False
pd.Timedelta("1s") == "foo" # False
pd.Timedelta("1s") == CustomClass() # False

def new_eq(self,other):
    return NotImplemented

pd.Timedelta.__eq__ = new_eq

pd.Timedelta("1s") == 6 # still False
pd.Timedelta("1s") == "foo" # still False
pd.Timedelta("1s") == CustomClass() # still False, but prints custom class eq

TomAugspurger added the Timedelta Timedelta data type label Apr 26, 2018

Sup3rGeo mentioned this issue Jun 8, 2018

Bugfix timedelta notimplemented eq #21394

Merged

4 tasks

gfyoung added the Bug label Jun 8, 2018

jreback added this to the 0.24.0 milestone Jun 13, 2018

jbrockmendel closed this as completed in #21394 Oct 24, 2018

jbrockmendel mentioned this issue Nov 14, 2018

Timedelta Comparisons Inconsistent #23684

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Timedelta always returning False for equality tests for incompatible types #20829

Timedelta always returning False for equality tests for incompatible types #20829

Sup3rGeo commented Apr 26, 2018 •

edited

Loading

INSTALLED VERSIONS

TomAugspurger commented Apr 26, 2018 •

edited

Loading

Sup3rGeo commented Apr 26, 2018

TomAugspurger commented Apr 26, 2018

jbrockmendel commented Apr 26, 2018

Sup3rGeo commented May 29, 2018

jbrockmendel commented May 29, 2018

Sup3rGeo commented May 30, 2018

jbrockmendel commented May 30, 2018

Sup3rGeo commented May 30, 2018

jbrockmendel commented May 31, 2018

Sup3rGeo commented Jun 1, 2018

Timedelta always returning False for equality tests for incompatible types #20829

Timedelta always returning False for equality tests for incompatible types #20829

Comments

Sup3rGeo commented Apr 26, 2018 • edited Loading

Code Sample, a copy-pastable example if possible

Problem description

Expected Output

Output of pd.show_versions()

INSTALLED VERSIONS

TomAugspurger commented Apr 26, 2018 • edited Loading

Sup3rGeo commented Apr 26, 2018

TomAugspurger commented Apr 26, 2018

jbrockmendel commented Apr 26, 2018

Sup3rGeo commented May 29, 2018

jbrockmendel commented May 29, 2018

Sup3rGeo commented May 30, 2018

jbrockmendel commented May 30, 2018

Sup3rGeo commented May 30, 2018

jbrockmendel commented May 31, 2018

Sup3rGeo commented Jun 1, 2018

Sup3rGeo commented Apr 26, 2018 •

edited

Loading

Output of `pd.show_versions()`

TomAugspurger commented Apr 26, 2018 •

edited

Loading