Groupby with matching column and index name emits spurious warning #17383

TomAugspurger · 2017-08-30T18:45:17Z

Which of these should raise FutureWarnings? I think the idea was to use pd.Grouper(name) to disambiguate? In that case In [21] should not raise a warning?

In [17]: df = pd.DataFrame({"A": [1] * 5 + [2] * 5, "B": ['a', 'b'] * 5, 'C': range(10)}, index=pd.Index(range(10), name='A'))

In [19]: _ = df.groupby('A').mean()
/Users/taugspurger/.virtualenvs/pandas-dev/bin/ipython:1: FutureWarning: 'A' is both a column name and an index level.
Defaulting to column but this will raise an ambiguity error in a future version
  #!/Users/taugspurger/Envs/pandas-dev/bin/python3.6

In [20]: _ = df.groupby(['A']).mean()
/Users/taugspurger/.virtualenvs/pandas-dev/bin/ipython:1: FutureWarning: 'A' is both a column name and an index level.
Defaulting to column but this will raise an ambiguity error in a future version
  #!/Users/taugspurger/Envs/pandas-dev/bin/python3.6

In [21]: _ = df.groupby(pd.Grouper('A')).mean()
/Users/taugspurger/Envs/pandas-dev/lib/python3.6/site-packages/pandas/pandas/core/groupby.py:1699: FutureWarning: 'A' is both a column name and an index level.
Defaulting to column but this will raise an ambiguity error in a future version
  return klass(obj, by, **kwds)

In [22]: _ = df.groupby([pd.Grouper('A')]).mean()

cc @jmmease

The text was updated successfully, but these errors were encountered:

jonmmease · 2017-08-30T19:40:38Z

Yeah, I think [21] shouldn't raise a warning

jonmmease · 2017-09-11T22:20:55Z

I'll try to take a look at this soon. Any reason why the df.groupby(grp_expr) operation shouldn't just always go down the df.groupby([grp_expr]) path?

I'd like to see if we could just always turn scalar by args into single element lists and get rid of the non-list logic. Thoughts?

jreback · 2017-09-12T00:42:16Z

scalar and list of a single arg should be the same
separate pr for that would be fine

jonmmease · 2017-10-10T21:12:04Z

@jreback @TomAugspurger See #17843
FYI, I didn't end up needing to make the change discussed above (treating all groupers as lists)

TomAugspurger added API Design Groupby labels Aug 30, 2017

TomAugspurger added this to the 0.21.0 milestone Aug 30, 2017

TomAugspurger added Difficulty Novice labels Aug 30, 2017

jreback modified the milestones: 0.21.0, Next Major Release Sep 23, 2017

jonmmease mentioned this issue Oct 10, 2017

Refactor index-as-string groupby tests and fix spurious warning (Bug 17383) #17843

Merged

4 tasks

TomAugspurger added the good first issue label Oct 11, 2017

jreback modified the milestones: Next Major Release, 0.21.0 Oct 14, 2017

jreback closed this as completed in #17843 Oct 14, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Groupby with matching column and index name emits spurious warning #17383

Groupby with matching column and index name emits spurious warning #17383

TomAugspurger commented Aug 30, 2017

jonmmease commented Aug 30, 2017

jonmmease commented Sep 11, 2017

jreback commented Sep 12, 2017

jonmmease commented Oct 10, 2017

Groupby with matching column and index name emits spurious warning #17383

Groupby with matching column and index name emits spurious warning #17383

Comments

TomAugspurger commented Aug 30, 2017

jonmmease commented Aug 30, 2017

jonmmease commented Sep 11, 2017

jreback commented Sep 12, 2017

jonmmease commented Oct 10, 2017