COMPAT: Iteration should always yield a python scalar #17491

jreback · 2017-09-10T21:58:03Z

xref #10904
closes #13236
closes #13258
xref #14216

codecov · 2017-09-11T00:00:35Z

Codecov Report

Merging #17491 into master will increase coverage by <.01%.
The diff coverage is 94.73%.

@@            Coverage Diff             @@
##           master   #17491      +/-   ##
==========================================
+ Coverage   91.15%   91.15%   +<.01%     
==========================================
  Files         163      163              
  Lines       49534    49540       +6     
==========================================
+ Hits        45153    45160       +7     
+ Misses       4381     4380       -1

Flag	Coverage Δ
#multiple	`88.94% <94.73%> (+0.02%)`	⬆️
#single	`40.22% <57.89%> (-0.07%)`	⬇️

Impacted Files	Coverage Δ
pandas/core/series.py	`94.92% <ø> (-0.03%)`	⬇️
pandas/core/indexes/base.py	`96.28% <ø> (-0.01%)`	⬇️
pandas/core/base.py	`96.01% <100%> (+0.05%)`	⬆️
pandas/core/indexes/category.py	`98.54% <100%> (ø)`	⬆️
pandas/core/categorical.py	`95.51% <100%> (+0.01%)`	⬆️
pandas/core/sparse/array.py	`91.3% <85.71%> (-0.12%)`	⬇️
pandas/io/gbq.py	`25% <0%> (-58.34%)`	⬇️
pandas/core/frame.py	`97.72% <0%> (-0.1%)`	⬇️
pandas/core/indexes/datetimes.py	`95.43% <0%> (-0.1%)`	⬇️
pandas/plotting/_converter.py	`65.05% <0%> (+1.81%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e6aed2e...2ebcbfc. Read the comment docs.

codecov · 2017-09-11T00:00:43Z

Codecov Report

Merging #17491 into master will increase coverage by 0.01%.
The diff coverage is 94.73%.

@@            Coverage Diff             @@
##           master   #17491      +/-   ##
==========================================
+ Coverage   91.15%   91.16%   +0.01%     
==========================================
  Files         163      163              
  Lines       49534    49543       +9     
==========================================
+ Hits        45153    45168      +15     
+ Misses       4381     4375       -6

Flag	Coverage Δ
#multiple	`88.95% <94.73%> (+0.03%)`	⬆️
#single	`40.21% <57.89%> (-0.07%)`	⬇️

Impacted Files	Coverage Δ
pandas/core/series.py	`94.92% <ø> (-0.03%)`	⬇️
pandas/core/indexes/base.py	`96.28% <ø> (-0.01%)`	⬇️
pandas/core/indexes/category.py	`98.54% <100%> (ø)`	⬆️
pandas/core/categorical.py	`95.51% <100%> (+0.01%)`	⬆️
pandas/core/base.py	`96.01% <100%> (+0.05%)`	⬆️
pandas/core/sparse/array.py	`91.3% <85.71%> (-0.12%)`	⬇️
pandas/io/gbq.py	`25% <0%> (-58.34%)`	⬇️
pandas/core/frame.py	`97.77% <0%> (-0.05%)`	⬇️
pandas/core/groupby.py	`92.22% <0%> (+0.01%)`	⬆️
pandas/core/reshape/pivot.py	`96.35% <0%> (+0.99%)`	⬆️
... and 1 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9a84274...05f8a6f. Read the comment docs.

xref pandas-dev#10904 closes pandas-dev#13236 closes pandas-dev#13256 xref pandas-dev#14216

jorisvandenbossche

Added some comments.

(I know you didn't directly merge it but only after a couple of days, but I think for such api changes we should wait until at least some other core dev had the time to review, or explicitly ping us)

jorisvandenbossche · 2017-09-12T15:25:03Z

doc/source/whatsnew/v0.21.0.txt

+
+Previously:
+
+.. code-block:: python


python -> ipython

jorisvandenbossche · 2017-09-12T15:25:12Z

doc/source/whatsnew/v0.21.0.txt

+
+.. ipython:: python
+
+   s = Series([1, 2, 3])


Series -> pd.Series

jorisvandenbossche · 2017-09-12T15:25:33Z

doc/source/whatsnew/v0.21.0.txt

+Iteration of Series/Index will now return python scalars
+^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+
+Previously, when using certain iteration methods for a ``Series`` with dtype ``int`` or ``float``, you would receive a ``numpy`` scalar, e.g. a ``np.int64``, rather than a python ``int``. Issue (:issue:`10904`) corrected this for ``Series.tolist()`` and ``list(Series)``. This change makes all iteration methods consistent, in particular, for ``__iter__()`` and ``.map()``; note that this only affect int/float dtypes. (:issue:`13236`, :issue:`13258`, :issue:`14216`).


this only affect -> this only affects

jorisvandenbossche · 2017-09-12T15:26:08Z

doc/source/whatsnew/v0.21.0.txt

+
+Previously:
+
+.. code-block:: python


python -> ipython

jorisvandenbossche · 2017-09-12T15:28:37Z

pandas/core/base.py

@@ -884,6 +890,21 @@ def argmin(self, axis=None):
        """
        return nanops.nanargmin(self.values)

+    def tolist(self):
+        """
+        return a list of the values; box to scalars


Can you put a bit more explanation that python scalar types are returned?

jorisvandenbossche · 2017-09-12T15:30:00Z

pandas/core/base.py

+        if is_datetimelike(self):
+            return (_maybe_box_datetimelike(x) for x in self._values)
+        else:
+            return iter(self._values.tolist())


For the tolist implementation, this seems a bit double work: values are converted to list, then iterateted over, and then again converted to list.

jorisvandenbossche · 2017-09-12T15:33:12Z

pandas/core/indexes/category.py

@@ -253,6 +253,10 @@ def get_values(self):
        """ return the underlying data as an ndarray """
        return self._data.get_values()

+    def __iter__(self):
+        """ iterate like Categorical """
+        return self._data.__iter__()


this feels not clean. The tolist of Categorical should already ensure this?

jorisvandenbossche · 2017-09-12T15:38:41Z

pandas/tests/test_base.py

+        # gh-10904
+        # gh-13258
+        # coerce iteration to underlying python / pandas types
+        s = typ([1], dtype=dtype)


I would make a separate construction for object/category, because the test is not ensuring this is correct. For example on master, a categorical series of integers will box to np.int64, but a np.int64 scalar passes the ininstance(.., object) test

jorisvandenbossche · 2017-09-12T15:40:46Z

pandas/tests/test_base.py

+                              Timestamp('2000-12-31')])
+
+        result = method(i)[0]
+        assert isinstance(result, Timestamp)


can you add test for Series.iteritems and DataFrame.itertuples/iterrows as well?

these are in tests/frame/test_api.py already

Yes, but you said this PR changed the behaviour of itertuples ? (#13468 (comment)) Then we should have a test for that?

and that's all well tested see
frame/test_api

ah, missed that one-line change in existing test in the diff. Thanks for the clarification!

xref pandas-dev#10904 closes pandas-dev#13236 closes pandas-dev#13256 xref pandas-dev#14216

)

xref pandas-dev#10904 closes pandas-dev#13236 closes pandas-dev#13256 xref pandas-dev#14216

)

jreback added API Design Compat pandas objects compatability with Numpy or Python functions Dtype Conversions Unexpected or buggy dtype conversions labels Sep 10, 2017

jreback added this to the 0.21.0 milestone Sep 10, 2017

jreback force-pushed the map branch from 86231bf to 2ebcbfc Compare September 11, 2017 00:00

jreback force-pushed the map branch 4 times, most recently from c0fd989 to 6a02e4f Compare September 11, 2017 11:13

COMPAT: Iteration should always yield a python scalar

05f8a6f

xref pandas-dev#10904 closes pandas-dev#13236 closes pandas-dev#13256 xref pandas-dev#14216

jreback force-pushed the map branch from 6a02e4f to 05f8a6f Compare September 12, 2017 10:29

jreback mentioned this pull request Sep 12, 2017

iterrows: when upcasting to object, values are converted to python types #13468

Open

jreback merged commit 83436af into pandas-dev:master Sep 12, 2017

jorisvandenbossche reviewed Sep 12, 2017

View reviewed changes

dwyatte mentioned this pull request Sep 12, 2017

Inconsistent type casting between DataFrame and Series #14216

Closed

jreback added a commit to jreback/pandas that referenced this pull request Sep 12, 2017

COMPAT: followup to pandas-dev#17491

8164da0

jreback added a commit to jreback/pandas that referenced this pull request Sep 13, 2017

COMPAT: followup to pandas-dev#17491

c3fc62e

jreback added a commit to jreback/pandas that referenced this pull request Sep 13, 2017

COMPAT: followup to pandas-dev#17491

03af10c

jreback added a commit that referenced this pull request Sep 13, 2017

COMPAT: followup to #17491 (#17503)

eef810e

jreback pushed a commit that referenced this pull request Sep 17, 2017

DOC: fixes after #17503 and #17491 (#17541)

98f05eb

nateyoder mentioned this pull request Oct 31, 2017

Allow indices to be mapped through through dictionaries or series #15081

Merged

4 tasks

TomAugspurger mentioned this pull request Nov 7, 2017

test_constrained fail statsmodels/statsmodels#4108

Closed

alanbato pushed a commit to alanbato/pandas that referenced this pull request Nov 10, 2017

COMPAT: Iteration should always yield a python scalar (pandas-dev#17491)

c7e4654

xref pandas-dev#10904 closes pandas-dev#13236 closes pandas-dev#13256 xref pandas-dev#14216

alanbato pushed a commit to alanbato/pandas that referenced this pull request Nov 10, 2017

COMPAT: followup to pandas-dev#17491 (pandas-dev#17503)

3a8deab

alanbato pushed a commit to alanbato/pandas that referenced this pull request Nov 10, 2017

DOC: fixes after pandas-dev#17503 and pandas-dev#17491 (pandas-dev#17541

f72030c

)

No-Stream pushed a commit to No-Stream/pandas that referenced this pull request Nov 28, 2017

COMPAT: Iteration should always yield a python scalar (pandas-dev#17491)

2414864

xref pandas-dev#10904 closes pandas-dev#13236 closes pandas-dev#13256 xref pandas-dev#14216

No-Stream pushed a commit to No-Stream/pandas that referenced this pull request Nov 28, 2017

COMPAT: followup to pandas-dev#17491 (pandas-dev#17503)

dc26db4

No-Stream pushed a commit to No-Stream/pandas that referenced this pull request Nov 28, 2017

DOC: fixes after pandas-dev#17503 and pandas-dev#17491 (pandas-dev#17541

9d78d29

)

jorisvandenbossche mentioned this pull request Apr 23, 2018

Surprising type conversion when iterating #20791

Open

boydgreenfield mentioned this pull request Apr 3, 2019

Series iteration and to_dict methods *sometimes* return underlying storage type vs. Python object #25969

Open

maximz mentioned this pull request Jul 26, 2019

to_dict() on a boolean series sometimes returns numpy types instead of Python types #27616

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

COMPAT: Iteration should always yield a python scalar #17491

COMPAT: Iteration should always yield a python scalar #17491

jreback commented Sep 10, 2017 •

edited

Loading

codecov bot commented Sep 11, 2017

codecov bot commented Sep 11, 2017 •

edited

Loading

jorisvandenbossche left a comment

jorisvandenbossche Sep 12, 2017

jorisvandenbossche Sep 12, 2017

jorisvandenbossche Sep 12, 2017

jorisvandenbossche Sep 12, 2017

jorisvandenbossche Sep 12, 2017

jorisvandenbossche Sep 12, 2017

jreback Sep 12, 2017

jorisvandenbossche Sep 12, 2017

jreback Sep 12, 2017

jorisvandenbossche Sep 12, 2017

jorisvandenbossche Sep 12, 2017

jreback Sep 12, 2017

jorisvandenbossche Sep 15, 2017

jreback Sep 17, 2017

jorisvandenbossche Sep 18, 2017

COMPAT: Iteration should always yield a python scalar #17491

COMPAT: Iteration should always yield a python scalar #17491

Conversation

jreback commented Sep 10, 2017 • edited Loading

codecov bot commented Sep 11, 2017

Codecov Report

codecov bot commented Sep 11, 2017 • edited Loading

Codecov Report

jorisvandenbossche left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jreback commented Sep 10, 2017 •

edited

Loading

codecov bot commented Sep 11, 2017 •

edited

Loading