Performance impact of non-str dict keys #208

Jongy · 2020-06-30T18:20:40Z

CPython has a specialized dictionary lookup function for str-only keys. The first time a dict instance is accessed with a non-str key, it's modified so future lookups use the generic function. Performance imapct is observable:

In [1]: d = {str(i): 1 for i in range(1_000)}                                                                                                                                                              

In [2]: %timeit d["1"]                                                                                                                                                                                     
26.7 ns ± 0.0895 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)

In [3]: d[1] = 1                                                                                                                                                                                           

In [4]: %timeit d["1"]                                                                                                                                                                                     
33.2 ns ± 0.117 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)

This is non-reversible - the particular dict instance will keep using the generic function forever, even if non-str keys are removed.

In [5]: del d[1]                                                                                                                                                                                           

In [6]: %timeit d["1"]                                                                                                                                                                                     
33.8 ns ± 1.1 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)

I'll submit a PR containing adding a section explaining this

The text was updated successfully, but these errors were encountered:

satwikkansal · 2020-07-01T16:35:06Z

Hi @Jongy, thanks.

This will be a good addition to the collection. Looking forward to your PR :)

Closes: satwikkansal#208

satwikkansal added the new snippets label Jul 1, 2020

Jongy added a commit to Jongy/wtfpython that referenced this issue Jul 4, 2020

Add "dict lookup performance" section

0b74f9b

Closes: satwikkansal#208

Jongy mentioned this issue Jul 4, 2020

Add "dict lookup performance" section #209

Merged

satwikkansal closed this as completed in #209 Jul 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance impact of non-str dict keys #208

Performance impact of non-str dict keys #208

Jongy commented Jun 30, 2020 •

edited

Loading

satwikkansal commented Jul 1, 2020 •

edited

Loading

Performance impact of non-str dict keys #208

Performance impact of non-str dict keys #208

Comments

Jongy commented Jun 30, 2020 • edited Loading

satwikkansal commented Jul 1, 2020 • edited Loading

Jongy commented Jun 30, 2020 •

edited

Loading

satwikkansal commented Jul 1, 2020 •

edited

Loading