bpo-39028: Performance enhancement in keyword extraction #17576

seberg · 2019-12-12T04:04:25Z

All keywords should first be checked for pointer identity. Only
after that failed for all keywords (unlikely) should unicode
equality be used.
The original code would call unicode equality on any non-matching
keyword argument. Meaning calling it often e.g. when a function
has many kwargs but only the last one is provided.

Unless I am missing something big, this seems like it was the original intention, and simple typo. It may make sense to backport?

I have to admit, I have not actually recompiled and timed this, I may do that tomorrow (but I have never had the need to compile python, so need to look up how).

https://bugs.python.org/issue39028

seberg · 2019-12-12T05:47:48Z

Ah, I missed that this code is used for PyArg_* where it is not clear that strings are almost always interned, so the change would likely be useful for argument clinic functions, but probably not in general.

EDIT: Both can make sense I guess, but for the clinic it is clearly interned I think? So another option is to have two versions here...

serhiy-storchaka · 2019-12-12T08:11:46Z

Python/getargs.c

        if (kwname == key) {
            return kwstack[i];
        }
+    }
+    /* ptr == ptr should normally find a match in since keyword keys


This comment is related to kwname == key.

serhiy-storchaka

This change LGTM, but please move the comment before the pointers comparison or before the first loop.

Although this is almost dead code. It is used in just few sites generated by Argument Clinic and soon will not be used at all.

serhiy-storchaka · 2019-12-12T16:39:41Z

Ah, sorry, the use in _PyArg_UnpackKeywords() is not dead, so this optimization has long term effect.

seberg · 2019-12-12T17:31:39Z

Yeah, although it only will have remotely noticeable effects for >3 kwargs probably, which seems pretty rare in python. Anyway moved and tweaked the comment.

methane · 2019-12-13T12:48:58Z

Python/getargs.c

+    }
+    /* This function assumes the strings should be interned, so that this
+       should only be reached on error, and the loop below will never
+       find a match */


If we keep the second loop, why we need to have both of find_keyword_interned and find_keyword?

Oh man, sorry, I do not think we do. I was trying around with it a bit, and apparently forgot to commit before pushing or something :/. fixed.

All keywords should first be checked for pointer identity. Only after that failed for all keywords (unlikely) should unicode equality be used. The original code would call unicode equality on any non-matching keyword argument. Meaning calling it often e.g. when a function has many kwargs but only the last one is provided.

methane · 2019-12-16T04:41:29Z

LGTM. Would you create an NEWS entry?
https://devguide.python.org/committing/#what-s-new-and-news-entries

) All keywords should first be checked for pointer identity. Only after that failed for all keywords (unlikely) should unicode equality be used. The original code would call unicode equality on any non-matching keyword argument. Meaning calling it often e.g. when a function has many kwargs but only the last one is provided.

the-knights-who-say-ni added the CLA signed label Dec 12, 2019

bedevere-bot added the awaiting review label Dec 12, 2019

seberg changed the title ~~ENH: Fix performance issue in keyword extraction~~ bpo-39028: Fix performance issue in keyword extraction Dec 12, 2019

serhiy-storchaka reviewed Dec 12, 2019

View reviewed changes

seberg force-pushed the kwarg-extract-performance branch from ba65512 to 17561c2 Compare December 12, 2019 17:31

seberg changed the title ~~bpo-39028: Fix performance issue in keyword extraction~~ bpo-39028: Performance enhancement in keyword extraction Dec 12, 2019

methane reviewed Dec 13, 2019

View reviewed changes

seberg force-pushed the kwarg-extract-performance branch from 17561c2 to 52562b9 Compare December 13, 2019 14:19

📜🤖 Added by blurb_it.

435b181

methane merged commit 75bb07e into python:master Dec 18, 2019

bedevere-bot removed the awaiting review label Dec 18, 2019

seberg deleted the kwarg-extract-performance branch January 21, 2020 19:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

bpo-39028: Performance enhancement in keyword extraction #17576

bpo-39028: Performance enhancement in keyword extraction #17576

Uh oh!

seberg commented Dec 12, 2019 •

edited by bedevere-bot

Loading

Uh oh!

seberg commented Dec 12, 2019 •

edited

Loading

Uh oh!

serhiy-storchaka Dec 12, 2019

Uh oh!

serhiy-storchaka left a comment

Uh oh!

serhiy-storchaka commented Dec 12, 2019

Uh oh!

seberg commented Dec 12, 2019

Uh oh!

methane Dec 13, 2019

Uh oh!

seberg Dec 13, 2019

Uh oh!

methane commented Dec 16, 2019

Uh oh!

Uh oh!

Uh oh!

bpo-39028: Performance enhancement in keyword extraction #17576

bpo-39028: Performance enhancement in keyword extraction #17576

Uh oh!

Conversation

seberg commented Dec 12, 2019 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seberg commented Dec 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

serhiy-storchaka Dec 12, 2019

Choose a reason for hiding this comment

Uh oh!

serhiy-storchaka left a comment

Choose a reason for hiding this comment

Uh oh!

serhiy-storchaka commented Dec 12, 2019

Uh oh!

seberg commented Dec 12, 2019

Uh oh!

methane Dec 13, 2019

Choose a reason for hiding this comment

Uh oh!

seberg Dec 13, 2019

Choose a reason for hiding this comment

Uh oh!

methane commented Dec 16, 2019

Uh oh!

Uh oh!

seberg commented Dec 12, 2019 •

edited by bedevere-bot

Loading

seberg commented Dec 12, 2019 •

edited

Loading