bpo-44525: Specialize `CALL_FUNCTION` for C function calls #26934

Fidget-Spinner · 2021-06-28T14:33:04Z

Initial infrastructure code for specializing CALL_FUNCTION. Also added specialization for calling METH_O PyCFunctions because it's the easiest to implement. Along with METH_FASTCALL.

Measured up to 20% faster calls for METH_O on microbenchmarks.

https://bugs.python.org/issue44525

Fidget-Spinner · 2021-06-28T14:35:03Z

Initially I also planned to specialize things like list() and normal python functions, but the diff was getting rather long, so I'll try those out some other time. This code can also specialize CALL_FUNCTION_KW, and maybe even CALL_METHOD.

bedevere-bot · 2021-06-28T15:20:34Z

🤖 New build scheduled with the buildbot fleet by @Fidget-Spinner for commit 0e0a3a4 🤖

If you want to schedule another build, you need to add the ":hammer: test-with-buildbots" label again.

pablogsal

I think this is unfortunate far too complex for the benefit this is giving. I think at the very least this needs some macro benchmarks but if this already is just 20% on a micro benchmark it doesn't look very promising, unfortunately

bedevere-bot · 2021-06-28T19:33:12Z

A Python core developer has requested some changes be made to your pull request before we can consider merging it. If you could please address their requests along with any other requests in other reviews from core developers that would be appreciated.

Once you have made the requested changes, please leave a comment on this pull request containing the phrase I have made the requested changes; please review again. I will then notify any core developers who have left a review that you're ready for them to take another look at this pull request.

pablogsal · 2021-06-28T19:37:09Z

Oh, I just saw the macro benchmarks in https://gist.github.com/Fidget-Spinner/b2f05e0f6b8851d41e09412f5189ce8f.

Indeed, it shows almost no improvement.

I propose to close this because this is far too complex for no global benefit, and even the micro benchmark is too small in my humble opinion.

Maybe other core Devs have another view on this, thought.

pablogsal · 2021-06-28T19:37:58Z

@markshannon @serhiy-storchaka @vstinner @isidentical @ericsnowcurrently

isidentical · 2021-06-28T19:48:52Z

I agree with @pablogsal that as is, it is too much of complexity with no real gains. Though if this is most of an infrastructure PR, then I think the optimizations (6 of them?) @Fidget-Spinner spoke of should also be implemented in this as well, and if they show promising returns on macro benchmarks we could re-evaluate the complexity/performance pair.

pablogsal · 2021-06-28T19:50:42Z

Notice that this also has a memory cost for caching the pointers, which should also be taken into account when evaluating the benefits.

Fidget-Spinner · 2021-06-28T23:39:12Z

@pablogsal and @isidentical your concerns are valid and shared by me. I was worried about the code maintenance while writing this PR too.

Roughly 1/4th of it is boilerplate infrastructure for adaptive instructions. The entire diff by CALL_FUNCTION_ADAPTIVE and some parts of the specialize functions fall into that. Maybe I can look into refactoring the code elsewhere to reduce the boilerplate and diff.

I'm hesitant to implement the other specializations too as they require a lot more time (and complexity!). So I sent this PR first to get a sanity check on what I'm doing.

Ultimately, I'll let the core devs decide on this and whether y'all find it useful. I don't mind too much if this gets rejected -- I already had a lot of fun writing this :). But I'm interested to hear what the others have to say.

Fidget-Spinner · 2021-06-29T10:32:25Z

Oh I forgot to mention that the pyperformance numbers are out of date. They were taken when I only optimized for __builtins__. I'm also waiting for @isidentical 's PR GH-26677 which will cause more CALL_FUNCTION to be emitted correctly. Once that PR is merged I'll re-bench again.

In the meantime, I'll explore the other specializations and see if there's a difference. Thanks everyone!

markshannon

Specialized instructions need to be quite specialized 🙂

Looking forward to seeing benchmark results once the specialization is more refined and #26677 is merged.

Python/specialize.c

Python/ceval.c

Include/internal/pycore_code.h

markshannon · 2021-06-29T13:47:56Z

One other thing, take a look at #26954

Python/specialize.c

markshannon

Looks good. A few minor tweaks needed.

Python/specialize.c

Python/ceval.c

markshannon · 2021-10-19T10:23:10Z

Currently, just a little bit faster but that's good as this PR takes the hit of the adaptive machinery for all other calls.

Fidget-Spinner · 2021-10-19T10:43:11Z

Please ignore a message I posted earlier, I just realized it was a different bytecode altogether 🤦‍♂️ .

markshannon

A few minor things I missed earlier

Python/ceval.c

bedevere-bot · 2021-10-19T14:11:22Z

🤖 New build scheduled with the buildbot fleet by @markshannon for commit f191720 🤖

If you want to schedule another build, you need to add the ":hammer: test-with-buildbots" label again.

markshannon · 2021-10-19T23:15:31Z

Buildbot failures are the usual trio plus a timeout.

bedevere-bot · 2021-10-19T23:45:32Z

⚠️⚠️⚠️ Buildbot failure ⚠️⚠️⚠️

Hi! The buildbot s390x RHEL7 LTO 3.x has failed when building commit 3163e68.

What do you need to do:

Don't panic.
Check the buildbot page in the devguide if you don't know what the buildbots are or how they work.
Go to the page of the buildbot that failed (https://buildbot.python.org/all/#builders/402/builds/1002) and take a look at the build logs.
Check if the failure is related to this commit (3163e68) or if it is a false positive.
If the failure is related to this commit, please, reflect that on the issue and make a new Pull Request with a fix.

You can take a look at the buildbot page here:

https://buildbot.python.org/all/#builders/402/builds/1002

Failed tests:

test_pickle
test_pickletools

Summary of the results of the build (if available):

==

Click to see traceback logs

Traceback (most recent call last):
  File "/home/dje/cpython-buildarea/3.x.edelsohn-rhel-z.lto/build/Lib/multiprocessing/resource_tracker.py", line 209, in main
    cache[rtype].remove(name)
    ^^^^^^^^^^^^^^^^^^^^^^^^^
KeyError: '/psm_52734b07'


Traceback (most recent call last):
  File "/home/dje/cpython-buildarea/3.x.edelsohn-rhel-z.lto/build/Lib/multiprocessing/resource_tracker.py", line 209, in main
    cache[rtype].remove(name)
    ^^^^^^^^^^^^^^^^^^^^^^^^^
KeyError: '/psm_b71e5584'


Traceback (most recent call last):
  File "/home/dje/cpython-buildarea/3.x.edelsohn-rhel-z.lto/build/Lib/multiprocessing/resource_tracker.py", line 209, in main
    cache[rtype].remove(name)
    ^^^^^^^^^^^^^^^^^^^^^^^^^
KeyError: '/psm_d663bdb0'

pablogsal · 2021-10-27T16:18:44Z

This commit has broken thes 390x RHEL7 LTO 3.x buildbot as you can see before. As per the buildbot maintenance procedures, we will need to revert this PR unless is fixed in 24 hours.

@markshannon @Fidget-Spinner

Fidget-Spinner · 2021-10-27T16:30:30Z

@pablogsal indeed it seems this PR is guilty. I'll send a PR to revert this tomorrow. Unfortunately I don't have the bandwidth to investigate this right now. Sorry.

Fidget-Spinner · 2021-10-28T13:31:00Z

@pablogsal it seems that the buildbot is now green thanks to Dennis' patch #29048.

For clarity, the C function optimizations assume all C functions called already do their own recursion checking. This should already be the case for all vectorcall-abiding functions, but it seems that isinstance did not properly abide by our own rules 😉 .

Unfortunately the rule doesn't apply to tp_call functions. So I will have to update CALL_FUNCTION_BUILTIN_O (METH_O) to be safer too.

pablogsal · 2021-10-28T13:50:01Z

For clarity, the C function optimizations assume all C functions called already do their own recursion checking.

Right, but this is not true in all functions as we saw. Doesn't also need to be done in the function call site? What is the contact exactly here? Because if this means that all the C functions need to do the recursion check on the function body, then the change is backwards incompatible

Fidget-Spinner · 2021-10-28T14:20:17Z

For clarity, the C function optimizations assume all C functions called already do their own recursion checking.

Right, but this is not true in all functions as we saw. Doesn't also need to be done in the function call site? What is the contact exactly here? Because if this means that all the C functions need to do the recursion check on the function body, then the change is backwards incompatible

The contract is that for vectorcall functions, the callee must do their own recursion checking https://docs.python.org/3/c-api/call.html#recursion-control. For tp_call functions, CPython will check for them.

then the change is backwards incompatible

Right. Like I mentioned above, the only incompatible opcode is CALL_FUNCTION_BUILTIN_O, the other opcodes all call vectorcall functions, or builtin functions that we know have vectorcall, so they must already do their own checking. I'll add recursion checking in the caller for just that one opcode.

pablogsal · 2021-10-28T15:34:42Z

. I'll add recursion checking in the caller for just that one opcode.

Fantastic! Thanks for the explanation 👍

Fidget-Spinner added 7 commits June 26, 2021 17:59

WIP: Specialize CALL_FUNCTION for builtins

5e73b74

fix some GCC compilation warnings

1539105

hopefully fix the segfaults

68e5451

Rename to CALL_CFUNCTION and generalize to all c functions

1d841b0

fix formatting, remove redundant check

f41b623

goto fail rather than return -1

de520bd

Create 2021-06-28-22-23-59.bpo-44525.sSvUKG.rst

0e0a3a4

Fidget-Spinner requested a review from markshannon as a code owner June 28, 2021 14:33

the-knights-who-say-ni added the CLA signed label Jun 28, 2021

bedevere-bot added the awaiting review label Jun 28, 2021

Fidget-Spinner closed this Jun 28, 2021

Fidget-Spinner reopened this Jun 28, 2021

Fidget-Spinner added the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Jun 28, 2021

bedevere-bot removed the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Jun 28, 2021

pablogsal requested changes Jun 28, 2021

View reviewed changes

bedevere-bot removed the awaiting review label Jun 28, 2021

bedevere-bot added the awaiting changes label Jun 28, 2021

Fidget-Spinner added the DO-NOT-MERGE label Jun 29, 2021

markshannon requested changes Jun 29, 2021

View reviewed changes

Python/specialize.c Outdated Show resolved Hide resolved

Python/ceval.c Outdated Show resolved Hide resolved

Python/ceval.c Outdated Show resolved Hide resolved

Include/internal/pycore_code.h Outdated Show resolved Hide resolved

Include/internal/pycore_code.h Outdated Show resolved Hide resolved

Apply easier suggestions from Mark's review

65de42d

Fidget-Spinner added 2 commits October 19, 2021 00:54

remove typo

8b113d1

remove nit

9642df5

Fidget-Spinner commented Oct 18, 2021

View reviewed changes

Python/specialize.c Outdated Show resolved Hide resolved

fix wrong return code

8a74cff

markshannon requested changes Oct 18, 2021

View reviewed changes

Python/specialize.c Outdated Show resolved Hide resolved

Python/specialize.c Outdated Show resolved Hide resolved

Python/ceval.c Outdated Show resolved Hide resolved

Python/ceval.c Outdated Show resolved Hide resolved

Fidget-Spinner added 2 commits October 19, 2021 02:44

partly address code review

907c5cb

Exclude function if not collecting stats

3e09485

Fidget-Spinner added 3 commits October 19, 2021 18:43

check for error first

b28d85c

Record cache hit earlier

617424b

fix isinstance bug

e73b69f

markshannon requested changes Oct 19, 2021

View reviewed changes

Python/ceval.c Show resolved Hide resolved

Python/ceval.c Outdated Show resolved Hide resolved

Python/ceval.c Outdated Show resolved Hide resolved

apply suggestions from review: move up cache hits

f191720

markshannon added the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Oct 19, 2021

bedevere-bot removed the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Oct 19, 2021

markshannon merged commit 3163e68 into python:main Oct 19, 2021

bedevere-bot removed the awaiting changes label Oct 19, 2021

Fidget-Spinner deleted the call_function_specialize branch October 21, 2021 15:04

Fidget-Spinner mentioned this pull request Oct 28, 2021

bpo-44525: Add recursive checks for CALL_FUNCTION_BUILTIN_O #29271

Merged

Fidget-Spinner mentioned this pull request Sep 9, 2022

Implement CALL_FUNCTION adaptive interpreter optimizations #88691

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bpo-44525: Specialize `CALL_FUNCTION` for C function calls #26934

bpo-44525: Specialize `CALL_FUNCTION` for C function calls #26934

Fidget-Spinner commented Jun 28, 2021 •

edited

Loading

Fidget-Spinner commented Jun 28, 2021

bedevere-bot commented Jun 28, 2021

pablogsal left a comment

bedevere-bot commented Jun 28, 2021

pablogsal commented Jun 28, 2021

pablogsal commented Jun 28, 2021

isidentical commented Jun 28, 2021 •

edited

Loading

pablogsal commented Jun 28, 2021

Fidget-Spinner commented Jun 28, 2021

Fidget-Spinner commented Jun 29, 2021

markshannon left a comment

markshannon commented Jun 29, 2021

markshannon left a comment

markshannon commented Oct 19, 2021

Fidget-Spinner commented Oct 19, 2021

markshannon left a comment

bedevere-bot commented Oct 19, 2021

markshannon commented Oct 19, 2021

bedevere-bot commented Oct 19, 2021

pablogsal commented Oct 27, 2021

Fidget-Spinner commented Oct 27, 2021

Fidget-Spinner commented Oct 28, 2021 •

edited

Loading

pablogsal commented Oct 28, 2021 •

edited

Loading

Fidget-Spinner commented Oct 28, 2021

pablogsal commented Oct 28, 2021

bpo-44525: Specialize CALL_FUNCTION for C function calls #26934

bpo-44525: Specialize CALL_FUNCTION for C function calls #26934

Conversation

Fidget-Spinner commented Jun 28, 2021 • edited Loading

Fidget-Spinner commented Jun 28, 2021

bedevere-bot commented Jun 28, 2021

pablogsal left a comment

Choose a reason for hiding this comment

bedevere-bot commented Jun 28, 2021

pablogsal commented Jun 28, 2021

pablogsal commented Jun 28, 2021

isidentical commented Jun 28, 2021 • edited Loading

pablogsal commented Jun 28, 2021

Fidget-Spinner commented Jun 28, 2021

Fidget-Spinner commented Jun 29, 2021

markshannon left a comment

Choose a reason for hiding this comment

markshannon commented Jun 29, 2021

markshannon left a comment

Choose a reason for hiding this comment

markshannon commented Oct 19, 2021

Fidget-Spinner commented Oct 19, 2021

markshannon left a comment

Choose a reason for hiding this comment

bedevere-bot commented Oct 19, 2021

markshannon commented Oct 19, 2021

bedevere-bot commented Oct 19, 2021

⚠️⚠️⚠️ Buildbot failure ⚠️⚠️⚠️

pablogsal commented Oct 27, 2021

Fidget-Spinner commented Oct 27, 2021

Fidget-Spinner commented Oct 28, 2021 • edited Loading

pablogsal commented Oct 28, 2021 • edited Loading

Fidget-Spinner commented Oct 28, 2021

pablogsal commented Oct 28, 2021

bpo-44525: Specialize `CALL_FUNCTION` for C function calls #26934

bpo-44525: Specialize `CALL_FUNCTION` for C function calls #26934

Fidget-Spinner commented Jun 28, 2021 •

edited

Loading

isidentical commented Jun 28, 2021 •

edited

Loading

Fidget-Spinner commented Oct 28, 2021 •

edited

Loading

pablogsal commented Oct 28, 2021 •

edited

Loading