Skip to content

Conversation

@b-pass
Copy link
Contributor

@b-pass b-pass commented Mar 16, 2025

Description

This PR add the ability for pybind11 modules to support subinterpreters. This support requires 2 things:

  1. Multiphase init (implemented in feat: change PYBIND11_MODULE to use multi-phase init (PEP 489) #5574)
  2. internals and local_internals have to have an instance per-interpreter (can no longer be static singletons), which is the primary subject of this PR.

Multiphase init

The PR adds tags which can be passed as the 3rd argument to the PYBIND11_MODULE definition macro (in addition to the existing mod_gil_not_used). If the tags are not specified, the module is defined without multiple interpreter support (as they are today).

If multiple interpreters is example, then when a module is imported a second time in a sub-interpreter, the module's exec slot is run again. For pybind11 this means the user's the module init function is re-run in the sub-interpreter. That's good, because the sub-interpreter needs it's own type_info for all of the bindings.

internals

This presents the problem that the place that pybind11 stores these is currently a singleton. But sub-interpreters need this state to be per-interpreter. That means the correct instance (for the current active interpreter) needs to be retrieved from the interpreter state dict. Fortunately, the internals pointer-to-pointer is already stored in the state dict.

In order to minimize performance costs, we can detect whether or not multi-subinterpreters are present by counting how many times the module has been imported. If it has only been imported once then it can only possibly have one internals (even if there are other sub-interpreters where it was not imported). When it has been imported more times, then we need to do additional work to make sure the right internals object is used (the one associated with the current interpreter in the current thread). We can switch between these two cases with a single simple branch, thus causing minimal performance overhead for existing code.

In the multi-interpreter case we would still like to minimized the cost of accessing internals, we don't want to have to reach into the interpreter state dict every time. So we cache the value in a thread_local along with the pointer to the PyInterpreterState to which it belongs. This means that the slow path (acquiring the GIL, doing a dict lookup, etc) is only done when the active PyInterpreterState changes (or the first time get_internals is called in an OS thread). So the fast path merely checks that the PyInterpreterState hasn't changed, and then returns the previously looked up value.

local_internals also has to change in a similar way to internals.

Both of these now share some templated code to manage their thread-local cache and the state-dict cache. Some code around these has also been cleaned up (like moving most of the internals initialization into its constructor).

Memory management / Future work

This PR does not add support for creating / deleting / switching between sub-interpreters.

In embed, pybind11 only cleans up the internals and local_interals associated with the main interpreter (when it is finalized). Since it doesn't currently manage any subinterpreters it can't clean up after them.

Suggested changelog entry:

* Support for sub-interpreters (both isolated (with separate GILs) and legacy (with a global GIL). Add the ``py::multiple_interpreters::per_interpreter_gil()`` tag (or, ``py::multiple_interpreters::shared_gil()`` for legacy interpreter support) to ``PYBIND11_MODULE`` calls (as the third parameter) to indicate that a module supports running with sub-interpreters.

The guide needs to add a short mention of py::multiple_interpreters::per_interpreter_gil() and py::multiple_interpreters::shared_gil() tags.

@b-pass b-pass requested a review from henryiii as a code owner March 16, 2025 01:25
@b-pass b-pass force-pushed the subinterpreters branch 2 times, most recently from 59a2076 to 02e9609 Compare March 16, 2025 01:31
@b-pass b-pass marked this pull request as draft March 16, 2025 01:32
@b-pass b-pass force-pushed the subinterpreters branch 5 times, most recently from 236cc25 to 75d55f3 Compare March 16, 2025 02:05
@b-pass b-pass marked this pull request as ready for review March 16, 2025 02:55
@b-pass b-pass force-pushed the subinterpreters branch 2 times, most recently from df2fcc6 to e64d19f Compare March 16, 2025 19:30
@henryiii
Copy link
Collaborator

Very excited to see this! I have a couple of comments/questions:

  • What happens if someone doesn't define PYBIND11_SUBINTERPRETER_SUPPORT but tries to use the new tags? I would think you'd want a compile-time error?
  • Do you know how much of an impact PYBIND11_SUBINTERPRETER_SUPPORT has? Just curious what constitutes "small". Having a compile time opt-in is a little unfortunate, but having it as a default-off option sounds safe for now, though.
  • I believe communication between extensions even with different compile definitions is still supported due to @rwgk's interface work.

We'll need some docs, too. Maybe we should do a full test run with the define on, too.

Copy link
Collaborator

@rwgk rwgk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a complex PR. I need to find more time later for a full review.

High-level questions:

  • Could it be useful to split this PR: 1. multi-phase init only. 2. multi-interpreter support? — That would make it easier to do the reviews now, and understand the development steps in the future. It might also help us dealing with bugs after this change is released.
  • Are you still working on a new Python tests, to exercise the new multi-interpreter support?
  • Is there a potential for bug or feature interference between free-threading and multi-interpreter functionality? — I think we'll need tests for all combinations of (free-threading on/off) x (multi-interpreter support on/off); not for all platforms, but maybe one each: Linux, macOS, Windows.

@b-pass
Copy link
Contributor Author

b-pass commented Mar 20, 2025

* Could it be useful to split this PR: 1. multi-phase init only. 2. multi-interpreter support? — That would make it easier to do the reviews now, and understand the development steps in the future. It might also help us dealing with bugs after this change is released.

Sure. I have created #5574 for multi-phase init only. I will keep updating this PR for sub-interpreter support, and will remove the multiphase init from this branch shortly.

* Are you still working on a new Python tests, to exercise the new multi-interpreter support?

I'm not currently, but I can add a few more after/along with some additional changes from comments.

* Is there a potential for bug or feature interference between free-threading and multi-interpreter functionality? — I think we'll need tests for all combinations of (free-threading on/off) x (multi-interpreter support on/off); not for all platforms, but maybe one each: Linux, macOS, Windows.

No one would ever say less testing :) I think the potential problems are small. While they have similar goals, free threading and own-gil-sub-interpreters are fairly different and could be used together. Sub-interpreters were originally created (I think) to offer sandboxing features. So even with free threading, the idea that a module is used in two different sandboxes is still valid, and it still needs separation for each instance.

Also I considered whether free-threading is a superset of sub-interpreter support. I think it is not, for the same reason that the two have slightly different implications for a module. With free threading (only) it is perfectly reasonable to have, for example, a global/static atomic variable. With sub-interpreters that is probably incorrect, because each sub-interpreter should have its own separate state. However, if a module is free-threading safe (so, thread safe) and it is multi-interpreter aware (as no global state) then it should also be own-gil-sub-interpreter safe... that is, it doesn't need GIL synchronization across the many multiple subinterpreter states, which must be true or it would not be free-threading safe.

@wjakob
Copy link
Member

wjakob commented Mar 20, 2025

My 2cts: TLS in shared libraries is real disaster (especially C++ thread_local involving nontrivial types). The following writeup is enlightening: https://yosefk.com/blog/cxx-thread-local-storage-performance.html. FWIW I saw significant perf gains in nanobind by basically removing all use of TLS except for a handful of rare/complex cases.

So adding more TLS to the internals data structure in general sounds like a pretty significant performance sink. I would encourage you to thoroughly benchmark function/method calling on Windows/macOS/Linux to see how bad this is, and to what extent these costs can be mitigated.

@b-pass b-pass marked this pull request as draft March 20, 2025 22:57
@henryiii
Copy link
Collaborator

FYI, this is a perfect example of where I'd personally always rebase and force push. ;)

@b-pass
Copy link
Contributor Author

b-pass commented Mar 22, 2025

My 2cts: TLS in shared libraries is real disaster (especially C++ thread_local involving nontrivial types). The following writeup is enlightening: https://yosefk.com/blog/cxx-thread-local-storage-performance.html. FWIW I saw significant perf gains in nanobind by basically removing all use of TLS except for a handful of rare/complex cases.

Luckily, these are all zero-initialized pointers, so no constructors. And I think multiple interpreters in multiple threads a the same time might qualify as a complex case. Still, point taken, on this expert advice I have made a bunch of changes to get the thread_local s completely out of the path of code that isn't running multiple interpreters. It takes just one branch to do so, so that's pretty much the best we could hope for performance wise.

Unfortunately, in the multi-interpreter case there really isn't any choice but to use thread_local. The TLS is there to avoid an even more costly dict lookup in the interpreter state dict.

@b-pass
Copy link
Contributor Author

b-pass commented Mar 22, 2025

I would encourage you to thoroughly benchmark function/method calling on Windows/macOS/Linux to see how bad this is, and to what extent these costs can be mitigated.

Looks like, with the rewrite, the cost is an extra 0.22ns per call to get_details (an increase of about 15%). IMO that is a very small cost, unlikely any real-world usage will notice it.

The cost is much more significant when a subinterpreter is actually created, the cost to access the internals triples. In my opinion, that's just the cost of using subinterpreters.

... I've added this information to the PR description.

@b-pass b-pass marked this pull request as ready for review March 22, 2025 14:17
@XuehaiPan
Copy link
Contributor

XuehaiPan commented Mar 23, 2025

Hi, I wonder if you have read these:

To support sub-interpreters, I think we also need to implement PEP 573 and PEP 630. To clarify:


I think it would be really hard to make PYBIND11_MODULE to both support module isolation while also keeping backward compatibility. Maybe we should add a new API PYBIND11_ISOLATED_MODULE. Or Opt-Out: Limiting to One Module Object per Process.

@b-pass
Copy link
Contributor Author

b-pass commented Mar 23, 2025

To support sub-interpreters, I think we also need to implement PEP 573 and PEP 630.

The goal of these is to get rid of global state, and replace it with state that is tied to the module instance. While implementing this according to the python guides will definitely accomplish that goal -- and that might be the best way to do it --, that is not the only way to accomplish it. I think strict adherence to these would require major rewrites of several parts of pybind11, but I don't think that is necessary to support sub-interpreters/multi-interpreters. Definitely following PEP573 would make a module work with sub-interpreters. It is a sufficient condition but not a necessary condition.

Pybind11's global state is entirely contained within internals and local_internals. This PR makes these into per-interpreter state which is similar to how they behave today (when there is only one interpreter). But not into per-module state (how python suggests that state be managed). The existing implementation with global static state isn't compliant with PEP 573, this PR does not get pybind11 any closer to that either. Pybind11 has some problems that PEP 573 would solve already, and this PR doesn't fix them but it doesn't introduce any new issues AFAIK.

* [Changing Static Types to Heap Types](https://docs.python.org/3.13/howto/isolating-extensions.html#changing-static-types-to-heap-types)

I don't think this is required, since Pybind11's types are managed by its internals structures, they already are not globally static in the strict sense. Converting them to use Type_Spec and Slots is IMO unrelated to this PR.

(Edit: or, maybe pybind11 is already doing heap types? At least, some of them are...)

I think it would be really hard to make PYBIND11_MODULE to both support module isolation while also keeping backward compatibility.

I agree that is probably impossible. My goal here isn't full module isolation, pybind11 already doesn't have module isolation and it can't be added. That doesn't mean it can't support sub-interpreters. Maybe another way to think about it is that this PR adds interpreter isolation without adding module isolation. The examples you linked explain the kinds of problems that non-isolated modules have, which existing pybind11 modules all have, and they would continue to have after this PR.

@henryiii
Copy link
Collaborator

I've run @wjakob's benchmarks for nanobind on this PR. The binary size might be a bit off, since I can't run strip due to needing undefined symbols, but I don't think that has any affect here anyway. I couldn't get nanobind to load, so I had to take it off the runtime plot.

image

image

image

This seems to have a noticeable impact on debug (unoptimized) performance, but not really noticeable on runtime, probably within the uncertainty margins. I'd love for the runtime cost to go down (there's an old PR that did that, but not usable anymore) instead of up, but this looks acceptable to avoid complications building.

@b-pass
Copy link
Contributor Author

b-pass commented May 14, 2025

@rwgk OK, I think it's fixed in 89e9599 . The problem was concurrent threads importing the same module at the same time overwrote the (global, static) module_def and slots, which occasionally would stomp on each other before their init was finished. Fixed (in detail/common.h PYBIND11_MODULE) by wrapping the init with a static so it only happens once and is atomic. (Once it is init'd, it that part does not need to be done again in order for the module to be used in other interpreters.)

I might try to do -fsanitize=address in a future PR (no promises!).....

I'd make module_options a namespace, no need for chained (). And we could use "mod", since that's the CPython term for it. So py::mod::multiple_interpreters() (or py::mod::multiple_interpeters_supported(), to exactly match the CPython API term).

Sound good, but a question for @b-pass: are all combinations of options valid?

gil_not_used can be mixed with the multiple interpreter options. Only one of the multiple_interpreters options can be specified ... per_interpreter_gil is more constrained than shared_gil, so in the presence of both it is safe to report per_interpreter_gil.

So is the suggestion to change the current py::multiple_interpreters::per_interpreter_gil()/py::multiple_interpreters::shared_gil() to py::mod::multiple_interpreters_per_interpreter_gil() / py::mod::multiple_interpreters_shared_gil()? Unfortunately, I don't think the CPython naming is very clear .. the option is named Py_mod_multiple_interpreters but the values are Py_MOD_MULTIPLE_INTERPRETERS_SUPPORTED (old/legacy/shared gil) and Py_MOD_PER_INTERPRETER_GIL_SUPPORTED (new/default/multi gil).

@henryiii
Copy link
Collaborator

henryiii commented May 14, 2025

Personally, I'm fine with the current naming, which makes it clearer that you can pick one multiple_interperters option. Otherwise, it would be py::mod::multiple_interpreters() and py::mod::per_interpreter_gil(), I think. Optionally we could add the _supported.

@b-pass
Copy link
Contributor Author

b-pass commented May 14, 2025

Ok, let's stick with the current implementation then. I added a PR for docs, #5659.

@henryiii
Copy link
Collaborator

@rwgk if the error has gone away and you're OK with the naming, then I think we're good to go.

@rwgk
Copy link
Collaborator

rwgk commented May 14, 2025

@rwgk if the error has gone away and you're OK with the naming, then I think we're good to go.

I still see the error :-(

I tried a couple times from scratch.

I'll click the Update branch button here, which will switch us over to the 24.04 runners. Let's see if the error shows up in the CI, too.

@rwgk
Copy link
Collaborator

rwgk commented May 14, 2025

I ran this:

#!/bin/bash
set -x
for i in {1..100}; do
    (cd /home/rgrossekunst/bld/cmake_build/tests/test_embed && /home/rgrossekunst/bld/cmake_build/tests/test_embed/test_embed)
done

With this outcome:

$ grep '1514 passed | 1 failed' ~/log_run100_2025-05-13+2057.txt | wc -l
100

I.e. it fails every single time.

IIUC, it only fails sometimes for @b-pass?

It's a mystery to me what's different on my machine.

@rwgk
Copy link
Collaborator

rwgk commented May 14, 2025

Again with MinSizeRel all else equal:

rm -rf cmake_build
cmake -S $HOME/forked/pybind11 -B cmake_build -DCMAKE_VERBOSE_MAKEFILE=ON -DDOWNLOAD_CATCH=ON -DPYTHON_EXECUTABLE=$(which python3) -DCMAKE_CXX_STANDARD=17 -DCMAKE_BUILD_TYPE=MinSizeRel -DPYBIND11_PYTEST_ARGS=-vv
0db6015-lcedt.nvidia.com:~/forked/pybind11 $ grep '1514 passed | 1 failed' ~/log_run100_2025-05-13+2107.txt | wc -l
100

Not a single success, again.

@rwgk
Copy link
Collaborator

rwgk commented May 14, 2025

This crossed my mind (PYTHONPATH):

#!/bin/bash
set -x
unset PYTHONPATH
printenv | grep PYTHONPATH
for i in {1..100}; do
    (cd /home/rgrossekunst/bld/cmake_build/tests/test_embed && /home/rgrossekunst/bld/cmake_build/tests/test_embed/test_embed)
done

But still:

$ grep '1514 passed | 1 failed' ~/log_run100_2025-05-13+2154.txt | wc -l
100

(I'm using a python in a venv, I'll try to eliminate that, too.)

@rwgk
Copy link
Collaborator

rwgk commented May 14, 2025

After resetting my PATH to /usr/sbin:/usr/bin:/sbin:/bin the error goes away!

I rebuilt from scratch after resetting PATH.

$ grep 'All tests passed ' ~/log_run100_2025-05-13+2204.txt | wc -l
100

I have to figure out what on my regular PATH makes it upset. Not sure if I can still do that tonight.

@henryiii
Copy link
Collaborator

henryiii commented May 14, 2025

Did that change which version of Python was being picked up? It only affects newer Pythons.

@rwgk
Copy link
Collaborator

rwgk commented May 14, 2025

Did that change which version of Python was being picked up? It only affects newer Pythons.

Yes and no.

Let's call this PathA:

export PATH=/home/rgrossekunst/venvs/0db6015-lcedt/base/bin:/usr/sbin:/usr/bin:/sbin:/bin

This is PathB:

export PATH=/usr/sbin:/usr/bin:/sbin:/bin

For completeness: In what follows, it does NOT matter if I had PathA or PathB while running the cmake and make commands.

But at runtime:

  • test_embed fails if PathA is active.
  • test_embed passes if PathB is active.

I can switch back and forth, the behavior is 100% deterministic.

Interestingly (but as I expected):

$ export PATH=/usr/sbin:/usr/bin:/sbin:/bin
$ which python
/usr/bin/python
$ realpath "$(which python)"
/usr/bin/python3.12
$ export PATH=/home/rgrossekunst/venvs/0db6015-lcedt/base/bin:/usr/sbin:/usr/bin:/sbin:/bin
$ which python
/home/rgrossekunst/venvs/0db6015-lcedt/base/bin/python
$ realpath "$(which python)"
/usr/bin/python3.12

Open question: How can that lead to the test failure?

@rwgk
Copy link
Collaborator

rwgk commented May 14, 2025

I'm getting the same behavior when using the master branch!

Note: Usually I run my tests with a scons-based build. With that, tests pass for master and this PR.

This is a bit twisty: I had to tweak my scons files to make it work for this PR. To avoid that before, I chose to simply use the cmake build instead. I totally didn't expect that to fail on master. I ran the cmake-based tests many times in the past, and it used to work, but it's only fairly recently that I started using a venv in my log-in environment.

But back to the cmake build:

This fails as reported before:

$ echo $PATH
/home/rgrossekunst/venvs/0db6015-lcedt/base/bin:/usr/sbin:/usr/bin:/sbin:/bin
$ (cd /home/rgrossekunst/bld/cmake_build/tests/test_embed && /home/rgrossekunst/bld/cmake_build/tests/test_embed/test_embed)

These work, everything else exactly equal:

$ (cd /home/rgrossekunst/bld/cmake_build/tests/test_embed && PATH=/usr/bin /home/rgrossekunst/bld/cma
ke_build/tests/test_embed/test_embed)
===============================================================================
All tests passed (1581 assertions in 19 test cases)

$ (cd /home/rgrossekunst/bld/cmake_build/tests/test_embed && PATH= /home/rgrossekunst/bld/cmake_build
/tests/test_embed/test_embed)
===============================================================================
All tests passed (1581 assertions in 19 test cases)

I don't know what to do about this, but it's pretty much certain now that the weird error has nothing to do with this PR.

@rwgk
Copy link
Collaborator

rwgk commented May 14, 2025

@b-pass wrote:

gil_not_used can be mixed with the multiple interpreter options.

OK, then my suggestion with the chaining doesn't make sense, and the current naming is fine.

@henryiii wrote:

if the error has gone away and you're OK with the naming, then I think we're good to go.

Awesome, I'll merge this now.

Ideally we should do something about the weird test_embed behavior, but certainly not in this PR.

@rwgk rwgk merged commit 95e8f89 into pybind:master May 14, 2025
65 checks passed
@github-actions github-actions bot added the needs changelog Possibly needs a changelog entry label May 14, 2025
@henryiii
Copy link
Collaborator

There's something wrong with the embed tests in CMake. I'll have to try to work it out, but I can't get them to pass locally, haven't for a while. But they work on many of our CI jobs (a few don't have the requirements present to run them).

@rwgk
Copy link
Collaborator

rwgk commented May 14, 2025

There's something wrong with the embed tests in CMake. I'll have to try to work it out, but I can't get them to pass locally, haven't for a while. But they work on many of our CI jobs (a few don't have the requirements present to run them).

I don't think it's cmake specific (maybe you didn't mean that, but JIC, to clarify). My scons-based test just happens to work interactively because I'm doing this (read that code from the bottom up):

  test_embed_dirpath = os.path.join(tests_dirpath, "test_embed")                
  test_embed, list_of_test_py = build_list_of_tests(tests_dirpath, substrings)  
  env = {"PYTHONPATH": normabspath("lib")}                                      
  if test_embed:                                                                
    bin_test_embed = normabspath("bin/test_embed")                              
    print('(cd "%s" && PYTHONPATH="%s" "%s")' % (                               
        test_embed_dirpath, env["PYTHONPATH"], bin_test_embed))                 
    sys.stdout.flush()                                                          
    subprocess.call([bin_test_embed], cwd=test_embed_dirpath, env=env)          

What I didn't realize in the past: passing in env like this means PATH is not set.

I believe if you change the cmake files to do this

(cd /home/rgrossekunst/bld/cmake_build/tests/test_embed && PATH= /home/rgrossekunst/bld/cmake_build
/tests/test_embed/test_embed)

local testing will pass for you, too.

@henryiii
Copy link
Collaborator

henryiii commented May 15, 2025

I'm now getting

FAILED test_multiple_interpreters.py::test_independent_subinterpreters - ModuleNotFoundError: No module named 'interpreters'
FAILED test_multiple_interpreters.py::test_dependent_subinterpreters - ModuleNotFoundError: No module named 'interpreters'

on the 3.14 PR in #5646. I'll try to look at why later, heading to PyCon.

@henryiii henryiii changed the title Support for sub-interpreters feat: support for sub-interpreters May 17, 2025
@henryiii henryiii removed the needs changelog Possibly needs a changelog entry label May 17, 2025
// Slightly faster code paths are available when PYBIND11_SUBINTERPRETER_SUPPORT is *not* defined,
// so avoid defining it for implementations that do not support subinterpreters.
// However, defining it unnecessarily is not expected to break anything.
#if PY_VERSION_HEX >= 0x030C0000 && !defined(PYPY_VERSION) && !defined(GRAALVM_PYTHON)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@rwgk
Copy link
Collaborator

rwgk commented Jun 14, 2025

@b-pass I just saw the flake below:

https://github.com/pybind/pybind11/actions/runs/15654733200/job/44104243287?pr=5728

A rerun worked.

Does this ring any bells?

Run Build Command(s): /usr/local/bin/ninja -v cpptest
[0/2] /usr/local/bin/cmake -P /home/runner/work/pybind11/pybind11/build/CMakeFiles/VerifyGlobs.cmake
[1/2] cd /home/runner/work/pybind11/pybind11/build/tests/test_embed && /home/runner/work/pybind11/pybind11/build/tests/test_embed/test_embed
FAILED: tests/test_embed/CMakeFiles/cpptest /home/runner/work/pybind11/pybind11/build/tests/test_embed/CMakeFiles/cpptest 
cd /home/runner/work/pybind11/pybind11/build/tests/test_embed && /home/runner/work/pybind11/pybind11/build/tests/test_embed/test_embed
Exception ignored in: <module 'threading' from '/opt/hostedtoolcache/Python/3.12.11/x64/lib/python3.12/threading.py'>
Traceback (most recent call last):
  File "/opt/hostedtoolcache/Python/3.12.11/x64/lib/python3.12/threading.py", line 1602, in _shutdown
    assert tlock.locked()
           ^^^^^^^^^^^^^^
AssertionError: 
mremap_chunk(): invalid pointer

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
test_embed is a Catch v2.13.10 host application.
Run with -? for options

-------------------------------------------------------------------------------
Per-Subinterpreter GIL
-------------------------------------------------------------------------------
/home/runner/work/pybind11/pybind11/tests/test_embed/test_subinterpreter.cpp:293
...............................................................................

/home/runner/work/pybind11/pybind11/tests/test_embed/test_subinterpreter.cpp:293: FAILED:
  {Unknown expression after the reported line}
due to a fatal error condition:
  SIGABRT - Abort (abnormal termination) signal

===============================================================================
test cases:   20 |   19 passed | 1 failed
assertions: 1588 | 1587 passed | 1 failed

Fatal Python error: Aborted

Current thread 0x00007f3ef7fff6c0 (most recent call first):
  File "<frozen importlib._bootstrap_external>", line 757 in _compile_bytecode
  File "<frozen importlib._bootstrap_external>", line 11[28](https://github.com/pybind/pybind11/actions/runs/15654733200/job/44104243287?pr=5728#step:13:29) in get_code
  File "<frozen importlib._bootstrap_external>", line 995 in exec_module
  File "<frozen importlib._bootstrap>", line 935 in _load_unlocked
  File "<frozen importlib._bootstrap>", line 13[31](https://github.com/pybind/pybind11/actions/runs/15654733200/job/44104243287?pr=5728#step:13:32) in _find_and_load_unlocked
  File "<frozen importlib._bootstrap>", line 1360 in _find_and_load
  File "<frozen importlib._bootstrap>", line 488 in _call_with_frames_removed
  File "<frozen importlib._bootstrap>", line 1415 in _handle_fromlist
  File "/opt/hostedtoolcache/Python/3.12.11/x64/lib/python3.12/encodings/__init__.py", line [33](https://github.com/pybind/pybind11/actions/runs/15654733200/job/44104243287?pr=5728#step:13:34) in <module>
  File "<frozen importlib._bootstrap>", line 488 in _call_with_frames_removed
  File "<frozen importlib._bootstrap_external>", line 999 in exec_module
  File "<frozen importlib._bootstrap>", line 9[35](https://github.com/pybind/pybind11/actions/runs/15654733200/job/44104243287?pr=5728#step:13:36) in _load_unlocked
  File "<frozen importlib._bootstrap>", line 1331 in _find_and_load_unlocked
  File "<frozen importlib._bootstrap>", line 1[36](https://github.com/pybind/pybind11/actions/runs/15654733200/job/44104243287?pr=5728#step:13:37)0 in _find_and_load
Aborted (core dumped)
ninja: build stopped: subcommand failed.

@b-pass
Copy link
Contributor Author

b-pass commented Jun 14, 2025

Does this ring any bells?

Not particularly. Based on the tlock I think this happened in another thread while trying to shut down a subinterpreter

mremap_chunk(): invalid pointer

This makes me think something (such as a double free) in that MR corrupted the heap.

@henryiii
Copy link
Collaborator

@b-pass
Copy link
Contributor Author

b-pass commented Jun 16, 2025

#5729 might fix it (but maybe not, I have not been able to reproduce it locally)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants