Ability to replace current Node process with another

**Edit:** If someone can come up with a better shim for `execve` for Windows, that'd be *far* better. The form below is *very* expensive and *very* horrible.

**Edit 2:** Linked relevant [SO question](https://stackoverflow.com/questions/51185115/what-is-the-ideal-way-to-emulate-process-replacement-on-windows).

**Edit 3:** Clarify FS changes

**Edit 4:** Here's the text from that SO question as of July 6, 2018 (so you don't have to search for it), where I asked about how to do the Windows part.

<details>
<summary>Click to show (warning: lots of text)</summary>

So, in a [feature request I filed against Node.js](https://github.com/nodejs/node/issues/21664), I was looking for a way to replace the current Node process with another. In Linux and friends (really, any POSIX-compliant system), this is easy: use [`execve`](http://man7.org/linux/man-pages/man2/execve.2.html) and friends and call it a day. But obviously, that won't work on Windows, since it only has `CreateProcess` (which `execve` and friends delegate to, [complete with async behavior](https://stackoverflow.com/questions/49736973/blocking-version-of-execvp-windows)). And it's not like [people](https://stackoverflow.com/questions/35111313/windows-exec-equivalent) [haven't](https://stackoverflow.com/questions/6743567/replace-current-process-with-invocation-of-subprocess) [wanted](https://stackoverflow.com/questions/7198666/strategies-for-replacing-program-executable-in-windows) [to](https://stackoverflow.com/questions/198122/how-can-i-replace-the-current-java-process-like-a-unix-style-exec) [do](https://stackoverflow.com/questions/5450147/how-to-replace-the-current-java-process-in-windows-using-jna-jni) [similar](https://stackoverflow.com/questions/45607959/restart-windows-process-inplace-preserving-process-id-and-handles), leading to [numerous duplicate questions on this site](https://www.google.com/search?q=windows+replace+current+process+site:stackoverflow.com). (This isn't a duplicate because it's explicitly seeking a workaround given certain constraints, not just asking for direct replacement.)

Process replacement has several facets that have to addressed:

1. All console I/O streams have to be forwarded to the new process.
1. All signals need transparently forwarded to the new process.
1. The data from the old process have to be destroyed, with as many resources reclaimed as possible.
1. All pre-existing threads and child processes should be destroyed.
1. All pre-existing handles should be destroyed apart from open file descriptors and named pipes/etc.
1. Optimally, the old process's memory should be kept to a minimum after the process is created.
1. For my particular use case, retaining the process ID is not important.

And for my particular case, there are a few constraints:

1. I can control the initial process's startup as well as the location of my "process replacement" function.
1. I could load arbitrary native code via add-ons at potentially any stack offset.
    - Implication: I can't even dream of tracking `malloc` calls, handles, thread manipulation, or process manipulation to track and free them all, since DLL rewriting isn't exactly practical.
1. I have no control over *when* my "process replacement" is called. It could be called through an add-on, which could've been called through either interpreted code via FFI or even another add-on recursively. It could even be called during add-on initialization.
    - Implication: I would have no ability to know what's in the stack, even if I perfectly instrumented my side. And rewriting all their `call`s and `push`es is far from practical, and would just be all-around slow for obvious reasons.

So, here's the gist of what I was thinking: use something similar to a pseudo-trampoline.

1. Statically allocate the following:
    1. A single pointer for the stack pointer.
    1. `MAX_PATH + 1` chars for the application path + `'\0'`.
    1. `MAX_PATH + 1` chars for the current working directory path + `'\0'`.
    1. 32768 chars for the arguments + `'\0'`.
    1. 32768 chars for the environment + `'\0'`.
1. On entry, set the global stack pointer reference to the stack pointer.
1. On "replacement":
    1. Do relevant process cleanup and lock/release everything you can.
    1. Set the stack pointer to the stored original global one.
    1. Terminate each child thread.
    1. Kill each child process.
    1. Free [each open handle](https://stackoverflow.com/questions/733384/how-to-enumerate-process-handles).
    1. If possible (i.e. not in a UWP program), [For each heap](https://docs.microsoft.com/en-us/windows/desktop/api/heapapi/nf-heapapi-getprocessheaps), [destroy it](https://docs.microsoft.com/en-us/windows/desktop/api/HeapApi/nf-heapapi-heapdestroy) if it's not the [default heap](https://docs.microsoft.com/en-us/windows/desktop/api/HeapApi/nf-heapapi-getprocessheap) or the temporary heap (if it exists).
    1. If possible, close [each open handle](https://stackoverflow.com/questions/733384/how-to-enumerate-process-handles).
    1. If possible, [walk](https://docs.microsoft.com/en-us/windows/desktop/api/HeapApi/nf-heapapi-heapwalk) the default heap and [free](https://docs.microsoft.com/en-us/windows/desktop/api/HeapApi/nf-heapapi-heapfree) each segment associated with it.
    1. Create a new process with the statically allocated file/arguments/environment/etc. with no new window created.
    1. Proxy all future received signals, exceptions, etc. without modification to this process somehow. [The standard signals are easy](https://docs.microsoft.com/en-us/windows/console/setconsolectrlhandler), but not so much with the exceptions.
    1. Wait for the process to end.
    1. Return with [the process's exit code](https://docs.microsoft.com/en-us/windows/desktop/api/processthreadsapi/nf-processthreadsapi-getexitcodeprocess).

The idea here is to use a process-based trampoline and drop the current process size to an absolute minimum while the newly created one is started.

But where I'm not very familiar with Windows, I probably made quite a few mistakes here. Also, the above seems *extremely* inefficient and to an extent it just feels horribly wrong for something a kernel could just release a few memory pages, deallocate a bunch of memory handles, and move some memory around for the next process.

So, to summarize, what's the ideal way to emulate process replacement on Windows with the fewest limitations?
</details>

-----

I would like a means to "replace" the current Node process with another, keeping the same process ID. It would be something morally similar to [this function](https://github.com/isiahmeadows/thallium/blob/master/lib/cli/util.js#L92-L111), but it wouldn't return. This would be most useful for conditionally replacing Node flags in a startup script - for example, if someone wants to enable modules and your behavior needs to change non-trivially in the presence of them (like if you need to install a default loader), you'll want to respawn the process with `--experimental-modules --loader <file>` so you can install the loader.

This is also for scenarios when you want to run a module as a `main` module. If you want to do logic after the process ends, you should be using `child_process.spawn` regardless - you shouldn't be attempting to "replace" it in any capacity.

Here's what I propose:

- `child_process.replaceSpawn(command [ , args] [ , options ])`
    - `command` is the path to the new command.
    - `args` is the args to replace the arguments with. This defaults to the empty array.
    - `options` is for the various options for replacing the process. This defaults to an empty object.
        - `options.cwd` is the new cwd to use. (Default: `process.cwd()`)
        - `options.env` is the new environment to use. (Default: `process.env`)
        - `options.argv0` is the binary to spawn as. (Default: `command`)

- `child_process.replaceFork(mainPath [ , args] [ , options ])` works similarly to above.
    - `mainPath` is the path to the new `require.main`.
    - `options.execPath` is the new binary to spawn as. (Default: `process.execPath`)
    - `options.execArgv` are the new Node flags to spawn with. (Default: `process.execArgv`)
    - `options.argv0` is the binary to spawn as. (Default: `process.argv0`)
    - The command is the original binary itself.

- Add a `napi_terminating` member for `napi_status` to represent `try_catch.HasTerminated()` and the result of each call after replacement termination.

- Add a `napi_set_terminate_hook(napi_env env, void (*fun)(void*), void* data)` function to register a callback called on termination, to make it easier to clean up resources.

Internally, there are two cases you need to cover, and the simulated part for Windows is where it gets really hairy due to all the edge cases. Here's pseudocode for the basic algorithm (I'm not really familiar with Node internals, so take this as a rough guideline):

1. Stop the main event loop.
1. Go through the standard shutdown routine.
1. Destroy any open libuv handles and cancel any remaining event loop tasks.
1. If we're on a platform that supports process replacement (like Linux or Mac):
    1. Invoke `execve` or equivalent with the new process path, arguments, and environment.
1. Else, if we're on Windows (the only supported OS that doesn't), we have to simulate it entirely:
    1. Terminate execution via `v8::V8::TerminateExecution()`. All N-API callbacks should return `napi_terminated` during this step.
    1. For each loaded native module:
        1. If the native module has a terminate hook, call it.
        1. Unload the native module's DLL.
    1. Close the event loop.
    1. Dispose the isolate.
    1. Do the rest according to whatever happens to [this SO question](https://stackoverflow.com/questions/51185115/what-is-the-ideal-way-to-emulate-process-replacement-on-windows).
    1. Else, on other OSs without a process replacement function, it'd look similar to Windows.

In addition, file system requests will have to generally create each file descriptor with `O_CLOEXEC`.

As for precedent where this could be used immediately:

- [Liftoff](https://www.npmjs.com/package/liftoff) works very similarly, just with a little extra opinionated sugar, and that's used natively in Gulp. This kind of thing would speed that up quite a bit.
- I do [very similar](https://github.com/isiahmeadows/thallium/blob/master/cli.js#L126-L150) to transparently pass through unknown Node flags.
- Babel [attempts to use `kexec`](https://github.com/babel/babel/blob/master/packages/babel-node/src/babel-node.js#L87-L88) where available, which [is a POSIX-only module that replaces the process literally](https://www.npmjs.com/package/kexec). Absent that, it falls back to [its own implementation](https://github.com/babel/babel/blob/master/packages/babel-node/src/babel-node.js#L90-L109) that works like the other two examples.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Ability to replace current Node process with another #21664

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Ability to replace current Node process with another #21664

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions