Add a prebuilt version for Apple Silicon (M1) #4334

d3lm · 2021-11-16T21:01:46Z

wasm-pack uses wasm-opt and it tries to download the binary for it when it gets executed. However, if you run this on an M1 it gives the following error:

Error: no prebuilt wasm-opt binaries are available for this platform: Unrecognized target!

It would be great if there was a prebuilt version for ARM so wasm-opt runs out of the box on an M1 Mac.

The text was updated successfully, but these errors were encountered:

kripken · 2021-11-17T21:28:12Z

cc @sbc100

sbc100 · 2021-11-17T22:54:57Z

@alexcrichton is this something wasm-pack can do or should we try to do it here in the binaryen CI? Do you want to have go at this?

Even if we don't build an arm binary wasm-pack should be able to download and use the x86 version right? That might be the quickest way to get things working.

alexcrichton · 2021-11-18T15:23:12Z

Ah I don't work on wasm-pack any more, so I don't know where this would best be done.

sbc100 · 2021-11-18T15:38:54Z

@d3lm can you open a bug on wasm-pack and suggest they just the x86-64 binary when running on arm? (for now).

d3lm · 2021-11-19T14:16:03Z

@sbc100 Yep I can do that. Thanks.

d3lm · 2021-11-19T14:16:32Z

But just to understand, the x86-64 does not work out of the box on arm right? You'd have to run this via rosetta no?

sbc100 · 2021-11-19T15:17:21Z

My understanding is the M1 macs do support running x86-64 out of the box (i.e. they ship with whatever tooling they need to make this work).

sbc100 · 2021-11-19T15:18:04Z

My understanding is that x86-64 binaries can be executed transparently without any special commands and launch sequences.

d3lm · 2021-11-19T19:25:04Z

Ah ok, and wasm-pack doesn't have a prebuilt x86-64?

sbc100 · 2021-11-19T20:06:13Z

Binaryen does provide and x86-64 prebuilt of wasm-opt. I'm suggesting the wasm-opt look for this binary rather than the aarch64 (arm64) version (which doesn't not exist).

d3lm · 2021-11-20T08:40:22Z

Aha! Got it. Thanks a lot.

d3lm · 2021-12-03T14:39:46Z

Ok, after some discussion I still think it would be best to have a osx_aarch64 release of binaryen because wasm-pack simply downloads it like this

Tool::WasmOpt => {
  Ok(format!(
    "https://github.com/WebAssembly/binaryen/releases/download/{vers}/binaryen-{vers}-{target}.tar.gz",
     vers = "version_90",
     target = target,
   ))
}

Why can't we add a build for macos-12 too? It's quite common and I think it would be right to ship a release for it as part of binaryen.

sbc100 · 2021-12-03T16:05:38Z

So the binaryen filenames are of the form binaryen-version_100-x86_64-macos.tar.gz so why can't wasm-pack simply do if (target == aarch64-macos) target = x86_64-macos right before trying to download?

Of course, if you want to get the last bit of performance benefit from having a native arm64 build, and you would like to contribute the github action recipe for building it that would be most welcome. But I don't see why you can't use the x86_64 version for now.

As for targeting a certain/newer macos version, what would be point of that? I don't know of any benefit of targeting a more recent version, but maybe I'm missing something? I think we we want to be as compatible as can with all versions (within reason) which means setting a low minimum version when build, right?

d3lm · 2021-12-03T16:50:56Z

Because that requires rosetta and it'd be great if we could avoid rosetta and have a native arm64 build. Or am I wrong?

d3lm · 2021-12-03T16:52:31Z

I see if I can contribute and submit an action for building it for arm64.

d3lm · 2021-12-03T17:22:54Z

I actually think that building for Apple Sillicon in a GitHub action is not really possible right now, because there's no environment that runs an M1. Tho, you could pass -DCMAKE_OSX_ARCHITECTURES=arm64 to cmake, but I am not sure if that would be enough. Do you have any experience with this?

sbc100 · 2021-12-03T19:52:37Z

Because that requires rosetta and it'd be great if we could avoid rosetta and have a native arm64 build. Or am I wrong?

Why is avoiding rosetta an important goal here? Don't all M1 macs ship with rosetta builtin? I get that it would be nice to have.. but it doesn't seem particularly urgent or important?

Do you have an M1 mac that doesn't have rosetta installed?

Or are you worried about that overhead of running wasm-opt in emulation mode? (is your wasm-opt phase very slow?)

sbc100 · 2021-12-03T19:53:25Z

Yes, if we wanted to have github build these binaries I imagine it would be cross compile.. i.e. produce an arm64 build on x86_64 hardware.

d3lm · 2021-12-03T19:58:00Z

Don't all M1 macs ship with rosetta builtin?

I think so yes.

Do you have an M1 mac that doesn't have rosetta installed?

No, because I think it's installed by default.

Or are you worried about that overhead of running wasm-opt in emulation mode?

Yes, but that may be unjustified worries, because I have not yet seen any performance issues.

For the time being, I submitted a PR to wasm-pack rustwasm/wasm-pack#1088 that downloads the x86_64 build for aarch64. If issues pop up we can re-evaluate or try to set up a native arm64 build on x86_64 hardware.

sbc100 · 2021-12-03T20:22:42Z

Great! That sounds a good place to be for now.

kripken · 2021-12-03T21:08:46Z

Another option here is to make a wasm build of wasm-opt. With node-pthreads support + exceptions support that should run pretty fast in recent node. Not as fast as a native M1 build I'm sure, but it might be faster than an x86_64 build emulated on M1... which makes this potentially interesting.

d3lm · 2021-12-03T21:44:20Z

@kripken I actually had the same idea! That would work too which would eliminate the interoperability issues across different systems.

kripken · 2021-12-07T21:08:21Z

I did a little testing of that now using this diff:

diff --git a/CMakeLists.txt b/CMakeLists.txt
index e540b1f57..0dc9c204b 100644
--- a/CMakeLists.txt
+++ b/CMakeLists.txt
@@ -265,26 +265,32 @@ else()
     add_nondebug_compile_flag("-UNDEBUG")
   endif()
 endif()
 
 if(EMSCRIPTEN)
   # link with -O3 for metadce and other powerful optimizations. note that we
   # must use add_link_options so that this appears after CMake's default -O2
   add_link_options("-O3")
   add_link_flag("-s SINGLE_FILE")
   add_link_flag("-s ALLOW_MEMORY_GROWTH=1")
-  add_compile_flag("-s DISABLE_EXCEPTION_CATCHING=0")
-  add_link_flag("-s DISABLE_EXCEPTION_CATCHING=0")
+  add_compile_flag("-fwasm-exceptions")
+  add_link_flag("-fwasm-exceptions")
+  add_compile_flag("-pthread")
+  add_link_flag("-pthread")
+  add_link_flag("-s PROXY_TO_PTHREAD")
+  add_link_flag("-s EXIT_RUNTIME")
+  add_link_flag("-Wno-pthreads-mem-growth")
   # make the tools immediately usable on Node.js
   add_link_flag("-s NODERAWFS")
   # in opt builds, LTO helps so much (>20%) it's worth slow compile times
-  add_nondebug_compile_flag("-flto")
+  #add_nondebug_compile_flag("-flto")
 endif()
 
 # clang doesn't print colored diagnostics when invoked from Ninja
 if(UNIX AND CMAKE_GENERATOR STREQUAL "Ninja")
   if(CMAKE_CXX_COMPILER_ID STREQUAL "GNU")
     add_compile_flag("-fdiagnostics-color=always")
   elseif(CMAKE_CXX_COMPILER_ID STREQUAL "Clang")
     add_compile_flag("-fcolor-diagnostics")
   endif()
 endif()
diff --git a/src/support/threads.cpp b/src/support/threads.cpp
index ab9de4175..7aec720ba 100644
--- a/src/support/threads.cpp
+++ b/src/support/threads.cpp
@@ -132,24 +132,25 @@ void ThreadPool::initialize(size_t num) {
       threads.clear();
       return;
     }
   }
   DEBUG_POOL("initialize() waiting\n");
   condition.wait(lock, [this]() { return areThreadsReady(); });
   DEBUG_POOL("initialize() is done\n");
 }
 
 size_t ThreadPool::getNumCores() {
-#ifdef __EMSCRIPTEN__
+#if defined(__EMSCRIPTEN__) && !defined(__EMSCRIPTEN_PTHREADS__)
   return 1;
 #else
   size_t num = std::max(1U, std::thread::hardware_concurrency());
   if (getenv("BINARYEN_CORES")) {
     num = std::stoi(getenv("BINARYEN_CORES"));
   }
   return num;
 #endif
 }
 
 ThreadPool* ThreadPool::get() {
   DEBUG_POOL("::get()\n");
   // lock on the creation

Everything works as expected when I optimize some large files, but it is surprisingly slow. It is using multiple cores, and in fact uses them more than a native build. Native has this:

66.95user 0.67system 0:14.33elapsed 471%CPU (0avgtext+0avgdata 608316maxresident)k

and node 16.5 has this:

768.36user 237.38system 2:37.22elapsed 639%CPU (0avgtext+0avgdata 762028maxresident)k

I'm not sure what's going wrong here. Perhaps the slower wasm atomics are hurting us since they are all sequentially consistent...? Anyhow, sadly using wasm isn't a easy solution for this issue.

d3lm · 2021-12-08T12:56:34Z

Thanks so much for investigating a WASM version of wasm-opt @kripken. Appreciate it a lot!

It's a bummer that it's so much slower 😞 but I can also imagine that Atomics are slowing it down a lot. I can't imagine that using pthreads (which is mapped to web workers or worker threads) would be a problem here.

I'd love to loop @tschneidereit in as well. Maybe you have an idea why it would be significantly slower than native?

I suppose we have to wait for an M1 environment for GitHub actions or build an aarch64 with a cross compiler. WDYT @kripken?

dschuff · 2021-12-16T00:19:58Z

If we have a recent enough SDK on our existing mac builder, It should be very straightforward to cross-compile for aarch64.

tlively · 2025-05-16T19:04:29Z

We now have MacOS Arm releases.

kripken mentioned this issue Dec 7, 2021

Node+pthreads+wasm EH surprisingly slow emscripten-core/emscripten#15727

Closed

dschuff mentioned this issue Dec 16, 2021

Build ARM64 MacOS releases #4397

Merged

FaberVitale mentioned this issue Jan 20, 2022

Wasm opt rust aduros/wasm4#340

Open

4 tasks

tlively closed this as completed May 16, 2025

Add a prebuilt version for Apple Silicon (M1) #4334

Add a prebuilt version for Apple Silicon (M1) #4334

Comments

d3lm commented Nov 16, 2021

kripken commented Nov 17, 2021

Uh oh!

sbc100 commented Nov 17, 2021

Uh oh!

alexcrichton commented Nov 18, 2021

Uh oh!

sbc100 commented Nov 18, 2021

Uh oh!

d3lm commented Nov 19, 2021

Uh oh!

d3lm commented Nov 19, 2021

Uh oh!

sbc100 commented Nov 19, 2021

Uh oh!

sbc100 commented Nov 19, 2021

Uh oh!

d3lm commented Nov 19, 2021

Uh oh!

sbc100 commented Nov 19, 2021

Uh oh!

d3lm commented Nov 20, 2021

Uh oh!

d3lm commented Dec 3, 2021

Uh oh!

sbc100 commented Dec 3, 2021

Uh oh!

d3lm commented Dec 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

d3lm commented Dec 3, 2021

Uh oh!

d3lm commented Dec 3, 2021

Uh oh!

sbc100 commented Dec 3, 2021

Uh oh!

sbc100 commented Dec 3, 2021

Uh oh!

d3lm commented Dec 3, 2021

Uh oh!

sbc100 commented Dec 3, 2021

Uh oh!

kripken commented Dec 3, 2021

Uh oh!

d3lm commented Dec 3, 2021

Uh oh!

kripken commented Dec 7, 2021

Uh oh!

d3lm commented Dec 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dschuff commented Dec 16, 2021

Uh oh!

tlively commented May 16, 2025

Uh oh!

d3lm commented Dec 3, 2021 •

edited

Loading

d3lm commented Dec 8, 2021 •

edited

Loading