Utilize lowMemoryUnused #1103

dcodeIO · 2020-02-11T15:50:28Z

This PR investigates utilizing the lowMemoryUnused option provided by Binaryen, doing (techically unsafe) optimizations like

(i32.load
 (i32.add
  (local.get $0)
  (i32.const 16)
 )
)

to

(i32.load offset=16
 (local.get $0)
)

In order to do this, the first 1KB of memory must be guaranteed to be invalid, since the 32-bit addition in the first snippet can overflow while the offset= attribute cannot. As a result, pointer additions below the 1024 bytes threshold can be optimized.

On a first glimpse this doesn't help us a lot since the compiler already emits offset= for field accesses, with just a few, yet important cases where it helps, especially within std ArrayBuffer and Array (which is expected), making it more a micro optimization for speed than anything else.

dcodeIO · 2020-02-11T15:59:02Z

From the fixtures it actually seems that the only place where this does something useful is in the memset helper, which we could as well optimize per hand, hmm.

MaxGraey · 2020-02-11T16:01:09Z

I'm wondering could we decrease 1024 bytes to 256 bytes which potentially enough for most of usual cases but more economical in terms of RAM consumption which is pretty important for IoT for example? I saw binaryen hardcoded this values in enum

dcodeIO · 2020-02-11T16:10:20Z

Yeah, the value is currently hardcoded, but I guess it wouldn't be too hard to make it configurable. Might well be that we can get away with a much smaller value, as this really just appears to affect manual unsafe code. Even if we'd have huge classes with lots of fields, these would already use offsets anyway.

Currently thinking that we'd not do this optimization at all if lowMemoryLimit (from the other PR) is present, because every single byte may count there.

MaxGraey · 2020-02-11T17:12:30Z

Yeah, we have a lot of handcrafted optimizations and low-level stuffs but for external user this optimization pass could be useful. Also it close this most oldest issue at last! =)

dcodeIO · 2020-02-11T17:48:50Z

Not so sure about the old issue. Seems to be quite limited to memset currently, while an array access goes through an overload that doesn't appear to benefit from the same effects. Reminds me that I wanted to make bindings for the inlining constants, since tweaking these might help.

dcodeIO · 2020-02-11T18:33:27Z

PR for inlining limits: WebAssembly/binaryen#2655

MaxGraey · 2020-02-11T18:35:48Z

That's inlining PR will be really useful for our needs!

MaxGraey · 2020-03-14T02:41:37Z

@dcodeIO it seems this optimization pass significant slowdown compilation but with pretty small benefits at least for our tests so I suggest apply it only for optimizeLevelHint >= 3. wdyt?

dcodeIO · 2020-03-14T09:34:12Z

Yeah, turned out that our tests are relatively immune to this because stdlib loads and stores are optimized already. More about user code where immOffset is not fully utilized. Fine with upping this to >=3. Do you have any numbers on how much slower it is, or a theory why?

MaxGraey · 2020-03-14T09:35:35Z

I made pr for this: #1169

MaxGraey · 2020-03-14T09:38:44Z

in that tuned PR tests run in 82,124 ms
currently it takes: 111,206 ms

Utilize lowMemoryUnused

f922239

dcodeIO added 2 commits March 13, 2020 18:46

Merge branch 'master' into low-memory-unused

2de33c7

optimize memset by hand

cb97276

dcodeIO requested a review from MaxGraey March 13, 2020 18:19

MaxGraey approved these changes Mar 13, 2020

View reviewed changes

dcodeIO merged commit b7df27c into master Mar 13, 2020

dcodeIO deleted the low-memory-unused branch March 15, 2020 13:35

MaxGraey mentioned this pull request Mar 16, 2020

Propagate constant load/store offsets more efficiently #32

Closed

dcodeIO mentioned this pull request Apr 6, 2021

Fix invalid store offsets in memset polyfill #1787

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Utilize lowMemoryUnused #1103

Utilize lowMemoryUnused #1103

dcodeIO commented Feb 11, 2020

dcodeIO commented Feb 11, 2020

MaxGraey commented Feb 11, 2020

dcodeIO commented Feb 11, 2020

MaxGraey commented Feb 11, 2020

dcodeIO commented Feb 11, 2020

dcodeIO commented Feb 11, 2020

MaxGraey commented Feb 11, 2020

MaxGraey commented Mar 14, 2020

dcodeIO commented Mar 14, 2020

MaxGraey commented Mar 14, 2020

MaxGraey commented Mar 14, 2020 •

edited

Loading

Utilize lowMemoryUnused #1103

Utilize lowMemoryUnused #1103

Conversation

dcodeIO commented Feb 11, 2020

dcodeIO commented Feb 11, 2020

MaxGraey commented Feb 11, 2020

dcodeIO commented Feb 11, 2020

MaxGraey commented Feb 11, 2020

dcodeIO commented Feb 11, 2020

dcodeIO commented Feb 11, 2020

MaxGraey commented Feb 11, 2020

MaxGraey commented Mar 14, 2020

dcodeIO commented Mar 14, 2020

MaxGraey commented Mar 14, 2020

MaxGraey commented Mar 14, 2020 • edited Loading

MaxGraey commented Mar 14, 2020 •

edited

Loading