Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Function body maximum size #1138

Open
Cellule opened this issue Oct 18, 2017 · 52 comments
Open

Function body maximum size #1138

Cellule opened this issue Oct 18, 2017 · 52 comments

Comments

@Cellule
Copy link

Cellule commented Oct 18, 2017

Among the browser vendors we agreed upon a limit of function body size to allow in webassembly module of 128KB.

However, the benchmark Godot has a function of 151975Bytes or 148KB (function 2187).

For some reason Chrome and Firefox allow it.
Now we end up in Edge where we invalidate this module.

It makes me think that we didn't understand the limit correctly.
According to v8/module-decoder.cc, it looks like it is checking the size in bytes.

Can someone from the other browser vendors help me understand this ?

@lukewagner
Copy link
Member

D'oh, this particular limit check wasn't factored into the shared validation code and thus was only being checked by WebAssembly.validate in FF. That's an awfully big function; I'll see if perhaps Godot isn't enabling optimizations in their build. But if it is just some giant interpreter loop or something, perhaps this is evidence we've set the limit too low.

@lukewagner
Copy link
Member

Also it appears that function index 21555 is even bigger, with 219504 bytes.

@binji
Copy link
Member

binji commented Oct 19, 2017

It makes me think that we didn't understand the limit correctly.
According to v8/module-decoder.cc, it looks like it is checking the size in bytes.

Unfortunately, it looks like DecodeWasmFunction is only used by SyncDecodeWasmFunction which is only used by a unittest. The limit is also used in the streaming decoder, but it's not clear why it's not erroring out there (likely because the streaming decoder is not used here). I filed a v8 bug: https://crbug.com/v8/6959.

@jfbastien
Copy link
Member

jfbastien commented Oct 19, 2017

FWIW we don't apply all the limits because some didn't make sense to our implementation. The ones we apply are here: https://github.com/WebKit/webkit/blob/master/Source/JavaScriptCore/wasm/WasmLimits.h

In particular, I dropped function size because it's a ridiculously low limit which basic wasm programs exceed (e.g. the benchmarks from the PLDI paper trips it).

@Cellule
Copy link
Author

Cellule commented Oct 19, 2017

Do we want to revisit the limits that we use and make them part of the spec (either js or web spec)?
I agree that 128kb is too low.

@littledan
Copy link

In general, if Wasm programs should work consistently across browsers, having any sort of restrictions be written in the specification and with conformance tests seems like a good idea to me. Otherwise, there's a natural "race to the bottom" (or top, here?) where browsers try to be compatible with each other as much as possible and permit what all the other ones permit. This sort of dynamic is responsible for a lot of the mess that we have in JS and the rest of the web platform.

I don't understand the details of this particular limit; is there any past discussion that was written down anywhere about it?

@rossberg
Copy link
Member

@littledan, a race to the top is usually a good thing when it comes to arbitrary limits. Every language implementation has tons of implicit limitations, almost no language even documents their possibility (C is the only exception that comes to mind, and only for some hand-selected cases). Picking concrete numbers on the spec level makes no sense in most cases, because the limits are not static but depend on many complex platform-dependent and implementation-specific variables.

@lukewagner
Copy link
Member

If it doesn't make sense in the spec, documenting the limits somewhere for toolchains to refer to makes sense. Maybe a new .md in webassembly/tool-conventions?

Anyhow yeah, sounds like 128kb is way too low. I think having a chosen limit is better than each engine having its own implicit limit unless we think all engines will be able to handle functions up to the maxModuleSize (1gb). I doubt that, especially given the subtle limits that show up like the 32mb local jump range on ARM. How about 1mb?

@binji
Copy link
Member

binji commented Oct 19, 2017

It might be nice to have some more information about the largest functions we've seen before making this choice.

@lukewagner
Copy link
Member

@kripken What are the biggest you've ever seen? Alon was telling me he was thinking of bringing the "outlining" feature (which splits up crazy-big functions) to wasm. With this, we wouldn't necessarily need to accommodate the biggest possible functions.

@kripken
Copy link
Member

kripken commented Oct 20, 2017

I've been thinking about this, but I don't yet have a good idea for something like outlining in wasm (the asm.js approach was quite hackish and we had a bunch of bugs there). I may end up just emitting a warning in the meantime.

Size-wise, I don't have good data on this, none of the biggest ones were recent, and I measured them as lines in the text format anyhow.

Could we do 64MB? I've never seen a wasm module that big, let alone a single function, so it seems like a safe bound. Or is there a benefit to a lower value? (I'm not sure why we need a bound at all so I'm missing some background I guess.)

@lukewagner
Copy link
Member

Assuming there is a small factor expansion between wasm bytecode and machine code (I've seen 2-4x), and given ARM(32,64)'s 32mb local-jump limit, I'd be surprised if engines didn't fail in practice well before 32mb. It's possible of course to handle >32mb functions if you design for it, but I expect not all engines do. Happy to hear from other engines though.

@RyanLamansky
Copy link

One engine's data point: my .NET-based implementation inherits all of its limitations from the .NET framework itself, which doesn't seem to have any hard limit on function size (at least none that I've heard of or have been able to find after some quick searches).

@titzer
Copy link

titzer commented Oct 23, 2017

I've fixed the V8 bug, and we now enforce the function size limit.

@titzer
Copy link

titzer commented Oct 23, 2017

I still think it is a good idea that engines have a "gentleman's agreement" to enforce a static size limit for functions. I'm OK bumping up the 128k limit, but I think we should probably pick a reasonable one before the race to the top picks up :)

@lukewagner
Copy link
Member

1mb?

@kripken
Copy link
Member

kripken commented Oct 23, 2017

@titzer Perhaps it's best not to apply the 128K limit right now? It will break Godot and likely other things. This will cause confusion and frustration for users.

@flagxor
Copy link
Member

flagxor commented Oct 23, 2017

We've seen as high as 1.5MB, for instance this:
https://bugs.chromium.org/p/chromium/issues/detail?id=769607

@MikeHolman
Copy link
Member

If we're already seeing pathological cases of 1.5mb, then maybe we should bump all the way up to 10mb?

@kripken For now though, do you know how Godot is being built/is there any way to compile it smaller? Edge just shipped, and this isn't a reliability issue for existing production websites so we're going to be stuck with 128kb limit until next Windows release.

@kripken
Copy link
Member

kripken commented Oct 23, 2017

@MikeHolman: When is the next Windows release?

We can try to figure out a solution for Godot, but we may need to do something more radical if we can't expect a browser update soon. Because we've already seen other examples from other codebases like the one @flagxor mentioned, and also there may not be a practical solution in some cases, if the code is from a code generator, or something else that can't be worked around by preventing inlining.

In asm.js we experimented with outlining/function splitting. Overall it was messy and buggy, so I'd rather not repeat that, but maybe we have no choice.

@MikeHolman
Copy link
Member

I expect it will be in about 6 months, so I think a short term targeted workaround will be sufficient.

@lukewagner
Copy link
Member

@flagxor Oh hah, from your comments, looks like that 1.5mb was a data section expressed as one big function.

@MikeHolman I've emailed the Godot author (https://github.com/eska014/godot/tree/wasm-benchmark) asking if all optimizations are enabled and, if so, if the function can be artificially split somehow.

@lukewagner
Copy link
Member

@MikeHolman That being said, I wouldn't be surprised if this broke for various uses in the wild over the next 6 months so, since it's such a trivial fix, it may be worth pushing to uplift the fix.

@lukewagner
Copy link
Member

With a worst-case 4x bytecode-to-machine-code expansion, a 10mb function could hit ARM 32mb limits; so to be a bit conservative, could we do something smaller than 10mb?

@MikeHolman
Copy link
Member

Windows is conservative about servicing. Even though it is a small fix, it needs to represent a significant reliability issue or a security issue. I'm not going to be able to convince them that this super new feature has enough web presence and hits the pathological case often enough to meet that bar.

I'm hesitant to go very much lower than 10mb, since we're still in early days and already have an example less than an order of magnitude away. Is 7mb a reasonable compromise?

@lukewagner
Copy link
Member

Ah, makes sense. 7mb sgtm

@littledan
Copy link

If there's a new limit, should it be included in some specification and shared tests?

@kripken
Copy link
Member

kripken commented Oct 23, 2017

@MikeHolman 6 months sounds like a long time, so I guess we should look into function splitting on the toolchain side even if godot has a manual workaround - thoughts?

Regardless it's interesting to know if they have a workaround. @lukewagner btw, fwiw that wasm looks optimized to me.

@lukewagner
Copy link
Member

@littledan I don't know about whether it belongs in the specifications... maybe so as an appendix. Above I suggested the tool-conventions repo.

@littledan
Copy link

@lukewagner Appendix seems fine to me. The important thing would be that implementations actually converge on something shared (if we want to have a shared limit like this; maybe @rossberg or @jfbastien disagree). This could even be something as rough as shared tests, where a comment in the tests points to this bug thread.

It just seems unfortunate that we're in this position of having had a miscommunication that might lead to incompatibility between browsers; I'm wondering if we can prevent more of these in the future.

chakrabot pushed a commit to chakra-core/ChakraCore that referenced this issue Oct 26, 2017
…21 bytes

Merge pull request #4070 from Cellule:wasm_func_limit

Bumping limit according to discussion on WebAssembly/design#1138

Fixes OS#14291237
@lukewagner
Copy link
Member

Looks like @Cellule agrees :) I'll land a patch in FF too. @jfbastien y tu tambien?

@littledan was talking about adding an engine internal limits section to the wasm-web spec, so perhaps that could be the resolution of this issue.

Also: the Godot demo has been updated to validate with the original 128kb limit.

@MikeHolman
Copy link
Member

Awesome, thanks Luke for following up on that! Glad we were able to get it to meet the original constraints. Unfortunately it looks like Godot requires WebGL 2 support, which Edge is still lacking :(

@lukewagner
Copy link
Member

/o\ I'll ask about that too; maybe it's not actually necessary but just used in some incidental way.

@jfbastien
Copy link
Member

@jfbastien y tu tambien?

Lo que sea.

WebGL 2 is a deal-breaker for us as well.

@titzer
Copy link

titzer commented Oct 27, 2017

@dschuff Suggested on our internal bug to make this a limit of the web embedding. I'm fine with that, although for now we enforce the 7654321 byte limit in the engine itself.

chakrabot added a commit to nodejs/node-chakracore that referenced this issue Oct 27, 2017
…ease the Wasm function size limit to 7654321 bytes

Merge pull request #4070 from Cellule:wasm_func_limit

Bumping limit according to discussion on WebAssembly/design#1138

Fixes OS#14291237
@jfbastien
Copy link
Member

@dschuff Suggested on our internal bug to make this a limit of the web embedding. I'm fine with that, although for now we enforce the 7654321 byte limit in the engine itself.

That's a good point, worth considering. @littledan if you add it to the spec I think that what @dschuff suggests is a good idea, so I'd hold off until we decide where it goes.

In any case, this is probably worth a poll at an upcoming meeting.

hubot pushed a commit to WebKit/WebKit-http that referenced this issue Oct 27, 2017
https://bugs.webkit.org/show_bug.cgi?id=178946
<rdar://problem/34257412>
<rdar://problem/34501154>

Reviewed by Saam Barati.

WebAssembly/design#1138 discusses the
arbitrary function size limit, which it turns out Chrome and
Firefox didn't enforce. We didn't use it because it was
ridiculously low and actual programs ran into that limit (bummer
for Edge which just shipped it...). Now that we agree on a high
arbitrary program limit, let's update it! While I'm doing this
there are a few other spots that I polished to use Checked or
better check limits overall.

* wasm/WasmB3IRGenerator.cpp:
(JSC::Wasm::B3IRGenerator::addLocal):
* wasm/WasmFormat.cpp:
(JSC::Wasm::Segment::create):
* wasm/WasmFunctionParser.h:
(JSC::Wasm::FunctionParser<Context>::parse):
* wasm/WasmInstance.cpp:
* wasm/WasmLimits.h:
* wasm/WasmModuleParser.cpp:
(JSC::Wasm::ModuleParser::parseGlobal):
(JSC::Wasm::ModuleParser::parseCode):
(JSC::Wasm::ModuleParser::parseData):
* wasm/WasmSignature.h:
(JSC::Wasm::Signature::allocatedSize):
* wasm/WasmTable.cpp:
(JSC::Wasm::Table::Table):
* wasm/js/JSWebAssemblyTable.cpp:
(JSC::JSWebAssemblyTable::JSWebAssemblyTable):
(JSC::JSWebAssemblyTable::grow):


git-svn-id: http://svn.webkit.org/repository/webkit/trunk@224122 268f45cc-cd09-0410-ab3c-d52691b4dbfc
@lukewagner
Copy link
Member

As an update, it looks like Godot simply uses OpenGL ES 3.0, so the dependency on WebGL2 isn't superficial. However, they do have plans to add OpenGL ES 2.0 in a few months which would give us a WebGL1 workload.

@kripken
Copy link
Member

kripken commented Nov 3, 2017

The Go language is planning to manage the call stack manually, i.e., without using call or separate wasm functions. Approaches like that might hit imposed function size limits (they might be able to split up the one big function into chunks, but it's not straightforward).

@titzer
Copy link

titzer commented Nov 6, 2017

@kripken Seems like that is just a prototype. It would be truly unfortunate, and probably have extremely bad performance, to generate one large function. There was discussion earlier in that thread about ignoring the stack machine and emulating a register machine. Seems like that work is preliminary and we shouldn't base decisions on it here.

@kripken
Copy link
Member

kripken commented Nov 6, 2017

@titzer Agreed, yeah. I just mentioned it as it seemed an interesting case for non-standard code generation (given their requirement to support things like bidirectional longjmp, a single big function or an interpreter may be their only options, all of which are not that good obviously).

@icefoxen
Copy link

Maybe there should be a specified minimum size instead of a maximum size?

@lars-t-hansen
Copy link
Contributor

The proposal is for a maximum number of functions and a maximum function body size, but the product of those two is a large number of bytecodes (7TB total code section size). Do we want to place an additional limit on the code section size?

(Not sure about this myself. We may end up with a hard limit in SpiderMonkey anyway to accomodate streaming compilation, but this is under discussion, and I'm not sure about the wisdom of legislating it.)

@littledan
Copy link

Have the limits been added to any sort of specification document? I'm having trouble finding them.

If they haven't been added yet: We've talked in the above thread about putting the limits in an appendix. Would the implementation limits appendix be an appropriate place, or should it be some other document? Or just in tests?

@icefoxen
Copy link

icefoxen commented Mar 4, 2018

A way to query the limits imposed by the implementation would also be nice. I am reminded of https://www.khronos.org/registry/vulkan/specs/1.0/man/html/VkPhysicalDeviceLimits.html for example.

@jfbastien
Copy link
Member

Not done, tracked here: WebAssembly/meetings#100

@rossberg
Copy link
Member

rossberg commented Mar 5, 2018

Would the implementation limits appendix be an appropriate place

No, since these are Web-specific limitations. The obvious place would be the Web API spec.

ryanhaddad pushed a commit to WebKit/WebKit that referenced this issue Dec 22, 2020
https://bugs.webkit.org/show_bug.cgi?id=178946
<rdar://problem/34257412>
<rdar://problem/34501154>

Reviewed by Saam Barati.

WebAssembly/design#1138 discusses the
arbitrary function size limit, which it turns out Chrome and
Firefox didn't enforce. We didn't use it because it was
ridiculously low and actual programs ran into that limit (bummer
for Edge which just shipped it...). Now that we agree on a high
arbitrary program limit, let's update it! While I'm doing this
there are a few other spots that I polished to use Checked or
better check limits overall.

* wasm/WasmB3IRGenerator.cpp:
(JSC::Wasm::B3IRGenerator::addLocal):
* wasm/WasmFormat.cpp:
(JSC::Wasm::Segment::create):
* wasm/WasmFunctionParser.h:
(JSC::Wasm::FunctionParser<Context>::parse):
* wasm/WasmInstance.cpp:
* wasm/WasmLimits.h:
* wasm/WasmModuleParser.cpp:
(JSC::Wasm::ModuleParser::parseGlobal):
(JSC::Wasm::ModuleParser::parseCode):
(JSC::Wasm::ModuleParser::parseData):
* wasm/WasmSignature.h:
(JSC::Wasm::Signature::allocatedSize):
* wasm/WasmTable.cpp:
(JSC::Wasm::Table::Table):
* wasm/js/JSWebAssemblyTable.cpp:
(JSC::JSWebAssemblyTable::JSWebAssemblyTable):
(JSC::JSWebAssemblyTable::grow):


Canonical link: https://commits.webkit.org/195093@main
git-svn-id: https://svn.webkit.org/repository/webkit/trunk@224122 268f45cc-cd09-0410-ab3c-d52691b4dbfc
@HarikrishnanBalagopal
Copy link

HarikrishnanBalagopal commented Dec 25, 2023

Golang seems to have hardcoded a limit of 2^16 bytes = 64KB on function size (WASM page size).
Have made an issue documenting the details golang/go#64856

@binji I have gone through the https://github.com/binji/raw-wasm demos repo and implemented a runtime in JS that could run those demos. I haven't found a reason why the function size would have to be limited to just 1 WASM page.

It would be very helpful if someone knowledgeable on the topic could comment on why this limit exists and how to work around it. Thank you!

@titzer
Copy link

titzer commented Dec 26, 2023

The agreed limitation size for function bodies is 7,654,321 bytes. I am not sure why Golang has a 1 page limitation; the 64KiB page size applies to memories, so that seems weird.

We decided to limit function size because in practice all engines have some non-linearity in their execution tiers, and a function size check ends up somewhere (e.g. the frontend of an optimizing tier). We decided to standardize limitations across engines and tools (at a rather generous size) in order to avoid incompatibilities and overall help the ecosystem. There are similar reasons for the other limitations on modules, e.g. the number of parameter/return types for a function. We generally chose powers of 10 to make it clear it's not a limitation of the binary format. IIRC @dschuff proposed 7654321 bytes for function sizes (offline); I posted it above, and we went with it.

@dschuff
Copy link
Member

dschuff commented Jan 2, 2024

I'm not sure how we ended up with powers of 10 for everything except module size, function size, and number of pages. I think I just proposed moving the limit from the core to the embedder spec, not the specific number :)
Which is actually worth bringing up in this case; the limits (specified in https://webassembly.github.io/spec/js-api/index.html#limits) are actually part of the JS embedder specification, not the core specification. So other (non-JS) embeddings could have different limits. It could be that the golang target was initially written to target some other runtime (presumably a smaller one) than JS.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests