forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[NFC] Avoid re-parsing instruction names (#5171)	Thomas Lively	2022-10-20	1	-88/+88
\| \| \| \| \| \| \| \| \|	Since gen-s-parser.py is essentially a giant table mapping instruction names to the information necessary to construct the corresponding IR nodes, there should be no need to further parse instruction names after the code generated by gen-s-parser.py runs. However, memory instruction parsing still parsed instruction names to get information such as size and default alignment. The new parser does not have the ability to parse that information out of instruction names, so put it in the gen-s-parser.py table instead.
*	[Parser][NFC] Pass instruction locations to `makeXXX` functions (#5133)	Thomas Lively	2022-10-12	1	-2/+2
\| \| \| \| \| \| \| \|	The `makeXXX` functions that are responsible for individual instructions will generally need the locations of those functions to emit useful errors. However, since the instruction names are parsed before the `makeXXX` functions are called, the functions have no good way of getting the location of the beginning of the instruction. Fix this by explicitly passing them the location of the beginning of the instruction.
*	[Parser][NFC] Move `ParseInput` into the parser context (#5132)	Thomas Lively	2022-10-12	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	Rather than passing both a `Ctx` and a `ParseInput` to every parsing function, pass only a `Ctx` with a `ParseInput` inside of it. This significantly reduces verbosity in the parser. To handle cases where parsing needs to happen at specific locations, which used to be handled by constructing a new `ParseInput` independent from the ctx, introduce a new RAII utility for temporarily changing the location of the `ParseInput` inside a context. Also add a utility for generating an error at a particular location to avoid having to construct new `ParseInput` objects just for that purpose. This resolves a few TODOs about correcting error locations, but since we don't test those yet, I still consider this NFC.
*	Make `Name` a pointer, length pair (#5122)	Thomas Lively	2022-10-11	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	With the goal of supporting null characters (i.e. zero bytes) in strings. Rewrite the underlying interned `IString` to store a `std::string_view` rather than a `const char`, reduce the number of map lookups necessary to intern a string, and present a more immutable interface. Most importantly, replace the `c_str()` method that returned a `const char` with a `toString()` method that returns a `std::string`. This new method can correctly handle strings containing null characters. A `const char` can still be had by calling `data()` on the `std::string_view`, although this usage should be discouraged. This change is NFC in spirit, although not in practice. It does not intend to support any particular new functionality, but it is probably now possible to use strings containing null characters in at least some cases. At least one parser bug is also incidentally fixed. Follow-on PRs will explicitly support and test strings containing nulls for particular use cases. The C API still uses `const char` to represent strings. As strings containing nulls become better supported by the rest of Binaryen, this will no longer be sufficient. Updating the C and JS APIs to use pointer, length pairs is left as future work.
*	Fix fuzzer after #5114 (#5117)	Sam Clegg	2022-10-06	1	-7/+7
\| \| \|	Fixes ./scripts/fuzz_opt.py --auto-initial-contents 7044408155933374954
*	Pin v8 10.8.104 (#5112)	Thomas Lively	2022-10-05	1	-1/+4
\| \| \| \| \| \|	More recent version of V8 include a change from `dataref` to `structref` that prevent WasmGC modules produced by the fuzzer from validating. Use the last compatible v8 version if it is in the path until we can update Binaryen to be compatible with newer v8.
*	[Strings] Add missing String effects + tests (#5057)	Alon Zakai	2022-09-19	1	-0/+1
\| \| \|	Also fix some formatting issue in the file.
*	Allow optimizing with global function effects (#5040)	Alon Zakai	2022-09-16	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds a map of function name => the effects of that function to the PassOptions structure. That lets us compute those effects once and then use them in multiple passes afterwards. For example, that lets us optimize away a call to a function that has no effects: (drop (call $nothing)) [..] (func $nothing ;; .. lots of stuff but no effects, only a returned value .. ) Vacuum will remove that dropped call if we tell it that the called function has no effects. Note that a nice result of adding this to the PassOptions struct is that all passes will use the extra info automatically. This is not enabled by default as the benefits seem rather minor, though it does help in a small but noticeable way on J2Wasm code, where we use call.without.effects and have situations like this: (func $foo (call $bar) ) (func $bar (call.without.effects ..) ) The call to bar looks like it has effects, normally, but with global effect info we know it actually doesn't. To use this, one would do --generate-global-effects [.. some passes that use the effects ..] --discard-global-effects Discarding is not necessary, but if there is a pass later that adds effects, then not discarding could lead to bugs, since we'd think there are fewer effects than there are. (However, normal optimization passes never add effects, only remove them.) It's also possible to call this multiple times: --generate-global-effects -O3 --generate-global-effects -O3 That computes affects after the first -O3, and may find fewer effects than earlier. This doesn't compute the full transitive closure of the effects across functions. That is, when computing a function's effects, we don't look into its own calls. The simple case so far is enough to handle the call.without.effects example from before (though it may take multiple optimization cycles).
*	Remove typed-function-references feature (#5030)	Thomas Lively	2022-09-09	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In practice typed function references will not ship before GC and is not independently useful, so it's not necessary to have a separate feature for it. Roll the functionality previously enabled by --enable-typed-function-references into --enable-gc instead. This also avoids a problem with the ongoing implementation of the new GC bottom heap types. That change will make all ref.null instructions in Binaryen IR refer to one of the bottom heap types. But since those bottom types are introduced in GC, it's not valid to emit them in binaries unless unless GC is enabled. The fix if only reference types is enabled is to emit (ref.null func) instead of (ref.null nofunc), but that doesn't always work if typed function references are enabled because a function type more specific than func may be required. Getting rid of typed function references as a separate feature makes this a nonissue.
*	[Wasm GC] Support non-nullable locals in the "1a" form (#4959)	Alon Zakai	2022-08-31	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	An overview of this is in the README in the diff here (conveniently, it is near the top of the diff). Basically, we fix up nn locals after each pass, by default. This keeps things easy to reason about - what validates is what is valid wasm - but there are some minor nuances as mentioned there, in particular, we ignore nameless blocks (which are commonly added by various passes; ignoring them means we can keep more locals non-nullable). The key addition here is LocalStructuralDominance which checks which local indexes have the "structural dominance" property of 1a, that is, that each get has a set in its block or an outer block that precedes it. I optimized that function quite a lot to reduce the overhead of running that logic after each pass. The overhead is something like 2% on J2Wasm and 0% on Dart (0%, because in this mode we shrink code size, so there is less work actually, and it balances out). Since we run fixups after each pass, this PR removes logic to manually call the fixup code from various places we used to call it (like eh-utils and various passes). Various passes are now marked as requiresNonNullableLocalFixups => false. That lets us skip running the fixups after them, which we normally do automatically. This helps avoid overhead. Most passes still need the fixups, though - any pass that adds a local, or a named block, or moves code around, likely does. This removes a hack in SimplifyLocals that is no longer needed. Before we worked to avoid moving a set into a try, as it might not validate. Now, we just do it and let fixups happen automatically if they need to: in the common code they probably don't, so the extra complexity seems not worth it. Also removes a hack from StackIR. That hack tried to avoid roundtrip adding a nondefaultable local. But we have the logic to fix that up now, and opts will likely keep it non-nullable as well. Various tests end up updated here because now a local can be non-nullable - previous fixups are no longer needed. Note that this doesn't remove the gc-nn-locals feature. That has been useful for testing, and may still be useful in the future - it basically just allows nn locals in all positions (that can't read the null default value at the entry). We can consider removing it separately. Fixes #4824
*	Implement `extern.externalize` and `extern.internalize` (#4975)	Thomas Lively	2022-08-29	2	-1/+5
\| \| \| \|	These new GC instructions infallibly convert between `extern` and `any` references now that those types are not in the same hierarchy.
*	Simplify TrapsNeverHappen fuzzing (#4957)	Alon Zakai	2022-08-24	1	-4/+5
\| \| \| \|	Also fix a small logic error - call lines can be prefixes of each other, so use the full line (including newline) to differentiate.
*	[Fuzzer] Fuzz TrapsNeverHappen mode (#4936)	Alon Zakai	2022-08-22	1	-2/+85
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This mode is tricky to fuzz because the mode is basically "assume traps never happen; if a trap does happen, that is undefined behavior". So if any trap occurs in the random fuzz testcase, we can't optimize with -tnh and assume the results stay to same. To avoid that, we ignore all functions from the first one that traps, that is, we only compare the code that ran without trapping. That often is a small subset of the total functions, sadly, but I do see that this ends up with some useful coverage despite the drawback. This also requires some fixes to comparing of references, specifically, funcrefs are printed with the function name/index, but that can change during opts, so ignore that. This wasn't noticed before because this new fuzzer mode does comparisons of --fuzz-exec-before output, instead of doing --fuzz-exec which runs it before and after and compares it internally in wasm-opt. Here we are comparing the output externally, which we didn't do before.
*	Mutli-Memories Support in IR (#4811)	Ashley Nelson	2022-08-17	1	-0/+8
\| \| \| \| \| \| \|	This PR removes the single memory restriction in IR, adding support for a single module to reference multiple memories. To support this change, a new memory name field was added to 13 memory instructions in order to identify the memory for the instruction. It is a goal of this PR to maintain backwards compatibility with existing text and binary wasm modules, so memory indexes remain optional for memory instructions. Similarly, the JS API makes assumptions about which memory is intended when only one memory is present in the module. Another goal of this PR is that existing tests behavior be unaffected. That said, tests must now explicitly define a memory before invoking memory instructions or exporting a memory, and memory names are now printed for each memory instruction in the text format. There remain quite a few places where a hardcoded reference to the first memory persist (memory flattening, for example, will return early if more than one memory is present in the module). Many of these call-sites, particularly within passes, will require us to rethink how the optimization works in a multi-memories world. Other call-sites may necessitate more invasive code restructuring to fully convert away from relying on a globally available, single memory pointer.
*	Fix require() in wasm2js test suite for newer Node.js (#4913)	Alon Zakai	2022-08-17	1	-0/+2
\| \| \| \| \| \| \|	Without this fix, newer node errors on: "node-esm-loader.mjs 'resolve'" did not call the next hook in its chain and did not explicitly signal a short circuit. If this is intentional, include `shortCircuit: true` in the hook's return. This adds that shortCircuit property.
*	Fuzzer: When typed-function-references is disabled, disable GC too (#4908)	Alon Zakai	2022-08-16	1	-1/+2
\|
*	Fix name of port_passes_tests_to_lit.py script. NFC (#4902)	Sam Clegg	2022-08-12	1	-2/+2
\| \| \|	I was reading these tests and failing to find the names script.
*	Port test/passes/legalize-js-interface* to lit (#4903)	Sam Clegg	2022-08-12	1	-3/+12
\| \| \| \| \|	Also, add support for the `--binaryen-bin` flag to `scripts/port_passes_tests_to_lit.py`. This is needed for folks who don't do in-tree builds.
*	Move `auto_update_tests.py` into scripts directory. NFC (#4879)	Sam Clegg	2022-08-11	2	-2/+202
\| \| \| \|	So it lives alongside `update_lit_checks.py` and `update_help_checks.py`
*	Fuzzer: Emit absolute paths in commands (#4890)	Alon Zakai	2022-08-10	1	-28/+31
\| \| \| \| \| \| \|	* better * fix * undo
*	Fuzzer: Add some docs and error handling" (#4887)	Alon Zakai	2022-08-09	1	-4/+19
\|
*	Remove metadata generation from wasm-emscripten-finalize (#4863)	Sam Clegg	2022-08-07	1	-10/+0
\| \| \| \|	This is no longer needed by emscripten as of: https://github.com/emscripten-core/emscripten/pull/16529
*	Remove RTTs (#4848)	Thomas Lively	2022-08-05	2	-18/+9
\| \| \| \| \| \| \|	RTTs were removed from the GC spec and if they are added back in in the future, they will be heap types rather than value types as in our implementation. Updating our implementation to have RTTs be heap types would have been more work than deleting them for questionable benefit since we don't know how long it will be before they are specced again.
*	Bail out of fuzz_shell.js if instantiation fails (#4873)	Thomas Lively	2022-08-04	1	-2/+9
\| \| \| \| \| \| \| \| \|	Sometimes the fuzzer produces valid modules that trap during instantiation. When that happens, the JS harness used to run the fuzzer output in d8 would previously throw an error, creating spurious fuzzer failures on valid modules. Update fuzz_shell.js to catch and supress errors during instantiation (but not validation) to avoid these spurious failures. Fixes #4865.
*	Re-run scripts/test/generate_lld_tests.py. NFC (#4861)	Sam Clegg	2022-08-02	1	-4/+6
\|
*	Grand Unified Flow Analysis (GUFA) (#4598)	Alon Zakai	2022-07-22	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	This tracks the possible contents in the entire program all at once using a single IR. That is in contrast to say DeadArgumentElimination of LocalRefining etc., all of whom look at one particular aspect of the program (function params and returns in DAE, locals in LocalRefining). The cost is to build up an entire new IR, which takes a lot of new code (mostly in the already-landed PossibleContents). Another cost is this new IR is very big and requires a lot of time and memory to process. The benefit is that this can find opportunities that are only obvious when looking at the entire program, and also it can track information that is more specialized than the normal type system in the IR - in particular, this can track an ExactType, which is the case where we know the value is of a particular type exactly and not a subtype.
*	[Strings] GC variants for string.encode (#4817)	Alon Zakai	2022-07-21	1	-0/+2
\|
*	[Strings] Add string.new GC variants (#4813)	Alon Zakai	2022-07-19	1	-0/+2
\|
*	[Strings] stringview_wtf16.length (#4809)	Alon Zakai	2022-07-18	1	-0/+1
\| \| \| \|	This measures the length of a view, so it seems simplest to make it a sub-operation of the existing measure instruction.
*	[Strings] stringview_*.slice (#4805)	Alon Zakai	2022-07-15	1	-0/+3
\| \| \| \| \| \| \|	Unfortunately one slice is the same as python [start:end], using 2 params, and the other slice is one param, [CURR:CURR+num] (where CURR is implied by the current state in the iter). So we can't use a single class here. Perhaps a different name would be good, like slice vs substring (like JS does), but I picked names to match the current spec.
*	[Strings] stringview access operations (#4798)	Alon Zakai	2022-07-13	1	-0/+5
\|
*	[Strings] string.as (#4797)	Alon Zakai	2022-07-12	1	-0/+3
\|
*	[Parser] Start to parse instructions (#4789)	Thomas Lively	2022-07-11	1	-6/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Update gen-s-parser.py to produce a second version of its parsing code that works with the new wat parser. The new version automatically replaces the `s` element argument in the existing parser with the `ctx` and `in` arguments used by the new parser, so adding new instructions will not require any additional work in gen-s-parser.py after this change. Also add stub `make*` functions to the new wat parser, with a few filled out, namely `makeNop`, `makeUnreachable`, `makeConst`, and `makeRefNull`. Update the `global` parser to parse global initializer instructions and update wat-kitchen-sink.wast to demonstrate that the instructions are parsed correctly. Adding new instruction classes will require adding a new `make*` function to wat-parser.cpp in additional to wasm-s-parser.{h,cpp} after this change, but adding a trivial failing implementation is good enough for the time being, so I don't expect this to appreciably increase our maintenance burden in the near term. The infrastructure for parsing folded instructions, instructions with operands, and control flow instructions will be implemented in future PRs.
*	Parse `rec` in update_lit_checks.py (#4784)	Thomas Lively	2022-07-08	1	-1/+1
\|
*	[Strings] string.is_usv_sequence (#4783)	Alon Zakai	2022-07-08	1	-0/+1
\| \| \| \| \| \| \|	This implements it as a StringMeasure opcode. They do have the same number of operands, same trapping behavior, and same return type. They both get a string and do some inspection of it to return an i32. Perhaps the name could be StringInspect or something like that, rather than StringMeasure..? But I think for now this might be good enough, and the spec may change anyhow later.
*	[Strings] string.eq (#4781)	Alon Zakai	2022-07-08	1	-0/+1
\|
*	[Strings] string.concat (#4777)	Alon Zakai	2022-07-08	1	-0/+1
\|
*	[Strings] string.encode (#4776)	Alon Zakai	2022-07-07	1	-0/+2
\|
*	Fuzzer: Fix determinism fuzzing (#4767)	Alon Zakai	2022-07-07	1	-6/+14
\| \| \| \| \| \|	#4758 regressed the determinism fuzzer. First, FEATURE_OPTS is not a list but a string. Second, as a hack another location would add --strip-dwarf to that list, but that only works in wasm-opt. To fix that, remove the hack and instead just ignore DWARF-containing files.
*	[Strings] string.measure (#4775)	Alon Zakai	2022-07-07	1	-0/+2
\|
*	[Strings] Add string.const (#4768)	Alon Zakai	2022-07-06	1	-0/+1
\| \| \| \| \|	This is more work than a typical instruction because it also adds a new section: all the (string.const "foo") strings are put in a new "strings" section in the binary, and the instructions refer to them by index.
*	[Strings] Add string.new* instructions (#4761)	Alon Zakai	2022-06-29	1	-0/+2
\| \| \| \| \| \|	This is the first instruction from the Strings proposal. This includes everything but interpreter support.
*	[Strings] Add string proposal types (#4755)	Alon Zakai	2022-06-29	1	-1/+3
\| \| \| \| \| \| \| \|	This starts to implement the Wasm Strings proposal https://github.com/WebAssembly/stringref/blob/main/proposals/stringref/Overview.md This just adds the types.
*	Disallow --nominal with GC (#4758)	Thomas Lively	2022-06-28	1	-12/+11
\| \| \| \| \| \| \| \| \| \| \|	Nominal types don't make much sense without GC, and in particular trying to emit them with typed function references but not GC enabled can result in invalid binaries because nominal types do not respect the type ordering constraints required by the typed function references proposal. Making this change was mostly straightforward, but required fixing the fuzzer to use --nominal only when GC is enabled and required exiting early from nominal-only optimizations when GC was not enabled. Fixes #4756.
*	[Fuzzer] Pretty print of feature opts and passes (#4740)	Max Graey	2022-06-22	1	-2/+2
\|
*	Disable Wasm2C in fuzz_opt for now (#4743)	Max Graey	2022-06-21	1	-2/+4
\| \| \|	Relates to #4741
*	Update relaxed SIMD instructions	Thomas Lively	2022-06-07	1	-2/+0
\| \| \| \| \|	Update the opcodes for all relaxed SIMD instructions and remove the unsigned dot product instructions that are no longer in the proposal.
*	Global Struct Inference pass: Infer two constants in struct.get (#4659)	Alon Zakai	2022-06-01	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This optimizes constants in the megamorphic case of two: when we know two function references are possible, we could in theory emit this: (select (ref.func A) (ref.func B) (ref.eq (..ref value..) ;; globally, only 2 things are possible here, and one has ;; ref.func A as its value, and the other ref.func B (ref.func A)) That is, compare to one of the values, and emit the two possible values there. Other optimizations can then turn a call_ref on this select into an if over two direct calls, leading to devirtualization. We cannot compare a ref.func directly (since function references are not comparable), and so instead we look at immutable global structs. If we find a struct type that has only two possible values in some field, and the structs are in immutable globals (which happens in the vtable case in j2wasm for example), then we can compare the references of the struct to decide between the two values in the field.
*	Fuzzer: Pass v8 flag for extended-const (#4543)	Alon Zakai	2022-05-27	1	-1/+2
\| \| \| \| \|	This flag is needed now that we have testcases using extended-const in our test suite, as the fuzzer will grab random testcases, modify them, and run them through v8.
*	[Fuzzer] Ignore relaxed-simd test for initial contents (#4683)	Alon Zakai	2022-05-24	1	-0/+8
\| \| \| \|	If we use it as initial contents, we will try to execute it, and hit the TODOs in the interpreter for unimplemented parts.