forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[ctor-eval] Add --ignore-external-input option (#4428)	Alon Zakai	2022-01-06	2	-11/+73
\| \| \| \| \| \| \| \| \| \| \| \|	This is meant to address one of the main limitations of wasm-ctor-eval in emscripten atm, that libc++ global ctors will read env vars, which means they call an import, which stops us from evalling, emscripten-core/emscripten#15403 (comment) To handle that, this adds an option to ignore external input. When set, we can assume that no env vars will be read, no reading from stdin, no arguments to main(), etc. Perhaps these could each be separate options, but I think keeping it simple for now might be good enough.
*	[ctor-eval] Refactor an applyToModule() method instead of hacks [NFC] (#4425)	Alon Zakai	2022-01-06	1	-19/+38
\| \| \| \| \| \| \|	Previously this would hackishly apply all execution changes to the memory all the time, and then "undo" it by saving the state before and copying that in. Instead, this PR makes execution write into a side buffer, and now there is a clear method for when we want to actually apply the results to the module.
*	[C API] Fix BinaryenTypeCreate argument numTypes type (#4417)	chai2010	2022-01-06	2	-3/+3
\| \| \|	All other numXxxs argument use BinaryenIndex type.
*	[ctor-eval] Remove stack hacks (#4429)	Alon Zakai	2022-01-06	1	-55/+2
\| \| \| \| \| \| \| \| \|	Remove some hackish code for fastcomp's stack handling. The stack pointer arrives in an imported global there. Upstream does not do this, so this code is completely unneeded these days (and, frankly, kind of scary as I read it now... it modeled the stack as separate memory from the heap...). Remove the tests for this as well. I verified that there was nothing else in those tests that we need to keep.
*	Add categories to --help text (#4421)	Alon Zakai	2022-01-05	16	-13/+216
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The general shape of the --help output is now: ======================== wasm-foo Does the foo operation ======================== wasm-foo opts: -------------- --foo-bar .. Tool opts: ---------- .. The options are now in categories, with the more specific ones - most likely to be wanted by the user - first. I think this makes the list a lot less confusing. In particular, in wasm-opt all the opt passes are now in their own category. Also add a script to make it easy to update the help tests.
*	Turn an assertion on not colliding with an internal name into an error (#4422)	Alon Zakai	2022-01-05	2	-3/+6
\| \| \| \| \| \|	Without this, the result in a build without assertions might be quite confusing. See #4410 Also make the internal names more obviously internal names.
*	Add binary format parse check for imported function types (#4423)	Alon Zakai	2022-01-05	1	-1/+7
\| \| \| \| \|	Without this we hit an assertion later, which is less clear. See #4413
*	[EH] Fixup nested pops after reading stacky binary (#4420)	Heejin Ahn	2022-01-04	3	-45/+55
\| \| \| \| \| \|	When reading stacky code in the binary reader, we create `block`s to make it fit into Binaryen AST, within which `pop`s can be nested, making the resulting AST invalid. This PR runs the fixup function after reading each `Try` to fix this.
*	[EH] Enable fuzzer with initial contents (#4409)	Heejin Ahn	2022-01-04	5	-6/+16
\| \| \| \| \| \| \| \| \|	This enables fuzzing EH with initial contents. fuzzing.cpp/h does not yet support generation of EH instructions, but with this we can still fuzz EH based on initial contents. The fuzzer ran successfully for more than 1,900,000 iterations, with my local modification that always enables EH and lets the fuzzer select only EH tests for its initial contents.
*	Factor out a utility for replacing types during canonicalization (#4416)	Thomas Lively	2022-01-04	1	-253/+219
\| \| \| \| \| \| \| \| \| \| \| \| \|	Equirecursive canonicalization generates new minimal type definitions for each distinct type, but then it must replace the old, non-minimial definitions with the new ones. That second part where the types are replaced is not unique to equirecursive canonicalization; the same replacement happens with BasicKind types and in the future the hybrid isorecursive system will also need to do that kind of replacement. To improve code reuse and separation of concerns, factor the type replacement logic out into a separate utility. This change slightly slows down shape canonicalization, but shape canonicalization is still much faster than global canonicalization. Nominal typing performance is not affected.
*	Remove python from CMake build (#4324)	Blaine Bublitz	2022-01-04	2	-6/+20
\| \| \|	Use CMake's configure_file() instead.
*	Compare traps in ExecutionResults (#4405)	Heejin Ahn	2021-12-29	2	-23/+39
\| \| \| \| \| \| \| \|	We used to only compare return values, and in #4369 we started comparing whether an uncaught exception was thrown. This also adds whether a trap occurred to `ExecutionResults`. So in `--fuzz-exec`, if a program with a trap loses the trap or vice versa, it will error out saying the result has changed, unless either of `--ignore-implicit-traps` or `--trans-never-happen` is set.
*	[EH][GC] Fix nested pop after removing ref.cast (#4407)	Heejin Ahn	2021-12-28	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	`ref.cast` can be statically removed when the ref's type is a subtype of the intended RTT type and either of `--ignore-implicit-traps` or `--traps-never-happen` is given: https://github.com/WebAssembly/binaryen/blob/083ab9842ec3d4ca278c95e1a33112ae7cd4d9e5/src/passes/OptimizeInstructions.cpp#L1603-L1624 Some more context: https://github.com/WebAssembly/binaryen/pull/4097#discussion_r694456784 But this can create a block in which a `pop` is nested, which makes the `catch` invalid. The test in this PR is the same as the example given by @kripken in #4237. This calls the fixup function `EHUtils::handleBlockNestedPops` at the end of the pass to fix this. Also, because this pass creates a lot of blocks in other patterns, I think it is possible there can be other patterns to cause this kind of `pop` nesting.
*	[Fuzzer] Allow empty data in --translate-to-fuzz (#4406)	Heejin Ahn	2021-12-28	1	-2/+2
\| \| \| \| \| \| \|	When a parameter and a member variable have the same name within a constructor, to access (and change) the member variable, we need to either use `this->` or change the name of the parameter. The current code ended up changing the parameter and didn't affect the status of the member variable, which remained empty.
*	[EH] Support try-delegate in interpreter (#4408)	Heejin Ahn	2021-12-28	1	-0/+16
\|
*	Remove tableSize from emscripten metadata (#4415)	Sam Clegg	2021-12-28	1	-6/+0
\| \| \|	See https://github.com/emscripten-core/emscripten/pull/15855
*	[EH] Handle nested pops after inlining (#4404)	Heejin Ahn	2021-12-20	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Inlining creates additional `block`s at inlined call sites, which can be inside a `catch`. For example: ```wast (try (do) (catch $tag (call $callee (pop i32) ) ) ) ``` After inlining, this becomes ```wast (try (do) (catch $tag (block $__inlined_func$callee (local.set $0 (pop i32) ;; Invalid!! ) (nop) ) ) ) ``` Now the `pop` is nested in a `block`, which makes this invalid. This PR runs `EHUtils::handleBlockNestedPops` at the end to assign the `pop` to a local right after the `catch`, making the code valid again: ```wast (try (do) (catch $tag (local.set $new ;; New local to store `pop` result (pop i32) ) (block $__inlined_func$callee (local.set $0 (local.get $new) ) (nop) ) ) ) ```
*	Add binary format parse checking for ref.as input type (#4389)	Alon Zakai	2021-12-16	1	-0/+3
\| \| \| \| \| \| \|	If that type is not valid then we cannot even create and finalize the node, which means we'd hit an assertion inside finalize(), before we reach the validator. Fixes #4383
*	Validate LUBs in the type fuzzer (#4396)	Thomas Lively	2021-12-15	3	-82/+150
\| \| \| \| \|	Update the LUB calculation code to use std::optional rather than out params and validate LUBs in the fuzzer to ensure that the change is NFC as intended. Also add HeapType::getLeastUpperBound to the public API as a convenience.
*	[NFC] Reuse `globallyCanonicalize` for nominal types (#4395)	Thomas Lively	2021-12-14	1	-36/+17
\| \| \| \| \| \| \| \| \| \|	Hashing and comparison of nominal HeapTypeInfos previously observed their child Types, so the Types had to be canonicalized before the HeapTypes. Unfortunately equirecursive canonicalization requires that the HeapTypes be canonicalized before the Types, so this was a point of divergence between the two systems. However, #4394 updated hashing and comparison of nominal types to not depend on child Types, so now we can harmonize the two systems by having them use the same `globallyCanonicalize` function to canonicalize their HeapTypes followed by their Types.
*	[Wasm GC] Refine results in SignatureRefining (#4380)	Alon Zakai	2021-12-14	1	-17/+70
\| \| \| \| \| \|	Similar to what DeadArgumentElimination does for individual functions, this can refine the results of a set of functions all using the same heap type, when they all return something more specific. After this PR SignatureRefining can refine both params and results and is basically complete.
*	[NFC] Simplify HeapTypeInfo hashing and comparison (#4394)	Thomas Lively	2021-12-14	1	-47/+4
\| \| \| \| \| \| \|	Now that caching of "canonical" nominal signatures is handled at a separate layer, we can remove the separate code paths for hashing and comparing HeapTypeInfos based on their structure even in nominal mode. Now hashing and comparing of HeapTypeInfos is uniformly handled by FiniteShapeHasher and FiniteShapeEquator.
*	[OptimizeInstructions] Combine some relational ops joined Or/And (Part 4) ↵	Max Graey	2021-12-14	1	-4/+53
\| \| \| \| \| \|	(#4339) (i32(x) < 0) & (i32(y) < 0) ==> i32(x & y) < 0 (i64(x) < 0) & (i64(y) < 0) ==> i64(x & y) < 0
*	Add requireFunctionContext in necessary places (#4388)	Alon Zakai	2021-12-14	1	-0/+4
\| \| \|	Fixes #4384
*	Allow fractional timeouts in wasm2js Atomics.wait. Followup to #4385 (#4387)	Alon Zakai	2021-12-14	1	-1/+1
\|
*	Fix a DeadArgumentElimination determinism bug (#4386)	Alon Zakai	2021-12-13	1	-1/+1
\| \| \| \|	We iterate on that data structure in two loops, and the fuzzer found a case where the difference in ordering actually ended up mattering in the output.
*	[Precompute][SIMD] Enable constant folding for simd (#4381)	Max Graey	2021-12-13	1	-9/+0
\|
*	Implement timeout argument in wasm2js_atomic_wait_i32 (#4385)	Sam Clegg	2021-12-11	1	-2/+7
\| \| \| \| \| \|	Also, fix bug where pointer was being used direcltly to index into Int32Array. I suppose this code had basically zero users until I tried to land this change in emscripten: https://github.com/emscripten-core/emscripten/pull/15742
*	[NFC] Refactor result type LUB computation into a helper function (#4379)	Alon Zakai	2021-12-09	4	-82/+123
\|
*	SimplifyGlobals: Handle nested read-only-to-write patterns (#4365)	Alon Zakai	2021-12-08	1	-5/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The general pattern is if (!global) { global = 1 } This PR generalizes that to handle nested appearances, if ({ if (!global) { global = 1 } !global }) { global = 1 } With this I can finally see no more "once" global operations on the hottest function in the currently slowest j2wasm benchmark ("filter"). Also added a failing testcase for something we do not handle yet.
*	[NFC] Add a separate cache for nominal signature types (#4375)	Thomas Lively	2021-12-08	1	-18/+49
\| \| \| \| \| \| \|	We have always cached nominal signature types keyed on their signatures to avoid creating extra nominal types through the `HeapType::HeapType(Signature)` constructor. However, that logic was previously built into the HeapTypeInfo canonicalization system. To allow that system to be simplified in future PRs, separate the caching into its own explicit layer.
*	Do not track effects of immutable things (#4376)	Alon Zakai	2021-12-08	1	-18/+2
\| \| \| \|	We don't use those effects now in any way, and if we need them some day we can add them back. For now they just add overhead and complexity.
*	[NFC] Deduplicate Store insertion logic (#4374)	Thomas Lively	2021-12-07	1	-50/+51
\| \| \| \| \| \| \|	Types and HeapTypes are inserted into their respective stores either by copying a reference to a `TypeInfo` or `HeapTypeInfo` or by moving a `std::unique_ptr<TypeInfo>` or `std::unique_ptr<HeapTypeInfo>`. Previously these two code paths had separate, similar logic. To reduce deduplication, combine both code paths into a single method.
*	[NFC] Use std::optional for `getCanonical` in wasm-type.cpp (#4373)	Thomas Lively	2021-12-07	1	-42/+37
\|
*	[EH] Make interpreter handle uncaught exceptions (#4369)	Heejin Ahn	2021-12-06	1	-24/+28
\| \| \| \| \| \| \| \| \|	When a wasm exception is thrown and uncaught in the interpreter, it caused the whole interpreter to crash, rather than gracefully reporting it. This fixes the problem, and also compares whether an uncaught exception happened when comparing the results before and after optimizations in `--fuzz-exec`. To do that, when `--fuzz-exec` is given, we now compare results even when the function does not have return values. Logs for some existing test have changed because of this.
*	[EH] Fix binary parsing for catchless try + inner delegate (#4370)	Heejin Ahn	2021-12-06	1	-7/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We do some postprocessing after parsing `Try` to make sure `delegate` only targets `try`s and not `block`s: https://github.com/WebAssembly/binaryen/blob/9659f9b07c1196447edee68fe04c8d7dd2480652/src/wasm/wasm-binary.cpp#L6404-L6426 But in case the outer `try` has neither of `catch` nor `delegate`, the previous code just return prematurely, skipping the postprocessing part, resulting in a binary parsing error. This PR removes that early-exiting code. Some test outputs have changed because `try`s are assigned labels after the early exit. But those labels can be removed by other optimization passes when there is no inner `rethrow` or `delegate` that targets them. (On a side note, the restriction that `delegate` cannot target a `block` has been removed a few months ago in the spec, so if a `delegate` targets a `block`, it means it is just rethrown from that block. But I still think this is a convenient invariant to hold at least within the binaryen IR. I'm planning to allow parsing of `delegate` targeting `block`s later, but I will make them point to `try` when read in the IR. At the moment the LLVM toolchain does not generate such code.)
*	[EH] Support try-delegate in EffectAnalyzer (#4368)	Heejin Ahn	2021-12-06	6	-19/+43
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds support for try-delegate in `EffectAnalyzer`. Without this support, the expresion below has been incorrectly classified as "cannot throw", because the previous code considered everything inside `try`-`catch_all` as "cannot throw". This is not the case when there is a `delegate` that can bypass the `catch_all`. ```wasm try $l0 try try throw $e delegate $l0 catch_all end end
*	[OptimizeInstructions] Combine some relational ops joined Or/And (Part 3) ↵	Max Graey	2021-12-04	1	-9/+47
\| \| \| \| \| \| \| \|	(#4338) (i32(x) < 0) \| (i32(y) < 0) ==> i32(x \| y) < 0 (i32(x) != 0) \| (i32(y) != 0) ==> i32(x \| y) != 0 Likewise for i64.
*	SimplifyGlobals: Ignore irrelevant effects in read-only-to-write (#4363)	Alon Zakai	2021-12-02	1	-41/+105
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously this pass would see something like this and fail: if (foo() + global) { global = 1; } The call to foo() has side effects, so we did not optimize. However, in such a case the side effects are safe: they happen anyhow, regardless of the global that we are optimizing. That is, "global" is read only to be written, even though other things also influence the decision to write it. But "global" is not used in a way that is observable: we can remove it, and nothing will notice (except for things getting smaller/faster). In other words, this PR will let us optimize the above example, while it also needs to avoid optimizing the dangerous cases, like this: if (foo(global)) { global = 1; } Here "global" flows into a place that notices its value and may use it aside from deciding to write that global. A common case where we want to optimize is combined ifs, if (foo()) { if (global) { global = 1; } } which the optimizer turns into if (foo() & global) { global = 1; } With this PR we can handle those things too. This lets us optimize out some important globals in j2wasm like the initializer boolean for the Math object, reducing some total 0.5% of code size.
*	[NFC] Avoid some unnecessary copies of PassOptions (#4361)	Alon Zakai	2021-12-01	2	-10/+10
\| \| \| \| \| \|	PassOptions is a fairly large structure and even includes a std::map. I also have plans to add further fields there to make it even larger. Before doing that I noticed that in some places we copy it instead of being consistent and taking it by reference, which this PR fixes.
*	Handle try in Flatten pass (#2567)	Heejin Ahn	2021-11-29	2	-0/+40
\| \| \|	This adds handling of try in the Flatten pass.
*	CoalesceLocals: Use ValueNumbering (#4355)	Alon Zakai	2021-11-24	1	-14/+20
\| \| \| \| \| \| \| \| \| \| \| \|	This removes the old hardcoded value numbering in that pass and makes it use the new code that was split into helper code. The immediate benefit of this is to make the code aware of identical constants: if two locals have the same constant then they do not interfere. Future improvements to numbering will also automatically help here. This changes some constants in existing tests so that they keep testing what they were testing before, and adds new tests for the new benefit here. This implements a proposed TODO from #4314
*	wasm2js: Don't assume the existence of js assert function (#4357)	Sam Clegg	2021-11-24	1	-1/+1
\| \| \| \| \| \|	Its seems that with this emscripten change DCE is able to remove the `assert` JS runtime function making this call to assert fail with `ReferenceError: assert is not defined`.
*	SimplifyGlobals: If all writes write the initial value, they are unneeded ↵	Alon Zakai	2021-11-23	1	-10/+32
\| \| \| \|	(#4356)
*	Modernize code to C++17 (#3104)	Max Graey	2021-11-22	82	-534/+268
\|
*	Change from storing Signature to HeapType on CallIndirect (#4352)	Thomas Lively	2021-11-22	30	-107/+65
\| \| \| \| \| \| \| \| \| \| \| \|	With nominal function types, this change makes it so that we preserve the identity of the function type used with call_indirect instructions rather than recreating a function heap type, which may or may not be the same as the originally parsed heap type, from the function signature during module writing. This will simplify the type system implementation by removing the need to store a "canonical" nominal heap type for each unique signature. We previously depended on those canonical types to avoid creating multiple duplicate function types during module writing, but now we aren't creating any new function types at all.
*	Add missing include in numbering.h (#4354)	Alon Zakai	2021-11-22	1	-0/+1
\|
*	Add fixup function for nested pops in catch (#4348)	Heejin Ahn	2021-11-22	10	-66/+246
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds `EHUtils::handleBlockNestedPops`, which can be called at the end of passes that has a possibility to put `pop`s inside `block`s. This method assumes there exists a `pop` in a first-descendant line, even though it can be nested within a block. This allows a `pop` to be nested within a `block` or a `try`, but not a `loop`, since that means the `pop` can run multile times. In case of `if`, `pop` can exist only in its condition; if a `pop` is in its true or false body, that's not in the first-descendant line. This can be useful when optimization passes create blocks to do transformations. Wrapping expressions wiith a block does not change semantics most of the time, but if pops happen to be inside a block generated by those passes, they can result in invalid binaries. To test this, this adds `passes/test_passes.cpp`, which is intended to contain multiple test passes that test a single (or more) utility functions separately. Without this kind of pass, it is hard to test various cases in which nested `pop`s can be generated in existing passes. This PR also adds `PassRegistry::registerTestPass`, which registers a pass that's intended only for internal testing and does not show up in `wasm-opt --help`. Fixes #4237.
*	Remove 'using namespace std' (NFC) (#4349)	Heejin Ahn	2021-11-22	8	-19/+9
\|
*	Check for correct subtyping in the type fuzzer (#4350)	Thomas Lively	2021-11-20	5	-91/+126
\| \| \| \| \|	Check that types that were meant to have a subtype relationship actually do. To expose the intended subtyping to the fuzzer, expose `subtypeIndices` in the return value of the type generation function.