forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[Outlining] HashStringifyWalker (#5810)	Ashley Nelson	2023-07-13	3	-1/+97
\| \| \| \| \|	This PR is part of the effort to bring Outlining to Binaryen. HashStringifyWalker is a subclass of StringifyWalker #5772, and used to encode a wasm module as a "string". Our "string" is a vector and each symbol is a uint64_t, providing us with a capacity of 2^64 symbols.
*	Small fixes to lattice and analyzer (#5815)	Bruce He	2023-07-13	2	-9/+9
\| \| \|	Updates comments. Moves wasm-traversal.h to transfer function classes.
*	Add a pass to sort functions by name (#5811)	Alon Zakai	2023-07-12	3	-0/+22
\|
*	Fuzzer: Emit more variations of If (#5806)	Alon Zakai	2023-07-10	1	-2/+24
\| \| \| \| \| \|	Before we always created if-elses. Now we also create an If with one arm some of the time, when we can. Also, sometimes make one if arm unreachable, if we have two arms.
*	GUFA: Refine casts (#5805)	Alon Zakai	2023-07-07	2	-1/+22
\| \| \| \| \| \| \|	If we see (ref.cast $A) but we have inferred that a more refined type will be present there at runtime $B then we can refine the cast to (ref.cast $B). We could do the same even when a cast is not present, but that would increase code size. This optimization keeps code size constant.
*	Generalize Finite Powerset Lattice to Any Given Type (#5800)	Bruce He	2023-07-06	3	-21/+72
\| \| \| \| \| \| \| \| \| \|	This change creates a lattice, FinitePowersetLattice, to represent finite powerset lattices constructed from sets containing members of arbitrary type The bitvector finite powerset lattice class is renamed FiniteIntPowersetLattice. The templated FinitePowersetLattice class contains a FiniteIntPowersetLattice object, and over that provides functionality to map lattice element members to bitvector indices. Methods are also provided by the lattice to add or remove members of the given type from lattice members as an abstraction over flipping bits in the bitvector.
*	Make the Transfer Function a Generic, Interchangeable Type for the Monotone ↵	Bruce He	2023-07-06	4	-136/+218
\| \| \| \| \| \| \|	Analyzer (#5794) This change makes the transfer function an object separate from the monotone analyzer. The transfer function class type is a generic template of the monotone analyzer, and an instance of the transfer function is instantiated and then passed into the monotone analyzer via its constructor. The API of the transfer function is enforced by a static assertion. This change also introduces LivenessTransferFunction as a transfer function for liveness analysis as an example.
*	Initial support for `final` types (#5803)	Thomas Lively	2023-07-06	11	-64/+152
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Implement support in the type system for final types, which are not allowed to have any subtypes. Final types are syntactically different from similar non-final types, so type canonicalization is made aware of finality. Similarly, TypeMerging and TypeSSA are updated to work correctly in the presence of final types as well. Implement binary and text parsing and emitting of final types. Use the standard text format to represent final types and interpret the non-standard "struct_subtype" and friends as non-final. This allows a graceful upgrade path for users currently using the non-standard text format, where they can update their code to use final types correctly at the point when they update to use the standard format. Once users have migrated to using the fully expanded standard text format, we can update update Binaryen's parsers to interpret the MVP shorthands as final types to match the spec without breaking those users. To make it safe for V8 to independently start interpreting types declared without `sub` as final, also reserve that shorthand encoding only for types that have no strict subtypes.
*	[NFC] Fix the use of "strict" in subtypes.h (#5804)	Thomas Lively	2023-07-06	9	-24/+24
\| \| \| \| \| \| \| \|	Previously we incorrectly used "strict" to mean the immediate subtypes of a type, when in fact a strict subtype of a type is any subtype excluding the type itself. Rename the incorrect `getStrictSubTypes` to `getImmediateSubTypes`, rename the redundant `getAllStrictSubTypes` to `getStrictSubTypes`, and rename the redundant `getAllSubTypes` to `getSubTypes`. Fixing the capitalization of "SubType" to "Subtype" is left as future work.
*	[DeadArgumentElimination] Optimize function body after parameter refinement ↵	Jérôme Vouillon	2023-07-06	1	-7/+11
\| \| \| \| \| \|	(#5799) It can be useful to optimize a function body after its parameters are refined, like we do for other parameter changes.
*	Print supertype declarations using the standard format (#5801)	Thomas Lively	2023-07-06	1	-30/+14
\| \| \| \| \| \|	Use the standard "(sub $super ...)" format instead of the non-standard "XXX_supertype ... $super" format. In a follow-on PR implementing final types, this will allow us to print and parse the standard text format for final types right away with a smaller diff.
*	OptimizeInstructions: Loop on fallthrough values in RefTest (#5797)	Alon Zakai	2023-07-05	1	-31/+43
\| \| \| \| \|	This parallels the code in RefCast. Previously we only looked at the type reaching us, but intermediate fallthrough values can let us optimize too. In particular, we were not optimizing (ref.test (local.tee ..)) if the tee was to a less-refined type.
*	[Outlining] StringifyWalker (#5772)	Ashley Nelson	2023-06-30	2	-0/+207
\| \| \|	StringifyWalker is a new Walker with UnifiedExpressionVisitor. This walker performs a shallow visit of control-flow (try, if, block, loop) expressions and their simple expression siblings before then visiting the children of each control-flow expression in postorder. As a result, this walker un-nests nested control flow structures, so the expression visit order does not correspond to a normal postorder traversal of the function.
*	[NFC] Add a helper to get function DCE names in wasm-metadce (#5793)	Alon Zakai	2023-06-30	1	-30/+15
\|
*	[wasm-metadce] Note ref.func connections + fix rooting of segment offsets ↵	Jérôme Vouillon	2023-06-29	1	-13/+28
\| \| \| \|	(#5791)
*	Dynamic Sized Bitvector Powerset Lattice for Static Analysis (#5784)	Bruce He	2023-06-29	5	-171/+234
\| \| \|	This change introduces a finite-height powerset lattice FinitePowersetLattice where each element's height is determined when constructed in runtime. This is implemented using a vector of `bools. This change also modifies BlockState and MonotoneCFGAnalyzer to be templated on a generic lattice type instead of using a hardcoded lattice. It additionally fixes an iterator bug in MonotoneCFGAnalyzer::fromCFG which assigned a temporary iterator object as predecessor and successor pointers to BlockStates instead of pointers to actual BlockState objects.
*	Limit printing of Literal[s] in a general way (#5792)	Alon Zakai	2023-06-28	1	-16/+49
\| \| \| \| \| \| \| \|	Previously we limited printing in a single Literals. But we can have infinitely recursive GC literals, or just huge graphs even without infinite recursion where no single Literals is that big (but we still get exponential blowup). This PR adds a general limit on how much we print once we start to print a Literal or Literals.
*	Fix opt/shrink levels when running the optimizer multiple times, Part 2 (#5787)	Alon Zakai	2023-06-27	3	-41/+39
\| \| \| \| \| \| \| \| \| \| \|	This is a followup to #5333 . That fixed the selection of which passes to run, but forgot to also fix the global state of the current optimize/shrink levels. This PR fixes that. As a result, running -O3 -Oz will now work as expected: the first -O3 will run the right passes (as #5333 fixed) and while running them, the global optimize/shrinkLevels will be -O3 (and not -Oz), which this PR fixes. A specific result of this is that -O3 -Oz used to inline less, since the invocation of inlining during -O3 thought we were optimizing for size. The new test verifies that we do fully inline in the first -O3 now.
*	Liveness Analysis Proof of Concept (#5771)	Bruce He	2023-06-23	7	-0/+410
\| \| \|	This introduces a limited monotone flow-sensitive liveness analysis on local indices as an initial proof of concept for the creation of a monotone flow-sensitive static analysis framework. Tests are included in test/gtest/cfg.cpp.
*	PostEmscripten: Preserve __em_js__ exports in side modules (#5780)	Sam Clegg	2023-06-23	1	-4/+10
\|
*	Fuzzing for Try and Throw (#5776)	Alon Zakai	2023-06-21	3	-3/+79
\|
*	Limit literal printing to a reasonable limit (#5779)	Alon Zakai	2023-06-21	1	-0/+8
\| \| \| \| \| \| \| \| \|	Without this, a massive GC array for example can end up being printed as a huge amount of output, which is a problem in the fuzzer. I noticed this because a testcase was taking minutes to run. Perhaps we should add an option (BINARYEN_PRINT_FULL=1?) to disable this limit, but let's see if we ever need that.
*	Fix pop assertion (#5777)	Alon Zakai	2023-06-20	1	-1/+1
\| \| \|	Subtypes are allowed as well, not just exact matches, in the pop value's type.
*	[NFC] Simplify `Tuple` by making it an alias of `TypeList` (#5775)	Thomas Lively	2023-06-20	6	-65/+41
\| \| \| \| \| \| \|	Rather than wrap a `TypeList`, make `Tuple` an alias of `TypeList`. This means removing `Tuple::toString`, but that had no callers and was of limited use for debugging anyway. In return, the use of tuples becomes much less verbose. In the future, it may make sense to remove one of `Tuple` and `TypeList`.
*	[EH] Add pass to remove EH instructions (#5770)	Heejin Ahn	2023-06-15	4	-0/+81
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This pass strips all EH stuff, including EH instructions and tags, from the input module and disables the EH feature from the features section. 1. This removes `catch` and `catch_all` blocks from the code. So ```wast (try (do (some code) ) (catch ... ) ) ``` becomes just `(some code)`. Note that all `rethrow`s will be removed with `catch`es. Note that all `rethrow`s will be removed with `catch`es. 2. This converts 'throw (...)` into `unreachable`. Note that `rethrows 3. This removes all tags from the module, which are unused anyway after 1 and 2. 4. This removes exception handling feature from the features section. You can use the pass with ```console $ wasm-opt --enable-exception-handling --strip-eh INPUT -o OUTPUT ``` This is not an optimization pass, so it is not run unless you specify the pass explicitly. This is in effect similar to Clang's `-fignore-exceptions`, in which you can throw but it will result in a crash and we compile away all landing pads. This can be used for people who don't (or can't) use `-fignore-exceptions` in their build settings or who want to compile away `catch` blocks later. Closes emscripten-core/emscripten#19585.
*	[NFC] Rewrite isorecursive canonicalization (#5774)	Thomas Lively	2023-06-15	2	-341/+159
\| \| \| \| \| \| \|	Rewrite the type canonicalization algorithm to fully canonicalize a single rec group at a time rather than canonicalizing multiple rec groups at once in multiple stages. The previous code was useful when it had to be shared with equirecursive and nominal canonicalization, but was much more complicated than necessary for just isorecursive canonicalization, which is all we support today.
*	EffectAnalyzer: Assume we execute the two things whose effects we compare ↵	Alon Zakai	2023-06-13	1	-6/+45
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(#5764) EffectAnalyzer::canReorder/invalidate now assume that the things from whom we generated the effects both execute (or, rather, that if the first of them doesn't transfer control flow then they execute). If they both execute then we can do more work in TrapsNeverHappen mode, since we can then reorder this for example: (global.set ..) (i32.load ..) The load may trap, but in TNH mode we assume it won't. So we can reorder those two. However, if they did not both execute then we could be in this situation: (global.set ..) (br_if ..) (i32.load) Reordering the load and the set here would be invalid, because we could make the load execute when it didn't execute before, and it could now start to actually trap at runtime. This new assumption seems obvious, since we don't compare the effects of things unless they are adjacent and with no control flow between them - otherwise, why compare them? To be sure, I manually reviewed every single use of EffectAnalyzer::canReorder/invalidate in the entire codebase. I've also been fuzzing this for several days (hundreds of thousands of iterations), and have not seen any problem. This was motivated by seeing that #5744 should be able to do more work in TNH mode, but it wasn't. New tests show the benefits there in OptimizeCasts as well as in SimplifyLocals.
*	[NFC] Simplify the HeapTypeStore (#5765)	Thomas Lively	2023-06-13	1	-107/+38
\| \| \| \| \| \| \| \| \| \| \| \| \|	Now that we support only isorecursive typing, which canonicalizes heap types by the rec group rather than individually, we no longer need a canonicalizing store for heap types. Simplify the `Store` implementation, which was previously generic over both `HeapType`s and `Type`s, to instead work only for `Type`s. Replace `globalHeapTypeStore` with a simple vector to keep canonicalized `HeapTypeInfo`s alive. Also simplify global canonicalization to not replace heap type uses, since that is already done separately as part of isorecursive rec group canonicalization. This simplification does require adding new information to `CanonicalizationState`, but further work will simplify that away as well.
*	Remove BinaryenIRToBinaryWriter::visit (#5766)	Heejin Ahn	2023-06-13	1	-4/+0
\| \| \| \|	This is not used. We use the parent's `visit` method instead: https://github.com/WebAssembly/binaryen/blob/585af93ec6a22feb8954bc118f0bff997d1fc165/src/wasm-stack.h#L233-L262
*	Rename WasmBinaryBuilder to WasmBinaryReader (NFC) (#5767)	Heejin Ahn	2023-06-13	7	-212/+204
\| \| \| \| \| \|	We have `WasmBinaryBuilder` that read binary into Binaryen IR and `WasmBinaryWriter` that writes Binaryen IR to binary. To me `WasmBinaryBuilder` sounds similar to `WasmBinaryWriter`, which builds binary. How about renaming it to `WasmBinaryReader`?
*	DeadArgumentElimination: Do not error on bottom types in result refining (#5763)	Alon Zakai	2023-06-12	1	-1/+6
\| \| \| \|	More generally, the LUB computation that code relies on did not handle bottom types properly.
*	ConstantFieldPropagation: Track copied values properly (#5761)	Alon Zakai	2023-06-12	1	-24/+56
\| \| \| \|	The logic ignored copied values, which was fine for struct.get operations but not for struct.new.
*	Update br_on_cast binary and text format (#5762)	Thomas Lively	2023-06-12	6	-46/+61
\| \| \| \| \| \| \| \| \| \| \| \|	The final versions of the br_on_cast and br_on_cast_fail instructions have two reference type annotations: one for the input type and one for the cast target type. In the binary format, this is represented as a flags byte followed by two encoded heap types. Upgrade all of the tests at once to use the new versions of the instructions and drop support for the old instructions from the text parser. Keep support in the binary parser to avoid breaking users, though. Drop some binary tests of deprecated instruction encodings that would be more effort to update than they're worth. Re-land with fixes of #5734
*	Remove handleBranchForVisitBlock (#5760)	Heejin Ahn	2023-06-09	1	-15/+0
\| \| \| \|	This method doesn't seem to be used anymore. It was added in #1771 and its uses were removed in #2492.
*	Validate tag param types (#5759)	Alon Zakai	2023-06-08	1	-0/+5
\| \| \| \| \|	We already validated function params, but were missing tags. Without this the fuzzer can get confused if a type is only used in a tag.
*	Strings: Add initial validation checks (#5758)	Alon Zakai	2023-06-08	1	-0/+91
\| \| \| \| \| \| \| \| \|	This is far from comprehensive, but it checks strings being enabled for all the instructions. Without this, the fuzzer can get confused because it checks if code validates and then proceeds under that assumption, so any missing validation checks can cause problems (specifically, if we have a string.const without strings enabled then we error during writing of the string, since we don't do the initial pass to find all strings to deduplicate them).
*	TypeRefining: Fix a bug with chains of StructGets (#5757)	Alon Zakai	2023-06-08	1	-1/+23
\| \| \| \| \| \| \| \| \| \| \| \| \|	If we have (struct.get $A (struct.get $B then if both types end up refined we may have a problem. If the inner one is refined to emit nullref then the outer one no longer knows what type it is, since it depends on the type of the ref child for that in our IR. We can't just skip updating it, as the outside may depend on its new refined type to validate. To avoid errors here, just make this code that is effectively unreachable also actually unreachable.
*	[Strings] Fix non-nullable string emitting in the binary format (#5756)	Alon Zakai	2023-06-07	1	-3/+9
\| \| \|	Related to #5737 which did something similar for other types.
*	[NFC] Remove redundant code from EffectAnalyzer (#5754)	Alon Zakai	2023-06-06	1	-5/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This PR removes a check for transfersControlFlow() && other.trap which is already checked higher up in the code, here: transfersControlFlow() && other.hasSideEffects() In binaryen/src/ir/effects.h , lines 223 to 224 That last code handles the first code because trapping is part of hasSideEffects().
*	Move casts which are immediate children of local.gets to earlier local.gets ↵	Bruce He	2023-06-06	1	-16/+324
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(#5744) In the OptimizeCasts pass, it is useful to move more refined casts as early as possible without causing side-effects. This will allow such casts to potentially trap earlier, and will allow the OptimizeCasts pass to use more refined casts earlier. This change allows a more refined cast to be duplicated at an earlier local.get expression. The later instance of the cast will then be eliminated in a later optimization pass. For example, if we have the following instructions: (drop (local.get $x) ) (drop (ref.cast $A (local.get $x) ) (drop (ref.cast $B (local.get $x) ) ) Where $B is a sublcass of $A, we can convert this to: (drop (ref.cast $B (local.get $x) ) ) (drop (ref.cast $A (local.get $x) ) (drop (ref.cast $B (local.get $x) ) ) Concretely we will save the first cast to a local and use it in the other local.gets.
*	Fix emitting of function reference types without GC (#5737)	Thomas Lively	2023-06-05	1	-14/+17
\| \| \| \| \| \| \| \| \|	We previously had logic to emit GC types used in the IR as their corresponding top types when GC was not enabled (so e.g. nullfuncref would be emitted as funcref), but the logic was not robust enough and non-null function references were not properly emitted as funcref. Refactor the relevant code to be more robust and future-proof, and add a test demonstrating that the lowering works as intended.
*	StackIR: Remove nops (#5746)	Alon Zakai	2023-05-30	1	-0/+17
\| \| \| \| \| \| \|	No nop instruction is necessary in wasm, so in StackIR we can simply remove them all. Fixes #5745
*	wasm-merge: Preserve imports when copying module items (#5743)	Jérôme Vouillon	2023-05-26	1	-0/+4
\| \| \| \|	The import information of Tags and Memories was not preserved.
*	Revert "Update br_on_cast binary and text format (#5734)" (#5740)	Alon Zakai	2023-05-23	6	-40/+46
\| \| \| \| \| \| \|	This reverts commit b7b1d0df29df14634d2c680d1d2c351b624b4fbb. See comment at the end of #5734: It turns out that dropping the old opcodes causes problems for current users, so let's revert this for now, and later we can figure out how best to do the update.
*	Fuzzer: Limit ArrayNew sizes most of the time (#5738)	Alon Zakai	2023-05-22	1	-2/+11
\|
*	TypeSSA: Handle collisions by adding a hash to ensure a fresh rec group (#5724)	Alon Zakai	2023-05-19	2	-22/+100
\| \| \|	Fixes #5720
*	Update br_on_cast binary and text format (#5734)	Thomas Lively	2023-05-19	6	-46/+40
\| \| \| \| \| \| \| \| \| \|	The final versions of the br_on_cast and br_on_cast_fail instructions have two reference type annotations: one for the input type and one for the cast target type. In the binary format, this is represented as a flags byte followed by two encoded heap types. Since these instructions have been in flux for a while, do not attempt to maintain backward compatibility with older versions of the instructions. Instead, upgrade all of the tests at once to use the new versions of the instructions. Drop some binary tests of deprecated instruction encodings that would be more effort to update than they're worth.
*	Improve TODO docs in OptimizeCasts (#5732)	Alon Zakai	2023-05-18	1	-7/+17
\|
*	Vacuum code leading up to a trap in TrapsNeverHappen mode (#5228)	Alon Zakai	2023-05-17	1	-1/+79
\| \| \| \| \| \| \| \| \| \| \| \|	This adds two rules to vacuum in TNH mode: if (..) trap() => if (..) {} { stuff, trap() } => {} That is, we assume traps never happen so an if will not branch to one, and code right before a trap can be assumed to not execute. Together, we should be removing practically all possible code in TNH mode (though we could also add support for br_if etc.).
*	avoid incomplete type in a vector (#5730)	walkingeyerobot	2023-05-17	1	-17/+17
\| \| \| \| \|	Swap the order of struct declarations to avoid using an incomplete type in a vector. In C++20, using std::vector with an incomplete type often becomes a build failure due to increased usage of constexpr for vector members.