forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Implement `array.new_data` and `array.new_elem` (#5214)	Thomas Lively	2022-11-07	4	-25/+212
\| \| \| \| \| \| \| \| \|	In order to test them, fix the binary and text parsers to accept passive data segments even if a module has no memory. In addition to parsing and emitting the new instructions, also implement their validation and interpretation. Test the interpretation directly with wasm-shell tests adapted from the upstream spec tests. Running the upstream spec tests directly would require fixing too many bugs in the legacy text parser, so it will have to wait for the new text parser to be ready.
*	Multi-Memories Asyncify (#5222)	Ashley Nelson	2022-11-07	2	-0/+921
\| \| \|	Adds support for the Asyncify pass to use Multi-Memories. This is specified by passing flag --asyncify-in-secondary-memory. Another flag, --asyncify-secondary-memory-size, is used to specify the initial and max size of the secondary memory.
*	[Wasm GC] RSE: Switch local.get to use a more refined type when possible (#5216)	Alon Zakai	2022-11-04	1	-1/+200
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Similar to #5194 but for RedundantSetElimination. This has similar benefits in terms of using a more refined local in hopes of avoiding casts in followup opts, but unlike SimplifyLocals this will operate across basic blocks. To do this, we need to track not just local.set but also local.get in that pass. Then in each basic block we can track the equivalent locals and pick from them. I see a few dozen casts removed in the J2Wasm binary. Often stuff like this happens: y = cast(x); if (..) { foo(x); // this could use y }
*	Fix SmallSet ordering (#5218)	Alon Zakai	2022-11-04	1	-0/+60
\| \| \|	We did not preserve the ordering of the fixed-size storage there.
*	[Wasm GC] Fix GUFA on externalize/internalize (#5220)	Alon Zakai	2022-11-04	1	-0/+70
\| \| \| \| \| \| \|	These operations emit a completely different type than their input, so they must be marked as roots, and not as things that flow values through them (because then we filter everything out as the types are not compatible). Fixes #5219
*	Update default features to match new llvm defaults (#5212)	Sam Clegg	2022-11-03	9	-74/+48
\| \| \|	See: https://reviews.llvm.org/D125728
*	Fix binary parsing of data segment memory (#5208)	Thomas Lively	2022-11-03	1	-0/+14
\| \| \| \| \| \| \| \| \| \| \| \|	The binary parser was eagerly getting the name of memories to set the `memory` field of data segments, but that meant that when the memory names were updated later while parsing the names section, the data segment memory fields would become out of date. Update the issue by deferring setting the `memory` fields like we do for other parts of IR that reference memories. Also fix a segfault in the validator that was triggered by the reproducer for this bug before the bug was fixed. Fixes #5204.
*	RedundantSetElimination: Look at fallthrough values (#5213)	Alon Zakai	2022-11-03	1	-3/+0
\| \| \| \| \| \| \| \| \| \| \|	This can help in rare cases in MVP wasm, say for the return value of a block. But for wasm GC it is very important due to casts. Similar logic was added as part of #5194 for SimplifyLocals. It should probably have been in a separate PR then. This does the right thing for RedundantSetElimination, as a separate PR. Full tests will appear in that later PR (it is not really possible to test the GC side yet - we need the logic in the later PR that actually switches to a more refined local index when available).
*	[C API] Add APIs to inspect compound heap types (#5195)	dcode	2022-11-03	2	-3/+33
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Adds C APIs to inspect compound struct, array and signature heap types: Obtain field types, field packed types and field mutabilities of struct types: BinaryenStructTypeGetNumFields (to iterate) BinaryenStructTypeGetFieldType BinaryenStructTypeGetFieldPackedType BinaryenStructTypeIsFieldMutable Obtain element type, element packed type and element mutability of array types: BinaryenArrayTypeGetElementType BinaryenArrayTypeGetElementPackedType BinaryenArrayTypeIsElementMutable Obtain parameter and result types of signature types: BinaryenSignatureTypeGetParams BinaryenSignatureTypeGetResults
*	SimplifyLocals: Fix handling of subtyping (#5210)	Alon Zakai	2022-11-02	1	-5/+98
\| \| \| \| \| \| \|	We just checked if the new type we prefer (when switching a local to a more refined one in #5194) is different than the old type. But that check at the end must check it is a subtype as well. Diff without whitespace is smaller.
*	ReorderGlobals pass (#4904)	Alon Zakai	2022-11-02	3	-0/+300
\| \| \| \| \| \| \| \| \|	This sorts globals by their usage (and respecting dependencies). If the module has very many globals then using smaller LEBs can matter. If there are fewer than 128 globals then we cannot reduce size, and the pass exits early (so this pass will not slow down MVP builds, which usually have just 1 global, the stack pointer). But with wasm GC it is common to use globals for vtables etc., and often there is a very large number of them.
*	[Wasm GC] SimplifyLocals: Switch local.get to use a more refined type when ↵	Alon Zakai	2022-11-01	6	-53/+194
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	possible (#5194) (local.set $refined (cast (local.get $plain))) .. .. (local.get $plain) .. ;; we can change this to read from $refined By using the more refined type we may be able to eliminate casts later. To do this, look at the fallthrough value (so we can look through a cast or a block value - this is the reason for the small wasm2js improvements in tests), and also extend the code that picks which local index to read to look at types (previously we just ignored any pairs of locals with different types).
*	[NFC] Mention relevant flags in validator errors (#5203)	Alon Zakai	2022-11-01	2	-13/+10
\| \| \| \| \| \| \| \| \| \|	E.g. Atomic operation (atomics are disabled) => Atomic operations require threads [--enable-threads]
*	Multi-Memories Lowering Pass (#5107)	Ashley Nelson	2022-11-01	3	-0/+253
\| \| \| \| \| \| \| \| \| \|	Adds a multi-memories lowering pass that will create a single combined memory from the memories added to the module. This pass assumes that each memory is configured the same (type, shared). This pass also: - replaces existing memory.size instructions with a custom function that returns the size of each memory as if they existed independently - replaces existing memory.grow instructions with a custom function, using global offsets to track the page size of each memory so data doesn't overlap in the singled combined memory - adjusts the offsets of active data segments - adjusts the offsets of Loads/Stores
*	CodePushing: Push into If arms (#5191)	Alon Zakai	2022-11-01	2	-75/+724
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously the pass only pushed past an if or a br_if. This does the same but into an if arm. On Wasm GC for example this can perform allocation sinking: function foo() { x = new A(); if (..) { use(x); } } => function foo() { if (..) { x = new A(); // this moved use(x); } } The allocation won't happen if we never enter the if. This helps wasm MVP too, and in fact some existing tests benefit.
*	Fix a fuzz issue with scanning heap read types (#5184)	Alon Zakai	2022-11-01	1	-0/+45
\| \| \| \| \| \| \| \| \|	If a heap type only ever appears as the result of a read, we must include it in the analysis in ModuleUtils, even though it isn't written in the binary format. Otherwise analyses using ModuleUtils can error on not finding all types in the list of types. Fixes #5180
*	Fix br_if fallthrough value (#5200)	Alon Zakai	2022-10-31	1	-0/+47
\| \| \| \| \| \| \|	The fallthrough there is trickier because the value is evaluated before the condition. Unlike other fallthroughs, the value is not last, so we need to check if the condition (which is after it) interferes with it.
*	[NFC] Rewrite PossibleContents::combine to be static (#5192)	Alon Zakai	2022-10-28	1	-4/+11
\| \| \| \| \|	This makes the logic symmetric and easier to read. Measuring speed, this seems identical to before, so that concern seems fine.
*	[Wasm GC] Fix the depth of the new array heap type (#5186)	Alon Zakai	2022-10-28	2	-1/+62
\|
*	[C API] Add essential heap type utilities (#5160)	dcode	2022-10-26	1	-21/+54
\| \| \| \| \| \| \| \| \| \|	Adds heap type utility to the C API: BinaryenHeapTypeIsBasic BinaryenHeapTypeIsSignature BinaryenHeapTypeIsStruct BinaryenHeapTypeIsArray BinaryenHeapTypeIsSubType
*	Move removable code in CodePushing (#5187)	Alon Zakai	2022-10-25	1	-0/+48
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is safe since we "partially remove" it: we don't move it to a place it might execute more, but make it possibly execute less. See the new comment for more details. Motivated by wasm GC but this can help wasm MVP as well. In both cases loads from memory can trap, which limits what the VM can do to optimize them past conditions, but in trapsNeverHappens we can do that at the toolchain level: x = read(); if () { .. } use(x); => if () { .. } x = read(); // moved to here, and might not execute if the if did a break/return use(x);
*	[Parser] Parse `return` (#5181)	Thomas Lively	2022-10-25	1	-20/+77
\| \| \| \| \| \| \| \| \|	Unlike in the legacy parser, we cannot depend on the folded text format to determine how many values to return, so we determine that solely based on the current function context. To handle multivalue return correctly, fix a bug in which we could synthesize new `unreachable`s and place them before existing unreachable instructions (like returns) at the end of instruction sequences.
*	[Parser] Parse `memory.copy` and `memory.fill` (#5178)	Thomas Lively	2022-10-21	1	-0/+66
\|
*	[Wasm GC] Support BrOn* in CodePushing (#5177)	Alon Zakai	2022-10-21	1	-0/+80
\|
*	Fix fuzzer to ignore externalize/internalize (#5176)	Alon Zakai	2022-10-21	2	-54/+59
\| \| \| \| \|	The fuzzer started to fail on the recent externalize/internalize test that was added in #5175 as we lack interpreter support. Move that to a separate file and ignore it in the fuzzer for now.
*	[Parser] Parse loads and stores (#5174)	Thomas Lively	2022-10-21	1	-3/+218
\| \| \| \| \| \| \| \| \| \|	Add parsing functions for `memarg`s, the offset and align fields of load and store instructions. These fields are interesting because they are lexically reserved words that need to be further parsed to extract their actual values. On top of that, add support for parsing all of the load and store instructions. This required fixing a buffer overflow problem in the generated parser code and adding more information to the signatures of the SIMD load and store instructions. `SIMDLoadStoreLane` instructions are particularly interesting because they may require backtracking to parse correctly.
*	[Wasm GC] Externalize/Internalize allow nulls (#5175)	Alon Zakai	2022-10-21	1	-0/+54
\| \| \| \| \|	These are encoded as RefAs operations, and we have optimizations that assume those trap on null, but Externalize/Internalize do not. Skip them there to avoid an error on the type being incorrect later.
*	[Parser] Parse shared memory declarations (#5173)	Thomas Lively	2022-10-21	1	-4/+4
\|
*	[Parser] Parse `memory.size` and `memory.grow` (#5165)	Thomas Lively	2022-10-20	1	-0/+51
\| \| \| \| \|	Also add the ability to parse memory indexes to correctly handle the multi-memory versions of these instructions. Add and use a conversion from `Result` to `MaybeResult` as well.
*	[Parser] Parse memories (#5164)	Thomas Lively	2022-10-19	1	-0/+20
\| \| \| \| \| \| \|	Parse 32-bit and 64-bit memories, including their initial and max sizes. Shared memories are left to a follow-up PR. The memory abbreviation that includes inline data is parsed, but the associated data segment is not yet created. Also do some minor simplifications in neighboring helper functions for other kinds of module elements.
*	[Parser] Parse SIMD ternary expressions and shifts (#5158)	Thomas Lively	2022-10-19	1	-2/+30
\|
*	[Wasm GC] Use Cones in GUFA data reads and writes (#5157)	Alon Zakai	2022-10-19	1	-14/+4
\| \| \| \| \| \| \| \| \| \| \|	When we read from a struct/array using a cone type, read from the types in the cone and nothing else. Previously we used the declared type in the wasm, which might be larger (both in the base type and the depth). Likewise, in a write. To do this, this extends ConeReadLocation with a depth (previously the depth there was assumed to be infinite, and now it is to a potentially limited depth). After this we are fully utilizing cone types in GUFA, as the test changes show (or at least I can't think of any other uses of cones).
*	[C API] Align I31ref and Dataref to be nullable (#5153)	dcode	2022-10-19	2	-6/+6
\| \| \|	The C API still returned non nullable types for `dataref` (`ref data` instead of `ref null data`) and `i31ref` (`ref i31` instead of `ref null i31`). This PR aligns with the current state of the GC proposal, making them nullable when obtained via the C API.
*	[Parser] Parse SIMD lane manipulation instructions (#5156)	Thomas Lively	2022-10-19	1	-0/+40
\| \| \|	Including all `SIMDExtract`, `SIMDReplace`, `SIMDShuffle` expressions.
*	[Parser] Parse global.get and global.set (#5155)	Thomas Lively	2022-10-19	1	-2/+12
\| \| \| \|	Also add some missing error checking for the similar local instructions and make some neighboring styling more consistent.
*	[Wasm GC] Filter GUFA expression locations by their type (#5149)	Alon Zakai	2022-10-18	2	-2/+107
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Now that we have a cone type, we are able to represent in PossibleContents the natural content of a wasm location: a type or any of its subtypes. This allows us to enforce the wasm typing rules, that is, to filter the data arriving at a location by the wasm type of the location. Technically this could be unnecessary if we had full implementations of flowFoo and so forth, that is, tailored code for each wasm expression that makes sure we only contain and flow content that fits in the wasm type. Atm we don't have that, and until the wasm spec stabilizes it's probably not worth the effort. Instead, simply filter based on the type, which gives the same result (though it does take a little more work; I measured it at 3% or so of runtime). While doing so normalize cones to their actual maximum depth, which simplifies things and will help more later as well.
*	[C API] Add bottom heap types and array heap type (#5150)	dcode	2022-10-18	2	-0/+42
\| \| \|	Adds `BinaryenHeapTypeNone`, `BinaryenHeapTypeNoext` and `BinaryenHeapTypeNofunc` to obtain the bottom heap types. Also adds `BinaryenHeapTypeIsBottom` to test whether a given heap type is a bottom type, and `BinaryenHeapTypeGetBottom` to obtain the respective bottom type given a heap type.
*	Parse and emit `array.len` without a type annotation (#5151)	Thomas Lively	2022-10-18	10	-19/+53
\| \| \|	Test that we can still parse the old annotated form as well.
*	Implement `array` basic heap type (#5148)	Thomas Lively	2022-10-18	5	-43/+136
\| \| \| \| \| \| \| \| \|	`array` is the supertype of all defined array types and for now is a subtype of `data`. (Once `data` becomes `struct` this will no longer be true.) Update the binary and text parsing of `array.len` to ignore the obsolete type annotation and update the binary emitting to emit a zero in place of the old type annotation and the text printing to print an arbitrary heap type for the annotation. A follow-on PR will add support for the newer unannotated version of `array.len`.
*	Exhaustively test basic heap type relationships (#5147)	Thomas Lively	2022-10-17	1	-0/+199
\| \| \| \| \| \|	As the number of basic heap types has grown, the complexity of the subtype and LUB calculations has grown as well. To ensure that they are correct, test the complete matrix of basic types and trivial user-defined types. Fix the subtype calculation to make string types subtypes of `any` to make the test pass.
*	[GUFA] Add some tests for #5142 (#5146)	Alon Zakai	2022-10-17	1	-0/+130
\|
*	Binary format: Don't emit empty Memory sections (#5145)	Alon Zakai	2022-10-17	3	-2/+12
\| \| \| \|	If the only memories are imported, we don't need the section. We were already doing that for tables, functions, etc.
*	[Wasm GC][GUFA] Avoid Many in roots (#5142)	Alon Zakai	2022-10-13	1	-13/+13
\| \| \|	Instead of Many, use a proper Cone Type for the data, as appropriate.
*	[Parser] Validate type annotations on `select` (#5139)	Thomas Lively	2022-10-13	1	-1/+1
\| \| \| \| \|	Since the type annotations are not stored explicitly in Binaryen IR, we have to validate them in the parser. Implement this and fix a newly-caught incorrect annotation in the tests.
*	Add "struct" and "structref" as an alias for "data" and "dataref" (#5141)	Thomas Lively	2022-10-13	1	-0/+13
\| \| \| \| \| \| \|	In the upstream spec, `data` has been replaced with a type called `struct`. To allow for a graceful update in Binaryen, start by introducing "struct" as an alias for "data". Once users have stopped emitting `data` directly, future PRs will remove `data` and update the subtyping so that arrays are no longer subtypes of `struct`.
*	[Parser] Parse `local.set` and `local.tee` (#5138)	Thomas Lively	2022-10-13	1	-3/+6
\|
*	[Wasm GC] Add a getMaxDepths() helper for heap types (#5134)	Alon Zakai	2022-10-13	1	-0/+29
\| \| \| \| \| \|	This computes how deep the children of a heap type are. This will be useful in cone type optimizations, since we want to "normalize" cones: a cone of depth infinity can just be a cone of the actual maximum depth of existing children, etc., and it's simpler to have a single canonical representation to avoid extra work.
*	[Parser] Parse `local.get` (#5137)	Thomas Lively	2022-10-13	1	-44/+100
\| \| \| \| \|	This requires parsing local indices and fixing a bug in `Function::setLocalName` where it only set up the mapping from index to name and not the mapping from name to index.
*	[Wasm GC] Add GUFA tests for struct reads from cones (#5135)	Alon Zakai	2022-10-13	1	-0/+631
\| \| \|	We had cone tests for ref.eq and ref.cast etc. but not for struct.get.
*	[Parser] Parse instructions with children (#5129)	Thomas Lively	2022-10-12	1	-6/+459
\| \| \| \| \|	Parse unary, binary, drop, and select instructions, properly fixing up stacky code, unreachable code, and multivalue code so it can be represented in Binaryen IR.