forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	wasm-metadce all the things (#6142)	Alon Zakai	2023-11-30	1	-0/+47
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Remove hardcoded paths for globals/functions/etc. in favor of general code paths that support all the module elements uniformly. As a result of that, we now support all parts of wasm, such as tables and element segments, that we didn't before. This refactoring is NFC aside from adding functionality. Note that this reduces the size of wasm-metadce by 10% while increasing its functionality - the benefits of writing generic code. To support this, add some trivial generic helpers to get or iterate over module elements using their kind in a dynamic manner. Using them might make wasm-metadce slightly slower, but I can't measure any difference.
*	Implement table.copy (#6078)	Alon Zakai	2023-11-06	1	-0/+8
\| \| \|	Helps #5951
*	Typed Continuations: Add cont type (#5998)	Frank Emrich	2023-10-24	1	-0/+1
\| \| \| \| \| \| \| \| \|	This PR is part of a series that adds basic support for the [typed continuations proposal](https://github.com/wasmfx/specfx). This PR adds continuation types, of the form `(cont $foo)` for some function type `$foo`. The only notable changes affecting existing code are the following: - This is the first `HeapType` which has another `HeapType` (rather than, say, a `Type`) as its immediate child. This required fixes to certain traversals that have a flag for being at the toplevel of a type. - Some shared logic for parsing `HeapType`s has been factored out.
*	[typed-cont] Add feature flag (#5996)	Frank Emrich	2023-10-05	1	-0/+1
\| \| \| \| \| \| \|	This PR is part of a series that adds basic support for the [typed continuations proposal](https://github.com/wasmfx/specfx). This particular PR simply extends `FeatureSet` with a corresponding entry for this proposal.
*	Refine ref.test's castType during refinalization (#5985)	Thomas Lively	2023-10-02	1	-0/+2
\| \| \| \| \| \|	Just like we do with other casts, refine the cast type to be the greatest lower bound of its previous cast type and its input type. The difference is that the output type of ref.test remains i32, but it's still useful to retain more precise type information.
*	Implement table.fill (#5949)	Thomas Lively	2023-09-18	1	-0/+9
\| \| \| \| \| \| \| \|	This instruction was standardized as part of the bulk memory proposal, but we never implemented it until now. Leave similar instructions like table.copy as future work. Fixes #5939.
*	Replace I31New with RefI31 everywhere (#5930)	Thomas Lively	2023-09-13	1	-1/+1
\| \| \| \| \| \| \| \|	Globally replace the source string "I31New" with "RefI31" in preparation for renaming the instruction from "i31.new" to "ref.i31", as implemented in the spec in https://github.com/WebAssembly/gc/pull/422. This would be NFC, except that it also changes the string in the external-facing C APIs. A follow-up PR will make the corresponding behavioral change.
*	Rename multimemory flag (#5890)	Ashley Nelson	2023-08-21	1	-1/+1
\| \| \|	Renaming the multimemory flag in Binaryen to match its naming in LLVM.
*	Fix finalization of call_ref to handle refined target types (#5883)	Thomas Lively	2023-08-21	1	-6/+11
\| \| \| \| \| \| \| \| \| \|	Previously CallRef::finalize() would never update the type of the CallRef, even if the type of the call target had been refined to give a more precise result type. Besides unnecessarily losing type information, this could also lead to validation errors, since the validator checks that the type of CallRef matches the result type of the target signature. Fix the bug by updating CallRef's type based on its target signature in CallRef::finalize() and add a test that depends on this refinalization.
*	Further improve ref.cast during finalization (#5882)	Thomas Lively	2023-08-17	1	-16/+11
\| \| \| \| \| \|	We previously improved the nullability and heap type of the ref.cast target type in RefCast::finalize() based on what we knew about its input type. Simplify the code and make this improvement more powerful by using the greatest lower bound of the original cast target and input type.
*	Ensure br_on_cast* target type is subtype of input type (#5881)	Thomas Lively	2023-08-17	1	-0/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The WasmGC spec will require that the target cast type of br_on_cast and br_on_cast_fail be a subtype of the input type, but so far Binaryen has not enforced this constraint, so it could produce invalid modules when optimizations refined the input to a br_on_cast* such that it was no longer a supertype of the cast target type. Fix this problem by setting the cast target type to be the greatest lower bound of the original cast target type and the current input type in `BrOn::finalize()`. This maintains the invariant that the cast target type should be a subtype of the input type and it also does not change cast behavior; any value that could make the original cast succeed at runtime necessarily inhabits both the original cast target type and the input type, so it also must inhabit their greatest lower bound and will make the updated cast succeed as well.
*	[Wasm GC] Automatically make RefCast heap types more precise (#5704)	Alon Zakai	2023-05-05	1	-1/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We already did this for nullablilty, and so for the same reasons we should do it for heap types as well. Also, I realized that doing so would solve #5703, which is the new test added for TypeRefining here. The fuzz bug solved here is that our analysis of struct gets/sets will skip copy operations - a read from a field that is written into it. And we skip fallthrough values while doing so, since it doesn't matter if the read goes through an if arm or a cast. An if would automatically get a more precise type during refinalize, so this PR does the same for a cast basically. Fixes #5703
*	[NFC] Refactor each of ArrayNewSeg and ArrayInit into subclasses for ↵	Alon Zakai	2023-05-04	1	-2/+17
\| \| \| \| \| \| \| \| \| \| \|	Data/Elem (#5692) ArrayNewSeg => ArrayNewSegData, ArrayNewSegElem ArrayInit => ArrayInitData, ArrayInitElem Basically we remove the opcode and use the class type to differentiate them. This adds some code but it makes the representation simpler and more compact in memory, and it will help with #5690
*	[NFC] Assert that module maps are the right size (#5687)	Alon Zakai	2023-04-25	1	-0/+8
\| \| \| \|	If the names are not unique then the map would be smaller than the vector it is built from.
*	Implement array.fill, array.init_data, and array.init_elem (#5637)	Thomas Lively	2023-04-06	1	-0/+18
\| \| \| \| \|	These complement array.copy, which we already supported, as an initial complete set of bulk array operations. Replace the WIP spec tests with the upstream spec tests, lightly edited for compatibility with Binaryen.
*	Only update functions in optimizeAfterInlining() (#5624)	Alon Zakai	2023-04-05	1	-4/+8
\| \| \| \| \|	This saves the work of freeing and allocating for all the other maps. This is a code path that is used by several passes so it showed up in profiling for #5561
*	[NFC] Internally rename `ArrayInit` to `ArrayNewFixed` (#5526)	Thomas Lively	2023-02-28	1	-1/+1
\| \| \| \| \| \| \| \|	To match the standard instruction name, rename the expression class without changing any parsing or printing behavior. A follow-on PR will take care of the functional side of this change while keeping support for parsing the old name. This change will allow `ArrayInit` to be used as the expression class for the upcoming `array.init_data` and `array.init_elem` instructions.
*	[Strings] Add experimental StringNew variants (#5459)	Alon Zakai	2023-01-26	1	-1/+1
\| \| \| \| \| \|	string.from_code_point makes a string from an int code point. string.new_utf8*_try makes a utf8 string and returns null on a UTF8 encoding error rather than trap.
*	[Wasm GC] Handle an unreachable br_on_cast_fail in DCE (#5418)	Alon Zakai	2023-01-11	1	-1/+4
\| \| \|	Without this we hit an assertion on unreachable not being a heap type.
*	Represent ref.as_{func,data,i31} with RefCast (#5413)	Thomas Lively	2023-01-10	1	-9/+0
\| \| \| \| \| \| \| \| \| \| \| \| \|	These operations are deprecated and directly representable as casts, so remove their opcodes in the internal IR and parse them as casts instead. For now, add logic to the printing and binary writing of RefCast to continue emitting the legacy instructions to minimize test changes. The few test changes necessary are because it is no longer valid to perform a ref.as_func on values outside the func type hierarchy now that ref.as_func is subject to the ref.cast validation rules. RefAsExternInternalize, RefAsExternExternalize, and RefAsNonNull are left unmodified. A future PR may remove RefAsNonNull as well, since it is also expressible with casts.
*	Replace `RefIs` with `RefIsNull` (#5401)	Thomas Lively	2023-01-09	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Replace `RefIs` with `RefIsNull` The other `ref.is` instructions are deprecated and expressible in terms of `ref.test`. Update binary and text parsing to parse those instructions as `RefTest` expressions. Also update the printing and emitting of `RefTest` expressions to emit the legacy instructions for now to minimize test changes and make this a mostly non-functional change. Since `ref.is_null` is the only `RefIs` instruction left, remove the `RefIsOp` field and rename the expression class to `RefIsNull`. The few test changes are due to the fact that `ref.is` instructions are now subject to `ref.test` validation, and in particular it is no longer valid to perform a `ref.is_func` on a value outside of the `func` type hierarchy.
*	Consolidate br_on* operations (#5399)	Thomas Lively	2023-01-06	1	-25/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The `br_on{_non}_{data,i31,func}` operations are deprecated and directly representable in terms of the new `br_on_cast` and `br_on_cast_fail` instructions, so remove their dedicated IR opcodes in favor of representing them as casts. `br_on_null` and `br_on_non_null` cannot be consolidated the same way because their behavior is not directly representable in terms of `br_on_cast` and `br_on_cast_fail`; when the cast to null bottom type succeeds, the null check instructions implicitly drop the null value whereas the cast instructions would propagate it. Add special logic to the binary writer and printer to continue emitting the deprecated instructions for now. This will allow us to update the test suite in a separate future PR with no additional functional changes. Some tests are updated because the validator no longer allows passing non-func data to `br_on_func`. Doing so has not made sense since we separated the three reference type hierarchies.
*	Support br_on_cast null (#5397)	Thomas Lively	2023-01-05	1	-12/+34
\| \| \| \| \| \| \| \| \|	As well as br_on_cast_fail null. Unlike the existing br_on_cast* instructions, these new instructions treat the cast as succeeding when the input is a null. Update the internal representation of the cast type in `BrOn` expressions to be a `Type` rather than a `HeapType` so it will include nullability information. Also update and improve `RemoveUnusedBrs` to handle the new instructions correctly and optimize in more cases.
*	Allow non-nullable ref.cast of nullable references (#5386)	Thomas Lively	2023-01-04	1	-0/+1
\| \| \| \| \| \| \|	This new cast configuration was not expressible with the legacy cast instructions. Although it is valid in Wasm, do not allow nullable casts of non-nullable references, since those would unnecessarily lose type information. Convert such casts to be non-nullable during expression finalization.
*	Update RefCast representation to drop extra HeapType (#5350)	Thomas Lively	2022-12-20	1	-4/+4
\| \| \| \| \| \| \| \| \|	The latest upstream version of ref.cast is parameterized with a target reference type, not just a heap type, because the nullability of the result is parameterizable. As a first step toward implementing these new, more flexible ref.cast instructions, change the internal representation of ref.cast to use the expression type as the cast target rather than storing a separate heap type field. For now require that the encoded semantics match the previously allowed semantics, though, so that none of the optimization passes need to be updated.
*	Rename UserSection -> CustomSection. NFC (#5288)	Sam Clegg	2022-11-22	1	-2/+2
\| \| \|	This reflects that naming used in the spec.
*	Implement `array.new_data` and `array.new_elem` (#5214)	Thomas Lively	2022-11-07	1	-1/+6
\| \| \| \| \| \| \| \| \|	In order to test them, fix the binary and text parsers to accept passive data segments even if a module has no memory. In addition to parsing and emitting the new instructions, also implement their validation and interpretation. Test the interpretation directly with wasm-shell tests adapted from the upstream spec tests. Running the upstream spec tests directly would require fixing too many bugs in the legacy text parser, so it will have to wait for the new text parser to be ready.
*	[Parser] Parse `local.get` (#5137)	Thomas Lively	2022-10-13	1	-0/+1
\| \| \| \| \|	This requires parsing local indices and fixing a bug in `Function::setLocalName` where it only set up the mapping from index to name and not the mapping from name to index.
*	Implement bottom heap types (#5115)	Thomas Lively	2022-10-07	1	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	These types, `none`, `nofunc`, and `noextern` are uninhabited, so references to them can only possibly be null. To simplify the IR and increase type precision, introduce new invariants that all `ref.null` instructions must be typed with one of these new bottom types and that `Literals` have a bottom type iff they represent null values. These new invariants requires several additional changes. First, it is now possible that the `ref` or `target` child of a `StructGet`, `StructSet`, `ArrayGet`, `ArraySet`, or `CallRef` instruction has a bottom reference type, so it is not possible to determine what heap type annotation to emit in the binary or text formats. (The bottom types are not valid type annotations since they do not have indices in the type section.) To fix that problem, update the printer and binary emitter to emit unreachables instead of the instruction with undetermined type annotation. This is a valid transformation because the only possible value that could flow into those instructions in that case is null, and all of those instructions trap on nulls. That fix uncovered a latent bug in the binary parser in which new unreachables within unreachable code were handled incorrectly. This bug was not previously found by the fuzzer because we generally stop emitting code once we encounter an instruction with type `unreachable`. Now, however, it is possible to emit an `unreachable` for instructions that do not have type `unreachable` (but are known to trap at runtime), so we will continue emitting code. See the new test/lit/parse-double-unreachable.wast for details. Update other miscellaneous code that creates `RefNull` expressions and null `Literals` to maintain the new invariants as well.
*	Remove some unused constants. NFC (#5072)	Sam Clegg	2022-09-22	1	-3/+0
\| \| \| \| \|	TABLE_BASE usage was removed in #3211. MEMORY_BASE usage was removed in #3089. NEW_SIZE usage was removed in #3180.
*	Remove typed-function-references feature (#5030)	Thomas Lively	2022-09-09	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	In practice typed function references will not ship before GC and is not independently useful, so it's not necessary to have a separate feature for it. Roll the functionality previously enabled by --enable-typed-function-references into --enable-gc instead. This also avoids a problem with the ongoing implementation of the new GC bottom heap types. That change will make all ref.null instructions in Binaryen IR refer to one of the bottom heap types. But since those bottom types are introduced in GC, it's not valid to emit them in binaries unless unless GC is enabled. The fix if only reference types is enabled is to emit (ref.null func) instead of (ref.null nofunc), but that doesn't always work if typed function references are enabled because a function type more specific than func may be required. Getting rid of typed function references as a separate feature makes this a nonissue.
*	[Wasm GC] Fix GlobalTypeOptimization fuzz bug on replacing unreachable ↵	Alon Zakai	2022-09-06	1	-1/+1
\| \| \| \| \| \|	struct.set (#5021) We replaced an unreachable struct.set with something reachable, which can break validation in corner cases.
*	Add JavaScript promise integration (JSPI) pass. (#4961)	Brendan Dahl	2022-09-02	1	-0/+4
\| \| \| \| \| \| \|	Add a pass that wraps all imports and exports with functions that handle storing and passing along the suspender externref needed for JSPI. https://github.com/WebAssembly/js-promise-integration/blob/main/proposals/js-promise-integration/Overview.md
*	Implement `extern.externalize` and `extern.internalize` (#4975)	Thomas Lively	2022-08-29	1	-0/+6
\| \| \| \|	These new GC instructions infallibly convert between `extern` and `any` references now that those types are not in the same hierarchy.
*	Adding Multi-Memories Wasm Feature (#4968)	Ashley Nelson	2022-08-25	1	-0/+1
\| \| \|	Adding multi-memories to the the list of wasm-features.
*	[Wasm GC] Fix TypeRefining on fallthrough values via tee (#4900)	Alon Zakai	2022-08-18	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	A rather tricky corner case: we normally look at fallthrough values for copies of fields, so when we try to refine a field, we ignore stuff like this: a.x = b.x; That copies the same field on the same type to itself, so refining is not limited by it. But if we have something else in the middle, and that thing cannot change type, then it is a problem, like this: (struct.set (..ref..) (local.tee $temp (struct.get))) tee has the type of the local, which does not change in this pass. So we can't look at just the fallthrough here and skip the tee: after refining the field, the tee's old type might not fit in the field's new type. We could perhaps add casts to fix things up, but those may have too big a cost. For now, just ignore the fallthrough.
*	Mutli-Memories Support in IR (#4811)	Ashley Nelson	2022-08-17	1	-3/+29
\| \| \| \| \| \| \|	This PR removes the single memory restriction in IR, adding support for a single module to reference multiple memories. To support this change, a new memory name field was added to 13 memory instructions in order to identify the memory for the instruction. It is a goal of this PR to maintain backwards compatibility with existing text and binary wasm modules, so memory indexes remain optional for memory instructions. Similarly, the JS API makes assumptions about which memory is intended when only one memory is present in the module. Another goal of this PR is that existing tests behavior be unaffected. That said, tests must now explicitly define a memory before invoking memory instructions or exporting a memory, and memory names are now printed for each memory instruction in the text format. There remain quite a few places where a hardcoded reference to the first memory persist (memory flattening, for example, will return early if more than one memory is present in the module). Many of these call-sites, particularly within passes, will require us to rethink how the optimization works in a multi-memories world. Other call-sites may necessitate more invasive code restructuring to fully convert away from relying on a globally available, single memory pointer.
*	LegalizeJSInterface: Look for get/setTempRet0 as exports (#4881)	Sam Clegg	2022-08-15	1	-2/+0
\| \| \| \| \| \|	This allows emscripten to move these helper functions from JS library imports to native wasm exports. See https://github.com/emscripten-core/emscripten/issues/7273
*	Remove RTTs (#4848)	Thomas Lively	2022-08-05	1	-59/+7
\| \| \| \| \| \| \|	RTTs were removed from the GC spec and if they are added back in in the future, they will be heap types rather than value types as in our implementation. Updating our implementation to have RTTs be heap types would have been more work than deleting them for questionable benefit since we don't know how long it will be before they are specced again.
*	[Strings] GC variants for string.encode (#4817)	Alon Zakai	2022-07-21	1	-1/+2
\|
*	Remove basic reference types (#4802)	Thomas Lively	2022-07-20	1	-6/+6
\| \| \| \| \| \| \| \| \|	Basic reference types like `Type::funcref`, `Type::anyref`, etc. made it easy to accidentally forget to handle reference types with the same basic HeapTypes but the opposite nullability. In principle there is nothing special about the types with shorthands except in the binary and text formats. Removing these shorthands from the internal type representation by removing all basic reference types makes some code more complicated locally, but simplifies code globally and encourages properly handling both nullable and non-nullable reference types.
*	[Strings] Add string.new GC variants (#4813)	Alon Zakai	2022-07-19	1	-1/+2
\|
*	[Strings] stringview_*.slice (#4805)	Alon Zakai	2022-07-15	1	-0/+17
\| \| \| \| \| \| \|	Unfortunately one slice is the same as python [start:end], using 2 params, and the other slice is one param, [CURR:CURR+num] (where CURR is implied by the current state in the iter). So we can't use a single class here. Perhaps a different name would be good, like slice vs substring (like JS does), but I picked names to match the current spec.
*	[Strings] stringview access operations (#4798)	Alon Zakai	2022-07-13	1	-0/+33
\|
*	[Strings] string.as (#4797)	Alon Zakai	2022-07-12	1	-0/+20
\|
*	[Strings] string.eq (#4781)	Alon Zakai	2022-07-08	1	-0/+8
\|
*	[Strings] string.concat (#4777)	Alon Zakai	2022-07-08	1	-0/+8
\|
*	[Strings] string.encode (#4776)	Alon Zakai	2022-07-07	1	-0/+8
\|
*	[Strings] string.measure (#4775)	Alon Zakai	2022-07-07	1	-0/+8
\|
*	[Strings] Add string.const (#4768)	Alon Zakai	2022-07-06	1	-0/+2
\| \| \| \| \|	This is more work than a typical instruction because it also adds a new section: all the (string.const "foo") strings are put in a new "strings" section in the binary, and the instructions refer to them by index.