forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Support br_on_cast null (#5397)	Thomas Lively	2023-01-05	1	-3/+2
\| \| \| \| \| \| \| \| \|	As well as br_on_cast_fail null. Unlike the existing br_on_cast* instructions, these new instructions treat the cast as succeeding when the input is a null. Update the internal representation of the cast type in `BrOn` expressions to be a `Type` rather than a `HeapType` so it will include nullability information. Also update and improve `RemoveUnusedBrs` to handle the new instructions correctly and optimize in more cases.
*	Support `ref.test null` (#5368)	Thomas Lively	2022-12-21	1	-3/+2
\| \| \|	This new variant of ref.test returns 1 if the input is null.
*	Update RefCast representation to drop extra HeapType (#5350)	Thomas Lively	2022-12-20	1	-2/+8
\| \| \| \| \| \| \| \| \|	The latest upstream version of ref.cast is parameterized with a target reference type, not just a heap type, because the nullability of the result is parameterizable. As a first step toward implementing these new, more flexible ref.cast instructions, change the internal representation of ref.cast to use the expression type as the cast target rather than storing a separate heap type field. For now require that the encoded semantics match the previously allowed semantics, though, so that none of the optimization passes need to be updated.
*	Rename UserSection -> CustomSection. NFC (#5288)	Sam Clegg	2022-11-22	1	-2/+2
\| \| \|	This reflects that naming used in the spec.
*	Switch from `typedef` to `using` in C++ code. NFC (#5258)	Sam Clegg	2022-11-15	1	-4/+4
\| \| \| \|	This is more modern and (IMHO) easier to read than that old C typedef syntax.
*	Implement `array.new_data` and `array.new_elem` (#5214)	Thomas Lively	2022-11-07	1	-0/+18
\| \| \| \| \| \| \| \| \|	In order to test them, fix the binary and text parsers to accept passive data segments even if a module has no memory. In addition to parsing and emitting the new instructions, also implement their validation and interpretation. Test the interpretation directly with wasm-shell tests adapted from the upstream spec tests. Running the upstream spec tests directly would require fixing too many bugs in the legacy text parser, so it will have to wait for the new text parser to be ready.
*	Multi-Memories Lowering Pass (#5107)	Ashley Nelson	2022-11-01	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	Adds a multi-memories lowering pass that will create a single combined memory from the memories added to the module. This pass assumes that each memory is configured the same (type, shared). This pass also: - replaces existing memory.size instructions with a custom function that returns the size of each memory as if they existed independently - replaces existing memory.grow instructions with a custom function, using global offsets to track the page size of each memory so data doesn't overlap in the singled combined memory - adjusts the offsets of active data segments - adjusts the offsets of Loads/Stores
*	[NFC] Add nullptr init for ElementSegment offset (#5168)	Alon Zakai	2022-10-20	1	-1/+1
\| \| \| \| \|	I believe all locations that create one already set it (or else we'd see errors), but it's not easy to see that when reading the code. And other similar locations (like DataSegment) do initialize to null, so do so for consistency.
*	Make `Name` a pointer, length pair (#5122)	Thomas Lively	2022-10-11	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	With the goal of supporting null characters (i.e. zero bytes) in strings. Rewrite the underlying interned `IString` to store a `std::string_view` rather than a `const char`, reduce the number of map lookups necessary to intern a string, and present a more immutable interface. Most importantly, replace the `c_str()` method that returned a `const char` with a `toString()` method that returns a `std::string`. This new method can correctly handle strings containing null characters. A `const char` can still be had by calling `data()` on the `std::string_view`, although this usage should be discouraged. This change is NFC in spirit, although not in practice. It does not intend to support any particular new functionality, but it is probably now possible to use strings containing null characters in at least some cases. At least one parser bug is also incidentally fixed. Follow-on PRs will explicitly support and test strings containing nulls for particular use cases. The C API still uses `const char` to represent strings. As strings containing nulls become better supported by the rest of Binaryen, this will no longer be sufficient. Updating the C and JS APIs to use pointer, length pairs is left as future work.
*	Fix bugs with copying expressions (#5099)	Thomas Lively	2022-09-30	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It does not make sense to construct an `Expression` directly because all expressions must be specific expressions. However, we previously allowed constructing Expressions, and in particular we allowed them to be copy constructed. Unrelatedly, `Fatal::operator<<` took its argument by value. Together, these two facts produced UB when printing Expressions in fatal error messages because a new Expression would be copy constructed with the original expression ID but without any of the actual data from the original specific expression. For example, when trying to print a Block, the printing code would try to look at the expression list, but the expression list would be junk stack data because the copied Expression does not contain an expression list. Fix the problem by making Expression's constructors visible only to its subclasses and making `Fatal::operator<<` take its argument by forwarding reference instead of by value.
*	Add JavaScript promise integration (JSPI) pass. (#4961)	Brendan Dahl	2022-09-02	1	-0/+1
\| \| \| \| \| \| \|	Add a pass that wraps all imports and exports with functions that handle storing and passing along the suspender externref needed for JSPI. https://github.com/WebAssembly/js-promise-integration/blob/main/proposals/js-promise-integration/Overview.md
*	Implement `extern.externalize` and `extern.internalize` (#4975)	Thomas Lively	2022-08-29	1	-0/+2
\| \| \| \|	These new GC instructions infallibly convert between `extern` and `any` references now that those types are not in the same hierarchy.
*	Mutli-Memories Support in IR (#4811)	Ashley Nelson	2022-08-17	1	-4/+23
\| \| \| \| \| \| \|	This PR removes the single memory restriction in IR, adding support for a single module to reference multiple memories. To support this change, a new memory name field was added to 13 memory instructions in order to identify the memory for the instruction. It is a goal of this PR to maintain backwards compatibility with existing text and binary wasm modules, so memory indexes remain optional for memory instructions. Similarly, the JS API makes assumptions about which memory is intended when only one memory is present in the module. Another goal of this PR is that existing tests behavior be unaffected. That said, tests must now explicitly define a memory before invoking memory instructions or exporting a memory, and memory names are now printed for each memory instruction in the text format. There remain quite a few places where a hardcoded reference to the first memory persist (memory flattening, for example, will return early if more than one memory is present in the module). Many of these call-sites, particularly within passes, will require us to rethink how the optimization works in a multi-memories world. Other call-sites may necessitate more invasive code restructuring to fully convert away from relying on a globally available, single memory pointer.
*	[Strings] string.new.array methods have start:end arguments (#4888)	Alon Zakai	2022-08-09	1	-0/+4
\|
*	Remove RTTs (#4848)	Thomas Lively	2022-08-05	1	-59/+0
\| \| \| \| \| \| \|	RTTs were removed from the GC spec and if they are added back in in the future, they will be heap types rather than value types as in our implementation. Updating our implementation to have RTTs be heap types would have been more work than deleting them for questionable benefit since we don't know how long it will be before they are specced again.
*	[Strings] GC variants for string.encode (#4817)	Alon Zakai	2022-07-21	1	-0/+10
\|
*	Remove basic reference types (#4802)	Thomas Lively	2022-07-20	1	-3/+5
\| \| \| \| \| \| \| \| \|	Basic reference types like `Type::funcref`, `Type::anyref`, etc. made it easy to accidentally forget to handle reference types with the same basic HeapTypes but the opposite nullability. In principle there is nothing special about the types with shorthands except in the binary and text formats. Removing these shorthands from the internal type representation by removing all basic reference types makes some code more complicated locally, but simplifies code globally and encourages properly handling both nullable and non-nullable reference types.
*	[Strings] Add string.new GC variants (#4813)	Alon Zakai	2022-07-19	1	-2/+12
\|
*	[Strings] stringview_wtf16.length (#4809)	Alon Zakai	2022-07-18	1	-0/+1
\| \| \| \|	This measures the length of a view, so it seems simplest to make it a sub-operation of the existing measure instruction.
*	[Strings] stringview_*.slice (#4805)	Alon Zakai	2022-07-15	1	-0/+31
\| \| \| \| \| \| \|	Unfortunately one slice is the same as python [start:end], using 2 params, and the other slice is one param, [CURR:CURR+num] (where CURR is implied by the current state in the iter). So we can't use a single class here. Perhaps a different name would be good, like slice vs substring (like JS does), but I picked names to match the current spec.
*	[Strings] stringview access operations (#4798)	Alon Zakai	2022-07-13	1	-0/+55
\|
*	[Strings] string.as (#4797)	Alon Zakai	2022-07-12	1	-0/+18
\|
*	[Strings] string.is_usv_sequence (#4783)	Alon Zakai	2022-07-08	1	-0/+1
\| \| \| \| \| \| \|	This implements it as a StringMeasure opcode. They do have the same number of operands, same trapping behavior, and same return type. They both get a string and do some inspection of it to return an i32. Perhaps the name could be StringInspect or something like that, rather than StringMeasure..? But I think for now this might be good enough, and the spec may change anyhow later.
*	[Strings] string.eq (#4781)	Alon Zakai	2022-07-08	1	-0/+11
\|
*	[Strings] string.concat (#4777)	Alon Zakai	2022-07-08	1	-0/+11
\|
*	[Strings] string.encode (#4776)	Alon Zakai	2022-07-07	1	-0/+19
\|
*	[Strings] string.measure (#4775)	Alon Zakai	2022-07-07	1	-0/+18
\|
*	[Strings] Add string.const (#4768)	Alon Zakai	2022-07-06	1	-0/+13
\| \| \| \| \|	This is more work than a typical instruction because it also adds a new section: all the (string.const "foo") strings are put in a new "strings" section in the binary, and the instructions refer to them by index.
*	[Strings] Add string.new* instructions (#4761)	Alon Zakai	2022-06-29	1	-0/+20
\| \| \| \| \| \|	This is the first instruction from the Strings proposal. This includes everything but interpreter support.
*	First class Data Segments (#4733)	Ashley Nelson	2022-06-21	1	-29/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Updating wasm.h/cpp for DataSegments * Updating wasm-binary.h/cpp for DataSegments * Removed link from Memory to DataSegments and updated module-utils, Metrics and wasm-traversal * checking isPassive when copying data segments to know whether to construct the data segment with an offset or not * Removing memory member var from DataSegment class as there is only one memory rn. Updated wasm-validator.cpp * Updated wasm-interpreter * First look at updating Passes * Updated wasm-s-parser * Updated files in src/ir * Updating tools files * Last pass on src files before building * added visitDataSegment * Fixing build errors * Data segments need a name * fixing var name * ran clang-format * Ensuring a name on DataSegment * Ensuring more datasegments have names * Adding explicit name support * Fix fuzzing name * Outputting data name in wasm binary only if explicit * Checking temp dataSegments vector to validateBinary because it's the one with the segments before we processNames * Pass on when data segment names are explicitly set * Ran auto_update_tests.py and check.py, success all around * Removed an errant semi-colon and corrected a counter. Everything still passes * Linting * Fixing processing memory names after parsed from binary * Updating the test from the last fix * Correcting error comment * Impl kripken@ comments * Impl tlively@ comments * Updated tests that remove data print when == 0 * Ran clang format * Impl tlively@ comments * Ran clang-format
*	Update relaxed SIMD instructions	Thomas Lively	2022-06-07	1	-2/+0
\| \| \| \| \|	Update the opcodes for all relaxed SIMD instructions and remove the unsigned dot product instructions that are no longer in the proposal.
*	Make RefCast safe by default (#4663)	Thomas Lively	2022-05-12	1	-1/+1
\| \| \| \|	This prevents new `RefCast` expressions that don't explicitly have their safety set from getting an unitialized safety value.
*	Add ref.cast_nop_static (#4656)	Thomas Lively	2022-05-11	1	-0/+5
\| \| \| \| \| \|	This unsafe experimental instruction is semantically equivalent to ref.cast_static, but V8 will unsafely turn it into a nop. This is meant to help us measure cast overhead more precisely than we can by globally turning all casts into nops.
*	Implement relaxed SIMD dot product instructions (#4586)	Thomas Lively	2022-04-11	1	-0/+4
\| \| \|	As proposed in https://github.com/WebAssembly/relaxed-simd/issues/52.
*	[SIMD] Make swizzle's opcode name consistent (NFC) (#4585)	Heejin Ahn	2022-04-09	1	-2/+2
\| \| \| \|	Other opcode ends with `Inxm` or `Fnxm` (where n and m are integers), while `i8x16.swizzle`'s opcode name doesn't have an `I` in there.
*	Implement i16x8.relaxed_q15mulr_s (#4583)	Thomas Lively	2022-04-07	1	-0/+1
\| \| \|	As proposed in https://github.com/WebAssembly/relaxed-simd/issues/40.
*	Generate heap type names when printing types (#4503)	Thomas Lively	2022-02-07	1	-8/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The previous printing system in the Types API would print the full recursive structure of a Type or HeapType with special markers using de Bruijn indices to avoid infinite recursion and a separate special marker for when the size exceeded an arbitrary upper limit. In practice, the types printed by that system were not human readable, so all that complexity was not useful. Replace that system with a new system that always emits a HeapType name rather than recursing into the structure of inner HeapTypes. Add methods for printing Types and HeapTypes with custom HeapType name generators. Also add a new wasm-type-printing.h header with off-the-shelf type name generators that implement simple naming schemes sufficient for tests and the type fuzzer. Note that these new printing methods and the old printing methods they augment are not used for emitting text modules. Printing types as part of expressions and modules is handled by separate code in Print.cpp and the printing API modified in this PR is mostly used for debugging. However, the new printing methods are general enough that Print.cpp should be able to use them as well, so update the format used to print types in the modified printing system to match the text format in anticipation of making that change in a follow-up PR.
*	Change from storing Signature to HeapType on CallIndirect (#4352)	Thomas Lively	2021-11-22	1	-13/+1
\| \| \| \| \| \| \| \| \| \| \| \|	With nominal function types, this change makes it so that we preserve the identity of the function type used with call_indirect instructions rather than recreating a function heap type, which may or may not be the same as the originally parsed heap type, from the function signature during module writing. This will simplify the type system implementation by removing the need to store a "canonical" nominal heap type for each unique signature. We previously depended on those canonical types to avoid creating multiple duplicate function types during module writing, but now we aren't creating any new function types at all.
*	Add support for relaxed-simd instructions (#4320)	Ng Zhi An	2021-11-15	1	-0/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds relaxed-simd instructions based on the current status of the proposal https://github.com/WebAssembly/relaxed-simd/blob/main/proposals/relaxed-simd/Overview.md. Binary opcodes are based on what is listed in https://github.com/WebAssembly/relaxed-simd/blob/main/proposals/relaxed-simd/Overview.md#binary-format. Text names are not fixed yet, and some sort sort of names that maps to the non-relaxed versions are chosen for this prototype. Support for these instructions have been added to LLVM via builtins, adding support here will allow Emscripten to successfully compile files that use those builtins. Interpreter support has also been added, and they delegate to the non-relaxed versions of the instructions. Most instructions are implemented in the interpreter the same way as the non-relaxed simd128 instructions, except for fma/fms, which is always fused.
*	[NFC] Create a .cpp file for fuzzer implementation (#4279)	Thomas Lively	2021-10-26	1	-2/+2
\| \| \| \| \| \|	Having a monolithic header file containing all the implementation meant there was no good way to split up the code or introduce new files. The new implementation file and source directory will make it much easier to add new fuzzing functionality in new files.
*	Add table.grow operation (#4245)	Max Graey	2021-10-18	1	-0/+13
\|
*	Add table.size operation (#4224)	Max Graey	2021-10-08	1	-0/+11
\|
*	Emit heap types for call_indirect that match the table (#4221)	Alon Zakai	2021-10-08	1	-0/+12
\| \| \| \| \| \| \| \|	See #4220 - this lets us handle the common case for now of simply having an identical heap type to the table when the signature is identical. With this PR, #4207's optimization of call_ref + table.get into call_indirect now leads to a binary that works in V8 in nominal mode.
*	Add table.set operation (#4215)	Max Graey	2021-10-07	1	-0/+13
\|
*	Implement table.get (#4195)	Alon Zakai	2021-09-30	1	-0/+12
\| \| \| \|	Adds the part of the spec test suite that this passes (without table.set we can't do it all).
*	[Wasm GC] Implement static (rtt-free) StructNew, ArrayNew, ArrayInit (#4172)	Alon Zakai	2021-09-23	1	-3/+12
\| \| \| \| \| \| \| \| \|	See #4149 This modifies the test added in #4163 which used static casts on dynamically-created structs and arrays. That was technically not valid (as we won't want users to "mix" the two forms). This makes that test 100% static, which both fixes the test and gives test coverage to the new instructions added here.
*	[Wasm GC] Add static variants of ref.test, ref.cast, and br_on_cast* (#4163)	Alon Zakai	2021-09-20	1	-4/+23
\| \| \| \| \| \| \| \| \| \| \| \|	These variants take a HeapType that is the type we intend to cast to, and do not take an RTT. These are intended to be more statically optimizable. For now though this PR just implements the minimum to get them parsing and to get through the optimizer without crashing. Spec: https://docs.google.com/document/d/1afthjsL_B9UaMqCA5ekgVmOm75BVFu6duHNsN9-gnXw/edit# See #4149
*	Support new dylink.0 custom section format (#4141)	Sam Clegg	2021-09-11	1	-0/+1
\| \| \| \| \| \| \|	See also: spec change: https://github.com/WebAssembly/tool-conventions/pull/170 llvm change: https://reviews.llvm.org/D109595 wabt change: https://github.com/WebAssembly/wabt/pull/1707 emscripten change: https://github.com/emscripten-core/emscripten/pull/15019
*	[Wasm GC] ArrayInit support (#4138)	Alon Zakai	2021-09-10	1	-0/+11
\| \| \| \| \| \| \|	array.init is like array.new_with_rtt except that it takes as arguments the values to initialize the array with (as opposed to a size and an optional initial value). Spec: https://docs.google.com/document/d/1afthjsL_B9UaMqCA5ekgVmOm75BVFu6duHNsN9-gnXw/edit#
*	[Refactoring] Cleanup asm2wasm. Use JS instead ASM prefix where possible. ↵	Max Graey	2021-09-01	1	-4/+3
\| \| \| \|	NFC (#4090)