forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[Strings] string.measure (#4775)	Alon Zakai	2022-07-07	1	-1/+4
\|
*	[Strings] Add string.const (#4768)	Alon Zakai	2022-07-06	1	-0/+3
\| \| \| \| \|	This is more work than a typical instruction because it also adds a new section: all the (string.const "foo") strings are put in a new "strings" section in the binary, and the instructions refer to them by index.
*	[Strings] Add string.new* instructions (#4761)	Alon Zakai	2022-06-29	1	-0/+3
\| \| \| \| \| \|	This is the first instruction from the Strings proposal. This includes everything but interpreter support.
*	First class Data Segments (#4733)	Ashley Nelson	2022-06-21	1	-10/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Updating wasm.h/cpp for DataSegments * Updating wasm-binary.h/cpp for DataSegments * Removed link from Memory to DataSegments and updated module-utils, Metrics and wasm-traversal * checking isPassive when copying data segments to know whether to construct the data segment with an offset or not * Removing memory member var from DataSegment class as there is only one memory rn. Updated wasm-validator.cpp * Updated wasm-interpreter * First look at updating Passes * Updated wasm-s-parser * Updated files in src/ir * Updating tools files * Last pass on src files before building * added visitDataSegment * Fixing build errors * Data segments need a name * fixing var name * ran clang-format * Ensuring a name on DataSegment * Ensuring more datasegments have names * Adding explicit name support * Fix fuzzing name * Outputting data name in wasm binary only if explicit * Checking temp dataSegments vector to validateBinary because it's the one with the segments before we processNames * Pass on when data segment names are explicitly set * Ran auto_update_tests.py and check.py, success all around * Removed an errant semi-colon and corrected a counter. Everything still passes * Linting * Fixing processing memory names after parsed from binary * Updating the test from the last fix * Correcting error comment * Impl kripken@ comments * Impl tlively@ comments * Updated tests that remove data print when == 0 * Ran clang format * Impl tlively@ comments * Ran clang-format
*	Update relaxed SIMD instructions	Thomas Lively	2022-06-07	1	-2/+0
\| \| \| \| \|	Update the opcodes for all relaxed SIMD instructions and remove the unsigned dot product instructions that are no longer in the proposal.
*	[NFC] Make Literal::makeNull take a HeapType (#4664)	Alon Zakai	2022-05-13	1	-3/+3
\| \| \| \| \| \| \| \|	Taking a Type is redundant as we only care about the heap type - the nullability must be Nullable. This avoids needing an assertion in the function, that is, it makes the API more type-safe.
*	Remove externref (#4633)	Thomas Lively	2022-05-04	1	-3/+1
\| \| \| \| \| \|	Remove `Type::externref` and `HeapType::ext` and replace them with uses of anyref and any, respectively, now that we have unified these types in the GC proposal. For backwards compatibility, continue to parse `extern` and `externref` and maintain their relevant C API functions.
*	Implement relaxed SIMD dot product instructions (#4586)	Thomas Lively	2022-04-11	1	-1/+6
\| \| \|	As proposed in https://github.com/WebAssembly/relaxed-simd/issues/52.
*	[SIMD] Make swizzle's opcode name consistent (NFC) (#4585)	Heejin Ahn	2022-04-09	1	-2/+2
\| \| \| \|	Other opcode ends with `Inxm` or `Fnxm` (where n and m are integers), while `i8x16.swizzle`'s opcode name doesn't have an `I` in there.
*	Implement i16x8.relaxed_q15mulr_s (#4583)	Thomas Lively	2022-04-07	1	-0/+1
\| \| \|	As proposed in https://github.com/WebAssembly/relaxed-simd/issues/40.
*	Interpreter: Remove GlobalManager (#4486)	Alon Zakai	2022-01-31	1	-26/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	GlobalManager is another class that added complexity in the interpreter logic, and did not help. In fact it hurts extensibility, as when one wants to extend the interpreter one has another class to customize, and it is templated on the main runner, so again as #4479 we end up with annoying template cycles. This simply removes that class. That makes the interpreter code strictly simpler. Applying that change to wasm-ctor-eval also ends up fixing a pre-existing bug, so this PR gets testing through that. The ctor-eval issue was that we did not extend the GlobalManager properly in the past: we checked for accesses on imported globals there, but not in the main class, i.e., not on global.get operations. Needing to do things in two places is an example of the previous complexity. The fix is simply to implement visitGlobalGet in one place, and remove all the GlobalManager logic added in ctor-eval, which then gets a lot simpler as well. The new imported-global-2.wast checks for that bug (a global.get of an import should stop us from evalling). Existing tests cover the other cases, like it being ok to read a non-imported global, etc. The existing test indirect-call3.wast required a slight change: There was a global.get of an imported global, which was ignored in the place it happened (an init of an elem segment); the new code checks all global.gets, so it now catches that.
*	[NFC] Refactor ModuleInstanceBase+RuntimeExpressionRunner into a single ↵	Alon Zakai	2022-01-28	1	-877/+858
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	class (#4479) As recently discussed, the interpreter code is way too complex. Trying to add ctor-eval stuff I need, I got stuck and ended up spending some time to get rid of some of the complexity. We had a ModuleInstanceBase class which was basically an instance of a module, that is, an execution of it. And internally we have RuntimeExpressionRunner which is a runner that integrates with the ModuleInstanceBase - basically, it uses the runtime info to execute code. For example, the MIB has globals info, and the RER would read it from there. But these two classes are really just one functionality - an execution of a module. We get rid of some complexity by removing the separation between them, ending up with a class that can run a module. One set of problems we avoid is that we can now extend the single class in a simple way. Before, we would need to extend both - and inform each other of those changes. That gets "fun" with CRTP which we use everywhere. In other words, each of the two classes depended on the other / would need to be templated on the other. Specifically, MIB.callFunction would need to be given the RER to run with, and so that would need to be templated on it. This ends up leading to a bunch more templating all around - all complexity that we just don't need. See the simplification to the wasm-ctor-eval for some of that (and even worse complexity would have been needed without this PR in the next steps for that tool to eval GC stuff). The final single class is now called ModuleRunner. Also fixes a pre-existing issue uncovered by this PR. We had the delegate target on the runner, but it should be tied to a function scope. This happened to not be a problem if one always created a new runner for each scope, but this PR makes the runner longer-lived, so the stale data ended up mattering. The PR moves that data to the proper place. Note: Diff without whitespace is far, far smaller.
*	[NFC] Templatize/generalize RuntimeExpressionRunner (#4477)	Alon Zakai	2022-01-26	1	-9/+20
\| \| \| \| \| \| \| \| \|	Add a base class for it, that is templated and can be extended in general ways, and make callFunction templated on the runner to use as well. This allows the interpreter's behavior to be customized in a way that we couldn't so far. wasm-ctor-eval wants to use a special Runner when it evals a function, so that it can track certain operations, which this will enable.
*	LiteralList => Literals (#4451)	Alon Zakai	2022-01-13	1	-18/+14
\| \| \| \| \| \| \|	LiteralList overlaps with Literals, but is less efficient as it is not a SmallVector. Add reserve/capacity methods to SmallVector which are now necessary to compile.
*	[ctor-eval] Partial evaluation (#4438)	Alon Zakai	2022-01-11	1	-3/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This lets us eval part of a function but not all, which is necessary to handle real-world things like __wasm_call_ctors in LLVM output, as that is the single ctor that is exported and it has calls to the actual ctors. To do so, we look for a toplevel block and execute its items one by one, in a FunctionScope. If we stop in the middle, then we are performing a partial eval. In that case, we only remove the parts of the function that we removed, and we also serialize the locals whose values we read from the FunctionScope. For example, consider this: function foo() { return 10; } function __wasm_call_ctors() { var x; x = foo(); x++; // We stop evalling here. import1(); import2(x); } We can eval x = foo() and x++, but we must stop evalling when we reach the first of those imports. The partially-evalled function then looks like this: function __wasm_call_ctors() { var x; x = 11; import1(); import2(x); } That is, we evalled two lines of executing code and simply removed them, and then we wrote out the value of the local at that point, and then the rest of the code in the function is as it used to be.
*	[ctor-eval] Add --ignore-external-input option (#4428)	Alon Zakai	2022-01-06	1	-4/+0
\| \| \| \| \| \| \| \| \| \| \| \|	This is meant to address one of the main limitations of wasm-ctor-eval in emscripten atm, that libc++ global ctors will read env vars, which means they call an import, which stops us from evalling, emscripten-core/emscripten#15403 (comment) To handle that, this adds an option to ignore external input. When set, we can assume that no env vars will be read, no reading from stdin, no arguments to main(), etc. Perhaps these could each be separate options, but I think keeping it simple for now might be good enough.
*	[EH] Support try-delegate in interpreter (#4408)	Heejin Ahn	2021-12-28	1	-0/+16
\|
*	Change from storing Signature to HeapType on CallIndirect (#4352)	Thomas Lively	2021-11-22	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	With nominal function types, this change makes it so that we preserve the identity of the function type used with call_indirect instructions rather than recreating a function heap type, which may or may not be the same as the originally parsed heap type, from the function signature during module writing. This will simplify the type system implementation by removing the need to store a "canonical" nominal heap type for each unique signature. We previously depended on those canonical types to avoid creating multiple duplicate function types during module writing, but now we aren't creating any new function types at all.
*	Add support for relaxed-simd instructions (#4320)	Ng Zhi An	2021-11-15	1	-1/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds relaxed-simd instructions based on the current status of the proposal https://github.com/WebAssembly/relaxed-simd/blob/main/proposals/relaxed-simd/Overview.md. Binary opcodes are based on what is listed in https://github.com/WebAssembly/relaxed-simd/blob/main/proposals/relaxed-simd/Overview.md#binary-format. Text names are not fixed yet, and some sort sort of names that maps to the non-relaxed versions are chosen for this prototype. Support for these instructions have been added to LLVM via builtins, adding support here will allow Emscripten to successfully compile files that use those builtins. Interpreter support has also been added, and they delegate to the non-relaxed versions of the instructions. Most instructions are implemented in the interpreter the same way as the non-relaxed simd128 instructions, except for fma/fms, which is always fused.
*	Return the correct flow when an RTT is breaking (#4310)	Thomas Lively	2021-11-05	1	-1/+1
\| \| \|	Fixes #4308.
*	Fix RTTs for RTT-less instructions (#4294)	Thomas Lively	2021-11-03	1	-133/+117
\| \| \| \| \| \| \| \| \| \| \| \|	Allocation and cast instructions without explicit RTTs should use the canonical RTTs for the given types. Furthermore, the RTTs for nominal types should reflect the static type hierarchy. Previously, however, we implemented allocations and casts without RTTs using an alternative system that only used static types rather than RTT values. This alternative system would work fine in a world without first-class RTTs, but it did not properly allow mixing instructions that use RTTs and instructions that do not use RTTs as intended by the M4 GC spec. This PR fixes the issue by using canonical RTTs where appropriate and cleans up the relevant casting code using std::variant.
*	[NFC] Use std::variant in GCData (#4289)	Thomas Lively	2021-10-28	1	-2/+3
\| \| \| \|	This helps prevent bugs where we assume that the GCData has either a HeapType or Rtt without checking. Indeed, one such bug is found and fixed.
*	Add table.grow operation (#4245)	Max Graey	2021-10-18	1	-15/+60
\|
*	Add table.size operation (#4224)	Max Graey	2021-10-08	1	-1/+12
\|
*	Add table.set operation (#4215)	Max Graey	2021-10-07	1	-0/+21
\|
*	Implement table.get (#4195)	Alon Zakai	2021-09-30	1	-15/+54
\| \| \| \|	Adds the part of the spec test suite that this passes (without table.set we can't do it all).
*	[Wasm GC] Implement static (rtt-free) StructNew, ArrayNew, ArrayInit (#4172)	Alon Zakai	2021-09-23	1	-20/+75
\| \| \| \| \| \| \| \| \|	See #4149 This modifies the test added in #4163 which used static casts on dynamically-created structs and arrays. That was technically not valid (as we won't want users to "mix" the two forms). This makes that test 100% static, which both fixes the test and gives test coverage to the new instructions added here.
*	[Wasm GC] Add static variants of ref.test, ref.cast, and br_on_cast* (#4163)	Alon Zakai	2021-09-20	1	-19/+40
\| \| \| \| \| \| \| \| \| \| \| \|	These variants take a HeapType that is the type we intend to cast to, and do not take an RTT. These are intended to be more statically optimizable. For now though this PR just implements the minimum to get them parsing and to get through the optimizer without crashing. Spec: https://docs.google.com/document/d/1afthjsL_B9UaMqCA5ekgVmOm75BVFu6duHNsN9-gnXw/edit# See #4149
*	Fix interpreting of ref.as_func\|data (#4164)	Alon Zakai	2021-09-20	1	-2/+2
\|
*	[Wasm GC] Fix lack of packing in array.init (#4153)	Alon Zakai	2021-09-14	1	-1/+2
\|
*	[Wasm GC] ArrayInit support (#4138)	Alon Zakai	2021-09-10	1	-0/+21
\| \| \| \| \| \| \|	array.init is like array.new_with_rtt except that it takes as arguments the values to initialize the array with (as opposed to a size and an optional initial value). Spec: https://docs.google.com/document/d/1afthjsL_B9UaMqCA5ekgVmOm75BVFu6duHNsN9-gnXw/edit#
*	[Simd] Implement extra convert, trunc, demote and promote ops for ↵	Max Graey	2021-07-28	1	-2/+6
\| \| \| \|	interpreter (#4023)
*	[Simd] Refactoring. Remove middle Vec from some simd ops for consistency ↵	Max Graey	2021-07-27	1	-17/+17
\| \| \| \|	(#4027)
*	[Simd] Add extending pairwise adds to interpreter (#4022)	Max Graey	2021-07-26	1	-4/+4
\|
*	[Simd] Add extension from i32x4 to i64x2 ops to interpreter (#4016)	Max Graey	2021-07-26	1	-0/+4
\|
*	Fix tiny typo (#3995)	Alon Zakai	2021-07-19	1	-1/+1
\|
*	Implement interpretation of i64x2.bitmask (#3982)	Thomas Lively	2021-07-13	1	-1/+1
\| \| \| \| \| \| \|	Like a few other SIMD operations, this i64x2.bitmask had not been implemented in the interpreter yet. Unlike the others, i64x2.bitmask has type i32 rather than type v128, so Precompute was not skipping it, leading to a crash, as in https://github.com/emscripten-core/emscripten/issues/14629. Fix the problem by implementing i64x2.bitmask in the interpreter.
*	Preserve Function HeapTypes (#3952)	Thomas Lively	2021-06-30	1	-10/+13
\| \| \| \| \| \| \| \| \|	When using nominal types, func.ref of two functions with identical signatures but different HeapTypes will yield different types. To preserve these semantics, Functions need to track their HeapTypes, not just their Signatures. This PR replaces the Signature field in Function with a HeapType field and adds new utility methods to make it almost as simple to update and query the function HeapType as it was to update and query the Function Signature.
*	[EH] Replace event with tag (#3937)	Heejin Ahn	2021-06-18	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \|	We recently decided to change 'event' to 'tag', and to 'event section' to 'tag section', out of the rationale that the section contains a generalized tag that references a type, which may be used for something other than exceptions, and the name 'event' can be confusing in the web context. See - https://github.com/WebAssembly/exception-handling/issues/159#issuecomment-857910130 - https://github.com/WebAssembly/exception-handling/pull/161
*	[Wasm GC] rtt.fresh_sub (#3936)	Alon Zakai	2021-06-17	1	-0/+3
\| \| \| \| \| \| \| \| \| \|	This is the same as rtt.sub, but creates a "new" rtt each time. See https://docs.google.com/document/d/1DklC3qVuOdLHSXB5UXghM_syCh-4cMinQ50ICiXnK3Q/edit# The old Literal implementation of rtts becomes a little more complex here, as it was designed for the original spec where only structure matters. It may be worth a complete redesign there, but for now as the spec is in flux I think the approach here is good enough.
*	[Wasm GC] Add negated BrOn* operations (#3913)	Alon Zakai	2021-06-02	1	-23/+55
\| \| \| \| \| \|	They are basically the flip versions. The only interesting part in the impl is that their returned typed and sent types are different. Spec: https://docs.google.com/document/d/1DklC3qVuOdLHSXB5UXghM_syCh-4cMinQ50ICiXnK3Q/edit
*	[Wasm GC] Add experimental array.copy (#3911)	Alon Zakai	2021-05-27	1	-5/+60
\| \| \| \| \| \| \| \|	Spec for it is here: https://docs.google.com/document/d/1DklC3qVuOdLHSXB5UXghM_syCh-4cMinQ50ICiXnK3Q/edit# Also reorder some things in wasm.h that were not in the canonical order (that has no effect, but it is confusing to read).
*	Heap2Local: Use escape analysis to turn heap allocations into local data (#3866)	Alon Zakai	2021-05-12	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If we allocate some GC data, and do not let the reference escape, then we can replace the allocation with locals, one local for each field in the allocation basically. This avoids the allocation, and also allows us to optimize the locals further. On the Dart DeltaBlue benchmark, this is a 24% speedup (making it faster than the JS version, incidentially), and also a 6% reduction in code size. The tests are not the best way to show what this does, as the pass assumes other passes will clean up after. Here is an example to clarify. First, in pseudocode: ref = new Int(42) do { ref.set(ref.get() + 1) } while (import(ref.get()) That is, we allocate an int on the heap and use it as a counter. Unnecessarily, as it could be a normal int on the stack. Wat: (module ;; A boxed integer: an entire struct just to hold an int. (type $boxed-int (struct (field (mut i32)))) (import "env" "import" (func $import (param i32) (result i32))) (func "example" (local $ref (ref null $boxed-int)) ;; Allocate a boxed integer of 42 and save the reference to it. (local.set $ref (struct.new_with_rtt $boxed-int (i32.const 42) (rtt.canon $boxed-int) ) ) ;; Increment the integer in a loop, looking for some condition. (loop $loop (struct.set $boxed-int 0 (local.get $ref) (i32.add (struct.get $boxed-int 0 (local.get $ref) ) (i32.const 1) ) ) (br_if $loop (call $import (struct.get $boxed-int 0 (local.get $ref) ) ) ) ) ) ) Before this pass, the optimizer could do essentially nothing with this. Even with this pass, running -O1 has no effect, as the pass is only used in -O2+. However, running --heap2local -O1 leads to this: (func $0 (local $0 i32) (local.set $0 (i32.const 42) ) (loop $loop (br_if $loop (call $import (local.tee $0 (i32.add (local.get $0) (i32.const 1) ) ) ) ) ) ) All the GC heap operations have been removed, and we just have a plain int now, allowing a bunch of other opts to run. That output is basically the optimal code, I think.
*	[Wasm GC] Fix casting code in interpreter (#3873)	Alon Zakai	2021-05-10	1	-5/+9
\| \| \| \| \| \| \| \| \| \|	The logic there would construct the cast value separately for functions and data (as we must), and then in an attempt to share code, would then check if the cast succeed or not (and if not, do nothing with the cast value). But this was wrong, as in some weird casts (like a struct to a function) we cannot construct a valid cast value, and we error there. Instead, check if the cast works first, once we know enough to do so, and only then construct the cast value if so.
*	[Wasm GC] Fix Array initialization of a packed value (#3868)	Alon Zakai	2021-05-07	1	-1/+2
\| \| \| \| \| \|	We truncated and extended packed values in get and set, but not during initialization. Found by the fuzzer.
*	Fix interpreting of a ref.cast of a function that is not on the module (#3863)	Alon Zakai	2021-05-06	1	-3/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Binaryen allows optimizing functions in function-parallel passes while the module is still being built, that is, while not all the other functions have even been added to the module yet. Since the removal of asm2wasm that has not been heavily tested, but the fuzzer found a closely related bug: in passes like inlining-optimizing, that inline and then optimize the functions we inlined into, the mechanism for optimizing only the relevant functions is to create a module with only some of them. (We only want to optimize the relevant ones, that we inlined into, because this happens after the main optimization pipeline - we don't want to re-optimize all the functions if we just inlined into one of them.) The specific bug here is that ref.cast of a funcref looked up the target function on the module (in order to get its signature, to see if the cast has the right RTT for it). The fix is to return a nonconstant flow in that case, as it is something we cannot precompute. (This does mean we may miss some optimization opportunities, but as in the case of where we optimize functions before the module is fully built up, we do still get 99% of function-local optimizations that way, and a subsequent round of full optimizations can be done later if necessary.)
*	Run spec test all at once after binary transform (#3817)	Abbas Mashayekh	2021-04-20	1	-10/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	#3792 added support for module linking and (register command to wasm-shell, but forgot about three problems: - Splitting spec tests prevents linking test modules together. - Registered modules may still be used in assertions or an invoke - Modules may re-export imported objects This PR appends transformed modules after binary checks to a spec.wast file, plus assertion tests and register commands. Then runs wasm-shell on the whole file. It also keeps both the module name and its registered name available in wasm-shell for use in shell commands and linked modules. Furthermore, it correctly finds the module where an object is defined even if it is imported and re-exported several times. The updated version of imports.wast spec test is enabled to verify the fixes.
*	Very simple module linking in wasm-shell (#3792)	Abbas Mashayekh	2021-04-16	1	-86/+146
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a rewrite of the wasm-shell tool, with the goal of improved compatibility with the reference interpreter and the spec test suite. To facilitate that, module instances are provided with a list of linked instances, and imported objects are looked up in the correct instance. The new shell can: - register and link modules using the (register ...) command. - parse binary modules with the syntax (module binary ...). - provide the "spectest" module defined in the reference interpreter - assert instantiation traps with assert_trap - better check linkability by looking up the linked instances in - assert_unlinkable It cannot call external function references that are not direct imports. That would require bigger changes.
*	Fuzzer: Distinguish traps from host limitations (#3801)	Alon Zakai	2021-04-12	1	-2/+11
\| \| \| \| \| \| \| \| \|	Host limitations are arbitrary and can be modified by optimizations, so ignore them. For example, if the optimizer removes allocations then a host limit on an allocation error may vanish. Or, an optimization that removes recursion and replaces it with a loop may avoid a host limit on call depth (that is not done currently, but might some day). This removes a class of annoying false positives in the fuzzer.
*	Rename SIMD extending load instructions (#3798)	Daniel Wirtz	2021-04-12	1	-18/+18
\| \| \| \| \| \| \| \| \|	Renames the SIMD instructions * LoadExtSVec8x8ToVecI16x8 -> Load8x8SVec128 * LoadExtUVec8x8ToVecI16x8 -> Load8x8UVec128 * LoadExtSVec16x4ToVecI32x4 -> Load16x4SVec128 * LoadExtUVec16x4ToVecI32x4 -> Load16x4UVec128 * LoadExtSVec32x2ToVecI64x2 -> Load32x2SVec128 * LoadExtUVec32x2ToVecI64x2 -> Load32x2UVec128