forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	[wasm-split] Use a fresh table when reference types are enabled (#6726)	Thomas Lively	2024-07-11	1	-13/+18
\| \| \| \| \| \| \|	Rather than trying to trampoline primary-to-secondary calls through an existing table, just create a fresh table for this purpose. This ensures that modifications to the existing tables cannot interfere with primary-to-secondary calls and conversely that loading the secondary module cannot overwrite modifications to the tables.
*	[WasmGC] Heap2Local: Optimize RefCast failures (#6727)	Alon Zakai	2024-07-11	1	-22/+22
\| \| \| \| \| \| \|	Previously we just did not optimize cases where our escape analysis showed an allocation flowed into a cast that failed. However, after inlining there can be real-world cases where that happens, even in traps-never-happen mode (if the cast is behind a conditional branch), so it seems worth optimizing.
*	Heap2Local: Drop RefEq's two arms (#6729)	Alon Zakai	2024-07-11	1	-3/+3
\| \| \| \| \|	This is a tiny bit more code but it is more consistent with other operations, and it saves work later.
*	Monomorphization: Optimize constants (#6711)	Alon Zakai	2024-07-11	4	-83/+456
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously the pass would monomorphize a call when we were sending more refined types than the target expects. This generalizes the pass to also consider the case where we send a constant in a parameter. To achieve that, this refactors the pass to explicitly define the "call context", which is the code around the call (inputs and outputs) that may end up leading to optimization opportunities when combined with the target function. Also add comments about the overall design + roadmap. The existing test is mostly unmodified, and the diff there is smaller when ignoring whitespace. We do "regress" those tests by adding more local.set operations, as in the refactoring that makes things a lot simpler, that is, to handle the general case of an operand having either a refined type or be a constant, we copy it inside the function, which works either way. This "regression" is only in the testing version of the pass (the normal version runs optimizations, which would remove that extra code). This also enables the pass when GC is disabled. Previously we only handled refined types, so only GC could benefit. Add a test for MVP content specifically to show we operate there as well.
*	[WasmGC] Heap2Local: Optimize RefIs and RefTest (#6705)	Alon Zakai	2024-07-11	1	-1/+62
\|
*	[StackIR] Allow StackIR to be disabled from the commandline (#6725)	Alon Zakai	2024-07-10	2	-3/+18
\| \| \| \| \| \| \| \| \|	Normally we use it when optimizing (above a certain level). This lets the user prevent it from being used even then. Also add optimization options to wasm-metadce so that this is possible there as well and not just in wasm-opt (this also opens the door to running more passes in metadce, which may be useful later).
*	Allow --keepfuncs and --splitfuncs to be use alongside a profile data (#6322)	Benjamin Ling	2024-07-10	2	-25/+38
\| \| \| \| \| \| \| \| \|	There are times after collecting a profile, we wish to manually include specific functions into the primary module. It could be due to non-deterministic profiling or functions for error scenarios (e.g. _trap). This PR helps to unlock this workflow by honoring both the `--keep-funcs` flag as well as the `--profile` flag
*	[C API] Add APIs for getting/setting function types, and a CallRef example ↵	Alon Zakai	2024-07-09	2	-0/+11
\| \| \| \| \|	(#6721) Fixes #6718
*	StackIR: Optimize away a drop before an unreachable (#6719)	Alon Zakai	2024-07-08	1	-1/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Anything else right before an unreachable is removed by the main DCE pass anyhow, but because of the structured form of BinaryenIR we can't remove a drop. That is, this is the difference between (i32.eqz (i32.const 42) (unreachable) ) and (drop (call $foo) ) (unreachable) In both cases the unreachable is preceded by something we don't need, but in the latter case it must remain in BinaryenIR for validation. To optimize this, add a rule in StackIR. Fixes #6715
*	Rename external conversion instructions (#6716)	Jérôme Vouillon	2024-07-08	22	-50/+68
\| \| \| \| \| \| \| \| \|	Rename instructions `extern.internalize` into `any.convert_extern` and `extern.externalize` into `extern.convert_any` to follow more closely the spec. This was changed in https://github.com/WebAssembly/gc/issues/432. The legacy name is still accepted in text inputs and in the C and JS APIs.
*	[DebugInfo] Add debug info to the values emitted in GlobalStructInference ↵	Alon Zakai	2024-07-02	1	-15/+23
\| \| \| \| \| \| \| \| \|	(#6709) Previously the replacement select got the debug info, but we should also copy it to the values, as often optimizations lead to one of those values remaining by itself. Similar to #6652 in general form.
*	ConstantFieldPropagation: Add a variation that picks between 2 values using ↵	Alon Zakai	2024-06-27	3	-16/+255
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	RefTest (#6692) CFP focuses on finding when a field always contains a constant, and then replaces a struct.get with that constant. If we find there are two constant values, then in some cases we can still optimize, if we have a way to pick between them. All we have is the struct.get and its reference, so we must use a ref.test: (struct.get $T x (..ref..)) => (select (..constant1..) (..constant2..) (ref.test $U (..ref..)) ) This is valid if, of all the subtypes of $T, those that pass the test have constant1 in that field, and those that fail the test have constant2. For example, a simple case is where $T has two subtypes, $T is never created itself, and each of the two subtypes has a different constant value. This is a somewhat risky operation, as ref.test is not necessarily cheap. To mitigate that, this is a new pass, --cfp-reftest that is not run by default, and also we only optimize when we can use a ref.test on what we think will be a final type (because ref.test on a final type can be faster in VMs).
*	[NFC] Add HeapType::getFeatures() (#6707)	Alon Zakai	2024-06-27	4	-89/+95
\|
*	[threads] Fuzz shared types in type fuzzer (#6704)	Thomas Lively	2024-06-26	4	-101/+139
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Give the type fuzzer the ability to generate shared heap types when the shared-everything feature is enabled. It correctly ensures that shared structs and arrays cannot reference unshared heap types, but that unshared heap types can reference any heap type. Update the main fuzzer so that for the time being it never uses the shared-everything feature when generating additional heap types, so it never generates shared types. We can lift this restriction once the main fuzzer has been updated to properly handle shared types. As a drive-by, fix some logic for subtracting feature sets from each other that is used in this commit.
*	[WasmGC] Heap2Local: Optimize RefEq (#6703)	Alon Zakai	2024-06-26	1	-2/+42
\| \| \| \| \|	If an allocation does not escape, then we can compute ref.eq for it: when compared to itself the result is 1, and when compared to anything else it is 0 (since it did not escape, anything else must be different).
*	[threads] Validate shared-polymorphic instructions (#6702)	Thomas Lively	2024-06-25	2	-72/+68
\| \| \| \|	Such as `ref.eq`, `i31.get_{s,u}`, and `array.len`. Also validate that struct and array operations work on shared structs and arrays.
*	[threads] Validate shared-to-unshared edges in heap types (#6698)	Thomas Lively	2024-06-25	2	-18/+53
\| \| \|	Add spec tests checking validation for structs and arrays.
*	Add missing struct/array type checks in binary reader (#6701)	Alon Zakai	2024-06-25	1	-0/+30
\| \| \|	Fixes #6695
*	Add a missing binary reading check for BrOn's reference child's type (#6700)	Alon Zakai	2024-06-25	1	-0/+3
\| \| \| \| \| \|	That child must be a reference, as `finalize()` assumes so. To avoid an assertion, error early. Fixes #6696
*	[NFC] Remove a minor compile-time optimization in Heap2Local (#6699)	Alon Zakai	2024-06-25	1	-43/+6
\| \| \| \| \| \| \| \|	We tracked which expressions we saw an allocated struct/array reach, and then quickly exited when another one did (as when two allocations mix, we can optimize neither). It turns out that this helps very little in actual measurements (looks like within noise - likely we are ruling out the un-optimizable cases early otherwise anyhow). Also the complexity it adds is a problem for an improvement I want to make to the pass, so remove it.
*	[WasmGC] Add missing ArrayNew variants to Properties::isGenerative (#6691)	Alon Zakai	2024-06-24	1	-0/+2
\| \| \|	Fixes #6690
*	Add a proper error for bad select results (#6697)	Alon Zakai	2024-06-24	1	-1/+5
\| \| \| \| \|	The result cannot be `none` or `unreachable` etc. Fixes #6694
*	Add TraceCalls pass (#6619)	Marcin Kolny	2024-06-21	4	-0/+224
\| \| \| \| \| \| \|	This pass receives a list of functions to trace, and then wraps them in calls to imports. This can be useful for tracing malloc/free calls, for example, but is generic. Fixes #6548
*	wasm2js: Fix the names of temp vars with named reference types (#6689)	Alon Zakai	2024-06-21	1	-0/+1
\| \| \| \| \| \|	We were missing code to mangle such names for JS. Without that, the name of a temp var for the type `(ref $foo)` would end up with `(`, `)` in the name, which is not valid in JS.
*	GlobalStructInference: Un-nest struct.news in globals when that is helpful ↵	Alon Zakai	2024-06-20	1	-62/+220
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(#6688) If we have (global $g (struct.new $S (i32.const 1) (struct.new $T ..) (ref.func $f) )) then before this PR if we wanted to read the middle field we'd stop, as it is non-constant. However, we can un-nest it, making it constant: (global $g.unnested (struct.new $T ..)) (global $g (struct.new $S (i32.const 1) (global.get $g.unnested) (ref.func $f) )) Now the field is a global.get of an immutable global, which is constant. Using this technique we can handle anything in a struct field, constant or not. The cost of adding a global is likely offset by the benefit of being able to refer to it directly, as that opens up more opportunities later. Concretely, this replaces the constant values we look for in GSI with a variant over constants or expressions (we do still want to group constants, as multiple globals with the same constant field can be treated as a whole). And we note cases where we need to un-nest, and handle those at the end.
*	Validate memarg offsets (#6683)	Thomas Lively	2024-06-20	1	-0/+13
\| \| \| \| \|	For 32-bit memories, the offset value must be in the u32 range. Update the address.wast spec test to assert that a module with an overlarge offset value is invalid rather than malformed.
*	[NFC] Add pragma to ignore maybe-uninitialized in strings.cpp (#6686)	Thomas Lively	2024-06-20	1	-0/+5
\| \| \|	This will hopefully fix the build on the coverage builder.
*	Validate that names are valid UTF-8 (#6682)	Thomas Lively	2024-06-19	5	-8/+36
\| \| \| \| \| \|	Add an `isUTF8` utility and use it in both the text and binary parsers. Add missing checks for overlong encodings and overlarge code points in our WTF8 reader, which the new utility uses. Re-enable the spec tests that test UTF-8 validation.
*	Fix validation of unused LEB128 bits (#6680)	Thomas Lively	2024-06-19	1	-11/+16
\| \| \| \| \|	The unused bits must be a sign extension of the significant value, but we were previously only validating that unsigned LEBs had their unused bytes set to zero. Re-enable the spec test that checks for proper validation.
*	Check malformed mutability on imported globals (#6679)	Thomas Lively	2024-06-18	1	-0/+3
\| \| \|	And re-enable the globals.wast spec test, which checks this.
*	Re-enable binary.wast spec test (#6677)	Thomas Lively	2024-06-18	3	-11/+26
\| \| \| \| \| \|	Fix the wast parser to accept IDs on quoted modules, remove tests that are invalidated by the multimemory proposal, and add validation that the total number of variables in a function is less than 2^32 and that the code section is present if there is a non-empty function section.
*	[threads] Shared basic heap types (#6667)	Thomas Lively	2024-06-19	13	-209/+322
\| \| \| \| \| \| \| \| \| \| \|	Implement binary and text parsing and printing of shared basic heap types and incorporate them into the type hierarchy. To avoid the massive amount of code duplication that would be necessary if we were to add separate enum variants for each of the shared basic heap types, use bit 0 to indicate whether the type is shared and replace `getBasic()` with `getBasic(Unshared)`, which clears that bit. Update all the use sites to record whether the original type was shared and produce shared or unshared output without code duplication.
*	[Parser] Fix bug in unreachable fallback logic (#6676)	Thomas Lively	2024-06-18	1	-13/+18
\| \| \| \| \| \| \| \| \|	When popping past an unreachable instruction would lead to popping from an empty stack or popping an incorrect type, we need to avoid popping and produce new Unreachable instructions instead to ensure we parse valid IR. The logic for this was flawed and made the synthetic Unreachable come before the popped unreachable child, which was not correct in the case that that popped unreachable was a branch or other non-trapping instruction. Fix and simplify the logic and re-enable the spec test that uncovered the bug.
*	fix(#6671): fix possible stack buffer overflow in gen-s-parser.inc (#6678)	mtb	2024-06-18	1	-0/+8
\| \| \| \| \| \|	The stack buffer overflow is occurring because memcpy(buf, op.data(), op.size()); can write up to op.size() bytes into buf, but buf is only 33 bytes long. If op.size() is greater than 33, this will result in a buffer overflow.
*	Reject invalid section IDs (#6675)	Thomas Lively	2024-06-18	2	-7/+8
\| \| \| \| \| \|	Rather than treating them as custom sections. Also fix UB where invalid `Section` enum values could be used as keys in a map. Use the raw `uint8_t` section IDs as keys instead. Re-enable a disabled spec test that was failing because of this bug and UB.
*	GlobalStructInference: Optimize globals too (#6674)	Alon Zakai	2024-06-17	1	-11/+10
\| \| \| \|	This is achieved by simply replacing the Literal with PossibleConstantValues, which supports both Literals and Globals.
*	Fix DataSegment name handling (#6673)	Alon Zakai	2024-06-17	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The code used i instead of index, as in this pseudocode: for i in range(num_names): index = readU32LEB() # index of the data segment to name name = readName() # name to give that segment data[i] = name # XXX 'i' should be 'index' That (funnily enough) happened to always work before since we write names in order. That is, normally given segments A,B,C we'd write then in the names section as A,B,C. Then the reader, which had the bug, would always have i and index identical in value anyhow. But if a wasm producer used different indexes, a problem could happen. To test this, add a binary file that has a reversed name section. Fixes #6672
*	wasm2js: Support arbitrary temp variable types (#6661)	Alon Zakai	2024-06-17	1	-20/+18
\| \| \| \| \| \| \| \|	Previously only basic types were allowed. Generalizing this to arbitrary types means we use a map instead of a vector, which is slower, but I can't measure any noticeable difference. Temp vars are pretty rare, and there are just much slower parts of wasm2js, I think.
*	[Parser] Fix error message on required reftype (#6666)	Thomas Lively	2024-06-14	1	-9/+15
\| \| \| \| \| \| \| \| \|	Not all uses of the `reftype` parser handled the fact that it returned a `MaybeResult`. Change its name to `maybeReftype`, add a new `reftype` parser that returns an error if there is no reftype, and update all the use sites. Fixes #6655.
*	[Parser] Update requirements for implicit type uses (#6665)	Thomas Lively	2024-06-14	1	-1/+1
\| \| \| \| \| \| \|	As an abbreviation, a `typeuse` can be given as just a list of parameters and results, in which case it corresponds to the index of the first function type with the same parameters and results. That function type must also be an MVP function type, i.e. it cannot have a nontrivial rec group, be non-final, or have a declared supertype. The parser did not previously implement all of these rules.
*	[threads] Binary reading and writing of shared composite types (#6664)	Thomas Lively	2024-06-14	3	-1/+10
\| \| \| \|	Also update the parser so that implicit type uses are not matched with shared function types.
*	[Parser][NFC] Make typeidx and maybeTypeidx return consistent types (#6663)	Thomas Lively	2024-06-14	2	-13/+12
\| \| \| \| \| \| \|	Since the BasicHeapTypes are in an enum, calling HeapType methods on them requires something like `HeapType(HeapType::func).someMethod()`. This is unnecessarily verbose, so add a new `HeapTypes` namespace that contains constexpr HeapType globals that can be used instead, shorting this to `HeapTypes::func.someMethod()`.
*	[NFC] Add constexpr HeapTypes for basic heap types (#6662)	Thomas Lively	2024-06-14	1	-0/+20
\| \| \| \| \| \| \|	Since the BasicHeapTypes are in an enum, calling HeapType methods on them requires something like `HeapType(HeapType::func).someMethod()`. This is unnecessarily verbose, so add a new `HeapTypes` namespace that contains constexpr HeapType globals that can be used instead, shorting this to `HeapTypes::func.someMethod()`.
*	[threads] Add a "shared-everything" feature (#6658)	Thomas Lively	2024-06-14	7	-6/+53
\| \| \| \| \|	Add the feature and flags to enable and disable it. Require the new feature to be enabled for shared heap types to validate. To make the test work, update the validator to actually check features for global types.
*	wasm2js: Generalize global initializer code to use the main codegen logic ↵	Alon Zakai	2024-06-13	1	-48/+19
\| \| \| \| \| \| \|	(#6659) This avoids special-casing particular global init forms. After this we should support everything in global inits that we support anywhere else.
*	Add local.set/tee local type annotations to BINARYEN_PRINT_FULL (#6657)	Alon Zakai	2024-06-13	1	-12/+25
\| \| \| \| \| \| \| \| \| \| \| \| \|	With this we now print e.g. (local.set $temp (; local type: i32 ;) ... This can be nice in large functions to avoid needing to scroll up to see the local type, e.g. when debugging why unsubtyping doesn't work somewhere. Also avoid [ ] in this mode, in favor of the standard (; ;), and put those at the end rather than at the start.
*	wasm2js: Fix null handling and RefAsNonNull (#6656)	Alon Zakai	2024-06-12	2	-27/+31
\|
*	[threads] Parse, build, and print shared composite types (#6654)	Thomas Lively	2024-06-12	6	-13/+70
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Parse the text format for shared composite types as described in the shared-everything thread proposal. Update the parser to use 'comptype' instead of 'strtype' to match the final GC spec and add the new syntactic class 'sharecomptype'. Update the type canonicalization logic to take sharedness into account to avoid merging shared and unshared types. Make the same change in the TypeMerging pass. Ensure that shared and unshared types cannot be in a subtype relationship with each other. Follow-up PRs will add shared abstract heap types, binary parsing and emitting for shared types, and fuzzer support for shared types.
*	[DebugInfo] Copy debug info in call-utils.h (#6652)	Alon Zakai	2024-06-12	7	-52/+97
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We automatically copy debuginfo in replaceCurrent(), but there are a few places that do other operations than simple replacements. call-utils.h will turn a call_ref with a select target into two direct calls, and we were missing the logic to copy debuginfo from the call_ref to the calls. To make this work, refactor out the copying logic from wasm-traversal, into debuginfo.h, and use it in call-utils.h. debuginfo.h itself is renamed from debug.h (as now this needs to be included from wasm-traversal, which nearly everything does, and it turns out some files have internal stuff like a debug() helper that ends up conflicing with the old debug namespace). Also rename the old copyDebugInfo function to copyDebugInfoBetweenFunctions which is more explicit. That is also moved from the header to a cpp file because it depends on wasm-traversal (so we'd end up with recursive headers otherwise). That is fine, as that method is called after copying a function, which is not that frequent. The new copyDebugInfoToReplacement (which was refactored out of wasm-traversal) is in the header because it can be called very frequently (every single instruction we optimize) and we want it to get inlined.
*	[Parser][NFC] Split parser into multiple compilation units (#6653)	Thomas Lively	2024-06-12	9	-167/+369
\| \| \| \| \| \| \| \| \| \| \| \|	Because the parser has five stages, it requires instantiating all of the templates in parsers.h with up to five different contexts. Instantiating all those templates in a single compilation unit takes a long time. On my machine, a release build of wat-parser.cpp.o took 32 seconds. To reduce the time of incremental rebuilds on machines with many cores, split the code across several compilation units so that the templates need to be instantiated for just a single context in each unit. On my machine the longest compilation time after this splitting is 17 seconds. The time for a full release build also drops from 42 seconds to 33 seconds. On machines with fewer cores, the benefit may be smaller or even negative, though.