forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	[Parser] Replace Signedness with ternary Sign (#4698)	Thomas Lively	2022-05-27	1	-42/+42
\| \| \| \| \| \| \| \|	Previously we were tracking whether integer tokens were signed but we did not differentiate between positive and negative signs. Unfortunately, without differentiating them, there's no way to tell the difference between an in-bounds negative integer and a wildly out-of-bounds positive integer when trying to perform bounds checks for s32 tokens. Fix the problem by tracking not only whether there is a sign on an integer token, but also what the sign is.
*	[Parser][NFC] Create a public wat-lexer.h header (#4695)	Thomas Lively	2022-05-27	2	-16/+34
\| \| \| \| \| \|	wat-parser-internal.h was already quite large after implementing just the lexer, so it made sense to rename it to be lexer-specific and start a new file for the higher-level parser. Also make it a proper .cpp file and split the testable interface out into wat-lexer.h.
*	OptimizeInstructions: Turn call_ref of a select into an if over two direct ↵	Alon Zakai	2022-05-27	2	-34/+136
\| \| \| \| \| \|	calls (#4660) This extends the existing call_indirect code to do the same for call_ref, basically. The shared code is added to a new helper utility.
*	[EH] Export tags (#4691)	Heejin Ahn	2022-05-26	4	-0/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds exported tags to `exports` section in wasm-emscripten-finalize metadata so Emscripten can use it. Also fixes a bug in the parser. We have only recognized the export format of ```wasm (tag $e2 (param f32)) (export "e2" (tag $e2)) ``` and ignored this format: ```wasm (tag $e1 (export "e1") (param i32)) ``` Companion patch: https://github.com/emscripten-core/emscripten/pull/17064
*	[Parser] Lex floating point values (#4693)	Thomas Lively	2022-05-26	1	-4/+490
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Rather than trying to actually implement the parsing of float values, which cannot be done naively due to precision concerns, just parse the float grammar then postprocess the parsed text into a form we can pass to `strtod` to do the actual parsing of the value. Since the float grammar reuses `num` and `hexnum` from the integer grammar but does not care about overflow, add a mode to `LexIntCtx`, `num`, and `hexnum` to allow parsing overflowing numbers. For NaNs, store the payload as a separate value rather than as part of the parsed double. The payload will be injected into the NaN at a higher level of the parser once we know whether we are parsing an f64 or an f32 and therefore know what the allowable payload values are.
*	[Parser] Lex keywords (#4688)	Thomas Lively	2022-05-25	1	-0/+29
\| \| \| \| \|	Also include reserved words that look like keywords to avoid having to find and enumerate all the valid keywords. Invalid keywords will be rejected at a higher level in the parser instead.
*	[Parser] Lex strings (#4687)	Thomas Lively	2022-05-25	1	-0/+102
\|
*	[Parser] Lex idchar and identifiers (#4686)	Thomas Lively	2022-05-25	1	-0/+21
\|
*	[Wasm GC] Fix CFG traversal of call_ref and add missing validation check (#4690)	Alon Zakai	2022-05-25	3	-0/+60
\| \| \| \| \| \| \| \|	We were missing CallRef in the CFG traversal code in a place where we note possible exceptions. As a result we thought CallRef cannot throw, and were missing some control flow edges. To actually detect the problem, we need to validate non-nullable locals properly, which we were not doing. This adds that as well.
*	[Parser] Start a new text format parser (#4680)	Thomas Lively	2022-05-24	2	-0/+350
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Begin implementing a new text format parser that will accept the standard text format. Start with a lexer that can iterate over tokens in an underlying text buffer. The initial supported tokens are integers, parentheses, and whitespace including comments. The implementation is in a new private internal header so it can be included into a gtest source file even though it is not meant to be a public API. Once the parser is more complete, there will be an additional public header exposing a more concise public API and the private header will be included into a source file that implements that public API. The new parser will improve on the existing text format parser not only because it will accept the full standard text format, but also because its code will be simpler and easier to maintain and because it will hopefully be faster as well. The new parser will be built out of small functions that closely mirror the grammar productions given in the spec and will heavily use C++17 features like string_view, optional, and variant to provide more self-documenting and efficient code. Future PRs will add support for lexing other kinds of tokens followed by support for parsing more complex constructs.
*	Add C and JS API functions for accessing memory info (#4682)	Jackson Gardner	2022-05-24	2	-0/+44
\| \| \|	Based on #3573 plus minor fixes
*	[Nominal Fuzzing] Fix SignatureRefining by updating types fully at once (#4665)	Alon Zakai	2022-05-23	1	-0/+46
\| \| \| \| \| \| \| \| \| \|	Optionally avoid updating types in TypeUpdating::updateParamTypes(). That update is incomplete if the function signature is also changing, which is the case in SignatureRefining (but not DeadArgumentElimination). "Incomplete" means that we updated the local.get type, but the function signature does not match yet. That incomplete state can hit an internal error in GlobalTypeRewriter::updateSignatures where it updates types. To avoid that, do the entire full update only there (in GlobalTypeRewriter::updateSignatures).
*	Fix binary parsing of the prototype nominal format (#4679)	Thomas Lively	2022-05-19	1	-13/+13
\| \| \| \| \| \|	We were checking that nominal modules only had a single element in their type sections, but that's not correct for the prototype nominal binary format we still want to support. The test for this missed catching the bug because it wasn't actually parsing in nominal mode.
*	Validator: Check features for ref.null's type (#4677)	Alon Zakai	2022-05-18	1	-0/+19
\|
*	[GC Fuzzing] Avoid non-nullable eqref without GC (#4675)	Alon Zakai	2022-05-18	1	-28/+34
\| \| \| \| \| \|	With only reference types but not GC, we cannot easily create a constant for eqref for example. Only GC adds i31.new etc. To avoid assertions in the fuzzer, avoid randomly picking (ref eq) etc., that is, keep it nullable so that we can emit a (ref.null eq) if we need a constant value of that type.
*	Allow TypeBuilder::grow to take 0 as an argument (#4668)	Thomas Lively	2022-05-16	1	-0/+2
\| \| \| \| \| \| \|	There's no reason not to allow growing by zero slots, but previously doing so would trigger an assertion. This caused a crash when roundtripping a trivial module. Fixes #4667.
*	Ensure symmetric results in PossibleConstantValues (#4662)	Alon Zakai	2022-05-13	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously we could return different results depending on the order we noted things: note(anyref.null); note(funcref.null); get() => anyref.null note(funcref.null); note(anyref.null); get() => funcref.null This is correct, as nulls are equal anyhow, and any could be used in the location we are optimizing. However, it can lead to nondeterminism if the caller's order of notes is nondeterministic. That is the case in DeadArgumentElimination, where we scan functions in parallel, then merge them without special ordering. To fix this, make the note operation symmetric. That seems simplest and least likely to be confusing. We can use the LUB to do that. To avoid duplicating the null logic, refactor note() to use combine().
*	Add SubTypes::getAllSubTypes variant which includes the type itself (#4649)	Alon Zakai	2022-05-13	2	-0/+31
\| \| \| \|	This also includes the type itself in the returned vector. This will be useful in a future PR.
*	Costs: Increase cost of casts (#4661)	Alon Zakai	2022-05-12	1	-0/+109
\| \| \| \| \|	Casts involve branches in the VM, so adding a cast in return for removing a branch (like If=>Select) is not beneficial. We don't want to ever do any more casts than we already are.
*	Add ref.cast_nop_static (#4656)	Thomas Lively	2022-05-11	1	-0/+18
\| \| \| \| \| \|	This unsafe experimental instruction is semantically equivalent to ref.cast_static, but V8 will unsafely turn it into a nop. This is meant to help us measure cast overhead more precisely than we can by globally turning all casts into nops.
*	[NominalFuzzing] Fix SignaturePruning on types with a super (#4657)	Alon Zakai	2022-05-11	1	-0/+18
\| \| \| \| \| \| \|	Do not prune parameters if there is a supertype that is a signature. Without this we crash on an assertion in TypeBuilder when we try to recreate the types (as we try to make a a subtype with fewer fields than the super).
*	[Fuzzer] Fix another reference types vs gc types issue (#4647)	Alon Zakai	2022-05-06	1	-39/+27
\| \| \| \| \| \| \| \| \| \|	Diff without whitespace is smaller. We can't emit HeapType::data without GC. Fixing that by switching to func, another problem was uncovered: makeRefFuncConst had a TODO to handle the case where we need a function to refer to but have created none yet. In fact that TODO was done at the end of the function. Fix up the logic in between to actually get there.
*	Fix fuzzer's choosing of reference types (#4642)	Alon Zakai	2022-05-05	1	-32/+37
\| \| \| \| \| \|	* Don't emit "i31" or "data" if GC is not enabled, as only the GC feature adds those. * Don't emit "any" without GC either. While it is allowed, fuzzer limitations prevent this atm (see details in comment - it's fixable).
*	Parse the prototype nominal binary format (#4644)	Thomas Lively	2022-05-04	2	-0/+27
\| \| \| \| \| \|	In f124a11ca3 we removed support for the prototype nominal binary format entirely, but that means that we can no longer parse older binary modules that used that format. Fix this regression by restoring the ability to parse the prototype binary format.
*	Update StackCheck for memory64 (#4636)	Sam Clegg	2022-05-04	2	-1/+94
\|
*	Remove externref (#4633)	Thomas Lively	2022-05-04	32	-425/+380
\| \| \| \| \| \|	Remove `Type::externref` and `HeapType::ext` and replace them with uses of anyref and any, respectively, now that we have unified these types in the GC proposal. For backwards compatibility, continue to parse `extern` and `externref` and maintain their relevant C API functions.
*	Replace 64K sparse matrix testcase with 8K (#4635)	Alon Zakai	2022-05-03	3	-2/+2
\| \| \| \|	Helps #4632: This makes it take 4 seconds instead of 5 minutes.
*	Update nominal type ordering (#4631)	Thomas Lively	2022-05-03	12	-54/+86
\| \| \| \| \| \|	V8 requires that supertypes come before subtypes when it parses isorecursive (i.e. standards-track) type definitions. Since 2268f2a we are emitting nominal types using the standard isorecursive format, so respect the ordering requirement.
*	Handle call.without.effects in RemoveUnusedModuleElements (#4624)	Alon Zakai	2022-05-02	1	-0/+112
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We assume a closed world atm in the GC space, but the call.without.effects intrinsic sort of breaks that: that intrinsic looks like an import, but we really need to care about what is sent to it even in a closed world: (call $call-without-effects (ref.func $target-keep) ) That reference cannot be ignored, as logically it is called just as if there were a call_ref there. This adds support for that, fixing the combination of #4621 and using call.without.effects. Also flip the vector of ref.func names to a set. I realized that in a very large program we might see the same name many times.
*	Update the type section binary format (#4625)	Thomas Lively	2022-05-02	1	-0/+36
\| \| \| \| \| \| \| \| \| \|	Print subtype declarations using the standards-track format with a vector of supertypes followed by a normal type declaration rather than our interim nominal format that used alternative versions of the func, struct, and array forms. Desugar the nominal format to additionally emit all the types into a single large recursion group. Currently V8 is performing this desugaring, but after this change and a future change that fixes the order of nominal types to ensure supertypes precede subtypes, it will no longer need to.
*	Lift the restriction in liveness-traversal.h that supported max 65535 locals ↵	juj	2022-04-28	5	-1/+56
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	in a function. (#4567) * Lift the restriction in liveness-traversal.h that supported max 65535 locals in a function. * Lint * Fix typo * Fix static * Lint * Lint * Lint * Add needed canRun function * lint * Use either a sparse or a dense matrix for tracking liveness copies, depending on the locals count. * Lint * Fix lint * Lint * Implement sparse_square_matrix class and use that as a backing. * Lint * Lint * Lint #includes * Lint * Lint includes * Remove unnecessary code * Fix canonical accesses to copies matrix * Lint * Add missing variable update * Remove canRun() function * Address review * Update expected test results * Update test name * Add asserts to sparse_square_matrix set and get functions that they are not out of bound. * Lint includes * Update test expectation * Use .clear() + .resize() to reset totalCopies vector
*	RemoveUnusedModuleElements: Track CallRef/RefFunc more precisely (#4621)	Alon Zakai	2022-04-28	3	-22/+226
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If we see (ref.func $foo) that does not mean that $foo is reachable - we must also see a (call_ref ..) of the proper type. Only after seeing both should we mark the function as reachable, which this PR does. This adds some complexity as we need to track intermediate state as we go, since we could see the RefFunc before the CallRef or vice versa. We also need to handle the case of a RefFunc without a CallRef properly: We cannot remove the function, as the RefFunc must refer to it, but at least we can empty out the body since we know it is never reached. This removes an old wasm-opt test which is now superseded by a new lit test. On J2Wasm output this removes 3% of all functions, which account for 2.5% of total code size.
*	OptimizeInstructions: Refinalize after a cast removal (#4611)	Alon Zakai	2022-04-25	1	-3/+55
\| \| \| \| \| \| \| \| \|	Casts can replace a type with a subtype, which normally has no downsides, but in a corner case of struct types it can lead to us needing to refinalize higher up too, see details in the comment. We have avoided any Refinalize calls in OptimizeInstructions, but the case handled here requires it sadly. I considered moving it to another pass, but this is a peephole optimization so there isn't really a better place.
*	[NominalFuzzing] SignatureRefining: Ignore exported functions (#4601)	Alon Zakai	2022-04-22	1	-0/+31
\| \| \|	This hits the fuzzer when it tries to call reference exports with a null.
*	[NominalFuzzing] Fix getHeapTypeCounts() on unreachable casts (#4609)	Alon Zakai	2022-04-22	1	-0/+24
\| \| \| \| \| \| \| \| \| \| \|	The cast instruction may be unreachable but the intended type for the cast still needs to be collected. Otherwise we end up with problems both during optimizations that look at heap types and in printing (which will use the heap type in code but not declare it). Diff without whitespace is much smaller: this just moves code around so that we can use a template to avoid code duplication. The actual change is just to scan ->intendedType unconditionally, and not ignore it if the cast is unreachable.
*	[NominalFuzzing] GTO: trap on null ref in removed struct.set (#4607)	Alon Zakai	2022-04-21	1	-3/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	When a field has no reads, we remove all its writes, but we did this: (struct.set $foo A B) => (drop A) (drop B) We also need to trap if A, the reference, is null, which this PR fixes, (struct.set $foo A B) => (drop (ref.as_non_null A)) (drop B)
*	[NominalFuzzing] MergeSimilarFunctions: handle nominal types properly (#4602)	Alon Zakai	2022-04-21	1	-0/+323
\| \| \| \| \| \|	This fixes two bugs: First, we need to compare the nominal types of function constants when looking for constants to "merge", not just their structure. Second, when creating the new function we must use the proper type of those constants, and not just another type.
*	Rename asyncify-side-module to asyncify-relocatable (#4596)	かめのこにょこにょこ	2022-04-18	1	-1/+1
\| \| \| \| \| \| \|	Related: emscripten-core/emscripten#15893 (comment) --pass-arg=asyncify-side-module option will be used not only from side modules, but also from main modules.
*	Revert "Re-enable a previously flaky type test (#4582)" (#4591)	Thomas Lively	2022-04-11	1	-1/+1
\| \| \|	This reverts commit 40a998c00eb42b65ddc1d42c1c009690bbd05cca.
*	Implement relaxed SIMD dot product instructions (#4586)	Thomas Lively	2022-04-11	1	-0/+112
\| \| \|	As proposed in https://github.com/WebAssembly/relaxed-simd/issues/52.
*	[Inlining] Preserve return_calls when possible (#4589)	Thomas Lively	2022-04-11	1	-0/+62
\| \| \| \| \| \| \| \| \|	We can preserve return_calls in inlined functions when the inlined call site is itself a return_call, since the call result types must transitively match in that case. This solves a problem where the previous inlining logic could introduce stack exhaustion by downgrading recursive return_calls to normal calls. Fixes #4587.
*	[SIMD] Make swizzle's opcode name consistent (NFC) (#4585)	Heejin Ahn	2022-04-09	1	-1/+1
\| \| \| \|	Other opcode ends with `Inxm` or `Fnxm` (where n and m are integers), while `i8x16.swizzle`'s opcode name doesn't have an `I` in there.
*	Implement i16x8.relaxed_q15mulr_s (#4583)	Thomas Lively	2022-04-07	1	-0/+26
\| \| \|	As proposed in https://github.com/WebAssembly/relaxed-simd/issues/40.
*	Re-enable a previously flaky type test (#4582)	Thomas Lively	2022-04-05	1	-1/+1
\| \| \| \| \|	I don't know what exactly was causing this test to flake, but since it was disabled we added the type fuzzer and fixed a lot of bugs, so I hope it is no longer flaky. If that turns out to be wrong, I can dig deeper.
*	Fix MemoryPacking bug (#4579)	Thomas Lively	2022-04-05	1	-0/+26
\| \| \| \| \| \| \| \|	247f4c20a1 introduced a bug that caused expressions that refer to data segments to be associated with the wrong segments in the presence of other segments that have no referring expressions at all. Fixes #4569. Fixes #4571.
*	[Wasm GC] Fix unreachable local.gets of non-nullable locals in ↵	Alon Zakai	2022-04-05	2	-1/+26
\| \| \| \| \| \| \| \|	CoalesceLocals (#4574) Normally we just replace unreachable local.gets with a constant (0, or null), but if the local is non-nullable we can't do that. Fixes #4573
*	Use LiteralUtils::canMakeZero before calling makeZero (#4568)	Alon Zakai	2022-04-01	2	-4/+91
\| \| \| \| \|	Fixes #4562 Fixes #4564
*	Port memory-packing tests to lit (#4559)	Thomas Lively	2022-04-01	6	-2146/+2286
\|
*	[NFC] Refactor Feature::All to match FeatureSet.setAll() (#4557)	Alon Zakai	2022-03-31	2	-2/+2
\| \| \| \| \| \| \| \| \| \| \|	As we recently noted in #4555, that Feature::All and FeatureSet.setAll() are different is potentially confusing... I think the best thing is to make them identical. This does that, and adds a new Feature::AllPossible which is everything possible and not just the set of all features that are enabled by -all. This undoes part of #4555 as now the old/simpler code works properly.
*	[Wasm GC] Fix stacky non-nullable tuples (#4561)	Alon Zakai	2022-03-31	2	-0/+112
\| \| \| \| \|	#4555 fixed validation for such tuples, but we also did not handle them in "stacky" code using pops etc., due to a logic bug in the binary reading code.