forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	Fuzzer: Add support for creating structs and arrays in makeConst (#4707)	Alon Zakai	2022-06-01	1	-8/+20
\| \| \| \| \| \| \| \|	#4659 adds a testcase with an import of (ref $struct). This could cause an error in the fuzzer, since it wants to remove imports (because the various fuzzers cannot pass in custom imports - they want to just run the wasm). When it tries to remove that import it tries to create a constant for a struct reference, and fails. To fix that, add enough support to create structs and arrays at least in the simple case where all their fields are defaultable.
*	Fuzzer: Refactor makeConst into separate functions [NFC] (#4709)	Alon Zakai	2022-06-01	2	-85/+118
\| \| \|	This just moves code around + adds assertions.
*	[Parser] Token classification (#4699)	Thomas Lively	2022-06-01	3	-30/+704
\| \| \| \| \| \| \| \| \| \| \| \|	Add methods to `Token` for determining whether the token can be interpreted as a particular token type, returning the interpreted value as appropriate. These methods perform additional bounds checks for integers and NaN payloads that could not be done during the initial lexing because the lexer did not know what the intended token type was. The float methods also reinterpret integer tokens as floating point tokens since the float grammar is a superset of the integer grammar and inject the NaN payloads into parsed NaN values. Move all bounds checking to these new classifier functions to have it in one place.
*	Global Struct Inference pass: Infer two constants in struct.get (#4659)	Alon Zakai	2022-06-01	8	-0/+1059
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This optimizes constants in the megamorphic case of two: when we know two function references are possible, we could in theory emit this: (select (ref.func A) (ref.func B) (ref.eq (..ref value..) ;; globally, only 2 things are possible here, and one has ;; ref.func A as its value, and the other ref.func B (ref.func A)) That is, compare to one of the values, and emit the two possible values there. Other optimizations can then turn a call_ref on this select into an if over two direct calls, leading to devirtualization. We cannot compare a ref.func directly (since function references are not comparable), and so instead we look at immutable global structs. If we find a struct type that has only two possible values in some field, and the structs are in immutable globals (which happens in the vtable case in j2wasm for example), then we can compare the references of the struct to decide between the two values in the field.
*	[NFC] Refactor EHUtils::findPops() method (#4704)	Alon Zakai	2022-06-01	3	-25/+53
\| \| \| \|	This moves it out of the validator so it can be used elsewhere. It will be used in #4685
*	[WasmGC] Fix try-catch and non-nullable local sets in values in ↵	Alon Zakai	2022-05-31	2	-6/+85
\| \| \| \| \| \| \| \| \|	SimplifyLocals (#4705) Followup to #4703, this also handles the case where there is a non- nullable local.set in the value of a nullable one, which we also cannot optimize. Fixes #4702
*	Updating spec test readme (#4706)	Ashley Nelson	2022-05-31	2	-113/+6
\| \| \| \| \|	* Updating command to run spec tests * Delete run.py
*	[WasmGC] Fix non-nullable locals and try-catch in SimplifyLocals (#4703)	Alon Zakai	2022-05-31	2	-3/+128
\| \| \| \| \| \| \|	Binaryen will not change dominance in SimplifyLocals, however, the current spec's notion of dominance is simpler than ours, and we must not optimize certain cases in order to still validate. See details in the comment and test. Helps #4702
*	wasm-emscripten-finalize: Improve detection of mainReadsParams (#4701)	Sam Clegg	2022-05-31	3	-8/+12
\| \| \| \| \| \|	The first way to should detect this is if the main function actually doesn't take any params. They we fallback to looking deeper. In preparation for https://reviews.llvm.org/D75277
*	Remove renameMainArgcArgv from wasm-emscripten-finalize (#4700)	Sam Clegg	2022-05-31	3	-24/+0
\| \| \| \| \| \| \| \| \|	This part to finalize is currently not used and was added in preparation for https://reviews.llvm.org/D75277. However, the better solution to dealing with this alternative name for main is on the emscripten side. The main reason for this is that doing the rename here in binaryen would require finalize to always re-write the binary, which is expensive.
*	[Parser] Replace Signedness with ternary Sign (#4698)	Thomas Lively	2022-05-27	3	-67/+65
\| \| \| \| \| \| \| \|	Previously we were tracking whether integer tokens were signed but we did not differentiate between positive and negative signs. Unfortunately, without differentiating them, there's no way to tell the difference between an in-bounds negative integer and a wildly out-of-bounds positive integer when trying to perform bounds checks for s32 tokens. Fix the problem by tracking not only whether there is a sign on an integer token, but also what the sign is.
*	[Parser][NFC] Improve comments about default NaN payloads (#4697)	Thomas Lively	2022-05-27	2	-0/+7
\|
*	[Parser][NFC] Remove extraneous braces from std::optional returns (#4696)	Thomas Lively	2022-05-27	1	-9/+9
\|
*	[Parser][NFC] Create a public wat-lexer.h header (#4695)	Thomas Lively	2022-05-27	5	-257/+317
\| \| \| \| \| \|	wat-parser-internal.h was already quite large after implementing just the lexer, so it made sense to rename it to be lexer-specific and start a new file for the higher-level parser. Also make it a proper .cpp file and split the testable interface out into wat-lexer.h.
*	OptimizeInstructions: Turn call_ref of a select into an if over two direct ↵	Alon Zakai	2022-05-27	5	-101/+355
\| \| \| \| \| \|	calls (#4660) This extends the existing call_indirect code to do the same for call_ref, basically. The shared code is added to a new helper utility.
*	Fuzzer: Pass v8 flag for extended-const (#4543)	Alon Zakai	2022-05-27	1	-1/+2
\| \| \| \| \|	This flag is needed now that we have testcases using extended-const in our test suite, as the fuzzer will grab random testcases, modify them, and run them through v8.
*	[EH] Export tags (#4691)	Heejin Ahn	2022-05-26	6	-1/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds exported tags to `exports` section in wasm-emscripten-finalize metadata so Emscripten can use it. Also fixes a bug in the parser. We have only recognized the export format of ```wasm (tag $e2 (param f32)) (export "e2" (tag $e2)) ``` and ignored this format: ```wasm (tag $e1 (export "e1") (param i32)) ``` Companion patch: https://github.com/emscripten-core/emscripten/pull/17064
*	[Parser][NFC] Clarify escaped string lexing (#4694)	Thomas Lively	2022-05-26	1	-24/+27
\| \| \| \|	Improve comments and variable names to make it clear that we allocate and build a separate string only when necessary to handle escape sequences.
*	[Parser] Lex floating point values (#4693)	Thomas Lively	2022-05-26	2	-29/+721
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Rather than trying to actually implement the parsing of float values, which cannot be done naively due to precision concerns, just parse the float grammar then postprocess the parsed text into a form we can pass to `strtod` to do the actual parsing of the value. Since the float grammar reuses `num` and `hexnum` from the integer grammar but does not care about overflow, add a mode to `LexIntCtx`, `num`, and `hexnum` to allow parsing overflowing numbers. For NaNs, store the payload as a separate value rather than as part of the parsed double. The payload will be injected into the NaN at a higher level of the parser once we know whether we are parsing an f64 or an f32 and therefore know what the allowable payload values are.
*	[Parser] Lex keywords (#4688)	Thomas Lively	2022-05-25	2	-1/+65
\| \| \| \| \|	Also include reserved words that look like keywords to avoid having to find and enumerate all the valid keywords. Invalid keywords will be rejected at a higher level in the parser instead.
*	[Parser] Lex strings (#4687)	Thomas Lively	2022-05-25	2	-1/+266
\|
*	[Parser] Lex idchar and identifiers (#4686)	Thomas Lively	2022-05-25	2	-3/+103
\|
*	[Wasm GC] Fix CFG traversal of call_ref and add missing validation check (#4690)	Alon Zakai	2022-05-25	5	-1/+94
\| \| \| \| \| \| \| \|	We were missing CallRef in the CFG traversal code in a place where we note possible exceptions. As a result we thought CallRef cannot throw, and were missing some control flow edges. To actually detect the problem, we need to validate non-nullable locals properly, which we were not doing. This adds that as well.
*	[Parser] Start a new text format parser (#4680)	Thomas Lively	2022-05-24	3	-0/+907
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Begin implementing a new text format parser that will accept the standard text format. Start with a lexer that can iterate over tokens in an underlying text buffer. The initial supported tokens are integers, parentheses, and whitespace including comments. The implementation is in a new private internal header so it can be included into a gtest source file even though it is not meant to be a public API. Once the parser is more complete, there will be an additional public header exposing a more concise public API and the private header will be included into a source file that implements that public API. The new parser will improve on the existing text format parser not only because it will accept the full standard text format, but also because its code will be simpler and easier to maintain and because it will hopefully be faster as well. The new parser will be built out of small functions that closely mirror the grammar productions given in the spec and will heavily use C++17 features like string_view, optional, and variant to provide more self-documenting and efficient code. Future PRs will add support for lexing other kinds of tokens followed by support for parsing more complex constructs.
*	[Fuzzer] Ignore relaxed-simd test for initial contents (#4683)	Alon Zakai	2022-05-24	1	-0/+8
\| \| \| \|	If we use it as initial contents, we will try to execute it, and hit the TODOs in the interpreter for unimplemented parts.
*	Updating wasm-opt example to an available pass (#4684)	Ashley Nelson	2022-05-24	1	-7/+6
\| \| \|	The wasm-opt example was using a test pass that was deleted. Updating to use one of the test passes in test/lit/passes.
*	Add C and JS API functions for accessing memory info (#4682)	Jackson Gardner	2022-05-24	5	-2/+101
\| \| \|	Based on #3573 plus minor fixes
*	[Nominal Fuzzing] Fix SignatureRefining by updating types fully at once (#4665)	Alon Zakai	2022-05-23	4	-20/+112
\| \| \| \| \| \| \| \| \| \|	Optionally avoid updating types in TypeUpdating::updateParamTypes(). That update is incomplete if the function signature is also changing, which is the case in SignatureRefining (but not DeadArgumentElimination). "Incomplete" means that we updated the local.get type, but the function signature does not match yet. That incomplete state can hit an internal error in GlobalTypeRewriter::updateSignatures where it updates types. To avoid that, do the entire full update only there (in GlobalTypeRewriter::updateSignatures).
*	Fix binary parsing of the prototype nominal format (#4679)	Thomas Lively	2022-05-19	2	-16/+13
\| \| \| \| \| \|	We were checking that nominal modules only had a single element in their type sections, but that's not correct for the prototype nominal binary format we still want to support. The test for this missed catching the bug because it wasn't actually parsing in nominal mode.
*	[NFC] Deduplicate imported global parsing (#4678)	Thomas Lively	2022-05-19	1	-23/+7
\| \| \| \| \| \| \| \| \| \|	Share the logic for parsing imported and non-imported globals of the formats: (import "module" "base" (global $name? type)) (global $name? type init) This fixes #4676, since the deleted logic for parsing imported globals did not handle parsing GC types correctly.
*	Validator: Check features for ref.null's type (#4677)	Alon Zakai	2022-05-18	3	-0/+26
\|
*	[GC Fuzzing] Avoid non-nullable eqref without GC (#4675)	Alon Zakai	2022-05-18	2	-30/+56
\| \| \| \| \| \|	With only reference types but not GC, we cannot easily create a constant for eqref for example. Only GC adds i31.new etc. To avoid assertions in the fuzzer, avoid randomly picking (ref eq) etc., that is, keep it nullable so that we can emit a (ref.null eq) if we need a constant value of that type.
*	Update README.md (#4674)	Ashley Nelson	2022-05-18	1	-1/+1
\| \| \|	Removing the command prompt so copy/paste is easier.
*	wasm-reduce: Fix order in shrinkByReduction call (#4673)	Alon Zakai	2022-05-17	1	-1/+4
\| \| \| \| \| \|	The old code would short-circuit and not do anything after we managed any reduction in the loop here. That would end up doing entire iterations of the whole pipeline before removing another element segment, which could be slow.
*	Fuzzer: Show a clear error if a given wasm fails to run (#4672)	Alon Zakai	2022-05-17	1	-1/+5
\| \| \| \| \| \|	Without this the error from a bad given wasm file - either by the user, or during a reduction where smaller wasms are given - could be very confusing. A bad given wasm file during reduction, in particular, indicates a bug in the reducer most likely.
*	[NFC] Remove unneeded include (#4670)	Alon Zakai	2022-05-17	1	-1/+0
\|
*	[NFC] Make InsertOrderedMap::insert's param const (#4669)	Alon Zakai	2022-05-17	1	-1/+1
\| \| \| \| \|	Being a const reference allows writing insert({a, b}), which will be useful in a future PR, and there is no reason to actually update the reference.
*	[NFC] Make PossibleConstantValues::dump() a const function (#4671)	Alon Zakai	2022-05-17	1	-1/+1
\|
*	[Fuzzer] Reduce trap probability in function ref fallback code (#4653)	Alon Zakai	2022-05-16	1	-10/+15
\| \| \| \| \| \|	Also improve comments. As suggested in #4647
*	Allow TypeBuilder::grow to take 0 as an argument (#4668)	Thomas Lively	2022-05-16	2	-1/+3
\| \| \| \| \| \| \|	There's no reason not to allow growing by zero slots, but previously doing so would trigger an assertion. This caused a crash when roundtripping a trivial module. Fixes #4667.
*	Ensure symmetric results in PossibleConstantValues (#4662)	Alon Zakai	2022-05-13	2	-20/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously we could return different results depending on the order we noted things: note(anyref.null); note(funcref.null); get() => anyref.null note(funcref.null); note(anyref.null); get() => funcref.null This is correct, as nulls are equal anyhow, and any could be used in the location we are optimizing. However, it can lead to nondeterminism if the caller's order of notes is nondeterministic. That is the case in DeadArgumentElimination, where we scan functions in parallel, then merge them without special ordering. To fix this, make the note operation symmetric. That seems simplest and least likely to be confusing. We can use the LUB to do that. To avoid duplicating the null logic, refactor note() to use combine().
*	Followups to #3841 (#3842)	Alon Zakai	2022-05-13	1	-6/+0
\| \| \| \| \|	If we don't think that preventing copies in assignment makes sense by itself (since we allow them on construction) then I think we can just remove the restriction and also the implicit copy constructor.
*	Add SubTypes::getAllSubTypes variant which includes the type itself (#4649)	Alon Zakai	2022-05-13	5	-17/+46
\| \| \| \|	This also includes the type itself in the returned vector. This will be useful in a future PR.
*	[NFC] Make Literal::makeNull take a HeapType (#4664)	Alon Zakai	2022-05-13	4	-8/+7
\| \| \| \| \| \| \| \|	Taking a Type is redundant as we only care about the heap type - the nullability must be Nullable. This avoids needing an assertion in the function, that is, it makes the API more type-safe.
*	Make RefCast safe by default (#4663)	Thomas Lively	2022-05-12	1	-1/+1
\| \| \| \|	This prevents new `RefCast` expressions that don't explicitly have their safety set from getting an unitialized safety value.
*	Costs: Increase cost of casts (#4661)	Alon Zakai	2022-05-12	3	-13/+136
\| \| \| \| \|	Casts involve branches in the VM, so adding a cast in return for removing a branch (like If=>Select) is not beneficial. We don't want to ever do any more casts than we already are.
*	Release 108 (#4658)	Alon Zakai	2022-05-11	2	-1/+8
\|
*	Add ref.cast_nop_static (#4656)	Thomas Lively	2022-05-11	11	-9/+63
\| \| \| \| \| \|	This unsafe experimental instruction is semantically equivalent to ref.cast_static, but V8 will unsafely turn it into a nop. This is meant to help us measure cast overhead more precisely than we can by globally turning all casts into nops.
*	[NominalFuzzing] Fix SignaturePruning on types with a super (#4657)	Alon Zakai	2022-05-11	2	-0/+27
\| \| \| \| \| \| \|	Do not prune parameters if there is a supertype that is a signature. Without this we crash on an assertion in TypeBuilder when we try to recreate the types (as we try to make a a subtype with fewer fields than the super).
*	Fix setting of stack size when building with clang for windows (#4654)	TheGreatRambler	2022-05-10	1	-1/+5
\|