forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Fix CFGWalker issue in single-threaded mode (#6573)	许鑫权	2024-05-09	1	-0/+2
\| \| \| \| \|	In that mode a walk on an entire module will reuse the same CFGWalker instance, so we must manually clear some fields, and we forgot some before.
*	Fuzzer: Stop emitting nullable stringviews (#6574)	Alon Zakai	2024-05-08	2	-5/+26
\| \| \| \| \| \| \| \| \| \| \| \| \|	As of https://chromium-review.googlesource.com/c/v8/v8/+/5471674 V8 requires stringviews to be non-nullable. It might be possible to make that change in our IR, or to remove views entirely, but for now this PR makes the fuzzer stop emitting nullable stringviews as a workaround to allow us to fuzz current V8. There are still rare corner cases where this pattern is emitted, that we have not tracked down, and so this also makes the fuzzer ignore the error for now.
*	wasm-split: Handle RefFuncs (#6513)	Alon Zakai	2024-05-08	1	-8/+85
\| \| \| \| \|	When we have a ref.func that refers to the secondary module then make a trampoline that calls it directly. The trampoline's call is then fixed up like all direct calls to the secondary module.
*	Allow DWARF and multivalue together (#6570)	Heejin Ahn	2024-05-06	1	-9/+29
\| \| \| \| \| \| \| \| \|	This allows writing of binaries with DWARF info when multivalue is enabled. Currently we just crash when both are enabled together. This just assumes, unless we have run DWARF-invalidating passes, all locals added for tuples or scratch locals would have been added at the end of the local list, so just printing all locals in order would preserve the DWARF info. Tuple locals are expanded in place and scratch locals are added at the end.
*	Source map fixes (#6550)	Jérôme Vouillon	2024-05-02	6	-25/+44
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Keep debug locations at function start The `fn_prolog_epilog.debugInfo` test is failing otherwise, since there was debug information associated to the nop instruction at the beginning of the function. * Do not clear the debug information when reaching the end of the source map The last segment should extend to the end of the function. * Propagate debug location from the function prolog to its first instruction * Fix printing of epilogue location The text parser no longer propagates locations to the epilogue, so we should always print the location if there is one. * Fix debug location smearing The debug location of the last instruction should not smear into the function epilogue, and a debug location from a previous function should not smear into the prologue of the current function.
*	Respect the Web limitation on Table size (#6567)	Alon Zakai	2024-05-01	6	-9/+43
\| \| \| \| \|	Without this the fuzzer can error on differences in behavior between V8 and us. Also move the limitations constants to their own header.
*	[StackIR] Support source maps and DWARF with StackIR (#6564)	Alon Zakai	2024-05-01	3	-5/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Helping #6509, this fixes debugging support for StackIR, which makes it more possible to use StackIR in more places. The fix is basically just to pass around some more state, and then to call the parent with "please write debug info" at the correct times, mirroring the similar calls in BinaryenIRWriter. The relevant Emscripten tests pass, and the source map test modified here produces identical output in StackIR and non-StackIR modes (the test is also simplified to remove --new-wat-parser which is no longer needed, after which the test can clearly show that StackIR has the same output as BinaryenIR).
*	J2CLOpts: Add "precompute" and "remove-unused-brs" as additional cleanup	Roberto Lublinerman	2024-04-30	1	-0/+2
\| \| \| \|	This makes the cleanup of bodies of functions that have had constants hoisted from them more effective.
*	[Strings] Limit string allocations like we do arrays (#6562)	Alon Zakai	2024-04-29	1	-3/+8
\| \| \| \| \| \| \| \|	When we concat strings, check if their length exceeds a reasonable limit. (We do not need to do this for string.new as that reads from an array, which is already properly limited.) This avoids very slow pauses in the fuzzer (that sometimes OOM).
*	[Strings] wasm-ctor-eval: Stop on seeing a string view, which we cannot ↵	Alon Zakai	2024-04-29	1	-0/+8
\| \| \| \|	precompute (#6561)
*	[Parser] Re-use blocks instead of wrapping where possible (#6552)	Thomas Lively	2024-04-29	1	-6/+11
\| \| \| \| \| \| \|	When the input has branches to block scope, IR builder generally has to add a wrapper block with a label name for the branch to target. To reduce the parsed IR size, add a special case for when the wrapped expression is already an unnamed block. In that case we can simply add the label to the existing block instead of creating a new wrapper block.
*	[Strings] Work around ref.cast not working on string views, and add fuzzing ↵	Alon Zakai	2024-04-29	3	-4/+52
\| \| \| \| \| \| \| \| \| \| \| \| \|	(#6549) As suggested in #6434 (comment) , lower ref.cast of string views to ref.as_non_null in binary writing. It is a simple hack that avoids the problem of V8 not allowing them to be cast. Add fuzzing support for the last three core string operations, after which that problem becomes very frequent. Also add yet another makeTrappingRefUse that was missing in that fuzzer code.
*	[Parser][NFC] Clean up the lexer index/pos API (#6553)	Thomas Lively	2024-04-29	3	-32/+30
\| \| \| \| \|	The lexer previously had both `getPos` and `getIndex` APIs that did different things, but after a recent refactoring there is no difference between the index and the position. Deduplicate the API surface.
*	[NFC] Use the new wat parser in RemoveNonJSOps (#6554)	Thomas Lively	2024-04-29	1	-5/+4
\|
*	Use the new wat parser in the C API (#6555)	Thomas Lively	2024-04-29	1	-7/+4
\|
*	Improve return validation (#6551)	Thomas Lively	2024-04-29	1	-10/+18
\| \| \| \|	Disallow returns from having any children, even unreachable children, in function that do not return any values.
*	Fix a bug with unreachable control flow in IRBuilder (#6558)	Jérôme Vouillon	2024-04-29	2	-14/+30
\| \| \| \| \| \| \| \| \| \| \| \|	When branches target control flow structures other than blocks or loops, the IRBuilder wraps those control flow structures with an extra block for the branches to target in Binaryen IR. When the control flow structure is unreachable because all its bodies are unreachable, the wrapper block may still need to have a non-unreachable type if it is targeted by branches. This is achieved by tracking whether the wrapper block will be targeted by any branches and use the control flow structure's original, non-unreachable type if so. However, this was not properly tracked when moving into the `else` branch of an `if` or the `catch`/`cath_all` handlers of a `try` block.
*	[jspi] - Support new version of JSPI for module splitting. (#6546)	Brendan Dahl	2024-04-29	1	-6/+20
\| \| \| \| \| \|	With the old version of JSPI, the JSPI pass was required to be run before splitting and would automatically add an export to be able to find the load_secondary_module function. Now that the pass is no longer needed, just add an import manually for the load_secondary_module function.
*	[Parser] Do not eagerly lex numbers (#6544)	Thomas Lively	2024-04-25	2	-293/+141
\| \| \| \|	Lex integers and floats on demand to avoid wasted work. Remove `Token` completely now that all kinds of tokens are lexed on demand.
*	[Parser] Do not eagerly lex strings (#6543)	Thomas Lively	2024-04-25	2	-49/+25
\| \| \|	Lex them on demand instead to avoid wasted work.
*	[Parser] Do not eagerly lex IDs (#6542)	Thomas Lively	2024-04-25	2	-43/+23
\| \| \|	Lex them on demand instead to avoid wasted work.
*	[Parser] Do not eagerly lex keywords (#6541)	Thomas Lively	2024-04-25	2	-85/+56
\| \| \|	Lex them on demand instead to avoid wasted work.
*	[Parser] Do not eagerly lex parens (#6540)	Thomas Lively	2024-04-25	3	-66/+36
\| \| \| \| \| \| \| \| \| \| \|	The lexer currently lexes tokens eagerly and stores them in a `Token` variant ahead of when they are actually requested by the parser. It is wasteful, however, to classify tokens before they are requested by the parser because it is likely that the next token will be precisely the kind the parser requests. The work of checking and rejecting other possible classifications ahead of time is not useful. To make incremental progress toward removing `Token` completely, lex parentheses on demand instead of eagerly.
*	[Strings] Fix effects of string.compare and add fuzzing (#6547)	Alon Zakai	2024-04-25	3	-12/+34
\| \| \| \| \| \| \| \|	We added string.compare late in the spec process, and forgot to add effects for it. Unlike string.eq, it can trap. Also use makeTrappingRefUse in recent fuzzer string generation places that I forgot, which should reduce the amount of traps in fuzzer output.
*	[Parser] Enable the new text parser by default (#6371)	Thomas Lively	2024-04-25	2	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The new text parser is faster and more standards compliant than the old text parser. Enable it by default in wasm-opt and update the tests to reflect the slightly different results it produces. Besides following the spec, the new parser differs from the old parser in that it: - Does not synthesize `loop` and `try` labels unnecessarily - Synthesizes different block names in some cases - Parses exports in a different order - Parses `nop`s instead of empty blocks for empty control flow arms - Does not support parsing Poppy IR - Produces different error messages - Cannot parse `pop` except as the first instruction inside a `catch`
*	GUFA: Handle bottom types in filterDataContents() (#6545)	Alon Zakai	2024-04-25	1	-1/+7
\| \| \| \| \| \|	Normally a bottom type cannot reach there, as we ignore unreachable GC operations early on. However, we can infer a bottom type later during the flow, so we need to handle that (just not error on it, and for clarity during debugging we also clear the contents).
*	[Strings] Fuzz string.encode (#6539)	Alon Zakai	2024-04-25	2	-15/+52
\| \| \| \| \| \| \|	A little trickier than the others due to the risk of trapping, which this handles like the other array operations. Also stop using immutable i16 arrays for string operations - only mutable ones work atm.
*	Do not add an extra null character when reading files (#6538)	Thomas Lively	2024-04-24	5	-11/+5
\| \| \| \| \| \| \| \|	The new wat parser currently considers itself to be at the end of the file whenever it cannot lex another token. This is not quite right, but fixing it causes parser errors because of the extra null character we were appending to files when we read them. This null character is not useful since we can already read files as `std::string`, which always has an implicit null character, so remove it. Clean up some users of `read_file` while we're at it.
*	Fuzzer: Update the typeLocals data structure before mutation (#6537)	Alon Zakai	2024-04-24	2	-2/+20
\| \| \| \| \|	Rather than compute the map of type to locals of that type once, at the start, also update it when relevant, as we can add more locals in some cases. This allows us to local.get from those late-added locals too.
*	[Strings] Implement string.measure_wtf16 in interpreter (#6535)	Alon Zakai	2024-04-24	1	-1/+1
\|
*	Add a flag to opt in to the old WAT parser (#6536)	Thomas Lively	2024-04-24	1	-0/+7
\| \| \| \| \|	This flag is intended to help users gracefully migrate to the new wat parser. It will be removed again not too long after the new wat parser is enabled by default in wasm-opt.
*	[Parser] Use the new parser in wasm-shell and wasm-as (#6529)	Thomas Lively	2024-04-24	6	-74/+136
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Updating just one or the other of these tools would cause the tests spec/import-after-.fail.wast to fail, since only the updated tool would correctly fail to parse its contents. To avoid this, update both tools at once. (The tests erroneously pass before this change because check.py does not ensure that .fail.wast tests fail, only that failing tests end in .fail.wast.) In wasm-shell, to minimize the diff, only use the new parser to parse modules and instructions. Continue using the legacy parsing based on s-expressions for the other wast commands. Updating the parsing of the other commands to use `Lexer` instead of `SExpressionParser` is left as future work. The boundary between the two parsing styles is somewhat hacky, but it is worth it to enable incremental development. Update the tests to fix incorrect wast rejected by the new parser. Many of the spec/old_ tests use non-standard forms from before Wasm MVP was standardized, so fixing them would have been onerous. All of these tests have non-old_* variants, so simply delete them.
*	[wasm-shell] Error on unknown commands (#6528)	Thomas Lively	2024-04-24	1	-0/+2
\| \| \| \| \| \| \| \| \|	We previously ignored unknown wast commands, which could lead to the mistaken impression that we were passing test cases that we were in fact not running at all. Clarify matters by having wasm-shell error out on unrecognized commands, and comment out all such commands in our versions of the spec test. As we work toward being able to run the upstream spec tests, having these unsupported commands explicitly commented out will make it easier to track progress toward full support.
*	[Strings] Fuzzer: Emit StringConcat (#6532)	Alon Zakai	2024-04-24	2	-67/+90
\| \| \|	Also refactor the code a little to make it easier to add this (mostly whitespace).
*	[Strings] Do not reuse mutable globals in StringGathering (#6531)	Alon Zakai	2024-04-24	1	-1/+2
\| \| \| \| \|	We were reusing mutable globals in StringGathering, which meant that we'd use a global to represent a particular string but if it was mutated then it could contain a different string during execution.
*	Fuzzer: Compare strings (#6530)	Alon Zakai	2024-04-24	1	-11/+15
\|
*	Source maps: Fix missing debug info in nested blocks (#6525)	Jérôme Vouillon	2024-04-24	1	-0/+1
\| \| \|	The special block nesting logic also needs to handle emitting debug info.
*	[Strings] Add the string heaptype to core fuzzer places (#6527)	Alon Zakai	2024-04-23	1	-20/+23
\| \| \| \| \| \| \|	With this we emit strings spontaneously (as opposed to just getting them from initial contents). The relevant -ttf test has been tweaked slightly to show the impact of this change: now there are some string.new/const in the output.
*	[Strings] Fuzz and interpret all relevant StringNew methods (#6526)	Alon Zakai	2024-04-23	3	-41/+94
\| \| \| \|	This adds fuzzing for string.new_wtf16_array and string.from_code_point. The latter was also missing interpreter support, which this adds.
*	[EH] Fix assumption that all throw_refs are created from rethrows (#6524)	Heejin Ahn	2024-04-24	1	-5/+7
\| \| \| \| \| \| \| \|	`shouldBeRef` incorrectly assumed that all `throw_ref`s within a `catch` body had been generated from `rethrow`s, which was not true, because `throw_ref`s are also created when translating `try`-`delegate`s: https://github.com/WebAssembly/binaryen/blob/219e668e87b012c0634043ed702534b8be31231f/src/passes/TranslateEH.cpp#L304 This fixes the assumption and changes `cast` to `dynCast`.
*	[EH] Fix missing outer block for catchless try (#6519)	Heejin Ahn	2024-04-24	1	-8/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When translating a `try` expression, we may need an 'outer' block that wraps the newly generated `try_table` so we can jump out of the expression when an exception does not occur. (The condition we use is when the `try` has any catches or if the `try` is a target of any inner `try-delegate`s: https://github.com/WebAssembly/binaryen/blob/219e668e87b012c0634043ed702534b8be31231f/src/passes/TranslateEH.cpp#L677) In case the `try` has either of `catch` or `delegate`, when we have the 'outer' block, we add the newly created `try_table` in the 'outer' block and replace the whole expression with the block: https://github.com/WebAssembly/binaryen/blob/219e668e87b012c0634043ed702534b8be31231f/src/passes/TranslateEH.cpp#L670 https://github.com/WebAssembly/binaryen/blob/219e668e87b012c0634043ed702534b8be31231f/src/passes/TranslateEH.cpp#L332-L340 But in case of a catchless `try`, we forgot to do that: https://github.com/WebAssembly/binaryen/blob/219e668e87b012c0634043ed702534b8be31231f/src/passes/TranslateEH.cpp#L388 So this PR fixes it.
*	Precompute: Ignore mutable arrays in StringNew (#6522)	Alon Zakai	2024-04-23	1	-0/+22
\| \| \| \| \|	All Struct/Array operations must ignore mutable fields in Precompute atm, which we did, but StringNew has an array variant that does an effective ArrayGet operation, which we didn't handle.
*	OptimizeInstructions: Optimize subsequent struct.sets after ↵	Alon Zakai	2024-04-23	1	-12/+21
\| \| \| \| \| \| \| \| \|	struct.new_with_default (#6523) Before we preferred not to add default values, as that increases code size. But since #6495 we turn more things into struct.new_with default, so it is important to handle this. It seems likely that in most cases the code size downside of adding default values is offset by avoiding a local.set later, so always do this (rather than add some kind of heuristic).
*	[EH] Fix delegating to caller when func result is concrete (#6518)	Heejin Ahn	2024-04-23	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	hen the function return type is concrete, the translation result of a `try`-`delegate` that targets the caller includes a `return` that returns the whole function body: https://github.com/WebAssembly/binaryen/blob/219e668e87b012c0634043ed702534b8be31231f/src/passes/TranslateEH.cpp#L751-L763 We should do that based on the function's return type, not the function body's return type. The previous code didn't handle the case where the function's return type is concrete but the function body's return type is unreachable.
*	[wasm-split] Do not split out functions referring to segments (#6517)	Thomas Lively	2024-04-23	1	-4/+62
\| \| \| \| \| \| \|	Since data and elem segments cannot be imported or exported, there is no way to access them from the secondary module, so functions that need to refer to them cannot be split out. Fixes #6512.
*	DebugLocationPropagation: pass debuglocation from parent node to chil… (#6500)	许鑫权	2024-04-21	5	-45/+106
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This PR creates a pass to propagate debug location from parent node to child nodes which has no debug location with pre-order traversal. This is useful for compilers that use Binaryen API to generate WebAssembly modules. It behaves like `wasm-opt` read text format file: children are tagged with the debug info of the parent, if they have no annotation of their own. For compilers that use Binaryen API to generate WebAssembly modules, it is a bit redundant to add debugInfo for each expression, Especially when the compiler wrap expressions. With this pass, compilers just need to add debugInfo for the parent node, which is more convenient. For example: ``` (drop (call $voidFunc) ) ``` Without this pass, if the compiler only adds debugInfo for the wrapped expression `drop`, the `call` expression has no corresponding source code mapping in DevTools debugging, which is obviously not user-friendly.
*	[Parser][NFC] Do less work when parsing function types (#6516)	Thomas Lively	2024-04-19	2	-3/+11
\| \| \| \| \| \| \|	After the initial parsing pass to find the locations of all the module elements and after the type definitions have been parsed, the next phase of parsing is to visit all of the module elements and parse their types. This phase does not require parsing function bodies, but it previously parsed entire functions anyway for simplicity. To improve performance, skip that useless work.
*	[Parser][NFC] Improve performance of idchar lexing (#6515)	Thomas Lively	2024-04-19	1	-30/+18
\| \| \| \| \|	The parsing of idchars was hot enough to show up while profiling the parsing of a very large module. Optimize it to speed up the overall parse by about 16% in a very unscientific measurement.
*	[Parser][NFC] Solve performance issue by adding maybeLabelidx (#6514)	Thomas Lively	2024-04-18	1	-7/+21
\| \| \| \| \| \| \| \| \| \| \| \| \|	Creating an error in the parser is an extremely expensive operation for very large files because it has to traverse the input buffer and count newlines to compute the error message. Despite that, there are a few places were we create errors just to discard them and continue parsing. The most notable of these places was where we parsed the list of label index immediates for the br_table instruction. The parser determined the end of the list by intercepting the error produced when trying to parse one more label index. Fix this significant performance problem causing parsing to be quadratic by introducing and using `maybeLabelidx`, which tries to parse a label index but does not produce an error if it fails.
*	[Strings] Fix finalize() of StringNew on arrays (#6511)	Alon Zakai	2024-04-18	1	-1/+3
\|