forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Fix incorrect wat in tests (#6207)	Thomas Lively	2024-01-08	15	-103/+101
\| \| \| \| \| \| \| \| \|	The new wat parser is much more strict than the legacy wat parser; the latter accepts all sorts of things that the spec does not allow. To ease an eventual transition to using the new wat parser by default, update the tests to use the standard text format in many places where they previously did not. We do not yet have a way to prevent new errors from being introduced into the test suite, but at least there will now be many fewer errors when it comes time to make the switch.
*	Require `then` and `else` with `if` (#6201)	Thomas Lively	2024-01-04	87	-6119/+12155
\| \| \| \| \| \| \| \| \| \| \| \|	We previously supported (and primarily used) a non-standard text format for conditionals in which the condition, if-true expression, and if-false expression were all simply s-expression children of the `if` expression. The standard text format, however, requires the use of `then` and `else` forms to introduce the if-true and if-false arms of the conditional. Update the legacy text parser to require the standard format and update all tests to match. Update the printer to print the standard format as well. The .wast and .wat test inputs were mechanically updated with this script: https://gist.github.com/tlively/85ae7f01f92f772241ec994c840ccbb1
*	Use the standard shared memory text format (#6200)	Thomas Lively	2024-01-03	24	-52/+48
\| \| \| \| \|	Update the legacy text parser and all tests to use the standard text format for shared memories, e.g. `(memory $m 1 1 shared)` rather than `(memory $m (shared 1 1))`. Also remove support for non-standard in-line "data" or "segment" declarations. This change makes the tests more compatible with the new text parser, which only supports the standard format.
*	Drop support for type annotations on array.len (#6197)	Thomas Lively	2024-01-03	1	-2/+2
\| \| \| \| \| \|	These type annotations were removed during the development of the GC proposal, but we maintained support for parsing them to ease the transition. Now that GC is shipped, remove support for the non-standard annotation and update our tests accordingly.
*	Drop support for non-standard quoted function names (#6188)	Thomas Lively	2023-12-20	22	-153/+151
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We previously supported a non-standard `(func "name" ...` syntax for declaring functions exported with the quoted name. Since that is not part of the standard text format, drop support for it, replacing it with the standard `(func $name (export "name") ...` syntax instead. Also replace our other usage of the quoted form in our text output, which was where we quoted names containing characters that are not allowed to appear in standard names. To handle that case, adjust our output from `"$name"` to `$"name"`, which is the standards-track way of supporting such names. Also fix how we detect non-standard name characters to match the spec. Update the lit test output generation script to account for these changes, including by making the `$` prefix on names mandatory. This causes the script to stop interpreting declarative element segments with the `(elem declare ...` syntax as being named "declare", so prevent our generated output from regressing by counting "declare" as a name in the script.
*	Add an arity immediate to tuple.extract (#6172)	Thomas Lively	2023-12-12	2	-6/+6
\| \| \| \| \| \| \| \|	Once support for tuple.extract lands in the new WAT parser, this arity immediate will let the parser determine how many values it should pop off the stack to serve as the tuple operand to `tuple.extract`. This will usually coincide with the arity of a tuple-producing instruction on top of the stack, but in the spirit of treating the input as a proper stack machine, it will not have to and the parser will still work correctly.
*	Add a `tuple.drop` text pseudoinstruction (#6170)	Thomas Lively	2023-12-12	2	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We previously overloaded `drop` to mean both normal drops of single values and also drops of tuple values. That works fine in the legacy text parser since it can infer parent-child relationships directly from the s-expression structure of the input, so it knows that a drop should drop an entire tuple if the tuple-producing instruction is a child of the drop. The new text parser, however, is much more like the binary parser in that it uses instruction types to create parent-child instructions. The new parser always assumes that `drop` is meant to drop just a single value because that's what it does in WebAssembly. Since we want to continue to let `Drop` IR expressions consume tuples, and since we will need a way to write tests for that IR pattern that work with the new parser, introduce a new pseudoinstruction, `tuple.drop`, to represent drops of tuples. This pseudoinstruction only exists in the text format and it parses to normal `Drop` expressions. `tuple.drop` takes the arity of its operand as an immediate, which will let the new parser parse it correctly in the future.
*	Update `tuple.make` text format to include arity (#6169)	Thomas Lively	2023-12-12	10	-24/+24
\| \| \| \| \| \| \| \| \| \|	Previously, the number of tuple elements was inferred from the number of s-expression children of the `tuple.make` expression, but that scheme would not work in the new wat parser, where s-expressions are optional and cannot be semantically meaningful. Update the text format to take the number of tuple elements (i.e. the tuple arity) as an immediate. This new format will be able to be implemented in the new parser as follow-on work.
*	Fuzzer: Better handling of globals from initial content (#6072)	Alon Zakai	2023-11-01	1	-45/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously the fuzzer never added gets or sets of globals from initial content. That was an oversight, I'm pretty sure - it's just that the code that sets up the lists from which we pick globals for gets and sets was in another place. That is, any globals in the initial content file were never used in new random code the fuzzer generates (only new globals the fuzzer generated were used there). This PR allows us to use those globals, but also ignores them with some probability, to avoid breaking patterns like "once" globals (that we want to only be used from initial content, at least much of the time). Also simplify the code here: we don't need isInvalidGlobal just to handle the hang limit global, which is already handled by not being added to the lists we pick names from anyhow.
*	Partially revert #6026 (#6043)	Alon Zakai	2023-10-23	1	-3/+3
\| \| \|	That optimization uncovered some LLVM and Binaryen bugs.
*	RemoveUnusedModuleElements: Make exports skip trampolines (#6026)	Alon Zakai	2023-10-19	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If we export a function that just calls another function, we can export that one instead. Then the one in the middle may be unused, function foo() { return bar(); } export foo; // can be an export of bar This saves a few bytes in rare cases, but probably more important is that it saves the trampoline, so if this is on a hot path, we save a call. Context: emscripten-core/emscripten#20478 (comment) In general this is not needed as inlining helps us out by inlining foo() into the caller (since foo is tiny, that always ends up happening). But exports are a case the inliner cannot handle, so we do it here.
*	Reuse existing function types for blocks (#6022)	Thomas Lively	2023-10-18	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Type annotations on multivalue blocks (and loops, ifs, and trys) are type indices that refer to function types in the type section. For these type annotations, the identities of the function types does not matter. As long as the referenced type has the correct parameters and results, it will be valid to use. Previously, when collecting module types, we always used the "default" function type for multivalue control flow, i.e. we used a final function type with no supertypes in a singleton rec group. However, in cases where the program already contains another function type with the expected signature, using the default type is unnecessary and bloats the type section. Update the type collecting code to reuse existing function types for multivalue control flow where possible rather than unconditionally adding the default function type. Similarly, update the binary writer to use the first heap type with the required signature when emitting annotations on multivalue control flow structures. To make this all testable, update the printer to print the type annotations as well, rather than just the result types. Since the parser was not able to parse those newly emitted type annotations, update the parser as well.
*	Fuzzer: Allow non-nullable locals (#6019)	Alon Zakai	2023-10-18	1	-42/+41
\| \| \| \| \| \| \|	Remove the code that avoided such locals. To avoid much new trapping, add logic to set a value to such locals if they have accesses that are not dominated by a set already. Also in makeTrivial only rarely emit a local.get of a non-nullable local (prefer a constant).
*	SimplifyGlobals: Run on function code in missing places (#6020)	Alon Zakai	2023-10-18	2	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The pass was written before we had relevant code in module locations, but now with GC we can have global.gets of more things. The scanner did not run on global code in a way that is not a problem yet, but will be for a later PR I'll open. It will be tested there. That is, right now there is no optimization that is confused by the fact that we did not scan code at the module level, but the next PR will do that. The use modifier did not run on global code either, which was an actual missed optimization opportunity: There are cases where we want to modify a global.get to point to another one, and such a get might be in global code, not just in a function. A test is added for that.
*	Add getGeneralSuperType() that includes basic supers, and use in fuzzer (#6005)	Alon Zakai	2023-10-17	1	-39/+42
\| \| \| \|	With this, the fuzzer can replace e.g. an eq expression with a specific struct type, because now it is away that struct types have eq as their ancestor.
*	Fuzzer: Move logic for adding a new local on demand to local.get (#6008)	Alon Zakai	2023-10-17	2	-62/+68
\| \| \| \| \| \| \|	Previously makeTrappingRefUse would add a local on demand if one was missing for the type, and add a tee for it. This PR moves that logic to makeLocalGet so that we get those benefits any time we want to emit a local.get of a local type that does not exist (including from makeTrappingRefUse which calls makeLocalGet).
*	[typed-cont] Add feature flag (#5996)	Frank Emrich	2023-10-05	1	-0/+1
\| \| \| \| \| \| \|	This PR is part of a series that adds basic support for the [typed continuations proposal](https://github.com/wasmfx/specfx). This particular PR simply extends `FeatureSet` with a corresponding entry for this proposal.
*	RemoveUnusedBrs: Allow less unconditional work and in particular division ↵	Alon Zakai	2023-10-03	1	-4/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(#5989) Fixes #5983: The testcase from there is used here in a new testcase remove-unused-brs_levels in which we check if we are willing to unconditionally do a division operation. Turning an if with an arm that does a division into a select, which always does the division, is almost 5x slower, so we should probably be extremely careful about doing that. I took some measurements and have some suggestions for changes in this PR: * Raise the cost of div/rem to what I measure on my machine, which is 5x slower than an add, or worse. * For some reason we added the if arms rather than take the max of them, so fix that. This does not help the issue, but was confusing. * Adjust TooCostlyToRunUnconditionally in the pass from 9 to 8 (this helps balance the last point). * Use half that value when not optimizing for size. That is, we allow only 4 extra unconditional work normally, and 8 in -Os, and when -Oz then we allow any extra amount. Aside from the new testcases, some existing ones changed. They all appear to change in a reasonable way, to me. We should perhaps go even further than this, and not even run a division unconditionally in -Os, but I wasn't sure it makes sense to go that far as other benchmarks may be affected. For now, this makes the benchmark in #5983 run at full speed in -O3 or -Os, and it remains slow in -Oz. The modified version of the benchmark that only divides in the if (no other operations) is still fast in -O3, but it become slow in -Os as we do turn that if into a select (but again, I didn't want to go that far as to overfit on that one benchmark).
*	[NFC] Port stack IR test to lit (#5957)	Thomas Lively	2023-09-18	2	-2297/+0
\| \| \| \|	Fix some whitespace, and name and reorder a few items to make the output better match the input, but otherwise port the tests to lit unmodified.
*	Remove legacy type defintion text syntax (#5948)	Thomas Lively	2023-09-18	1	-1/+1
\| \| \| \| \| \| \|	Remove support for the "struct_subtype", "array_subtype", "func_subtype", and "extends" notations we used at various times to declare WasmGC types, leaving only support for the standard text fromat for declaring types. Update all the tests using the old formats and delete tests that existed solely to test the old formats.
*	Replace i31.new with ref.i31 everywhere (#5931)	Thomas Lively	2023-09-13	1	-1/+1
\| \| \| \| \|	Replace i31.new with ref.i31 in the printer, tests, and source code. Continue parsing i31.new for the time being to allow a graceful transition. Also update the JS API to reflect the new instruction name.
*	Replace I31New with RefI31 everywhere (#5930)	Thomas Lively	2023-09-13	1	-1/+1
\| \| \| \| \| \| \| \|	Globally replace the source string "I31New" with "RefI31" in preparation for renaming the instruction from "i31.new" to "ref.i31", as implemented in the spec in https://github.com/WebAssembly/gc/pull/422. This would be NFC, except that it also changes the string in the external-facing C APIs. A follow-up PR will make the corresponding behavioral change.
*	Make final types the default (#5918)	Thomas Lively	2023-09-09	2	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	Match the spec and parse the shorthand binary and text formats as final and emit final types without supertypes using the shorthands as well. This is a potentially-breaking change, since the text and binary shorthands can no longer be used to define types that have subtypes. Also make TypeBuilder entries final by default to better match the spec and update the internal APIs to use the "open" terminology rather than "final" terminology. Future changes will update the text format to use the standard "sub open" rather than the current "sub final" keywords. The exception is the new wat parser, which supporst "sub open" as of this change, since it didn't support final types at all previously.
*	Simplify and consolidate type printing (#5816)	Thomas Lively	2023-08-24	105	-10837/+10837
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When printing Binaryen IR, we previously generated names for unnamed heap types based on their structure. This was useful for seeing the structure of simple types at a glance without having to separately go look up their definitions, but it also had two problems: 1. The same name could be generated for multiple types. The generated names did not take into account rec group structure or finality, so types that differed only in these properties would have the same name. Also, generated type names were limited in length, so very large types that shared only some structure could also end up with the same names. Using the same name for multiple types produces incorrect and unparsable output. 2. The generated names were not useful beyond the most trivial examples. Even with length limits, names for nontrivial types were extremely long and visually noisy, which made reading disassembled real-world code more challenging. Fix these problems by emitting simple indexed names for unnamed heap types instead. This regresses readability for very simple examples, but the trade off is worth it. This change also reduces the number of type printing systems we have by one. Previously we had the system in Print.cpp, but we had another, more general and extensible system in wasm-type-printing.h and wasm-type.cpp as well. Remove the old type printing system from Print.cpp and replace it with a much smaller use of the new system. This requires significant refactoring of Print.cpp so that PrintExpressionContents object now holds a reference to a parent PrintSExpression object that holds the type name state. This diff is very large because almost every test output changed slightly. To minimize the diff and ease review, change the type printer in wasm-type.cpp to behave the same as the old type printer in Print.cpp except for the differences in name generation. These changes will be reverted in much smaller PRs in the future to generally improve how types are printed.
*	Use the standard syntax for ref.cast, ref.test and array.new_fixed (#5894)	Jérôme Vouillon	2023-08-23	2	-14/+14
\| \| \| \| \| \| \| \| \|	* Update text output for `ref.cast` and `ref.test` * Update text output for `array.new_fixed` * Update tests with new syntax for `ref.cast` and `ref.test` * Update tests with new `array.new_fixed` syntax
*	Rename multimemory flag (#5890)	Ashley Nelson	2023-08-21	1	-1/+1
\| \| \|	Renaming the multimemory flag in Binaryen to match its naming in LLVM.
*	Remove legacy WasmGC instructions (#5861)	Thomas Lively	2023-08-09	1	-1/+1
\| \| \| \| \|	Remove old, experimental instructions and type encodings that will not be shipped as part of WasmGC. Updating the encodings and text format to match the final spec is left as future work.
*	[NFC] Port passes remove-unused-brs_all-features.wast to lit (#5843)	Thomas Lively	2023-07-27	2	-215/+0
\| \| \| \|	Port the test automatically using the port_passes_tests_to_lit.py script. As a drive-by, fix a typo in the script as well.
*	Fuzzer: Emit more variations of If (#5806)	Alon Zakai	2023-07-10	2	-58/+66
\| \| \| \| \| \|	Before we always created if-elses. Now we also create an If with one arm some of the time, when we can. Also, sometimes make one if arm unreachable, if we have two arms.
*	Initial support for `final` types (#5803)	Thomas Lively	2023-07-06	1	-44/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Implement support in the type system for final types, which are not allowed to have any subtypes. Final types are syntactically different from similar non-final types, so type canonicalization is made aware of finality. Similarly, TypeMerging and TypeSSA are updated to work correctly in the presence of final types as well. Implement binary and text parsing and emitting of final types. Use the standard text format to represent final types and interpret the non-standard "struct_subtype" and friends as non-final. This allows a graceful upgrade path for users currently using the non-standard text format, where they can update their code to use final types correctly at the point when they update to use the standard format. Once users have migrated to using the fully expanded standard text format, we can update update Binaryen's parsers to interpret the MVP shorthands as final types to match the spec without breaking those users. To make it safe for V8 to independently start interpreting types declared without `sub` as final, also reserve that shorthand encoding only for types that have no strict subtypes.
*	Print supertype declarations using the standard format (#5801)	Thomas Lively	2023-07-06	1	-1/+1
\| \| \| \| \| \|	Use the standard "(sub $super ...)" format instead of the non-standard "XXX_supertype ... $super" format. In a follow-on PR implementing final types, this will allow us to print and parse the standard text format for final types right away with a smaller diff.
*	Fuzzing for Try and Throw (#5776)	Alon Zakai	2023-06-21	1	-34/+40
\|
*	Update br_on_cast binary and text format (#5762)	Thomas Lively	2023-06-12	2	-8/+8
\| \| \| \| \| \| \| \| \| \| \| \|	The final versions of the br_on_cast and br_on_cast_fail instructions have two reference type annotations: one for the input type and one for the cast target type. In the binary format, this is represented as a flags byte followed by two encoded heap types. Upgrade all of the tests at once to use the new versions of the instructions and drop support for the old instructions from the text parser. Keep support in the binary parser to avoid breaking users, though. Drop some binary tests of deprecated instruction encodings that would be more effort to update than they're worth. Re-land with fixes of #5734
*	StackIR: Remove nops (#5746)	Alon Zakai	2023-05-30	1	-9/+0
\| \| \| \| \| \| \|	No nop instruction is necessary in wasm, so in StackIR we can simply remove them all. Fixes #5745
*	Revert "Update br_on_cast binary and text format (#5734)" (#5740)	Alon Zakai	2023-05-23	2	-8/+8
\| \| \| \| \| \| \|	This reverts commit b7b1d0df29df14634d2c680d1d2c351b624b4fbb. See comment at the end of #5734: It turns out that dropping the old opcodes causes problems for current users, so let's revert this for now, and later we can figure out how best to do the update.
*	Fuzzer: Limit ArrayNew sizes most of the time (#5738)	Alon Zakai	2023-05-22	1	-42/+31
\|
*	Update br_on_cast binary and text format (#5734)	Thomas Lively	2023-05-19	2	-8/+8
\| \| \| \| \| \| \| \| \| \|	The final versions of the br_on_cast and br_on_cast_fail instructions have two reference type annotations: one for the input type and one for the cast target type. In the binary format, this is represented as a flags byte followed by two encoded heap types. Since these instructions have been in flux for a while, do not attempt to maintain backward compatibility with older versions of the instructions. Instead, upgrade all of the tests at once to use the new versions of the instructions. Drop some binary tests of deprecated instruction encodings that would be more effort to update than they're worth.
*	Print function types on function imports in the text format (#5727)	Alon Zakai	2023-05-17	8	-10026/+10026
\| \| \| \|	The function type should be printed there just like for non-imported functions.
*	Port vacuum_all-features test to lit (#5708)	Thomas Lively	2023-05-09	2	-1294/+0
\| \| \|	Do the port automatically using the port_passes_tests_to_lit.py script.
*	Remove the ability to construct basic types in a TypeBuilder (#5678)	Thomas Lively	2023-04-19	1	-34/+42
\| \| \| \| \| \| \| \| \| \| \|	This capability was originally introduced to support calculating LUBs in the equirecursive type system, but has not been needed for anything except tests since the equirecursive type system was removed. Since building basic heap types is no longer useful and was a source of significant complexity, remove the APIs that allowed it and the tests that used those APIs. Also remove test/example/type-builder.cpp, since a significant portion of it tested the removed APIs and the rest is already better tested in test/gtest/type-builder.cpp.
*	[Wasm GC] Fuzz array.copy and array.fill (#5663)	Alon Zakai	2023-04-17	1	-27/+38
\|
*	[Wasm GC] Improve GC operation coverage by using locals more (#5661)	Alon Zakai	2023-04-17	1	-36/+27
\| \| \| \| \| \| \| \| \| \| \| \|	When we emit e.g. a struct.get's reference, this PR makes us prefer a non-nullable value, and even to reuse an existing local if possible. By doing that we reduce the risk of a trap, and also by using locals we end up testing operations on the same data, like this: x = new A(); x.a = .. foo(x.a) In contrast, without this PR each of those x. uses might be new A().
*	Fuzzer: When nested under makeTrivial(), avoid normal make() (#5657)	Alon Zakai	2023-04-12	1	-29/+31
\| \| \| \| \| \| \| \|	Without this, in certain complex operations we could end up calling a nested make() operation that included nontrivial things, which could cause problems. The specific problem I encountered was in fixAfterChanges() we tried to fix up a duplicate label, but calling makeTrivial() emitted something very large that happened to include a new block with a new label nested under a struct.get, and that block's label conflicted with a label we'd already processed.
*	[Wasm GC] Fuzz struct.set and array.set (#5655)	Alon Zakai	2023-04-12	1	-34/+33
\|
*	Add a name hint to getValidName() (#5653)	Alon Zakai	2023-04-11	1	-4/+4
\| \| \| \| \| \| \|	Without the hint, we always look for a valid name using name$0, $1, $2, etc., starting from 0, and in some cases that can lead to quadratic behavior. Noticed on a testcase in the fuzzer that runs for over 24 seconds (I gave up at that point) but takes only 2 seconds with this.
*	[Wasm GC] Fuzz struct.get and array.get (#5651)	Alon Zakai	2023-04-10	1	-27/+29
\|
*	Fuzzer: Improve mutate() (#5631)	Alon Zakai	2023-04-05	2	-60/+59
\| \| \|	Don't use a fixed 10% chance to mutate, but pick a mutation rate in each function.
*	[Wasm GC] Fuzz struct.new and array.new (#5622)	Alon Zakai	2023-04-04	1	-41/+35
\| \| \| \| \| \| \| \| \|	Repurpose makeBasicRef, makeCompoundRef to generate not just "constant" refs but any reference, and use those to create StructNew/ArrayNew. The key changes are to add makeCompoundRef to make(), and to make the function call make() for children, where possible, instead of just makeTrivial(). We also replace the i31-specific path with a call to makeBasicRef which handles i31 among other things.
*	Use Names instead of indices to identify segments (#5618)	Thomas Lively	2023-04-04	27	-100098/+100098
\| \| \| \| \| \| \| \| \| \|	All top-level Module elements are identified and referred to by Name, but for historical reasons element and data segments were referred to by index instead. Fix this inconsistency by using Names to refer to segments from expressions that use them. Also parse and print segment names like we do for other elements. The C API is partially converted to use names instead of indices, but there are still many functions that refer to data segments by index. Finishing the conversion can be done in the future once it becomes necessary.
*	[Wasm GC] Fuzz RefCast (#5617)	Alon Zakai	2023-04-03	1	-32/+34
\|