forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	Fix null dereference in FunctionValidator (#6849)	mtb	2024-08-26	2	-2/+25
\| \| \| \| \| \| \| \| \| \|	visitBlock() and validateCallParamsAndResult() both assumed they were running inside a function, but might be called on global code too. Calls and blocks are invalid in global positions, so we should error there, but must do so properly without a null deref. Fixes #6847 Fixes #6848
*	Fix the fp16 header include. (#6871)	Brendan Dahl	2024-08-26	1	-5/+1
\|
*	Support more reference constants in wast scripts (#6865)	Thomas Lively	2024-08-26	5	-25/+52
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Spec tests use constants like `ref.array` and `ref.eq` to assert that exported function return references of the correct types. Support more such constants in the wast parser. Also fix a bug where the interpretation of `array.new_data` for arrays of packed fields was not properly truncating the packed data. Move the function for reading fields from memory from literal.cpp to wasm-interpreter.h, where the function for truncating packed data lives. Other bugs prevent us from enabling any more spec tests as a result of this change, but we can get farther through several of them before failing. Update the comments about the failures accordingly.
*	[FP16] Add a feature flag for FP16. (#6864)	Brendan Dahl	2024-08-22	22	-134/+226
\| \| \|	Ensure the "fp16" feature is enabled for FP16 instructions.
*	[NFC] Avoid quadratic time when precomputing blocks (#6862)	Alon Zakai	2024-08-21	1	-0/+67
\| \| \| \| \| \| \| \| \|	When precomputing fails on a child block of a parent block, there is no point to precompute the parent, as that will fail as well. This makes --precompute on Emscripten's test_biggerswitch go from 1.44 seconds to 0.02 seconds (not a typo, that is 72x faster). The absolute number is not that big, but we do run this pass more than once, so it saves a noticeable chunk of time.
*	[NFC] Avoid quadratic time in StackIROptimizer::removeUnneededBlocks() (#6859)	Alon Zakai	2024-08-21	1	-5/+15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is in quite ancient code, so it's a long-standing issue, but it got worse when we enabled StackIR in more situations (#6568), which made it more noticeable, I think. For example, testing on test_biggerswitch in Emscripten, the LLVM part is pretty slow too so the Binaryen slowdown didn't stand out hugely, but just doing wasm-opt --optimize-level=2 input.wasm -o output.wasm (that is, do no work, but set the optimize level to 2 so that StackIR opts are run) used to take 28 seconds (!). With this PR that goes down to less than 1.
*	Add a string lowering mode disallowing non-UTF-8 strings (#6861)	Thomas Lively	2024-08-21	7	-2/+46
\| \| \| \| \| \| \| \| \| \| \|	The best way to lower strings is via the "magic imports" API that uses the names of imported string globals as their values. This approach only works for valid UTF-8 strings, though. The existing string-lowering-magic-imports pass falls back to putting non-UTF-8 strings in a JSON custom section, but this requires the runtime to support that custom section for correctness. To help catch errors early when runtimes do not support the strings custom section, add a new pass that uses magic imports and raises an error if there are any invalid strings.
*	[FP16] Implement arithmetic operations. (#6855)	Brendan Dahl	2024-08-21	16	-11/+549
\| \| \| \|	Specified at https://github.com/WebAssembly/half-precision/blob/main/proposals/half-precision/Overview.md
*	[NFC] hash constant string as void* (#6863)	Thomas Lively	2024-08-21	1	-1/+1
\| \| \| \| \| \| \| \|	possible-contents.h hashes the location for caught exnrefs by hashing an arbitrary string, "caught-exnref-location". It previously used `std::hash<const char>` for this, but some standard library implementations report an error when this template instantiation is used because hashing the location of a string is almost never correct. In this case it is fine, so switch to using `std::hash<const void>`.
*	Support `ref.extern n` in spec tests (#6858)	Thomas Lively	2024-08-21	11	-2234/+29
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Spec tests pass the value `ref.extern n`, where `n` is some integer, into exported functions that expect to receive externrefs and receive such values back out as return values. The payload serves to distinguish externrefs so the test can assert that the correct one was returned. Parse these values in wast scripts and represent them as externalized i31refs carrying the payload. We will need a different representation eventually, since some tests explicitly expect these externrefs to not be i31refs, but this suffices to get several new tests passing. To get the memory64 version of table_grow.wast passing, additionally fix the interpreter to handle growing 64-bit tables correctly. Delete the local versions of the upstream tests that can now be run successfully.
*	[NFC] Triage spec test problems (#6857)	Thomas Lively	2024-08-21	1	-81/+80
\| \| \| \|	Add comments to the spec test skip list briefly explaining why each skipped spec test must be skipped.
*	Fix encoding of heap type definitions (#6856)	Thomas Lively	2024-08-20	4	-988/+24
\| \| \| \| \| \| \| \|	The leading bytes that indicate what kind of heap type is being defined are bytes, but we were previously treating them as SLEB128-encoded values. Since we emit the smallest LEB encodings possible, we were writing the correct bytes in output files, but we were also improperly accepting binaries that used more than one byte to encode these values. This was caught by an upstream spec test.
*	Add the upstream spec testsuite as a submodule (#6853)	Thomas Lively	2024-08-20	64	-32159/+140
\| \| \| \| \| \|	Run the upstream tests by default, except for a large list of them that do not successfully run. Remove the local version of those that do successfully run where the local version is entirely subsumed by the upstream version.
*	[Exceptions] Finish interpreter + optimizer support for try_table. (#6814)	Sébastien Doeraene	2024-08-20	41	-420/+2764
\| \| \| \| \| \|	* Add interpreter support for exnref values. * Fix optimization passes to support try_table. * Enable the interpreter (but not in V8, see code) on exceptions.
*	Validate array.init_elem segment in IRBuilder (#6852)	Thomas Lively	2024-08-19	2	-1/+21
\| \| \| \| \| \| \| \| \|	IRBuilder is responsible for validation involving type annotations on GC instructions because those type annotations may not be preserved in the built IR to be used by the main validator. For `array.init_elem`, we were not using the type annotation to validate the element segment, which allowed us to parse invalid modules when the reference operand was a nullref. Add the missing validation in IRBuilder and fix a relevant spec test.
*	Print explicit typeuses for non-MVP function types (#6851)	Thomas Lively	2024-08-19	2	-2/+62
\| \| \| \| \| \| \| \| \|	We previously printed explicit typeuses (e.g. `(type $f)`) in function signatures when GC was enabled. But even when GC is not enabled, function types may use non-MVP features that require the explicit typeuse to be printed. Fix the printer to always print the explicit type use for such types. Fixes #6850.
*	[NFC] Use HeapType::getKind more broadly (#6846)	Thomas Lively	2024-08-19	10	-263/+363
\| \| \| \| \| \| \| \|	Replace code that checked `isStruct()`, `isArray()`, etc. in sequence with uses of `HeapType::getKind()` and switch statements. This will make it easier to find the code that needs updating if/when we add new heap type kinds in the future. It also makes it much easier to find code that already needs updating to handle continuation types by grepping for "TODO: cont".
*	Add a pass for minimizing recursion groups (#6832)	Thomas Lively	2024-08-17	11	-0/+2682
\| \| \| \| \| \| \| \| \| \| \| \|	Most of our type optimization passes emit all non-public types as a single large rec group, which trivially ensures that different types remain different, even if they are optimized to have the same structure. Usually emitting a single large rec group is fine, but it also means that if the module is split, all of the types will need to be repeated in all of the split modules. To better support this use case, add a pass that can split the large rec group back into minimal rec groups, taking care to preserve separate type identities by emitting different permutations of the same group where possible or by inserting unused brand types to differentiate them.
*	Fix direct comparisons with unshared basic heap types (#6845)	Thomas Lively	2024-08-16	8	-30/+152
\| \| \| \| \|	Audit the remaining ocurrences of `== HeapType::` and fix those that did not handle shared types correctly. Add tests for some of the fixes; others are NFC but clarify the code.
*	Implement table.init (#6827)	Alon Zakai	2024-08-16	37	-95/+700
\| \| \| \| \|	Also use TableInit in the interpreter to initialize module's table state, which will now handle traps properly, fixing #6431
*	Testing: Add an env var to pick the V8 binary (#6836)	Alon Zakai	2024-08-16	1	-2/+2
\| \| \| \| \|	Also we had a mix of os.environ.get and os.getenv. Prefer the former, as the default value does actual work, so it's a little more efficient to not run it unnecessarily. That is, os.getenv('X', work()) is less efficient than os.environ.get('X') or work().
*	[NFC] Avoid v128 in rec groups with no other v128 uses (#6843)	Alon Zakai	2024-08-15	1	-14/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We don't properly validate that yet. E.g.: (module (rec (type $func (func)) (type $unused (sub (struct (field v128)))) ) (func $func (type $func)) ) That v128 is not used, but it ends up in the output because it is in a rec group that is used. Atm we do not require that SIMD be enabled in such a case, which can trip up the fuzzer. Context: #6820. For now, modify the test that uncovered this.
*	Simplify validation of stale types (#6842)	Thomas Lively	2024-08-15	1	-24/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The previous rules for stale types were complicated and hard to remember: in general it was ok for result types to be further refinable as long as they were not refinable all the way to `unreachable`, but control flow structures had a carve-out and it was ok for them to be refinable all the way to unreachable. Simplify the rules so that further refinable result types are always ok, no matter what they can be refined to and no matter what kind of instruction is being validated. This will be much easier to remember and reason about. This relaxation of the rules strictly increases the set of valid IR, so no passes or tests need to be updated. It does make it possible for us to miss type refinement opportunities that previously would have been validation errors, but only in cases where non-control-flow instructions could have been refined all the way to unreachable, so the risk seems small.
*	[NFC] Clean up Literal copy constructor (#6841)	Alon Zakai	2024-08-15	1	-30/+27
\| \| \| \| \| \| \| \|	Diff without whitespace is smaller. * HeapType::ext was handled in two places. The second place was wrong, but not reached. * Near the end all we have left are refs, so no need to check isRef etc. * Simplify the code to get the heap type once.
*	Save build ID in a source map (#6799)	Marcin Kolny	2024-08-15	5	-1/+35
\| \| \| \| \| \| \|	This is based on these two proposals: * https://github.com/WebAssembly/tool-conventions/blob/main/BuildId.md * https://github.com/tc39/source-map/blob/main/proposals/debug-id.md
*	Heap type `none` requires GC (#6840)	Thomas Lively	2024-08-14	4	-5/+13
\| \| \| \| \| \|	Since reference types only introduced function and extern references, all of the types in the `any` hierarchy require GC, including `none`. Fixes #6839.
*	Count supertypes when collecting module types (#6838)	Thomas Lively	2024-08-14	2	-8/+4
\| \| \| \| \| \| \| \| \|	Previously we included supertypes, but did not increase their count. This was done so that the output for the nominal type system, which introduced explicitly supertypes, would more closely match the output with the old equirecursive types system. Neither type system exists anymore and we only support the single, standard isorecursive type system, so we can now properly count supertypes. It turns out it doesn't make much of a difference in the test outputs anyway.
*	Monomorphization: Add a flag to control the required improvement (#6837)	Alon Zakai	2024-08-14	7	-22/+1722
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The argument is the minimum benefit we must see for us to decide to optimize, e.g. --monomorphize --pass-arg=monomorphize-min-benefit@50 When the minimum benefit is 50% then if we reduce the cost by 50% through monomorphization then we optimize there. 95% would only optimize when we remove almost all the cost, etc. In practice I see 95% will actually tend to reduce code size overall, as while we add monomorphized versions of functions, we only do so when we remove a lot of work and size, and after inlining we gain benefits. However, 50% or even lower can lead to better benchmark results, in return for larger code size, just like with inlining. To be careful, the default is set to 95%. Previously we optimized whenever we saw any benefit at all, which is the same as requiring a minimum benefit of 0%. Old tests have the flag applied in this PR to set that value, so they do not change.
*	Heap2Local: Track interactions in detail (#6834)	Alon Zakai	2024-08-13	2	-70/+162
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously we tracked only whether an expression was relevant to analysis, that is, whether it interacted with the allocation we were tracing the behavior of. That is not enough for all cases, though, so also track the form of the interaction, namely whether the allocation flows through or is fully consumed. An example where that matters: (ref.eq (struct.get $A 0 (local.tee $x (struct.new_default $A) ) ) (local.get $x) ) Here the local.get flows out the allocation, but the struct.get only fully consumes it. Before this PR we thought the struct.get flowed the allocation, and we misoptimized this to 1. To make this possible, do a bunch of minor refactoring: * Move ParentChildInteraction out of the class. * Add a "None" interaction there. * Replace the set of reached expressions with a map of them to their interactions. * Add helper functions to get an expression's interaction or to update it when replacing. The new testcase here shows the main fix. The new assertions are covered by existing testcases.
*	Add missing parser error check in makeArrayInitElem (#6835)	Sofi Aberegg	2024-08-13	1	-0/+1
\| \| \|	Fixes #6833
*	[NFC] Separate out GlobalTypeRewriter::mapTypeNames (#6829)	Thomas Lively	2024-08-12	2	-25/+32
\| \| \| \| \| \| \| \| \| \|	Previously a module's type names were updated in `GlobalTypeRewriter::rebuildTypes`, which builds new versions of the existing types, rather than `GlobalTypeRewriter::mapTypes`, which otherwise handles replacing old types with new types everywhere in a module, but should not necessarily replace names. So that users of `mapTypes` who are building their own versions of existing types can also easily update type names, split type name mapping logic out into a new method `GlobalTypeRewriter::mapTypeNames`.
*	Add a TypeBuilder API for copying a heap type (#6828)	Thomas Lively	2024-08-12	4	-59/+105
\| \| \| \| \| \| \| \| \| \|	Given a function that maps the old child heap types to new child heap types, the new API takes care of copying the rest of the structure of a given heap type into a TypeBuilder slot. Use the new API in GlobalTypeRewriter::rebuildTypes. It will also be used in an upcoming type optimization. This refactoring also required adding the ability to clear the supertype of a TypeBuilder slot, which was previously not possible.
*	GlobalTypeOptimization: Reorder fields in order to remove them (#6820)	Alon Zakai	2024-08-12	4	-70/+594
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Before, we only removed fields from the end of a struct. If we had, say struct Foo { int x; int y; int z; }; // Add no fields but inherit the parent's. struct Bar : Foo {}; If y is only used in Bar, but never Foo, then we still kept it around, because if we removed it from Foo we'd end up with Foo = {x, z}, Bar = {x, y, z} which is invalid - Bar no longer extends Foo. But we can do this if we first reorder the two: struct Foo { int x; int z; int y; // now y is at the end }; struct Bar : Foo {}; And the optimized form is struct Foo { int x; int z; }; struct Bar : Foo { int y; // now y is added in Bar }; This lets us remove all fields possible in all cases AFAIK. This situation is not super-common, as most fields are actually used both up and down the hierarchy (if they are used at all), but testing on some large real-world codebases, I see 10 fields removed in Java, 45 in Kotlin, and 31 in Dart testcases. The NFC change to src/wasm-type-ordering.h was needed for this to compile.
*	Set hasExplicitName for thunks generated in FuncCastEmulation. NFC (#6826)	Sam Clegg	2024-08-09	1	-0/+1
\| \| \| \|	Without this all the newly created thunks lack names in the name section.
*	Typed continuations: update syntax of handler clauses (#6824)	Frank Emrich	2024-08-09	4	-13/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The syntax for handler clauses in `resume` instructions has recently changed, using `on` instead of `tag` now. Instead of ``` (resume $ct (tag $tag0 $block0) ... (tag $tagn $blockn)) ``` we now have ``` (resume $ct (on $tag0 $block0) ... (on $tagn $blockn)) ``` This PR adapts parsing, printing, and some tests accordingly. (Note that this PR deliberately makes none of the other changes that will arise from implementing the new, combined stack switching proposal, yet.)
*	[FP16] Implement relation operations. (#6825)	Brendan Dahl	2024-08-09	16	-33/+435
\| \| \| \|	Specified at https://github.com/WebAssembly/half-precision/blob/main/proposals/half-precision/Overview.md
*	[FP16] Implement lane access instructions. (#6821)	Brendan Dahl	2024-08-08	17	-0/+211
\| \| \| \|	Specified at https://github.com/WebAssembly/half-precision/blob/main/proposals/half-precision/Overview.md
*	Simplify TopologicalOrders (#6811)	Thomas Lively	2024-08-07	2	-28/+22
\| \| \| \| \| \|	Make `TopologicalOrders` its own iterator rather than having a separate iterator class that wraps a pointer to `TopologicalOrders`. This simplifies usage in cases where an iterator needs to be persistently stored. Notably, all of the tests continue working as they are.
*	Add a utility for comparing and hashing rec group shapes (#6808)	Thomas Lively	2024-08-07	5	-0/+519
\| \| \| \| \| \| \| \| \| \| \|	This is very similar to the internal utilities for canonicalizing rec groups in the type system implementation, except that the new utility also supports ordered comparison of rec groups, and of course the new utility only uses the public type API. A follow-up PR will replace the internal implementation of rec group comparison and hashing in the type system with this one. Another follow-up PR will use this new utility in a type optimization.
*	[FP16] Disable float 16 fuzzing for now. (#6822)	Brendan Dahl	2024-08-07	1	-0/+2
\|
*	GTO: Remove minor optimization of avoiding ChildLocalizer sometimes (#6818)	Alon Zakai	2024-08-07	2	-26/+18
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The optimization is to only use ChildLocalizer, which moves children to locals, if we actually have a reason to use it. It is simple enough to see if we are removing fields with side effects here, and only call ChildLocalizer if we are not. However, this will become much more complicated in a subsequent PR which will reorder fields, which allows removing yet more of them (without reordering, we can only remove fields at the end, if any subtype needs the field). This is a pretty minor optimization, as it avoids adding a few locals in the rare case of struct.new operands having side effects. We run --gto at the start of the pipeline, so later opts will clean that up anyhow. (Though, this might make us a little less efficient, but the following PR will justify this regression.)
*	[NFC][parser] Rename deftype and subtype (#6819)	Thomas Lively	2024-08-07	4	-36/+43
\| \| \| \| \| \|	Match the current spec and clarify terminology by renaming the old `deftype` to `rectype` and renaming the old `subtype` to `typedef`. Also split the parser for actual `subtype` out of the parser for the newly named `typedef`.
*	[parser] Fix bug when printing type builder errors (#6817)	Thomas Lively	2024-08-06	2	-1/+12
\| \| \| \| \| \|	The type index from the TypeBuilder error was mapped to a file location incorrectly, resulting in an assertion failure. Fixes #6816.
*	[FP16] Implement load and store instructions. (#6796)	Brendan Dahl	2024-08-06	18	-41/+952
\| \| \| \|	Specified at https://github.com/WebAssembly/half-precision/blob/main/proposals/half-precision/Overview.md
*	Restore isString type methods (#6815)	Thomas Lively	2024-08-06	9	-35/+31
\| \| \| \| \| \| \| \| \|	PR ##6803 proposed removing Type::isString and HeapType::isString in favor of more explicit, verbose callsites. There was no consensus to make this change, but it was accidentally committed as part of #6804. Revert the accidental change, except for the useful, noncontroversial parts, such as fixing the `isString` implementation and a few other locations to correctly handle shared types.
*	[Source maps] Handle single-segment entries in source map header decoder (#6794)	Ömer Sinan Ağacan	2024-08-06	3	-6/+77
\| \| \| \| \|	Single-segment mappings were already handled in readNextDebugLocation, but not in readSourceMapHeader.
*	Fix sharedness bug in inhabitable type fuzzer (#6807)	Thomas Lively	2024-08-06	1	-1/+2
\| \| \| \| \| \|	The code for collecting inhabitable types incorrectly considered shared, non-nullable externrefs to be inhabitable, which disagreed with the code for rewriting types to be inhabitable, which was correct, causing the type fuzzer to report an error.
*	[NFC] Add HeapType::getKind returning a new HeapTypeKind enum (#6804)	Thomas Lively	2024-08-06	9	-144/+134
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The HeapType API has functions like `isBasic()`, `isStruct()`, `isSignature()`, etc. to test the classification of a heap type. Many users have to call these functions in sequence and handle all or most of the possible classifications. When we add a new kind of heap type, finding and updating all these sites is a manual and error-prone process. To make adding new heap type kinds easier, introduce a new API that returns an enum classifying the heap type. The enum can be used in switch statements and the compiler's exhaustiveness checker will flag use sites that need to be updated when we add a new kind of heap type. This commit uses the new enum internally in the type system, but follow-on commits will add new uses and convert uses of the existing APIs to use `getKind` instead.
*	Make source parser consistent with binary parser when naming things. NFC (#6813)	Sam Clegg	2024-08-06	6	-18/+19
\| \| \| \| \|	The `timport$` prefix is already used for tables, so the binary parser currently uses `eimport$` to name tags (I guess because they are normally exception tags?).
*	WasmBinaryReader: Use helper function to create names for items. NFC (#6810)	Sam Clegg	2024-08-05	1	-14/+15
\| \| \| \| \|	As a followup we could probably make these more consistent. For example, we could use a single char prefix for defined functions/tables/globals (e.g. f0/t0/g0)