forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	Support atomic struct accessors (#7155)	Thomas Lively	2024-12-18	16	-42/+302
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Implement support for both sequentially consistent and acquire-release variants of `struct.atomic.get` and `struct.atomic.set`, as proposed by shared-everything-threads. Introduce a new `MemoryOrdering` enum for describing different levels of atomicity (or the lack thereof). This new enum should eventually be adopted by linear memory atomic accessors as well to support acquire-release semantics, but for now just use it in `StructGet` and `StructSet`. In addition to implementing parsing and emitting for the instructions, validate that shared-everything is enabled to use them, mark them as having synchronization side effects, and lightly optimize them by relaxing acquire-release accesses to non-shared structs to normal, unordered accesses. This is valid because such accesses cannot possibly synchronize with other threads. Also update Precompute to avoid optimizing out synchronization points. There are probably other passes that need to be updated to avoid incorrectly optimizing synchronizing accesses, but identifying and fixing them is left as future work.
*	RemoveUnusedBrs: Avoid an error on loops with unreachable ifs (#7156)	Alon Zakai	2024-12-17	1	-3/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We normally like to move brs after ifs into the if, when in a loop: (loop $loop (if .. (unreachable) (code) ) (br $loop) ) => (loop $loop (if .. (unreachable) (block (code) (br $loop) ;; moved in ) ) ) However this may be invalid to do if the if condition is unreachable, as then one arm may be concrete (`code` in the example could be an `i32`, for example). As this is dead code anyhow, leave it for DCE.
*	[wasm-reduce] Add an option to save all interim working files as we reduce ↵	Alon Zakai	2024-12-16	1	-2/+29
\| \| \| \| \| \| \|	(#7154) With this option, each time we reduce we save a file w.wasm.17 or such, incrementing that counter. This is useful when debugging the reducer, but might have more uses.
*	Fuzz JSPI (#7148)	Alon Zakai	2024-12-16	3	-0/+35
\| \| \| \| \| \| \| \| \| \|	* Add a new "sleep" fuzzer import, that does a sleep for some ms. * Add JSPI support in fuzz_shell.js. This is in the form of commented-out async/await keywords - commented out so that normal fuzzing is not impacted. When we want to fuzz JSPI, we uncomment them. We also apply the JSPI operations of marking imports and exports as suspending/promising. JSPI fuzzing is added to both fuzz_opt.py and ClusterFuzz's run.py.
*	Address comments from #7149 (#7152)	Thomas Lively	2024-12-13	1	-1/+1
\| \| \|	The PR was accidentally merged without these fixes included.
*	[NFC] Move HeapType::isBottom() to header (#7150)	Thomas Lively	2024-12-13	2	-25/+27
\| \| \| \| \|	This makes Precompute about 5% faster on a WasmGC binary. Inspired by #6931.
*	Support control flow inputs in IRBuilder (#7149)	Thomas Lively	2024-12-13	5	-112/+201
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since multivalue was standardized, WebAssembly has supported not only multiple results but also an arbitrary number of inputs on control flow structures, but until now Binaryen did not support control flow input. Binaryen IR still has no way to represent control flow input, so lower it away using scratch locals in IRBuilder. Since both the text and binary parsers use IRBuilder, this gives us full support for parsing control flow inputs. The lowering scheme is mostly simple. A local.set writing the control flow inputs to a scratch local is inserted immediately before the control flow structure begins and a local.get retrieving those inputs is inserted inside the control flow structure before the rest of its body. The only complications come from ifs, in which the inputs must be retrieved at the beginning of both arms, and from loops, where branches to the beginning of the loop must be transformed so their values are written to the scratch local along the way. Resolves #6407.
*	Execution results: JS traps on exnref on the boundary (#7147)	Alon Zakai	2024-12-12	1	-5/+5
\| \| \|	Fixes #7145
*	[NFC] Encode reference types with bit packing (#7142)	Thomas Lively	2024-12-10	2	-435/+139
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Value types were previously represented internally as either enum values for "basic," i.e. non-reference, non-tuple types or pointers to `TypeInfo` structs encoding either references or tuples. Update the representation of reference types to use one bit to encode nullability and the rest of the bits to encode the referenced heap type. This allows canonical reference types to be created with a single logical or rather than by taking a lock on a global type store and doing a hash map lookup to canonicalize. This change is a massive performance improvement and dramatically improves how performance scales with threads because the removed lock was highly contended. Even with a single core, the performance of an O3 optimization pipeline on a WasmGC module improves by 6%. With 8 cores, the improvement increases to 29% and with all 128 threads on my machine, the improvement reaches 46%. The full new encoding of types is as follows: - If the type ID is within the range of the basic types, the type is the corresponding basic type. - Otherwise, if bit 0 is set, the type is a tuple and the rest of the bits are a canonical pointer to the tuple. - Otherwise, the type is a reference type. Bit 1 determines the nullability and the rest of the bits encode the heap type. Also update the encodings of basic heap types so they no longer use the low two bits to avoid conflicts with the use of those bits in the encoding of types.
*	[NFC] Simplify TypeGraphWalker in wasm-type.cpp (#7143)	Thomas Lively	2024-12-10	1	-213/+139
\| \| \| \| \| \| \| \| \|	Co-locate the declaration and implementation of TypeGraphWalkerBase and its subtypes in wasm-type.cpp and simplify the implementation. Remove the preVisit and postVisit tasks for both Types and HeapTypes since overriding scanType and scanHeapType is sufficient for all users. Stop scanning the HeapTypes in reference types because a follow-on change (#7142) will make that much more complicated, and it turns out that it is not necessary.
*	[GC] Fix TypeRefining on StructGets without content but with a reachable ref ↵	Alon Zakai	2024-12-09	1	-62/+42
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(#7138) If we see a StructGet with no content (the type it reads from has no writes) then we can make it unreachable. The old code literally just changed the type to unreachable, which would later work out with refinalization - but only if the StructGet's ref was unreachable. But it is possible for this situation to occur without that, and if so, this hit the validation error "can't have an unreachable node without an unreachable child". To fix this, merge all code paths that handle "impossible" situations, which simplifies things, and add this situation. This uncovered an existing bug where we noted default values of refs, but not non-refs (which could lead us to think that a field of a struct that only was ever created by struct.new_default, was never created at all). Fixed as well.
*	Fuzzer: Add call-ref, call-ref-catch imports (#7137)	Alon Zakai	2024-12-09	3	-40/+143
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Similar to call-export, these imports call a wasm function from outside the module. The difference is that we send a function reference for them to call (rather than an export index). This gives more coverage, first by sending a ref from wasm to JS, and also since we will now try to call anything that is sent. Exports, in comparison, are filtered by the fuzzer to things that JS can handle, so this may lead to more traps, but maybe also some new situations. This also leads to adding more logic to execution-results.h to model JS trapping properly. fuzz_shell.js is refactored to allow sharing code between call-export and call-ref*.
*	Add bulk-memory-opt feature and ignore call-indirect-overlong (#7139)	Derek Schuff	2024-12-06	8	-19/+61
\| \| \| \| \| \| \| \| \| \|	LLVM recently split the bulk-memory-opt feature out from bulk-memory, containing just memory.copy and memory.fill. This change follows that, making bulk-memory-opt also enabled when all of bulk-memory is enabled. It also introduces call-indirect-overlong following LLVM, but ignores it, since Binaryen has always allowed the encoding (i.e. command line flags enabling or disabling the feature are accepted but ignored).
*	Remove incorrect warning when reading name section (#7140)	Thomas Lively	2024-12-06	1	-5/+0
\| \| \| \| \| \| \| \| \|	When we refactored how the name section is read, we accidentally left an old warning about invalid field name indices in place. The old warning code compares the type index from the names section to the size of the parsed type vector to determine if the index is out-of-bounds. Now that we parse the name section before the type section, this is no longer correct. Delete the old warning; we already have a new, correct warning for out-of-bound indices when we parse the type section.
*	[NFC] Send the closed-world flag to TranslateToFuzzReader (#7136)	Alon Zakai	2024-12-05	3	-9/+23
\| \| \| \| \| \| \| \| \| \| \|	This sends --closed-world to wasm-opt from the fuzzer, when we use that flag (before we just used it on optimizations, but not fuzz generation). And TranslateToFuzzReader now stores a boolean about whether we are in closed- world mode or not. This has no effect so far, and is a refactoring for a later PR, where we must generate code differently based on whether we are in closed-world mode or not.
*	Remove separate Table64Lowering pass (#7131)	Sam Clegg	2024-12-04	4	-167/+119
\| \| \| \| \| \| \| \| \|	This pass is now just part of Memory64Lowering. Once this lands we can remove the `--table64-lowering` flag from emscripten. Because I've used an alias here there will be some interim period where emscripten will run this pass twice since it passed both flags. However, this will only be temporary and that second run will be a no-op since the first one will remove the feature.
*	Fix GUFA on calls to function refs in open world (#7135)	Alon Zakai	2024-12-04	1	-13/+30
\| \| \| \|	In open world we must assume that a funcref that escapes to the outside might be called.
*	[NFC] Encapsulate source map reader state (#7132)	Thomas Lively	2024-12-03	17	-369/+370
\| \| \| \| \| \| \| \| \| \| \| \|	Move all state relevant to reading source maps out of WasmBinaryReader and into a new utility, SourceMapReader. This is a prerequisite for parallelizing the parsing of function bodies, since the source map reader state is different at the beginning of each function. Also take the opportunity to simplify the way we read source maps, for example by deferring the reading of anything but the position of a debug location until it will be used and by using `std::optional` instead of singleton `std::set`s to store function prologue and epilogue debug locations.
*	Fixup block-nested pops even when EH is not enabled (#7130)	Thomas Lively	2024-12-03	3	-5/+15
\| \| \| \| \| \| \| \| \| \|	While parsing a binary file, there may be pops that need to be fixed up even if EH is not (yet) enabled because the target features section has not been parsed yet. Previously `EHUtils::handleBlockNestedPops` did not do anything if EH was not enabled, so the binary parser would fail to fix up pops in that case. Add an optional parameter to override this behavior so the parser can fix up pops unconditionally. Fixes #7127.
*	Do not sink blocks into ifs with unreachable conditions (#7129)	Thomas Lively	2024-12-02	1	-0/+6
\| \| \| \| \| \| \| \| \| \|	RemoveUnusedBrs sinks blocks into If arms when those arms contain branches to the blocks and the other arm and condition do not. Now that we type Ifs with unreachable conditions as unreachable, it is possible for the If arms to have a different type than the block that would be sunk, so sinking the block would produce invalid IR. Fix the problem by never sinking blocks into Ifs with unreachable conditions. Fixes #7128.
*	[GC] Fix trapping on array.new_data of dropped segments of offset > 0 (#7124)	Alon Zakai	2024-12-02	1	-3/+12
\| \| \| \|	Even if the size is 0, if the offset is > 0 then we should trap.
*	[NFC] Rename {F32,F64}NearestInt to {F32,F64}Nearest (#7089)	Thomas Lively	2024-11-27	3	-9/+5
\| \| \| \| \| \|	Rename the opcode values in wasm-binary.h to better match the names of the corresponding instructions. This also makes these names match the scheme used by the rest of the basic unary operations, allowing for more macro use in the binary reader.
*	Use IRBuilder in the binary parser (#6963)	Thomas Lively	2024-11-26	4	-4419/+1487
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	IRBuilder is a utility for turning arbitrary valid streams of Wasm instructions into valid Binaryen IR. It is already used in the text parser, so now use it in the binary parser as well. Since the IRBuilder API for building each intruction requires only the information that the binary and text formats include as immediates to that instruction, the parser is now much simpler than before. In particular, it does not need to manage a stack of instructions to figure out what the children of each expression should be; IRBuilder handles this instead. There are some differences between the IR constructed by IRBuilder and the IR the binary parser constructed before this change. Most importantly, IRBuilder generates better multivalue code because it avoids eagerly breaking up multivalue results into individual components that might need to be immediately reassembled into a tuple. It also parses try-delegate more correctly, allowing the delegate to target arbitrary labels, not just other `try`s. There are also a couple superficial differences in the generated label and scratch local names. As part of this change, add support for recording binary source locations in IRBuilder.
*	Make more Ifs unreachable (#7094)	Thomas Lively	2024-11-27	6	-41/+48
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously the only Ifs that were typed unreachable were those in which both arms were unreachable and those in which the condition was unreachable that would have otherwise been typed none. This caused problems in IRBuilder because Ifs with unreachable conditions and value-returning arms would have concrete types, effectively hiding the unreachable condition from the logic for dropping concretely typed expressions preceding an unreachable expression when finishing a scope. Relax the conditions under which an If can be typed unreachable so that all Ifs with unreachable conditions or two unreachable arms are typed unreachable. Propagating unreachability more eagerly this way makes various optimizations of Ifs more powerful. It also requires new handling for unreachable Ifs with concretely typed arms in the Printer to ensure that printed wat remains valid. Also update Unsubtyping, Flatten, and CodeFolding to account for the newly unreachable Ifs.
*	Handle concrete values in CodeFolding (#7117)	Thomas Lively	2024-11-26	1	-123/+111
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	CodeFolding previously only worked on blocks that did not produce values. It worked on Ifs that produced values, but only by accident; the logic for folding matching tails was not written to support tails producing concrete values, but it happened to work for Ifs because subsequent ReFinalize runs fixed all the incorrect types it produced. Improve the power of the optimization by explicitly handling tails that produce concrete values for both blocks and ifs. Now that the core logic handles concrete values correctly, remove the unnecessary ReFinalize run. Also remove the separate optimization of Ifs with identical arms; this optimization requires ReFinalize and is already performed by OptimizeInstructions.
*	ReFinalize after merging siblings in TypeMerging (#7121)	Thomas Lively	2024-11-26	1	-0/+11
\| \| \| \| \| \| \| \|	The LUB of sibling types is their common supertype, but after the sibling types are merged, their LUB is the merged type, which is a strict subtype of the previous LUB. This means that merging sibling types causes `selects` to have stale types when the two select arms previously had the two merged sibling types. To fix any potential stale types, ReFinalize after merging sibling types.
*	[wasm2js] Run LLVM nontrapping fptoint lowering when running for emscripten ↵	Derek Schuff	2024-11-26	1	-0/+3
\| \| \| \| \| \|	(#7116) Lower away saturating fptoint operations when we know we are using emscripten.
*	Fix ArenaVector::swap (#7115)	Alon Zakai	2024-11-26	1	-6/+3
\| \| \| \| \|	This was never right for over a decade, and just never used I suppose... it should have been called "take" since it grabbed data from the other item and then set that other item to empty. Fix it so it swaps properly.
*	Fix memory.grow bounds and overflow checks for mem64 (#7112)	Thomas Lively	2024-11-25	1	-2/+7
\| \| \| \| \|	Previously the interpreter only executed overflow and bounds checks for memory.grow on 32-bit memories. Run the checks on 64-bit memories as well.
*	Handle unoptimized branches in CodeFolding (#7111)	Thomas Lively	2024-11-25	1	-8/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	CodeFolding previously did not consider br_on_* instructions at all, so it would happily merge tails even if there were br_on_* branches to the same label with non-matching tails. Fix the bug by making any label targeted by any instruction not explicitly handled by CodeFolding unoptimizable. This will gracefully handle other branching instructions like `resume` and `resume_throw` as well. Folding these branches properly is left as future work. Also rename the test file from code-folding_enable-threads.wast to just code-folding.wast and enable all features instead of just threads. The old name was left over from when the test was originally ported to lit, and the new feature is necessary because the new test uses GC instructions.
*	[GC] Refinalize after selectify in RemoveUnusedBrs (#7104)	Alon Zakai	2024-11-25	1	-2/+12
\| \| \| \|	Replacing an if with a select may have refined the type. Without this fix, the sharper stale type checks complain.
*	Remove AutoDrop (#7106)	Thomas Lively	2024-11-22	6	-105/+1
\| \| \| \|	The only internal use was in wasm2js, which doesn't need it. Fix API tests to explicitly drop expressions as necessary.
*	Print castType for unreachable br_on_cast{_fail} (#7107)	Thomas Lively	2024-11-22	1	-6/+4
\| \| \| \| \| \| \| \|	I forgot that there is a validation rule that the output type for br_on_cast and br_on_cast_fail must be a subtype of the input type. We were previously printing bottom input types in cases where the cast operand was unreachable, but that's only valid if the cast type is the same bottom type. Instead print the most precise valid input type, which is the cast type itself.
*	Print unreachable loads with valid types (#7108)	Thomas Lively	2024-11-22	1	-1/+9
\| \| \| \| \| \| \| \| \|	Since Load expressions use their `type` field to encode the type of the loaded value, unreachable loads need to come up with some other valid type to print. Previously we always chose i32 as that type, but that's not valid when the load was originally a v128 load with an alignment of 8, since 8 is greater than the maximum valid alignment of 4 for an i32. Fix the problem by taking alignment into account when choosing a type for the unreachable load.
*	Propagate public visibility through all types (#7105)	Thomas Lively	2024-11-21	1	-11/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously the classification of public types propagated public visibility only through types that had previously been collected by `collectHeapTypes`. Since there are settings that cause `collectHeapTypes` to collect fewer types, it was possible for public types to be missed if they were only public because they were reached by an uncollected types. Ensure that all public heap types are properly classified by propagating public visibility even through types that are not part of the collected output. Fixes #7103.
*	Fix printing of unreachable br_on_cast{_fail} (#7102)	Thomas Lively	2024-11-21	1	-2/+15
\| \| \| \| \| \| \| \| \| \| \| \| \|	br_on_cast and br_on_cast_fail have two type annotations: one for their input type and one for their cast type. In cases where their operands were unreachable, we were previously printing "unreachable" for the input type annotation. This is not valid wat because "unreachable" is not a reference type. To fix the problem, print the bottom type of the cast type's hierarchy as the input type for br_on_cast and br_on_cast_fail when the operand is unreachable. This ensures that the instructions have the most precise possible output type according to Wasm typing rules, so it maximizes the number of contexts in which the printed instructions are valid.
*	Make validation of stale types stricter (#7097)	Thomas Lively	2024-11-21	15	-63/+36
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We previously allowed valid expressions to have stale types as long as those stale types were supertypes of the most precise possible types for the expressions. Allowing stale types like this could mask bugs where we failed to propagate precise type information, though. Make validation stricter by requiring all expressions except for control flow structures to have the most precise possible types. Control flow structures are exempt because many passes that can refine types wrap the refined expressions in blocks with the old type to avoid the need for refinalization. This pattern would be broken and we would need to refinalize more frequently without this exception for control flow structures. Now that all non-control flow expressions must have precise types, remove functionality relating to building select instructions with non-precise types. Since finalization of selects now always calculates a LUB rather than using a provided type, remove the type parameter from BinaryenSelect in the C and JS APIs. Now that stale types are no longer valid, fix a bug in TypeSSA where it failed to refinalize module-level code. This bug previously would not have caused problems on its own, but the stale types could cause problems for later runs of Unsubtyping. Now the stale types would cause TypeSSA output to fail validation. Also fix a bug where Builder::replaceWithIdenticalType was in fact replacing with refined types. Fixes #7087.
*	[wasm2js] Properly handle loops without labels (#7100)	Alon Zakai	2024-11-21	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \|	When a loop has no name, the name does not matter, but we also cannot emit the same name for all such loops, as that is invalid JS. Just do not emit a while(){} at all in that case, as no continue can exist anyhow. Fixes #7099 Also fix two missing * in error reporting logic, that was printing pointers rather than the expression we wanted to print. I think we changed how iostream prints things years ago, and forgot to update these.
*	Fuzzer: Legalize and prune the JS interface in pickPasses (#7092)	Alon Zakai	2024-11-20	1	-0/+7
\| \| \| \|	Also add a test that the ClusterFuzz run.py does not warn, which was helpful when debugging this.
*	Improve fuzzing of both closed and open world styles of modules (#7090)	Alon Zakai	2024-11-19	6	-21/+177
\| \| \| \| \| \| \| \| \| \|	Before, we would simply not export a function that had an e.g. anyref param. As a result, the modules were effectively "closed", which was good for testing full closed-world mode, but not for testing degrees of open world. To improve that, this PR allows the fuzzer to export such functions, and an "enclose world" pass is added that "closes" the wasm (makes it more compatible with closed-world) that is run 50% of the time, giving us coverage of both styles.
*	Add nontrapping-fptoint lowering pass (#7016)	Derek Schuff	2024-11-19	4	-0/+186
\| \| \| \| \| \| \| \| \| \| \|	This pass lowers nontrapping FP to int instructions to implement LLVM's conversion behavior. This means that they are not fully complete lowerings according to the wasm spec, but have the same undefined behavior that LLM does. This keeps the pass simpler and preserves existing behavior when compiling without nontrapping-ft. This will be used in emscripten, so that we can build libraries with nontrapping-fp and lower them away after link if desired.
*	Fuzzing: ClusterFuzz integration (#7079)	Alon Zakai	2024-11-19	2	-14/+123
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The main addition here is a bundle_clusterfuzz.py script which will package up the exact files that should be uploaded to ClusterFuzz. It also documents the process and bundling and testing. You can do bundle.py OUTPUT_FILE.tgz That bundles wasm-opt from ./bin., which is enough for local testing. For actually uploading to ClusterFuzz, we need a portable build, and @dschuff had the idea to reuse the emsdk build, which works nicely. Doing bundle.py OUTPUT_FILE.tgz --build-dir=/path/to/emsdk/upstream/ will bundle wasm-opt (+libs) from the emsdk. I verified that those builds work on ClusterFuzz. I added several forms of testing here. First, our main fuzzer fuzz_opt.py now has a ClusterFuzz testcase handler, which simulates a ClusterFuzz environment. Second, there are smoke tests that run in the unit test suite, and can also be run separately: python -m unittest test/unit/test_cluster_fuzz.py Those unit tests can also run on a given bundle, e.g. one created from an emsdk build, for testing right before upload: BINARYEN_CLUSTER_FUZZ_BUNDLE=/path/to/bundle.tgz python -m unittest test/unit/test_cluster_fuzz.py A third piece of testing is to add a --fuzz-passes test. That is a mode for -ttf (translate random data into a valid wasm fuzz testcase) that uses random data to pick and run a set of passes, to further shape the wasm. (--fuzz-passes had no previous testing, and this PR fixes it and tidies it up a little, adding some newer passes too). Otherwise this PR includes the key run.py script that is bundled and then executed by ClusterFuzz, basically a python script that runs wasm-opt -ttf [..] to generate testcases, sets up their JS, and emits them. fuzz_shell.js, which is the JS to execute testcases, will now check if it is provided binary data of a wasm file. If so, it does not read a wasm file from argv[1]. (This is needed because ClusterFuzz expects a single file for the testcase, so we make a JS file with bundled wasm inside it.)
*	Use hints when generating fresh labels in IRBuilder (#7086)	Thomas Lively	2024-11-18	2	-4/+9
\| \| \| \| \| \| \| \| \| \| \| \|	IRBuilder often has to generate new label names for blocks and other scopes. Previously it would generate each new name by starting with "block" or "label" and incrementing a suffix until finding a fresh name, but this made name generation quadratic in the number of names to generate. To spend less time generating names, track a hint index at which to start looking for a fresh name and increment it every time a name is generated. This speeds up a version of the binary parser that uses IRBuilder by about 15%.
*	[NFC] Finalize blocks with explicit breakability in IRBuilder (#7085)	Thomas Lively	2024-11-18	2	-4/+16
\| \| \| \| \| \|	Since IRBuilder already knows what labels are used by branches, it is easy for it to pass that information when finalizing blocks. This avoids finalization having to walk the blocks looking for branches, speeding up a future version of the binary parser that uses IRBuilder by 10%.
*	Rename memory-copy-fill-lowering pass (#7082)	Derek Schuff	2024-11-16	4	-8/+8
\| \| \| \|	Since the resulting code has the same undefined behavior as LLVM, make the pass name reflect that.
*	[NFC] Remove redundant [[nodiscard]] attributes (#7084)	Thomas Lively	2024-11-15	2	-150/+140
\| \| \| \| \| \|	Now that Result and MaybeResult are annotated [[nodiscard]] at the type level, individual functions and methods that return these types do not need to be annotated [[nodiscard]] themselves. Remove the newly redundant annotations.
*	Mark Result and MaybeResult [[nodiscard]] (#7083)	Thomas Lively	2024-11-15	2	-5/+5
\| \| \| \| \| \|	Since these types may be carrying errors that need to be handled or propagated, it is always an error not to use them in some way. Adding the [[nodiscard]] attribute caused the compiler to find a few instances where we were incorrectly ignoring results. Fix these places.
*	Reset function context when ending a function in IRBuilder (#7081)	Thomas Lively	2024-11-15	4	-10/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	IRBuilder contains a pointer to the current function that is used to create scratch locals, look up the operand types for returns, etc. This pointer is nullable because IRBuilder can also be used in non-function contexts such as global initializers. Visiting the start of a function sets the function pointer, and after this change visiting the end of a function resets the pointer to null. This avoids potential problems where code outside a function would be able to incorrectly use scratch locals and returns if the IRBuilder had previously been used to build a function. This change requires some adjustments to Outlining, which visits code out of order, so ends up visiting code from inside a function after visiting the end of the function. To support this use case, add a `setFunction` method to IRBuilder that lets the user explicitly control its function context. Also remove the optional function pointer parameter to the IRBuilder constructor since it is less flexible and not used.
*	Use empty blocks instead of nops for empty scopes in IRBuilder (#7080)	Thomas Lively	2024-11-14	2	-3/+5
\| \| \| \| \| \| \| \| \| \|	When IRBuilder builds an empty non-block scope such as a function body, an if arm, a try block, etc, it needs to produce some expression to represent the empty contents. Previously it produced a nop, but change it to produce an empty block instead. The binary writer and printer have special logic to elide empty blocks, so this produces smaller output. Update J2CLOpts to recognize functions containing empty blocks as trivial to avoid regressing one of its tests.
*	Record binary locations for nested blocks (#7078)	Thomas Lively	2024-11-14	1	-0/+20
\| \| \| \| \| \| \| \| \| \| \|	The binary reader has special handling for blocks immediately nested inside other blocks to eliminate recursion while parsing very deep stacks of blocks. This special handling did not record binary locations for the nested blocks, though. Add logic to record binary locations for nested blocks. This binary reading code is about to be replaced with completely different code that uses IRBuilder instead, but this change will eliminate some test differences that we would otherwise see when we make that change.