forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	Rename WasmBinaryBuilder to WasmBinaryReader (NFC) (#5767)	Heejin Ahn	2023-06-13	1	-1/+1
\| \| \| \| \| \|	We have `WasmBinaryBuilder` that read binary into Binaryen IR and `WasmBinaryWriter` that writes Binaryen IR to binary. To me `WasmBinaryBuilder` sounds similar to `WasmBinaryWriter`, which builds binary. How about renaming it to `WasmBinaryReader`?
*	[NFC] Remove our bespoke `make_unique` implementation (#5613)	Thomas Lively	2023-03-31	1	-8/+8
\| \| \| \|	This code predates our adoption of C++14 and can now be removed in favor of `std::make_unique`, which should be more efficient.
*	Fix Asyncify assertions after #5293 (#5328)	Alon Zakai	2022-12-07	1	-4/+49
\| \| \| \| \| \| \| \| \| \| \|	Followup to #5293, this fixes a small regression there regarding assertions. We do have a need to visit non-instrumented functions if we want assertions, as we assert on some things there, namely that such functions do not change the state (if they changed it, we'd need to instrument them to handle that properly). This moves that logic into a new pass. We run that pass when assertions are enabled. Test diff basically undoes part the test diff from that earlier PR for that one file.
*	Optimize Asyncify to not flatten/optimize unnecessarily (#5293)	Alexander Guryanov	2022-12-06	1	-2/+50
\| \| \| \| \| \| \| \| \|	Add a way to proxy passes and the addition of passes in pass runners. With that we can make Asyncify only modify functions it actually needs to. On a project that Asyncify only needs to modify a few functions on, this can save a huge amount of time as it avoids flattening+optimizing the majority of the module. Fixes #4822
*	Add `hasArgument` helper to pass options. NFC (#5278)	Sam Clegg	2022-11-17	1	-7/+5
\|
*	Fix inverted logic bug with asyncify-ignore-indirect (#5275)	Sam Clegg	2022-11-17	1	-4/+4
\|
*	Switch from `typedef` to `using` in C++ code. NFC (#5258)	Sam Clegg	2022-11-15	1	-1/+1
\| \| \| \|	This is more modern and (IMHO) easier to read than that old C typedef syntax.
*	Multi-Memories Asyncify (#5222)	Ashley Nelson	2022-11-07	1	-41/+78
\| \| \|	Adds support for the Asyncify pass to use Multi-Memories. This is specified by passing flag --asyncify-in-secondary-memory. Another flag, --asyncify-secondary-memory-size, is used to specify the initial and max size of the secondary memory.
*	Fix comment in Asyncify.cpp (#5196)	William Stein	2022-10-31	1	-1/+1
\|
*	Make `Name` a pointer, length pair (#5122)	Thomas Lively	2022-10-11	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	With the goal of supporting null characters (i.e. zero bytes) in strings. Rewrite the underlying interned `IString` to store a `std::string_view` rather than a `const char`, reduce the number of map lookups necessary to intern a string, and present a more immutable interface. Most importantly, replace the `c_str()` method that returned a `const char` with a `toString()` method that returns a `std::string`. This new method can correctly handle strings containing null characters. A `const char` can still be had by calling `data()` on the `std::string_view`, although this usage should be discouraged. This change is NFC in spirit, although not in practice. It does not intend to support any particular new functionality, but it is probably now possible to use strings containing null characters in at least some cases. At least one parser bug is also incidentally fixed. Follow-on PRs will explicitly support and test strings containing nulls for particular use cases. The C API still uses `const char` to represent strings. As strings containing nulls become better supported by the rest of Binaryen, this will no longer be sufficient. Updating the C and JS APIs to use pointer, length pairs is left as future work.
*	Make Asyncify work with wasm64 (#5105)	Sam Clegg	2022-10-04	1	-57/+75
\| \| \| \| \|	The emscripten side is a little tricky but I've got some tests passing. Currently blocked on: https://github.com/emscripten-core/emscripten/issues/17969
*	Refactor interaction between Pass and PassRunner (#5093)	Thomas Lively	2022-09-30	1	-25/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously only WalkerPasses had access to the `getPassRunner` and `getPassOptions` methods. Move those methods to `Pass` so all passes can use them. As a result, the `PassRunner` passed to `Pass::run` and `Pass::runOnFunction` is no longer necessary, so remove it. Also update `Pass::create` to return a unique_ptr, which is more efficient than having it return a raw pointer only to have the `PassRunner` wrap that raw pointer in a `unique_ptr`. Delete the unused template `PassRunner::getLast()`, which looks like it was intended to enable retrieving previous analyses and has been in the code base since 2015 but is not implemented anywhere.
*	Mutli-Memories Support in IR (#4811)	Ashley Nelson	2022-08-17	1	-11/+25
\| \| \| \| \| \| \|	This PR removes the single memory restriction in IR, adding support for a single module to reference multiple memories. To support this change, a new memory name field was added to 13 memory instructions in order to identify the memory for the instruction. It is a goal of this PR to maintain backwards compatibility with existing text and binary wasm modules, so memory indexes remain optional for memory instructions. Similarly, the JS API makes assumptions about which memory is intended when only one memory is present in the module. Another goal of this PR is that existing tests behavior be unaffected. That said, tests must now explicitly define a memory before invoking memory instructions or exporting a memory, and memory names are now printed for each memory instruction in the text format. There remain quite a few places where a hardcoded reference to the first memory persist (memory flattening, for example, will return early if more than one memory is present in the module). Many of these call-sites, particularly within passes, will require us to rethink how the optimization works in a multi-memories world. Other call-sites may necessitate more invasive code restructuring to fully convert away from relying on a globally available, single memory pointer.
*	Lift the restriction in liveness-traversal.h that supported max 65535 locals ↵	juj	2022-04-28	1	-8/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	in a function. (#4567) * Lift the restriction in liveness-traversal.h that supported max 65535 locals in a function. * Lint * Fix typo * Fix static * Lint * Lint * Lint * Add needed canRun function * lint * Use either a sparse or a dense matrix for tracking liveness copies, depending on the locals count. * Lint * Fix lint * Lint * Implement sparse_square_matrix class and use that as a backing. * Lint * Lint * Lint #includes * Lint * Lint includes * Remove unnecessary code * Fix canonical accesses to copies matrix * Lint * Add missing variable update * Remove canRun() function * Address review * Update expected test results * Update test name * Add asserts to sparse_square_matrix set and get functions that they are not out of bound. * Lint includes * Update test expectation * Use .clear() + .resize() to reset totalCopies vector
*	Rename asyncify-side-module to asyncify-relocatable (#4596)	かめのこにょこにょこ	2022-04-18	1	-3/+3
\| \| \| \| \| \| \|	Related: emscripten-core/emscripten#15893 (comment) --pass-arg=asyncify-side-module option will be used not only from side modules, but also from main modules.
*	Asyncify: Use stack instead of recursive call to avoid stack overflow (#4433)	Yuta Saito	2022-01-25	1	-71/+147
\| \| \| \| \|	Rewrite AsyncifyFlow.process to use stack instead of recursive call. This patch resolves #4401
*	Allow import mutable globals used in Asyncify pass (#4427)	かめのこにょこにょこ	2022-01-14	1	-10/+25
\| \| \| \| \| \| \| \| \| \| \|	This PR is part of the solution to emscripten-core/emscripten#15594. emscripten Asyncify won't work properly in side modules, because the globals, __asyncify_state and __asyncify_data, are not synchronized between main-module and side-modules. A new pass arg, asyncify-side-module, is added to make __asyncify_state and __asyncify_data imported in the instrumented wasm.
*	Modernize code to C++17 (#3104)	Max Graey	2021-11-22	1	-6/+3
\|
*	Show a clear error on asyncify+references. (#4125)	Alon Zakai	2021-09-07	1	-3/+12
\| \| \|	Helps #3739
*	Asyncify: Degrade gracefully if too many locals to compute ↵	Alon Zakai	2021-08-27	1	-0/+9
\| \| \| \|	relevantLiveLocals (#4108)
*	Preserve Function HeapTypes (#3952)	Thomas Lively	2021-06-30	1	-5/+5
\| \| \| \| \| \| \| \| \|	When using nominal types, func.ref of two functions with identical signatures but different HeapTypes will yield different types. To preserve these semantics, Functions need to track their HeapTypes, not just their Signatures. This PR replaces the Signature field in Function with a HeapType field and adds new utility methods to make it almost as simple to update and query the function HeapType as it was to update and query the Function Signature.
*	Refactor LinearExecutionWalker to a separate file. NFC (#3956)	Alon Zakai	2021-06-28	1	-0/+1
\|
*	Remove Type ordering (#3793)	Thomas Lively	2021-05-18	1	-3/+3
\| \| \| \| \| \| \| \| \|	As found in #3682, the current implementation of type ordering is not correct, and although the immediate issue would be easy to fix, I don't think the current intended comparison algorithm is correct in the first place. Rather than try to switch to using a correct algorithm (which I am not sure I know how to implement, although I have an idea) this PR removes Type ordering entirely. In places that used Type ordering with std::set or std::map because they require deterministic iteration order, this PR uses InsertOrdered{Set,Map} instead.
*	[wasm-builder] Construct module elements as unique_ptrs (#3391)	Thomas Lively	2020-11-19	1	-2/+2
\| \| \| \| \| \| \| \| \|	When Functions, Globals, Events, and Exports are added to a module, if they are not already in std::unique_ptrs, they are wrapped in a new std::unique_ptr owned by the Module. This adds an extra layer of indirection when accessing those elements that can be avoided by allocating those elements as std::unique_ptrs. This PR updates wasm-builder to allocate module elements via std::make_unique rather than `new`. In the future, we should remove the raw pointer versions of Module::add* to encourage using std::unique_ptrs more broadly.
*	Rename Indirect to NonDirect in CallGraphPropertyAnalysis in preparation for ↵	Alon Zakai	2020-11-13	1	-4/+6
\| \| \| \| \| \| \| \| \| \|	CallRef (#3355) This is in preparation for CallRef, which takes a reference to a function and calls it. CallGraphPropertyAnalysis needs to be aware of anything that is not a direct call, and "NonDirect" is meant to cover both CallIndirect and CallRef.
*	Rewrite DCE pass (#3274)	Alon Zakai	2020-10-26	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The DCE pass is one of the oldest in binaryen, and had quite a lot of cruft from the changes in unreachability and other stuff in wasm and binaryen's history. This PR rewrites it from scratch, making it about 1/3 the size. I noticed this when looking for places to use code autogeneration. The old version had annoying boilerplate, while the new one avoids any need for it. There may be noticeable differences, as the old pass did more than it needed to. It overlapped with remove-unused-names for some reason I don't remember. The new pass leaves that to the other pass to do. I added another run of remove-unused-names to avoid noticeable differences in optimized builds, but you can see differences in the testcases that only run DCE by itself. (The test differences in this PR are mostly whitespace.) (The overlap is that if a block ended up not needed, that is, all branches to it were removed, the old DCE would remove the block.) This pass is about 15% faster than the old version. However, when adding another run of remove-unused-names the difference basically vanishes, so this isn't a speedup.
*	Use const modifier when dealing with types (#3064)	Daniel Wirtz	2020-08-20	1	-2/+2
\| \| \|	Since they make the code clearer and more self-documenting.
*	Replace Type::expand() with an iterator-based approach (#3061)	Daniel Wirtz	2020-08-19	1	-8/+7
\| \| \|	This leads to simpler code and is a prerequisite for #3012, which makes it so that not all `Type`s are backed by vectors that `expand` could return.
*	Fix typo in Asyncify comment (#3031)	Nikita Baksalyar	2020-08-10	1	-1/+1
\|
*	Asyncify verbose option (#3022)	Alon Zakai	2020-08-06	1	-4/+45
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This logs out the decisions made about instrumenting functions, which can help figure out why a function is instrumented, or to get a list of what might need to be. As the test shows, it can print things like this: [asyncify] import is an import that can change the state [asyncify] calls-import can change the state due to import [asyncify] calls-calls-import can change the state due to calls-import [asyncify] calls-calls-calls-import can change the state due to calls-calls-import (the test has calls-calls-calls-import => calls-calls-import => calls-import -> import).
*	Add a builder.makeConst helper template (#2971)	Alon Zakai	2020-07-21	1	-10/+9
\|
*	Asyncify liveness analysis (#2890)	Alon Zakai	2020-06-23	1	-8/+62
\| \| \| \| \| \| \| \| \|	This finds out which locals are live at call sites that might pause/resume, which is the set of locals we need to actually save/load. That is, if a local is not alive at any call site in the function, then it's value doesn't need to stay alive while sleeping. This saves about 10% of locals that are saved/loaded, and about 1.5% in final code size.
*	Asyncify: Instrument indirect calls from functions in add-list or only-list ↵	Alon Zakai	2020-06-17	1	-18/+49
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	(#2913) When doing manual tuning of calls using asyncify lists, we want it to be possible to write out all the functions that can be on the stack when pausing, and for that to work. This did not quite work right with the ignore-indirect option: that would ignore all indirect calls all the time, so that if foo() calls bar() indirectly, that indirect call was not instrumented (we didn't check for a pause around it), even if both foo() and bar() were listed. There was no way to make that work (except for not ignoring indirect calls at all). This PR makes the add-list and only-lists fully instrument the functions mentioned in them: both themselves, and indirect calls from them. (Note that direct calls need no special handling - we can just add the direct call target to the add-list or only-list.) This may add some overhead to existing users, but only in a function that is instrumented anyhow, and also indirect calls are slow anyhow, so it's probably fine. And it is simpler to do it this way instead of adding another list for indirect call handling.
*	Asyncify: Add an "add list", rename old lists (#2910)	Alon Zakai	2020-06-12	1	-48/+87
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Asyncify does a whole-program analysis to figure out the list of functions to instrument. In emscripten-core/emscripten#10746 (comment) we realized that we need another type of list there, an "add list" which is a list of functions to add to the instrumented functions list, that is, that we should definitely instrument. The use case in that link is that we disable indirect calls, but there is one special indirect call that we do need to instrument. Being able to add just that one can be much more efficient than assuming all indirect calls in a big codebase need instrumentation. Similar issues can come up if we add a profile-guided option to asyncify, which we've discussed. The existing lists were not good enough to allow that, so a new option is needed. I took the opportunity to rename the old ones to something better and more consistent, so after this PR we have 3 lists as follows: * The old "remove list" (previously "blacklist") which removes functions from the list of functions to be instrumented. * The new "add list" which adds to that list (note how add/remove are clearly parallel). * The old "only list" (previously "whitelist") which simply replaces the entire list, and so only those functions are instrumented and no other. This PR temporarily still supports the old names in the commandline arguments, to avoid immediate breakage for our CI.
*	Handle tuples in Asyncify call support (#2743)	Thomas Lively	2020-04-09	1	-9/+41
\| \| \| \| \|	Instead of adding globals for hardcoded basic types, traverse the module to collect all call types that might need to be handled and emit a global for each of them. Adapted from #2712.
*	Revert to using globals in Asyncify call support (#2719)	Thomas Lively	2020-04-01	1	-71/+53
\| \| \| \| \| \| \| \|	This reverts commit 5ddda8d2e6a3287ff6adcd69493e1e1c8b6c3872. We decided it would be easier to allow tuple-typed globals than to make calls work here after all. Reverts that change, but keeps small improvements it made for clarity.
*	Convert Asyncify fake globals to fake calls. NFC (#2706)	Alon Zakai	2020-03-25	1	-46/+71
\| \| \| \| \|	This will be easier to extend for tuples. Also add more clarifying comments.
*	Support tuple locals in Asyncify (#2696)	Thomas Lively	2020-03-17	1	-28/+46
\| \| \|	Iterate over tuple locals and separately load or store each component.
*	Asyncify: Fix wasm-only instrumentation of unnamed imports (#2682)	Alon Zakai	2020-03-05	1	-23/+39
\| \| \| \| \| \| \| \| \| \| \| \| \|	We assumed that the imports were already named (in their internal name) properly. When processing a binary file without names, or if the names don't match in general, that's not true. To fix this, use ModuleUtils::renameFunctions to do a proper renaming up front. Also fix renameFunctions to not assert on the case of renaming a function to the same name it already has. Helps #2680
*	Expose asyncify state via a getter (#2679)	Alon Zakai	2020-03-04	1	-1/+16
\| \| \| \| \| \| \| \| \| \|	Normally, a wrapper has to track state separately to know when to unwind/rewind and when to actually call import functions. Exposing Asyncify state can help avoid this duplication and avoid subtle bugs when internal and wrapper state get out of sync. Since this is a tiny function and it's useful for any Asyncify embedder, I've decided to expose it by default rather than hide behind an option.
*	Handle indirect calls in CallGraphPropertyAnalysis (#2624)	Alon Zakai	2020-01-24	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	We ignored them, which is a bad default, as typically they imply we can call anything in the table (and the table might change). Instead, notice indirect calls during traversal, and force the user to decide whether to ignore them or not. This was only an issue in PostEmscripten because the other user, Asyncify, already had indirect call analysis because it needed it for other things. Fixes a bug uncovered by #2619 and fixes the current binaryen roll.
*	[NFC] Enforce use of `Type::` on type names (#2434)	Thomas Lively	2020-01-07	1	-43/+45
\|
*	Move Type-related functions into Type class (NFC) (#2556)	Heejin Ahn	2019-12-29	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \|	Several type-related functions currently exist outside of `Type` class and thus in the `wasm`, effectively global, namespace. This moves these functions into `Type` class, making them either member functions or static functions. Also this renames `getSize` to `getByteSize` to make it not to be confused with `size`, which returns the number of types in multiple types. This also reorders the order of functions in `wasm-type.cpp` to match that of `wasm-type.h`.
*	Remove FunctionType (#2510)	Thomas Lively	2019-12-11	1	-6/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Function signatures were previously redundantly stored on Function objects as well as on FunctionType objects. These two signature representations had to always be kept in sync, which was error-prone and needlessly complex. This PR takes advantage of the new ability of Type to represent multiple value types by consolidating function signatures as a pair of Types (params and results) stored on the Function object. Since there are no longer module-global named function types, significant changes had to be made to the printing and emitting of function types, as well as their parsing and manipulation in various passes. The C and JS APIs and their tests also had to be updated to remove named function types.
*	Add string parameter to WASM_UNREACHABLE (#2499)	Sam Clegg	2019-12-05	1	-1/+1
\| \| \| \| \|	This works more like llvm's unreachable handler in that is preserves information even in release builds.
*	Multivalue type creation and inspection (#2459)	Thomas Lively	2019-11-22	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \|	Adds the ability to create multivalue types from vectors of concrete value types. All types are transparently interned, so their representation is still a single uint32_t. Types can be extracted into vectors of their component parts, and all the single value types expand into vectors containing themselves. Multivalue types are not yet used in the IR, but their creation and inspection functionality is exposed and tested in the C and JS APIs. Also makes common type predicates methods of Type and improves the ergonomics of type printing.
*	Refactor a CallGraphPropertyAnalysis helper [NFC] (#2441)	Alon Zakai	2019-11-18	1	-41/+33
\| \| \| \| \| \| \| \| \| \| \|	This moves code out of Asyncify into a general helper class. The class automates scanning the functions for a property, then propagating it to functions that call them. In Asyncify, the property is "may call something that leads to sleep", and we propagate backwards to callers, to find all those that may sleep. This will be useful in a future exceptions-optimizing pass I want to write, where the property will be "may throw". We will then be able to remove exceptions overhead in cases that definitely do not throw.
*	Add ModAsyncify* passes (#2404)	Alon Zakai	2019-10-23	1	-1/+145
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	These passes are meant to be run after Asyncify has been run, they modify the output. We can assume that we will always unwind if we reach an import, or that we will never unwind, etc. This is meant to help with lazy code loading, that is, the ability for an initially-downloaded wasm to not contain all the code, and if code not present there is called, we download all the rest and continue with that. That could work something like this: * The wasm is created. It contains calls to a special import for lazy code loading. * Asyncify is run on it. * The initially downloaded wasm is created by running --mod-asyncify-always-and-only-unwind: if the special import for lazy code loading is called, we will definitely unwind, and we won't rewind in this binary. * The lazily downloaded wasm is created by running --mod-asyncify-never-unwind: we will rewind into this binary, but no longer need support for unwinding. (Optionally, there could also be a third wasm, which has not had Asyncify run on it, and which we'd swap to for max speed.) These --mod-asyncify passes allow the optimizer to do a lot of work, especially for the initially downloaded wasm if we have lots of calls to the lazy code loading import. In that case the optimizer will see that those calls unwind, which means the code after them is not reached, potentially making lots of code dead and removable.
*	Show the unescaped name in Asyncify pattern warnings (#2351)	Alon Zakai	2019-09-23	1	-3/+5
\| \| \| \| \| \| \| \| \|	This is part of the fix for https://logs.chromium.org/logs/emscripten-releases/buildbucket/cr-buildbucket.appspot.com/8901492015302662960/+/steps/Emscripten_testsuite__upstream__other_/0/stdout Specifically it fixes that the name shown there should not be escaped. Followup for #2344
*	asyncify: support *-matching in whitelist and blacklist (#2344)	Beuc	2019-09-23	1	-32/+70
\| \| \|	See emscripten-core/emscripten#9381 for rationale.