forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	[Wasm GC] [ctor-eval] Evaluate and serialize GC data (#4491)	Alon Zakai	2022-02-03	9	-0/+264
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This ended up simpler than I thought. We can simply emit global and local data as we go, creating globals as necessary to contain GC data, and referring to them using global.get later. That will ensure that data identity works (things referring to the same object in the interpreter will refer to the same object when the wasm is loaded). In more detail, each live GC item is created in a "defining global", a global that is immutable and of the precise type of that data. Then we just read from that location in any place that wants to refer to that data. That is, something like function foo() { var x = Bar(10); var y = Bar(20); var z = x; z.value++; // first object now contains 11 ... } will be evalled into something like var define$0 = Bar(11); // note the ++ has taken effect here var define$1 = Bar(20); function foo() { var x = define$0; var y = define$1; var z = define$0; ... } This PR should handle everything but "cycles", that is, GC data that at runtime ends up forming a loop. Leaving that for later work (not sure how urgent it is to fix).
*	Interpreter: Remove GlobalManager (#4486)	Alon Zakai	2022-01-31	6	-4/+59
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	GlobalManager is another class that added complexity in the interpreter logic, and did not help. In fact it hurts extensibility, as when one wants to extend the interpreter one has another class to customize, and it is templated on the main runner, so again as #4479 we end up with annoying template cycles. This simply removes that class. That makes the interpreter code strictly simpler. Applying that change to wasm-ctor-eval also ends up fixing a pre-existing bug, so this PR gets testing through that. The ctor-eval issue was that we did not extend the GlobalManager properly in the past: we checked for accesses on imported globals there, but not in the main class, i.e., not on global.get operations. Needing to do things in two places is an example of the previous complexity. The fix is simply to implement visitGlobalGet in one place, and remove all the GlobalManager logic added in ctor-eval, which then gets a lot simpler as well. The new imported-global-2.wast checks for that bug (a global.get of an import should stop us from evalling). Existing tests cover the other cases, like it being ok to read a non-imported global, etc. The existing test indirect-call3.wast required a slight change: There was a global.get of an imported global, which was ignored in the place it happened (an init of an elem segment); the new code checks all global.gets, so it now catches that.
*	[ctor-eval] Eval functions with params if ignoring external input (#4446)	Alon Zakai	2022-01-12	7	-9/+71
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When ignoring external input, assume params have a value of 0. This makes it possible to eval main(argc, argv) if one is careful and does not actually use those values. This is basically a workaround for main always receiving argc/argv, even if the C code has no args (in that case the compiler emits __original_main for the user's main, and wraps it with a main that adds the args, hence the problem). This is similar to the existing support for handling wasi_args_get when ignoring external input, although it just sets values of zeros for the params. Perhaps it could check for main() specifically and return 1 for argc and a proper buffer for argv somehow, but I think if a program wants to use --ignore-external-input it can avoid actually reading argc/argv.
*	[ctor-eval] Eval functions with a return value (#4443)	Alon Zakai	2022-01-12	3	-12/+97
\| \| \|	This is necessary for e.g. main() which returns an i32.
*	[ctor-eval] Stop if there are any memory.init instructions (#4442)	Alon Zakai	2022-01-11	3	-0/+41
\| \| \| \| \| \| \| \|	This tool depends (atm) on flattening memory segments. That is not compatible with memory.init which cares about segment identities. This changes flatten() only by adding the check for MemoryInit. The rest is unchanged, although I saw the other two params are not needed and I removed them while I was there.
*	[ctor-eval] Add an option to keep some exports (#4441)	Alon Zakai	2022-01-11	3	-5/+85
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	By default wasm-ctor-eval removes exports that it manages to completely eval (if it just partially evals then the export remains, but points to a function with partially-evalled contents). However, in some cases we do want to keep the export around even so, for example during fuzzing (as the fuzzer wants to call the same exports before and after wasm-ctor-eval runs) and also if there is an ABI we need to preserve (like if we manage to eval all of main()), or if the function returns a value (which we don't support yet, but this is a PR to prepare for that). Specifically, there is now a new option: --kept-exports foo,bar That is a list of exports to keep around. Note that when we keep around an export after evalling the ctor we make the export point to a new function. That new function just contains a nop, so that nothing happens when it is called. But the original function is kept around as it may have other callers, who we do not want to modify.
*	[ctor-eval] Fix evalling of overlapping table segments (#4440)	Alon Zakai	2022-01-11	3	-0/+49
\|
*	[ctor-eval] Partial evaluation (#4438)	Alon Zakai	2022-01-11	20	-23/+246
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This lets us eval part of a function but not all, which is necessary to handle real-world things like __wasm_call_ctors in LLVM output, as that is the single ctor that is exported and it has calls to the actual ctors. To do so, we look for a toplevel block and execute its items one by one, in a FunctionScope. If we stop in the middle, then we are performing a partial eval. In that case, we only remove the parts of the function that we removed, and we also serialize the locals whose values we read from the FunctionScope. For example, consider this: function foo() { return 10; } function __wasm_call_ctors() { var x; x = foo(); x++; // We stop evalling here. import1(); import2(x); } We can eval x = foo() and x++, but we must stop evalling when we reach the first of those imports. The partially-evalled function then looks like this: function __wasm_call_ctors() { var x; x = 11; import1(); import2(x); } That is, we evalled two lines of executing code and simply removed them, and then we wrote out the value of the local at that point, and then the rest of the code in the function is as it used to be.
*	[ctor-eval] Eval and store changes to globals (#4430)	Alon Zakai	2022-01-07	12	-50/+45
\| \| \| \| \| \| \| \| \| \|	This is necessary for being able to optimize real-world code, as it lets us use the stack pointer for example. With this PR we allow changes to globals, and we simply store the final state of the global in the global at the end. Basically the same as we do for memory, but for globals. Remove a test that now fails ("imported2"). Replace it with a nicer test of saving the values of globals. Also add a test for an imported global, which we do not allow (we never did, but I don't see a test for it).
*	[ctor-eval] Add --ignore-external-input option (#4428)	Alon Zakai	2022-01-06	3	-0/+81
\| \| \| \| \| \| \| \| \| \| \| \|	This is meant to address one of the main limitations of wasm-ctor-eval in emscripten atm, that libc++ global ctors will read env vars, which means they call an import, which stops us from evalling, emscripten-core/emscripten#15403 (comment) To handle that, this adds an option to ignore external input. When set, we can assume that no env vars will be read, no reading from stdin, no arguments to main(), etc. Perhaps these could each be separate options, but I think keeping it simple for now might be good enough.
*	[ctor-eval] Remove stack hacks (#4429)	Alon Zakai	2022-01-06	9	-131/+0
\| \| \| \| \| \| \| \| \|	Remove some hackish code for fastcomp's stack handling. The stack pointer arrives in an imported global there. Upstream does not do this, so this code is completely unneeded these days (and, frankly, kind of scary as I read it now... it modeled the stack as separate memory from the heap...). Remove the tests for this as well. I verified that there was nothing else in those tests that we need to keep.
*	Remove Type ordering (#3793)	Thomas Lively	2021-05-18	1	-1/+1
\| \| \| \| \| \| \| \| \|	As found in #3682, the current implementation of type ordering is not correct, and although the immediate issue would be easy to fix, I don't think the current intended comparison algorithm is correct in the first place. Rather than try to switch to using a correct algorithm (which I am not sure I know how to implement, although I have an idea) this PR removes Type ordering entirely. In places that used Type ordering with std::set or std::map because they require deterministic iteration order, this PR uses InsertOrdered{Set,Map} instead.
*	Reorder global definitions in Print pass (#3770)	Abbas Mashayekh	2021-04-02	1	-1/+1
\| \| \| \|	This is needed to make sure globals are printed before element segments, where `global.get` can appear both as offset and an expression.
*	Properly use text format type names in printing (#3591)	Alon Zakai	2021-02-23	5	-7/+7
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds a TypeNames entry to modules, which can store names for types. So far this PR uses that to store type names from text format. Future PRs will add support for field names and for the binary format. (Field names are added to wasm.h here to see if we agree on this direction.) Most of the work here is threading a module through the various functions in Print.cpp. This keeps the module optional, so that we can still print an expression independently of a module, which has always been the case, and which I think we should keep (but, if a module was mandatory perhaps this would be a little simpler, and could be refactored into a form that depends on that). 99% of this diff are test updates, since almost all our tests use the text format, and many of them specify a type name but we used to ignore it. This is a step towards a proper solution for #3589
*	[reference-types] remove single table restriction in IR (#3517)	Abbas Mashayekh	2021-02-09	3	-3/+3
\| \| \|	Adds support for modules with multiple tables. Adds a field for the table name to `CallIndirect` and updates the C/JS APIs accordingly.
*	Remove exnref and br_on_exn (#3505)	Heejin Ahn	2021-01-22	2	-3/+3
\| \| \|	This removes `exnref` type and `br_on_exn` instruction.
*	Update reference types (#3084)	Daniel Wirtz	2020-09-09	2	-2/+2
\| \| \| \| \| \| \|	Align with the current state of the reference types proposal: * Remove `nullref` * Remove `externref` and `funcref` subtyping * A `Literal` of a nullable reference type can now represent `null` (previously was type `nullref`) * Update the tests and temporarily comment out those tests relying on subtyping
*	Remove function index printing (#2742)	Thomas Lively	2020-04-09	11	-17/+17
\| \| \| \| \| \| \| \|	`BinaryIndexes` was only used in two places (Print.cpp and wasm-binary.h), so it didn't seem to be a great fit for module-utils.h. This change moves it to wasm-binary.h and removes its usage in Print.cpp. This means that function indexes are no longer printed, but those were of limited utility and were the source of annoying noise when updating tests, anyway.
*	Trap when call_indirect's signatures mismatch (#2636)	Heejin Ahn	2020-02-03	3	-0/+39
\| \| \| \| \| \| \| \| \| \| \|	This makes the interpreter trap when the signature in `call_indirect` instruction and that of the actual function in the table mismatch. This also makes the `wasm-ctor-eval` not evaluate `call_indirect` in case the signatures mismatch. Before we only compared the arguments' signature and the function signature, which was sufficient before we had subtypes, but now the signature in `call_indirect` and that of the actual function can be different even if the argument's signature is OK.
*	Remove FunctionType (#2510)	Thomas Lively	2019-12-11	10	-27/+27
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Function signatures were previously redundantly stored on Function objects as well as on FunctionType objects. These two signature representations had to always be kept in sync, which was error-prone and needlessly complex. This PR takes advantage of the new ability of Type to represent multiple value types by consolidating function signatures as a pair of Types (params and results) stored on the Function object. Since there are no longer module-global named function types, significant changes had to be made to the printing and emitting of function types, as well as their parsing and manipulation in various passes. The C and JS APIs and their tests also had to be updated to remove named function types.
*	Refactor type and function parsing (#2143)	Heejin Ahn	2019-05-24	7	-19/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- Refactored & fixed typeuse parsing rules so now the rules more closely follow the spec. There have been multiple parsing rules that were different in subtle ways, which are supposed to be the same according to the spec. - Duplicate types, i.e., types with the same signature, in the type section are allowed as long as they don't have the same given name. If a name is given, we use it; if type name is not given, we generate one in the form of `$FUNCSIG$` + signature string. If the same generated name already exists in the type section, we append `_` at the end. This causes most of the changes in the autogenerated type names in test outputs. - A typeuse has to be in the order of (type) -> (param) -> (result), if more than one of them exist. In case of function definitions, (local) has to be after all of these. Fixed some test cases that violate this rule. - When only (param)/(result) are given, its type will be the type with the smallest existing type index whose parameter and result are the same. If there's no such type, a new type will be created and inserted. - Added a test case `duplicate_types.wast` to test type namings for duplicate types. - Refactored `parseFunction` function. - Add more overrides to helper functions: `getSig` and `ensureFunctionType`.
*	Massive renaming (#1855)	Thomas Lively	2019-01-07	13	-41/+41
\| \| \| \| \| \|	Automated renaming according to https://github.com/WebAssembly/spec/issues/884#issuecomment-426433329.
*	wasm-ctor-eval: handle the stack going either up or down	Alon Zakai	2018-12-11	3	-0/+33
\|
*	Generate sexp instruction parser (#1754)	Thomas Lively	2018-11-19	2	-4/+8
\| \| \|	Also fix broken tests surfaced by the new parser.
*	Emit imports before defined things in text format (#1715)	Alon Zakai	2018-11-01	3	-5/+5
\| \| \| \| \|	That is the correct order in the text format, wabt errors otherwise. See AssemblyScript/assemblyscript#310
*	Unify imported and non-imported things (#1678)	Alon Zakai	2018-09-19	8	-54/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fixes #1649 This moves us to a single object for functions, which can be imported or nor, and likewise for globals (as a result, GetGlobals do not need to check if the global is imported or not, etc.). All imported things now inherit from Importable, which has the module and base of the import, and if they are set then it is an import. For convenient iteration, there are a few helpers like ModuleUtils::iterDefinedGlobals(wasm, [&](Global* global) { .. use global .. }); as often iteration only cares about imported or defined (non-imported) things.
*	Misc tiny fuzz fixes (#1668)	Alon Zakai	2018-09-12	14	-25/+8
\| \| \| \| \| \| \| \| \| \| \| \|	* show a proper error for an empty asm2wasm input * handle end of input in processExpressions in binary reading * memory segment sizes should be unsigned * validate input in wasm-ctor-eval * update tests
*	wasm-ctor-eval improvements (#1631)	Alon Zakai	2018-08-07	12	-126/+14
\| \| \| \| \|	* When we eval a ctor, don't just nop the function body that no longer needs to be executed, also remove the export (as we report the ctor being evalled, and the outside will no longer call it). * Run the pass to remove unused global things. This can usually remove evalled ctors (unless something else happens to call them, which can't happen normally as LLVM wouldn't use a ctor in another place, but e.g. duplicate function merging might merge a ctor with another function).
*	Update call_indirect text syntax to match spec update (#1281)	Derek Schuff	2017-11-13	7	-7/+7
\| \| \| \|	Function type gets its own element rather than being a part of the call_indirect (see WebAssembly/spec#599)
*	Emit binary function index in comment in text format, for convenience (#1232)	Alon Zakai	2017-10-20	15	-35/+35
\|
*	ctor-eval fixes (#996)	Alon Zakai	2017-05-05	17	-31/+97
\| \| \| \| \| \| \| \|	* fix wasm-ctor-eval, we need to look for the STACKTOP etc. imports, they may not be named, if this build is not with -g * pack memory after ctor evalling, since we merge it up which is less efficient * do some useful opts after ctor-evalling, to clean things up
*	ctor evaller (#982)	Alon Zakai	2017-04-28	42	-0/+616
	Add wasm-ctor-eval, which evaluates functions at compile time - typically static constructor functions - and applies their effects into memory, saving work at startup. If we encounter something we can't evaluate at compile time in our interpreter, stop there. This is similar to ctor_evaller.py in emscripten (which was for asm.js).