forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Add Features.MVP and Features.All to binaryen.js (#2148)	Heejin Ahn	2019-05-29	2	-8/+6
\| \| \| \|	This adds `Features.MVP` and `Features.All` to binaryen.js and make test cases use it.
*	Add BinaryenModuleWriteSExpr to write a module to a string in s-expr format ↵	Siddharth	2019-05-21	2	-0/+35
\| \| \| \| \|	(#2106) Fixes #2103.
*	Reflect instruction renaming in code (#2128)	Heejin Ahn	2019-05-21	16	-214/+214
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	- Reflected new renamed instruction names in code and tests: - `get_local` -> `local.get` - `set_local` -> `local.set` - `tee_local` -> `local.tee` - `get_global` -> `global.get` - `set_global` -> `global.set` - `current_memory` -> `memory.size` - `grow_memory` -> `memory.grow` - Removed APIs related to old instruction names in Binaryen.js and added APIs with new names if they are missing. - Renamed `typedef SortedVector LocalSet` to `SetsOfLocals` to prevent name clashes. - Resolved several TODO renaming items in wasm-binary.h: - `TableSwitch` -> `BrTable` - `I32ConvertI64` -> `I32WrapI64` - `I64STruncI32` -> `I64SExtendI32` - `I64UTruncI32` -> `I64UExtendI32` - `F32ConvertF64` -> `F32DemoteI64` - `F64ConvertF32` -> `F64PromoteF32` - Renamed `BinaryenGetFeatures` and `BinaryenSetFeatures` to `BinaryenModuleGetFeatures` and `BinaryenModuleSetFeatures` for consistency.
*	Features C/JS API (#2049)	Thomas Lively	2019-05-17	2	-0/+30
\| \| \| \| \|	Add feature handling to the C/JS APIs. No features are enabled by default, so all used features will have to be explicitly enabled in order for modules to validate.
*	Allow color API to enable and disable colors (#2111)	Siddharth	2019-05-17	1	-0/+16
\| \| \| \| \| \|	This is useful for front-ends which wish to selectively enable or disable coloring. Also expose these APIs from the C API.
*	Add missing methods for globals to binaryen.js (#2099)	Heejin Ahn	2019-05-13	1	-2/+2
\| \| \| \| \|	- Print `globals` array in the tracing mode like other arrays (`functions`, `exports`, `imports`, ...) - Add accessor functions for globals
*	Add except_ref type (#2081)	Heejin Ahn	2019-05-07	2	-1/+3
\| \| \| \|	This adds except_ref type, which is a part of the exception handling proposal.
*	Passive segments (#1976)	Thomas Lively	2019-04-05	14	-23/+31
\| \| \| \| \|	Adds support for the bulk memory proposal's passive segments. Uses a new (data passive ...) s-expression syntax to mark sections as passive.
*	Update v128.const text formats (#1934)	Thomas Lively	2019-03-19	2	-630/+630
\| \| \| \| \|	Parse the formats allowed by the spec proposal and emit the i32x4 canonical format.
*	Align v128 text format with WABT (#1930)	Daniel Wirtz	2019-03-04	2	-630/+630
\| \| \| \| \| \| \| \|	This PR changes the formatting of v128.const literals in text format / stack ir like so - v128.const i32 0x1 0x2 0x3 0x4 0x5 0x6 0x7 0x8 0x9 0xa 0xb 0xc 0xd 0xe 0xf 0x80 + v128.const i32 0x04030201 0x08070605 0x0c0b0a09 0x800f0e0d Recently hit this when trying to load Binaryen generated text format with WABT, which errored with `error: unexpected token 0x5, expected ).
*	SmallVector (#1912)	Alon Zakai	2019-02-25	2	-0/+69
\| \| \| \| \|	Trying to refactor the code to be simpler and less redundant, I ran into some perf issues that it seems like a small vector, with fixed-size storage and optional additional storage as needed, might help with. This implements that class and uses it in a few places. This seems to help, I see some 1-2% fewer instructions and cycles in `perf stat`, but it's hard to tell if it really makes a noticeable difference.
*	Bulk memory operations (#1892)	Thomas Lively	2019-02-05	3	-70/+161
\| \| \| \| \| \|	Bulk memory operations The only parts missing are the interpreter implementation and spec tests.
*	Massive renaming (#1855)	Thomas Lively	2019-01-07	13	-235/+235
\| \| \| \| \| \|	Automated renaming according to https://github.com/WebAssembly/spec/issues/884#issuecomment-426433329.
*	LocalCSE: Consider pass options, both size and cost (#1840)	Alon Zakai	2018-12-21	2	-0/+19
\| \| \|	With this we can optimize redundant global accesses fairly well (at least locally; licm also works), see #1831
*	SIMD (#1820)	Thomas Lively	2018-12-13	3	-305/+3876
\| \| \| \| \| \| \| \| \|	Implement and test the following functionality for SIMD. - Parsing and printing - Assembling and disassembling - Interpretation - C API - JS API
*	Use template magic for tracing expressions (#1815)	Thomas Lively	2018-12-10	1	-8/+8
\|
*	Implement nontrapping float-to-int instructions (#1780)	Thomas Lively	2018-12-04	3	-196/+342
\|
*	Add v128 type (#1777)	Thomas Lively	2018-11-29	2	-1/+3
\|
*	Relooper: Merge consecutive blocks (#1770)	Alon Zakai	2018-11-26	5	-141/+385
\| \| \|	That is, A -> B where no other branches go to B. In that case we are guaranteed to not increase code size.
*	Merge-Blocks improvements (#1760)	Alon Zakai	2018-11-26	1	-3/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously we didn't try to merge a block into the parent if the block had a name. This lets us merge part of it, that is: (block (..a..) (block $child (..b..) (.. some br to $child ..) (..c..) ) ) => (block (..a..) (..b..) ;; moved out (block $child (.. some br to $child ..) (..c..) ) ) This is beneficial for 2 reasons: the child may now be a singleton, so we can remove the block; or, now that we canonicalized the br-containing code to the head of the child, we may be able to turn it into an if.
*	Relooper CFG optimizations (#1759)	Alon Zakai	2018-11-21	20	-124/+2411
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Previously the relooper would do some optimizations when deciding when to use an if vs a switch, how to group blocks, etc. This PR adds an additional pre-optimization phase with some basic but useful simplify-cfg style passes, * Skip empty blocks when they have just one exit. * Merge exiting branches when they are equivalent. * Canonicalize block contents to make such comparisons more useful. * Turn a trivial one-target switch into a simple branch. This can help in noticeable ways when running the rereloop pass, e.g. on LLVM wasm backend output. Also: * Binaryen C API changes to the relooper, which now gets a Module for its constructor. It needs it for the optimizations, as it may construct new nodes. * Many relooper-fuzzer improvements. * Clean up HashType usage.
*	Emit imports before defined things in text format (#1715)	Alon Zakai	2018-11-01	4	-10/+10
\| \| \| \| \|	That is the correct order in the text format, wabt errors otherwise. See AssemblyScript/assemblyscript#310
*	Shared memory support for add memory import and set memory functions. (#1686)	Nidin Vinayakan	2018-10-11	6	-8/+8
\|
*	Add initial/maximum table size parameters to C/JS API (#1687)	Daniel Wirtz	2018-09-28	2	-2/+2
\|
*	Unify imported and non-imported things (#1678)	Alon Zakai	2018-09-19	8	-90/+90
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fixes #1649 This moves us to a single object for functions, which can be imported or nor, and likewise for globals (as a result, GetGlobals do not need to check if the global is imported or not, etc.). All imported things now inherit from Importable, which has the module and base of the import, and if they are set then it is an import. For convenient iteration, there are a few helpers like ModuleUtils::iterDefinedGlobals(wasm, [&](Global* global) { .. use global .. }); as often iteration only cares about imported or defined (non-imported) things.
*	Binary format local parsing fixes (#1664)	Alon Zakai	2018-09-11	2	-8/+8
\| \| \| \| \| \|	* Error if there are more locals than browsers allow (50,000). We usually just warn about stuff like this, but we do need some limit (or else we hang or OOM), and if so, why not use the agreed-upon Web limit. * Do not generate nice string names for locals in binary parsing - the name is just $var$x instead of $x, so not much benefit, and worse as our names are interned this is actually slow (which is why the fuzz testcase here hangs instead of OOMing). Testcases and bugreport in #1663.
*	BinaryenSetFunctionTable now accepts array of func names not funcs. (#1650)	Jay Phelps	2018-09-01	2	-5/+6
\| \| \| \| \|	This allows using imports in the table. Fixes #1645
*	Stack IR (#1623)	Alon Zakai	2018-07-30	2	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds a new IR, "Stack IR". This represents wasm at a very low level, as a simple stream of instructions, basically the same as wasm's binary format. This is unlike Binaryen IR which is structured and in a tree format. This gives some small wins on binary sizes, less than 1% in most cases, usually 0.25-0.50% or so. That's not much by itself, but looking forward this prepares us for multi-value, which we really need an IR like this to be able to optimize well. Also, it's possible there is more we can do already - currently there are just a few stack IR optimizations implemented, DCE local2stack - check if a set_local/get_local pair can be removed, which keeps the set's value on the stack, which if the stars align it can be popped instead of the get. Block removal - remove any blocks with no branches, as they are valid in wasm binary format. Implementation-wise, the IR is defined in wasm-stack.h. A new StackInst is defined, representing a single instruction. Most are simple reflections of Binaryen IR (an add, a load, etc.), and just pointers to them. Control flow constructs are expanded into multiple instructions, like a block turns into a block begin and end, and we may also emit extra unreachables to handle the fact Binaryen IR has unreachable blocks/ifs/loops but wasm does not. Overall, all the Binaryen IR differences with wasm vanish on the way to stack IR. Where this IR lives: Each Function now has a unique_ptr to stack IR, that is, a function may have stack IR alongside the main IR. If the stack IR is present, we write it out during binary writing; if not, we do the same binaryen IR => wasm binary process as before (this PR should not affect speed there). This design lets us use normal Passes on stack IR, in particular this PR defines 3 passes: Generate stack IR Optimize stack IR (might be worth splitting out into separate passes eventually) Print stack IR for debugging purposes Having these as normal passes is convenient as then they can run in parallel across functions and all the other conveniences of our current Pass system. However, a downside of keeping the second IR as an option on Functions, and using normal Passes to operate on it, means that we may get out of sync: if you generate stack IR, then modify binaryen IR, then the stack IR may no longer be valid (for example, maybe you removed locals or modified instructions in place etc.). To avoid that, Passes now define if they modify Binaryen IR or not; if they do, we throw away the stack IR. Miscellaneous notes: Just writing Stack IR, then writing to binary - no optimizations - is 20% slower than going directly to binary, which is one reason why we still support direct writing. This does lead to some "fun" C++ template code to make that convenient: there is a single StackWriter class, templated over the "mode", which is either Binaryen2Binary (direct writing), Binaryen2Stack, or Stack2Binary. This avoids a lot of boilerplate as the 3 modes share a lot of code in overlapping ways. Stack IR does not support source maps / debug info. We just don't use that IR if debug info is present. A tiny text format comment (if emitting non-minified text) indicates stack IR is present, if it is ((; has Stack IR ;)). This may help with debugging, just in case people forget. There is also a pass to print out the stack IR for debug purposes, as mentioned above. The sieve binaryen.js test was actually not validating all along - these new opts broke it in a more noticeable manner. Fixed. Added extra checks in pass-debug mode, to verify that if stack IR should have been thrown out, it was. This should help avoid any confusion with the IR being invalid. Added a comment about the possible future of stack IR as the main IR, depending on optimization results, following some discussion earlier today.
*	Optimize validation of many nested blocks (#1576)	Alon Zakai	2018-05-30	2	-26/+0
\| \| \| \| \| \| \|	On the testcase from https://github.com/tweag/asterius/issues/19#issuecomment-393052653 this makes us almost 3x faster, and use 25% less memory. The main improvement here is to simplify and optimize the data structures the validator uses to validate br targets: use unordered maps, and use one less of them. Also some speedups from using that map more effectively (use of iterators to avoid multiple lookups). Also move the duplicate-node checks to the internal IR validation section, which makes more sense anyhow (it's not wasm validation, it's internal IR validation, which like the check for stale internal types, we do only if debugging).
*	More simple math opts (#1414)	Alon Zakai	2018-02-14	2	-10/+10
\| \| \| \| \| \| \| \|	* optimize more simple math operations: mul of 0, or of 0, and of 0, mul of 1, mul of a power of 2, urem of a power of 2 * fix asm2wasm callImport parsing: the optimizer may get rid of the added offset to a function table * update js builds
*	ThreadPool refactoring (#1389)	Alon Zakai	2018-01-26	2	-0/+62
\| \| \| \| \| \| \| \|	Refactor ThreadPool code for clarity and to fix some bugs with using the pool from different threads in parallel. We have a singleton pool, and need to ensure it is created only once and used only by one thread at a time. This model is a simple way to ensure we use a number of threads equal to the number of cores, more or less (a pool per Module might lead to number of cores * number of Modules being optimized). This refactoring adds a parent pointer in the worker threads (giving them direct access to the pool makes it simpler to make sure that pool and thread creation and teardown are threadsafe). This commit also adds proper locking around pool creation and pool usage.
*	Also clear imports and exports maps in BinaryenModuleDispose (#1372)	Daniel Wirtz	2018-01-19	1	-1/+1
\| \| \| \| \| \|	fixes #1369 * Update binaries and kitchen-sink test
*	Refactor optimization defaults (#1366)	Alon Zakai	2018-01-17	1	-1/+16
\| \| \| \| \|	Followup to #1357. This moves the optimization settings into pass.h, and uses it from there in the various places. This also splits up huge lines from the tracing code, which put all block children (whose number can be arbitrarily large) on one line. This seems to have caused random errors on the bots, I suspect from overflowing a buffer. Anyhow, it's much more clear to split the lines at a reasonable length.
*	Add optimize, shrink level and debug info options to C/JS (#1357)	Daniel Wirtz	2018-01-17	2	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Add optimize, shrink level and debug info options to C/JS * Add instantiate functionality for creating additional unique instances of the API * Use a workaround when running tests in node Tests misuse a module as a script by concatenating, so instead of catching this case in the library, catch it there * Update sieve test Seems optimized output changed due to running with optimize levels 2/1 now * Use the options with all pass runners * Update relooper-fuzz C-API test * Share defaults between tools and the C-API * Add a test for optimize levels * Unify node test support in check.by and auto_update_tests.py * Also add getters for optimize levels and test them * Also test debugInfo * Add debug info to C tests that used it as well * Fix missing NODEJS import in auto_update_tests * Detect node.js version (WASM support) * Update hello-world JS test (now also runs with node) * feature-test WebAssembly in node instead * Document that these options apply globally, and where * Make sure hello-world.js output doesn't differ between mozjs/node
*	Optimize out memory and table when possible (#1352)	Alon Zakai	2018-01-10	3	-12/+0
\| \| \|	We can remove the memory/table (itself, or an import if imported) if they are not used. This is pretty minor on a large wasm file, but when reading small wasts it's very noticeable to have an unused memory and table all the time.
*	Add getters for various specific expression fields to C/JS (#1332)	Daniel Wirtz	2017-12-20	7	-190/+200
\|
*	Provide AddImport/AddExport for each element in the C-API (#1292)	Daniel Wirtz	2017-11-22	6	-27/+27
\| \| \| \|	* Provide AddImport/AddExport for each element in the C-API
*	Update call_indirect text syntax to match spec update (#1281)	Derek Schuff	2017-11-13	2	-4/+4
\| \| \| \|	Function type gets its own element rather than being a part of the call_indirect (see WebAssembly/spec#599)
*	Fix yet another BinaryenAddGlobal tracing issue (#1283)	Daniel Wirtz	2017-11-13	3	-4/+19
\| \| \|	Now also includes a test.
*	Emit binary function index in comment in text format, for convenience (#1232)	Alon Zakai	2017-10-20	6	-69/+69
\|
*	Expressions should not appear twice in the ast (#1191)	Alon Zakai	2017-09-18	2	-245/+279
\|
*	Add missing finalize() call to C API for call_indirect (#1184)	Sergey Pepyakin	2017-09-14	2	-0/+24
\|
*	Avoid new blocks in binary reading/writing (#1165)	Alon Zakai	2017-09-12	1	-31/+27
\| \| \| \| \| \|	* don't emit a toplevel block if we don't need to, as in wasm it is a list context * don't create unnecessary blocks in wasm reading
*	Ignore unreachable code in wasm binaries (#1122)	Alon Zakai	2017-08-22	1	-11/+0
\| \| \|	Ignoring unreachable code in wasm binaries lets us avoid corner cases with unstructured code in wasm binaries that is a poor fit for Binaryen's structured IR.
*	Emit optimal-size LEBs in section/subsection/function body sizes (#1128)	Alon Zakai	2017-08-15	1	-1/+1
\| \| \| \|	* emit optimal-size LEBs in section/subsection/function body sizes, instead of preallocating 5 bytes
*	Support new result syntax for if/loop/block (#1047)	Sam Clegg	2017-06-12	2	-12/+12
\| \| \| \| \| \|	Support both syntax formats in input since the old spec tests still need to be parsable.
*	Optimize/merge duplicate function types (#1041)	Alon Zakai	2017-06-12	2	-9/+0
\|
*	Update binaryen-c/binaryen.js, fixes #1028, fixes #1029 (#1030)	Daniel Wirtz	2017-06-07	6	-76/+88
\| \| \|	This PR adds global variable support (addGlobal, getGlobal, setGlobal), host operations (currentMemory, growMemory), a few utility functions (removeImport, removeExport, getFunctionTypeBySignature with the latter being scheduled for removal once a better alternative is in place) and it introduces an additional argument to specify the result type in BinaryenBlock (effectively breaking the C-API but retaining previous behaviour by introducing the BinaryenUndefined() type for this purpose). Additionally, it enables compilation with exception support in build-js.sh as exceptions are thrown and caught when optimizing endless loops, intentionally resulting in an unreachable opcode. Affected test cases have been updated accordingly.
*	relooper improvements	Alon Zakai (kripken)	2017-05-20	1	-0/+3
\|
*	merge blocks before and after remove-unused-brs	Alon Zakai (kripken)	2017-05-10	1	-10/+12
\|