forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	Handle tuples in RemoveUnusedBrs (#2693)	Thomas Lively	2020-03-16	2	-102/+140
\| \| \| \| \|	RemoveUnusedBrs produces selects for some patterns, but selects of multivalue types are not valid. This change checks that types are not tuple types before producing selects.
*	Collect signatures from all block kinds (#2691)	Thomas Lively	2020-03-16	4	-10/+30
\| \| \| \| \| \| \| \| \|	Previously the signature collection mechanism responsible for populating the type section with signatures used by instructions only collected signatures from indirect call and block instructions. This works as long as all other control flow constructs like ifs, loops, and tries contain blocks with the same signature. But it is possible to have an if with non-block children, and we would need to collect its signature as well.
*	Interpret tuple locals and tail-calls correctly (#2690)	Thomas Lively	2020-03-16	1	-2/+12
\|
*	Handle unreachable in TupleExtract::finalize (#2692)	Thomas Lively	2020-03-16	4	-33/+69
\|
*	Update RedundantSetElimination to work with tuples (#2688)	Thomas Lively	2020-03-11	2	-24/+47
\| \| \| \| \| \| \| \|	Also makes it work with any other constant expression such as a ref.func or ref.null instructions. This optimization may not be very important, but it illustrates how simple it can be to update a pass to handle tuples (and also I was already looking at it because of the prior changes that had to be made to it).
*	Update Precompute to handle tuples (#2687)	Thomas Lively	2020-03-10	4	-1/+87
\| \| \| \| \| \|	This involves replacing `Literal::makeZero` with `Literal::makeZeroes` and `Literal::makeSingleZero` and updating `isConstantExpression` to handle constant tuples as well. Also makes `Literals` its own struct and adds convenience methods on it.
*	Handle multivalue returns in the interpreter (#2684)	Thomas Lively	2020-03-10	1	-0/+10
\| \| \| \|	Updates the interpreter to properly flow vectors of values, including at function boundaries. Adds a small spec test for multivalue return.
*	Asyncify: Fix wasm-only instrumentation of unnamed imports (#2682)	Alon Zakai	2020-03-05	1	-5/+14
\| \| \| \| \| \| \| \| \| \| \| \| \|	We assumed that the imports were already named (in their internal name) properly. When processing a binary file without names, or if the names don't match in general, that's not true. To fix this, use ModuleUtils::renameFunctions to do a proper renaming up front. Also fix renameFunctions to not assert on the case of renaming a function to the same name it already has. Helps #2680
*	Initial multivalue support (#2675)	Thomas Lively	2020-03-05	7	-26/+1233
\| \| \| \| \| \| \| \| \|	Implements parsing and emitting of tuple creation and extraction and tuple-typed control flow for both the text and binary formats. TODO: - Extend Precompute/interpreter to handle tuple values - C and JS API support/testing - Figure out how to lower in stack IR - Fuzzing
*	DWARF: Ignore a compile unit with no abbreviations (#2678)	Alon Zakai	2020-03-04	3	-0/+1
\| \| \| \| \| \| \| \| \| \|	Such a module can't have valid DIEs, since we have no way to interpret them. Also check if DWARF sections from LLVM have contents - when they are empty the section may exist but have a null for its data. Fixes #2673
*	Expose asyncify state via a getter (#2679)	Alon Zakai	2020-03-04	12	-3/+70
\| \| \| \| \| \| \| \| \| \|	Normally, a wrapper has to track state separately to know when to unwind/rewind and when to actually call import functions. Exposing Asyncify state can help avoid this duplication and avoid subtle bugs when internal and wrapper state get out of sync. Since this is a tiny function and it's useful for any Asyncify embedder, I've decided to expose it by default rather than hide behind an option.
*	Add multivalue feature (#2668)	Thomas Lively	2020-02-27	5	-9/+14
\|
*	Add EH support for CodeFolding (#2665)	Heejin Ahn	2020-02-26	2	-0/+125
\| \| \| \| \| \| \| \| \| \| \|	This does two things: - Treats the target branch of `br_on_exn` as unoptimizables, because it is a conditional branch. - Makes sure we don't move expressions that contain `exnref.pop`, which should follow right after `catch`. - Adds `containsChild` utility function, which can search all children, optionally with limited depth. This was actually added to be used in CodeFolding but ended up not being used, but wasn't removed in case there will be uses later.
*	Add the GetFunctionTable JS API (#2554)	COFFEETALES	2020-02-20	4	-33/+68
\|
*	Add br_on_exn support for UniqueNameMapper (#2659)	Heejin Ahn	2020-02-19	2	-4/+65
\| \| \| \|	This adds support for UniqueNameMapper, and adds a test in Inlining pass, which uses UniqueNameMapper.
*	Code pushing support for br_on_exn (#2660)	Heejin Ahn	2020-02-19	2	-0/+31
\| \| \| \| \| \| \| \| \|	Like `br_if`, `br_on_exn` is a conditional branch and across which code can be pushed past when conditions are satisfied. Also adds a few lines of comments and NFC changes in a couple places. Changes in Vacuum are NFC because they were being handled in `default:` in the same way anyway, but I added them to be more explicit and consistent with existing code.
*	Concise error output (#2652)	Alon Zakai	2020-02-18	3	-20/+41
\| \| \| \| \| \| \| \| \| \| \|	Don't print the entire module on an error. Instead, just print the validation errors. However, if the user passed --print, then do print it, as otherwise nothing would get printed - the error would be before the pass to print happens. And in general a user passing in a request to print would expect a printed module anyhow. fixes #2634
*	DWARF: Fix debug_range handling of invalid entries (#2662)	Alon Zakai	2020-02-18	3	-0/+709
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	If an invalid entry appears - either it began as such, or became invalid after optimization - we should not emit (0, 0) which is an end marker. Instead, emit an invalid entry marker, something with (0, x) for x != 0. As a bonus, if a test/passes case has "noprint" in the name, don't print the wasm, which we do by default. In the testcase here for example we just care about the dwarf, and the printed module would be quite large. Thank you to @paolosevMSFT for identifying and suggesting the fix.
*	Add C-/JS-APIs for inlining options (#2655)	Daniel Wirtz	2020-02-13	2	-0/+43
\| \| \| \| \| \| \| \| \| \|	Allows a user to modify the inlining limits using the C- and JS-APIs. * binaryen.getAlwaysInlineMaxSize(): `number` * binaryen.setAlwaysInlineMaxSize(size: `number`): `void` * binaryen.getFlexibleInlineMaxSize(): `number` * binaryen.setFlexibleInlineMaxSize(size: `number`): `void` * binaryen.getOneCallerInlineMaxSize(): `number` * binaryen.setOneCallerInlineMaxSize(size: `number`): `void`
*	Convert remaining python scripts to run under python3 (#2643)	Sam Clegg	2020-02-11	2	-2/+2
\|
*	Fix missing newline after // EMSCRIPTEN_START_FUNCS and // ↵	juj	2020-02-10	6	-12/+24
\| \| \| \| \| \| \| \| \| \|	EMSCRIPTEN_END_FUNCS markers. (#2626) * Fix missing newline after // EMSCRIPTEN_START_FUNCS and // EMSCRIPTEN_END_FUNCS markers. * Flake * Update tests
*	Optimize base64 decoding (#2623)	juj	2020-02-09	8	-150/+182
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Optimize base64 decoding (about 7x-10x faster and temporary garbage-free compared to the original version) * new Uint8Array * Reuse Uint8Array view * Fix end handling * Code format * Update tests
*	Optimize Try in Vacuum (#2644)	Heejin Ahn	2020-02-07	2	-0/+54
\| \| \| \|	If try's body does not throw, the whole try-catch can be replaced with the try body.
*	Add C-/JS-APIs for lowMemoryUnused and pass arguments (#2639)	Daniel Wirtz	2020-02-07	4	-0/+157
\| \| \| \| \| \| \| \| \|	Allows a user to enable/disable the `lowMemoryUnused` option and to get/set/clear arbitrary pass arguments when using the C- or JS-APIs. * binaryen.getLowMemoryUnused(): `boolean` * binaryen.setLowMemoryUnused(on: `boolean`): `void` * binaryen.getPassArgument(key: `string`): `string \| null` * binaryen.setPassArgument(key: `string`, value: `string \| null`): `void` * binaryen.clearPassArguments(): `void`
*	DWARF: Disable optimization passes not fully compatible with DWARF yet (#2640)	Alon Zakai	2020-02-06	1	-1523/+2496
\| \| \| \| \| \| \| \| \| \| \|	Anything that merges/swaps/etc. locals, or inlines, or merges functions, must be disabled for now. However, that does still leave almost all passes, so this should not affect output sizes much (and the full LLVM optimizer can be run before too). Over time we can resolve each of those FIXMEs. The test output here shows how disabling those allows over twice as much debug_line info to be preserved.
*	Fix LocalCSE's usable local selection (#2638)	Heejin Ahn	2020-02-05	2	-3/+38
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Now that we have subtypes, we cannot reuse any local that contains the same expression, because that local's type can be a supertype. For example: ``` (local $0 anyref) (local $1 nullref) ... (local.set $0 (ref.null)) (local.set $1 (ref.null)) ;; cannot be replaced with (local.get $0) ``` This extends `usables` map's key to contain both `HashedExpression` and the local's type, so we can get the right usable local in presence of subtypes.
*	Add EH support for OptimizeInstructions (#2608)	Heejin Ahn	2020-02-05	4	-30/+240
\| \| \| \| \| \|	- Adds support for `Try` in `optimizeBoolean` function - Adds support for `Try` in `getFallThrough` function - Adds approximate cost values for instructions in EH and reference types proposals.
*	Trap when call_indirect's signatures mismatch (#2636)	Heejin Ahn	2020-02-03	4	-0/+53
\| \| \| \| \| \| \| \| \| \| \|	This makes the interpreter trap when the signature in `call_indirect` instruction and that of the actual function in the table mismatch. This also makes the `wasm-ctor-eval` not evaluate `call_indirect` in case the signatures mismatch. Before we only compared the arguments' signature and the function signature, which was sufficient before we had subtypes, but now the signature in `call_indirect` and that of the actual function can be different even if the argument's signature is OK.
*	Add EH support for EffectAnalyzer (#2631)	Heejin Ahn	2020-02-03	7	-101/+3841
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds EH support to `EffectAnalyzer`. Before `throw` and `rethrow` conservatively set property. Now `EffectAnalyzer` has a new property `throws` to represent an expression that can throw, and expression that can throw sets `throws` correctly. When EH is enabled, any calls can throw too, so we cannot reorder them with another expression with any side effects, meaning all calls should be treated in the same way as branches when evaluating `invalidate`. This prevents many reorderings, so this patch sets `throws` for calls only when the exception handling features is enabled. This is also why I passed `--disable-exception-handling` to `wasm2js` tests. Most of code changes outside of `EffectAnalyzer` class was made in order to pass `FeatureSet` to it. `throws` isn't always set whenever an expression contains a throwable instruction. When an throwable instruction is within an inner try, it will be caught by the corresponding inner catch, so it does not set `throws`.
*	DWARF: Fix debug_abbrev section (#2630)	Alon Zakai	2020-01-28	4	-4/+44
\| \| \| \| \| \| \| \| \|	Each compilation unit's abbreviations must be terminated by a zero, so that we use the right abbreviations. This adds that support to the YAML layer, both adding the zeros and parsing them to look in the right abbreviation section at the right time. Also add two large testcases, zlib and cubescript, which crash without this and the last PR.
*	Optionally minify imported module names (#2620)	Alon Zakai	2020-01-27	2	-0/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This replaces imports like env.foo with a.foo, which can save a bunch of bytes when there are many imported functions. Note that by changing all the import names to a it ends up requiring a single merged import module. Note also that when doing this we modify all the imports, minifying their modules and names (since it makes no sense to be careful about minifying only modules known to us - env/wasi - if we are minifyin the names of all modules). This will require an emscripten PR to benefit from it.
*	DWARF: Properly emit signed 32 bit values for advance_line (#2625)	Alon Zakai	2020-01-24	4	-2333/+2333
\| \| \| \| \| \| \| \| \|	The LLVM SData field is 64-bit (to support 64-bit addresses I suppose) so when we assigned to it we actually led it to emit an LEB for a signed 64-bit value that is an unsigned 32-bit one. This worked in LLVM (where I guess it forces the value to 32-bit anyhow?) but failed in gimli (where I guess it doesn't?).
*	Reland "Fix renaming in FixInvokeFunctionNamesWalker (#2513)" (#2622)	Sam Clegg	2020-01-24	5	-47/+423
\| \| \| \| \|	This reverts commit 132daae1e9154782bb1afa5df80dfe7ea35f0369. This change is the same as before but the fix in #2619 should now make it safe.
*	Handle indirect calls in CallGraphPropertyAnalysis (#2624)	Alon Zakai	2020-01-24	2	-1/+46
\| \| \| \| \| \| \| \| \| \| \| \| \|	We ignored them, which is a bad default, as typically they imply we can call anything in the table (and the table might change). Instead, notice indirect calls during traversal, and force the user to decide whether to ignore them or not. This was only an issue in PostEmscripten because the other user, Asyncify, already had indirect call analysis because it needed it for other things. Fixes a bug uncovered by #2619 and fixes the current binaryen roll.
*	DWARF: Update .debug_loc (#2616)	Alon Zakai	2020-01-23	2	-216/+216
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Add support for that section to the YAML layer, and add code to update it. The updating is slightly tricky - unlike .debug_ranges, the size of entries is not fixed. So we can't just skip entries, as the end marker is smaller than a normal entry. Instead, replace now-invalid segments with (1, 1) which is of size 0 and so should be ignored by the debugger (we can't use (0, 0) as that would be an end marker, and (-1, *) is the special base marker). In the future we probably do want to do this in a more sophisticated manner, completely rewriting the indexes into the section as well. For now though this should be enough for when binaryen does not optimize (as we don't move/reorder anything). Note that this doesn't update the location description (like where on the wasm expression stack the value is). Again, that is correct for when binaryen doesn't optimize, but for fully optimized builds we would need to track things (which would be hard!). Also clean up some code that uses "Extra" instead of "Delimiter" that was missed before, and shorten some unnecessarily long names.
*	Limit the number of passive segments to work around a Chrome bug (#2613)	Thomas Lively	2020-01-22	2	-0/+590
\| \| \| \| \| \|	Chrome is currently decoding the segment indices as signed numbers, so some ranges of indices greater than 63 do not work. As a temporary workaround, limit the number of segments produced by MemoryPacking to 63 when bulk-memory is enabled.
*	DWARF: Update debug_ranges (#2612)	Alon Zakai	2020-01-22	5	-44/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Pretty straightforward given all we have so far. Note that fannkuch3_manyopts has an example of a sequence of ranges of which some must be skipped while others must not, showing we handle that by skipping the bad ones and updating the remaining. That is, if that we have a sequence of two (begin, end) spans [(10, 20), (30, 40)] It's possible (10, 20) maps in the new binary to (110, 120) while (30, 40) was eliminated by the optimizer and we have nothing valid to map it to. In that case we emit [(110, 120)]
*	DWARF: Fix debug lines in fannkuch -O0 (#2611)	Alon Zakai	2020-01-22	4	-294/+10460
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Just some trivial fixes: * Properly reset prologue after each line (unlike others, this flag should be reset immediately). * Test for a function's end address first, as LLVM output appears to use 1-past-the-end-of-the-function as a location in that function, and not the next (note the first byte of the next function, which is ambiguously identical to that value, is used at least in low_pc; I'm not sure if it's used in debug lines too). * Ignore the same address if LLVM emitted it more than once, which it does sometimes.
*	DWARF: Use end_sequence and copy properly (#2610)	Alon Zakai	2020-01-22	5	-2440/+2262
\| \| \| \| \| \| \|	We need to track end_sequence directly, and use either end_sequence or copy (copy emits a line without marking it as ending a sequence). After this, fib2 debug line output looks perfect.
*	DWARF: Allow debug lines with column 0 (#2609)	Alon Zakai	2020-01-22	5	-1786/+1833
\| \| \| \|	While line and address values of 0 should be skipped, it seems like column 0 are valid lines emitted by LLVM.
*	DWARF: Track more function locations (#2604)	Alon Zakai	2020-01-22	1	-4/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	DWARF from LLVM can refer to the first byte belonging to the function, where the size LEB is, or to the first byte after that, where the local declarations are, or the end opcode, or to one byte past that which is one byte past the bytes that belong to the function. We aren't sure why LLVM does this, but track it all for now. After this all debug line positions are identified. However, in some cases a debug line refers to one past the end of the function, which may be an LLVM bug. That location is ambiguous as it could also be the first byte of the next function (what made this discovery possible was when this happened to the last function, after which there is another section).
*	DWARF: Track the positions of 'end', 'else', 'catch' binary locations (#2603)	Alon Zakai	2020-01-21	2	-1128/+1170
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Control flow structures have those in addition to the normal span of (start, end), and we need to track them too. Tracking them during reading requires us to track control flow structures while parsing, so that we can know to which structure an end/else/catch refers to. We track these locations using a map on the side of instruction to its "extra" locations. That avoids increasing the size of the tracking info for the much more common non-control flow instructions. Note that there is one more 'end' location, that of the function (not referring to any instruction). I left that to a later PR to not increase this one too much.
*	Unify JS memory segment API (#2533)	Daniel Wirtz	2020-01-21	1	-3/+5
\| \| \| \| \| \| \| \| \| \|	Binaryen.js now uses offset instead of byteOffset when inspecting a memory segment, matching the arguments on memory segment creation. Also adds inspection of the passive property. Previously, one would specify { offset, data, passive } on creation and get back { byteOffset, data } upon inspection. This PR unifies both to the keys on creation while also adding the respective C-API to retrieve passive status, which was missing.
*	Update debug line info with function entries (#2600)	Alon Zakai	2020-01-17	5	-756/+827
\| \| \| \| \| \|	LLVM points to the start of the function in some debug line entries - right after the size LEB of the function, which is where the locals are declared, and before any instructions.
*	Expose ExpressionAnalyzer in C-/JS-API (#2585)	Daniel Wirtz	2020-01-17	2	-0/+106
\| \| \| \| \| \| \|	Instead of reinventing the wheel on our side, this adds ExpressionAnalyzer bindings to the C- and JS-APIs, which can be useful for generators. For example, a generator may decide to simplify a compilation step if a subexpression doesn't have any side effects, or simply skip emitting something that is likely to compile to a drop or an empty block right away.
*	DWARF: high_pc computation (#2595)	Alon Zakai	2020-01-16	5	-12/+12
\| \| \| \| \| \| \|	Update high_pc values. These are interesting as they may be a relative offset compared to the low_pc. For functions we already had both a start and an end. Add such tracking for instructions as well.
*	Add EH support for CFGWalker (#2597)	Heejin Ahn	2020-01-16	2	-0/+204
\| \| \| \| \| \| \| \|	This adds EH instruction support for `CFGWalker`. This also implements `call` instruction handling within a try-catch; every call can possibly throw and unwind to the innermost catch block. This adds tests for RedundantSetElimination pass, which uses `CFGWalker`.
*	Update DWARF testcases (#2594)	Alon Zakai	2020-01-16	6	-8703/+9118
\| \| \| \| \|	This only touches test code. The files are compiled with latest LLVM + https://reviews.llvm.org/D71681 in order to get more realistic DWARF content.
*	DWARF: Function location tracking (#2592)	Alon Zakai	2020-01-16	4	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Track the beginning and end of each function, both when reading and writing. We track expressions and functions separately, instead of having a single big map of (oldAddr) => (newAddr) because of the potentially ambiguous case of the final expression in a function: it's end might be identical in offset to the end of the function. So we have two different things that map to the same offset. However, if the context is "the end of the function" then the updated address is the new end of the function, even if the function ends with a different instruction now, as the old last instruction might have moved or been optimized out. Concretely, we have getNewExprAddr and getNewFuncAddr, so we can ask to update the location of either an expression or a function, and use that contextual information. This checks for the DIE tag in order to know what we are looking for. To be safe, if we hit an unknown tag, we halt, so that we don't silently miss things. As the test updates show, the new things we can do thanks to this PR are to update compile unit and subprogram low_pc locations. Note btw that in the first test (dwarfdump_roundtrip_dwarfdump.bin.txt) we change 5 to 0: that is correct since that test does not write out DWARF (it intentionally has no -g), so we do not track binary locations while writing, and so we have nothing to update to (the other tests show actual updating). Also fix the order in the python test runner code to show a diff of expected to encountered, and not the reverse, which confused me.
*	Optimize passive segments in memory-packing (#2426)	Thomas Lively	2020-01-15	3	-38/+1362
\| \| \| \| \| \| \| \| \|	When memory is packed and there are passive segments, bulk memory operations that reference those segments by index need to be updated to reflect the new indices and possibly split into multiple instructions that reference multiple split segments. For some bulk-memory operations, it is necessary to introduce new globals to explicitly track the drop state of the original segments, but this PR is careful to only add globals where necessary.