forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	Some refactorings in addition to #3338 (#3336)	Max Graey	2020-11-12	1	-16/+10
\| \| \| \|	See discussion in #3303
*	OptimizeInstructions: Fix regression from #3303 / #3275 (#3338)	Alon Zakai	2020-11-12	1	-6/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	X - Y <= 0 => X <= Y That is true mathematically, but not in the case of an overflow, e.g. X=10, Y=0x8000000000000000. X - Y is a negative number, so X - Y <= 0 is true. But it is not true that X <= Y (as Y is negative, but X is not). See discussion in #3303 (comment) The actual regression was in #3275, but the fuzzer had an easier time finding it due to #3303
*	Optimize i32(x) % C_pot in boolean context (#3307)	Max Graey	2020-11-10	1	-2/+17
\| \| \| \| \| \|	bool(i32(x) % C_pot) -> bool(i32(x) & (C_pot - 1)) bool(i32(x) % min_s) -> bool(i32(x) & max_s) For all other situations we already do this for (i32\|i64).rem_s
*	Canonicalize subtraction with constant on the right to addition (#3321)	Max Graey	2020-11-10	1	-59/+79
\| \| \| \| \| \| \|	Using addition in more places is better for gzip, and helps simplify the optimizer as well. Add a FinalOptimizer phase to do optimizations like our signed LEB tweaks, to reduce binary size in the rare case when we do want a subtraction.
*	Optimize signed / unsigned relationals when RHS is min or max constant (#3314)	Max Graey	2020-11-04	1	-7/+89
\|
*	Slight refactoring of handOptimize (#3305)	Max Graey	2020-11-03	1	-15/+15
\| \| \|	Move the checks for most unoptimizable expression types out into visitExpression and simplify some other code.
*	Optimize x * -1.0 in non-fastMath case (#3315)	Max Graey	2020-11-03	1	-3/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We can still make x * -1.0 cheaper for non-fastMath mode as: x * -1.0 -> -0.0 - x Should at least help baseline compilers. Also it could enable further optimizations, e.g.: a + b * -1 a + (-0.0 - b) (a - 0.0) - b a - b
*	Canonicalize relationals as well (#3303)	Max Graey	2020-10-30	1	-4/+82
\|
*	Fold subtraction of sums or differences from constants (#3295)	Max Graey	2020-10-29	1	-0/+33
\| \| \| \| \|	`C1 - (x + C2)` -> `(C1 - C2) - x` `C1 - (x - C2)` -> `(C1 + C2) - x` `C1 - (C2 - x)` -> `x + (C1 - C2)`
*	Optimize negative one on LHS for some shift operations (#3292)	Max Graey	2020-10-29	1	-2/+13
\|
*	Fix pow2 util and avoid pow2 for left shifting in ZeroRemover (#3293)	Max Graey	2020-10-28	1	-8/+10
\| \| \| \| \| \| \| \|	Fixes a fuzz bug that was triggered by https://github.com/WebAssembly/binaryen/pull/3015#issuecomment-718001620 but was actually a pre-existing bug in pow2, that that PR just happened to uncover.
*	Propagate sign to constants for float point expressions (#3289)	Max Graey	2020-10-27	1	-12/+25
\|
*	Replace x * 2 with x + x for floats (#3016)	Max Graey	2020-10-27	1	-0/+13
\| \| \|	But only when doing so doesn't require adding a new local.
*	Drop RHS of shift if effective shift is zero (#3209)	Max Graey	2020-10-26	1	-0/+8
\|
*	Сonstant value truncation during store operation (#3117)	Max Graey	2020-10-26	1	-0/+13
\|
*	[NFC] `using namespace Abstract` to make matchers more compact (#3284)	Thomas Lively	2020-10-26	1	-71/+56
\| \| \| \| \| \| \| \| \|	This change makes matchers in OptimizeInstructions more compact and readable by removing the explicit `Abstract::` namespace from individual operations. In some cases, this makes multi-line matcher expressions fit on a single line. This change is only possible because it also adds an explicit "RMW" prefix to each element of the `AtomicRMWOp` enumeration. Without that, their names conflicted with the names of Abstract ops.
*	Optimize relations of subtractions and zero (#3275)	Max Graey	2020-10-25	1	-15/+102
\|
*	OptimizeInstructions: More 64-bit integer patterns (#3015)	Max Graey	2020-10-23	1	-42/+56
\| \| \| \| \|	Extend ZeroRemover and optimizeAddedConstants to handle 64-bit integers as well. Use Literal.makeFromInt64 to make this easier.
*	Add float simplifications for absolute binary expressions (#3013)	Max Graey	2020-10-21	1	-0/+49
\|
*	Optimize signed division when RHS is signed minimum (#3221)	Max Graey	2020-10-20	1	-0/+17
\|
*	Optimize comparisons with 0/1 in boolean context (#3240)	Max Graey	2020-10-18	1	-15/+15
\| \| \| \| \| \| \| \| \| \|	i32(bool(x)) != 0 ==> i32(bool(x)) i64(bool(x)) & 1 ==> i64(bool(x)) Also: * clean up related matching rules in optimizeWithConstantOnRight * add more explanations about isPowerOf2Float & rename to isPowerOfTwoInvertibleFloat
*	Improve partial evaluation (#3236)	Max Graey	2020-10-14	1	-2/+11
\|
*	Optimize power of two float divisions (#3018)	Max Graey	2020-10-13	1	-5/+44
\|
*	Optimize unsigned divisions when rhs is negative constant (#2991)	Max Graey	2020-10-13	1	-7/+22
\| \| \| \| \| \| \| \|	`(uint32_t)x / C` --> `x >= C`, where `C > 2^31` `(uint32_t)x / -1` --> `x != -1` and for `shrinkLevel == 0`: `(uint64_t)x / C` --> `uint64_t(x >= C)`, where `C > 2^63` `(uint64_t)x / -1` --> `x != -1`
*	Refactor naming convention for functions handling tuples (#3196)	Max Graey	2020-10-09	1	-7/+7
\| \| \|	When there are two versions of a function, one handling tuples and the other handling non-tuple values, the previous naming convention was to have "Single" in the name of the non-tuple handling function. This PR simplifies the convention and shortens function names by making the names plural for the tuple-handling version and singular for the non-tuple-handling version.
*	Remove old attempted DSL beginnings from OptimizeInstructions (#3200)	Alon Zakai	2020-10-08	1	-23/+0
\| \| \| \| \| \|	Wasm turned out to not be that good for a DSL for such peephole optimizations, so that never made progress. Meanwhile we have the new matcher stuff which works well.
*	Add optimization rules for some shift operations (#3099)	Max Graey	2020-10-07	1	-0/+35
\| \| \|	Specifically, truncates constant shift values that are greater than the number of bits available and optimizes out explicit masking of the shift value that is redundant with the implicit masking performed by shift operations.
*	Revert some changes for #3193 (#3197)	Max Graey	2020-10-06	1	-15/+14
\| \| \|	`(signed)x % (i32\|i64).min_s ==> (x & (i32\|i64).max_s)` is not valid unless compared to zero.
*	fast-math: Fold `fp * -1` to `-fp` (#3189)	Max Graey	2020-10-05	1	-2/+5
\|
*	Generalize transforms for #3153 (#3193)	Max Graey	2020-10-05	1	-6/+17
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Implement a more general (additional) version of #3153 which also handles negative constant divisors: `(int32)x % -4 == 0` --> `(x & 3) == 0` `x % -C_pot == 0` --> `(x & (abs(C_pot) - 1)) == 0` and special two-complement values as well: `(int32)x % 0x80000000 == 0` --> `(x & 0x7fffffff) == 0` `(int64)x % 0x8000000000000000 == 0` --> `(x & 0x7fffffffffffffff) == 0` as separete rules: `(int32)x % 0x80000000` --> `x & 0x7fffffff` `(int64)x % 0x8000000000000000` --> `x & 0x7fffffffffffffff` The [previous pr](https://github.com/WebAssembly/binaryen/pull/3153) didn't use these possibilities.
*	Ordering correction fix in OptimizeInstructions for #3047 (#3195)	Alon Zakai	2020-10-05	1	-2/+12
\| \| \| \| \| \| \| \| \| \| \| \|	(found by the fuzzer) It is not valid to replace x \| (y \| x) ==> y \| x, if x, y cannot be reordered. It is also not valid to replace x ^ (y ^ x) ==> y, if x, y cannot be reordered, for a more subtle reason: if they cannot be reordered then y can affect the value of x (the opposite is not possible as we checked x for side effects so that we could remove one copy). If so, then the second appearance of x could be different, if e.g. it reads a local y writes to. Whereas, if it's ok to reorder, then it's ok to do x ^ (y ^ x) ==> x ^ (x ^ y) ==> y.
*	Optimize "clear bit mask" combination to cyclic rotation over preinverted ↵	Max Graey	2020-10-01	1	-0/+14
\| \| \| \|	mask (#3184)
*	Add comment about signed => unsigned lowering (#3187)	Max Graey	2020-10-01	1	-0/+5
\|
*	Clean up support/bits.h (#3177)	Thomas Lively	2020-09-30	1	-9/+9
\| \| \| \| \|	Use overloads instead of templates where applicable and change function names from PascalCase to camelCase. Also puts the functions in the Bits namespace to avoid naming conflicts.
*	Add --fast-math mode (#3155)	Alon Zakai	2020-09-30	1	-8/+11
\| \| \| \| \| \| \| \| \| \| \| \|	Similar to clang and gcc, --fast-math makes us ignore corner cases of floating-point math like NaN changes and (not done yet) lack of associativity and so forth. In the future we may want to have separate fast math flags for each specific thing, like gcc and clang do. This undoes some changes (#2958 and #3096) where we assumed it was ok to not change NaN bits, but @binji corrected us. We can only do such things in fast math mode. This puts those optimizations behind that flag, adds tests for it, and restores the interpreter to the simpler code from before with no special cases.
*	Fold i32.eqz(wrap_i64(x)) -> i64.eqz(x) where possible (#3181)	Max Graey	2020-09-30	1	-0/+12
\| \| \|	Specifically, when `x` has at most 32 bits so that wrapping doesn't change its value.
*	Simplify signed remainders compared with zero (#3153)	Max Graey	2020-09-29	1	-9/+29
\| \| \| \| \| \|	Specifically when the divisor is a power of two. `eqz((signed)x % C_pot)` -> `eqz(x & (C_pot - 1))` `(signed)x % C_pot != 0` -> `x & (C_pot - 1) != 0`
*	Add also non-equal with zero simplification for boolean context (#3178)	Max Graey	2020-09-29	1	-2/+3
\|
*	Lower signed binops to unsigned binops when possible (#2988)	Max Graey	2020-09-28	1	-10/+59
\| \| \|	This can unlock further instruction optimizations that do not apply to signed operations.
*	Expression matching API (#3134)	Thomas Lively	2020-09-18	1	-352/+368
\| \| \| \| \| \| \| \| \| \| \|	Provides an easily extensible layered API for matching expression patterns and extracting their components. The low-level API provides modular building blocks for creating matchers for any data type and the high-level API provides a succinct and flexible interface for matching expressions and extracting useful information from them. Matchers are currently provided for Const, Unary, Binary, and Select instructions. Adding a matcher for a new type of expression is straightforward enough that I expect to add them as they become useful as part of other changes.
*	Initial implementation of "Memory64" proposal (#3130)	Wouter van Oortmerssen	2020-09-18	1	-6/+12
\| \| \|	Also includes a lot of new spec tests that eventually need to go into the spec repo
*	Optimize binary operations with 1-bit on lhs and 1 const on rhs (#2948)	Max Graey	2020-09-17	1	-10/+45
\| \| \| \| \| \| \|	`expr \| 1` --> `1` `expr & 1` --> `expr` `expr == 1` --> `expr` `expr != 1` --> `!expr` where `maxBits(expr) == 1` i.e `expr` is boolean
*	Unary and binary duplicate expression elimination (#3047)	Max Graey	2020-09-17	1	-15/+139
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Simplifies patterns in which an expression is applied twice to its operands. `abs(abs(x))` -> `abs(x)` `ceil(ceil(x))` -> `ceil(x)` `floor(floor(x))` -> `floor(x)` `trunc(trunc(x))` -> `trunc(x)` `nearest(nearest(x))` -> `nearest(x)` `eqz(eqz(bool(x)))` -> `bool(x)` `sext(sext(x))` -> `sext(x)` `neg(neg(x))` -> `x` `y - (y - x)` -> `x` `(x ^ y) ^ y` -> `x` `(x \| y) \| y` -> `x \| y` `(x & y) & y` -> `x & y` `(x % y) % y` -> `x % y`
*	Add float operations for isSymmetric util (#3127)	Max Graey	2020-09-14	1	-2/+25
\| \| \|	Add floating point Eq and Ne operators to Properties::isSymmetric. Also treat additional float ops as symmetric specifically in OptimizeInstructions when their operands are known to be non-NaN.
*	Simplify subtracting zero from float expressions (#3125)	Max Graey	2020-09-13	1	-0/+21
\| \| \| \| \| \| \|	`x - 0.0` -> `x` `x + (-0.0)` -> `x` `x - (-0.0)` -> `x + 0.0` where `x` is `f32` or `f64`.
*	also drop size for memory.copy(x, x, y) (#3075)	Max Graey	2020-08-24	1	-2/+5
\| \| \|	This fixes a bug in which a side effect in the calculation of the size could be lost.
*	memory.copy: use nop reductions only for ignoreImplicitTraps (#3074)	Max Graey	2020-08-24	1	-3/+15
\| \| \| \| \| \| \| \| \|	According to changes in spec: WebAssembly/bulk-memory-operations#124 WebAssembly/bulk-memory-operations#145 we unfortunately can't fold to nop even for memory.copy(x, y, 0). So this PR revert all reductions to nop but do this only under ignoreImplicitTraps flag
*	Remove optimization for memory.copy(x, x, C) (#3073)	Max Graey	2020-08-23	1	-11/+1
\| \| \| \| \|	That can trap, so we can only remove it if traps are ignored, which was not handled properly. Revert it as we consider the options.
*	OptimizeInstructions on memory.copy: check size for side effect as well (#3072)	Max Graey	2020-08-23	1	-0/+2
\| \| \|	Fix issue found by fuzzer: #3038 (comment)
*	Optimize bulk memory.copy (#3038)	Max Graey	2020-08-22	1	-0/+68
\| \| \|	Replace it with a load and a store when the size is a small constant and remove it entirely when it would be a nop.