forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	Use IRBuilder in the binary parser (#6963)	Thomas Lively	2024-11-26	1	-13/+19
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	IRBuilder is a utility for turning arbitrary valid streams of Wasm instructions into valid Binaryen IR. It is already used in the text parser, so now use it in the binary parser as well. Since the IRBuilder API for building each intruction requires only the information that the binary and text formats include as immediates to that instruction, the parser is now much simpler than before. In particular, it does not need to manage a stack of instructions to figure out what the children of each expression should be; IRBuilder handles this instead. There are some differences between the IR constructed by IRBuilder and the IR the binary parser constructed before this change. Most importantly, IRBuilder generates better multivalue code because it avoids eagerly breaking up multivalue results into individual components that might need to be immediately reassembled into a tuple. It also parses try-delegate more correctly, allowing the delegate to target arbitrary labels, not just other `try`s. There are also a couple superficial differences in the generated label and scratch local names. As part of this change, add support for recording binary source locations in IRBuilder.
*	Source map fixes (#6550)	Jérôme Vouillon	2024-05-02	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Keep debug locations at function start The `fn_prolog_epilog.debugInfo` test is failing otherwise, since there was debug information associated to the nop instruction at the beginning of the function. * Do not clear the debug information when reaching the end of the source map The last segment should extend to the end of the function. * Propagate debug location from the function prolog to its first instruction * Fix printing of epilogue location The text parser no longer propagates locations to the epilogue, so we should always print the location if there is one. * Fix debug location smearing The debug location of the last instruction should not smear into the function epilogue, and a debug location from a previous function should not smear into the prologue of the current function.
*	Require `then` and `else` with `if` (#6201)	Thomas Lively	2024-01-04	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \|	We previously supported (and primarily used) a non-standard text format for conditionals in which the condition, if-true expression, and if-false expression were all simply s-expression children of the `if` expression. The standard text format, however, requires the use of `then` and `else` forms to introduce the if-true and if-false arms of the conditional. Update the legacy text parser to require the standard format and update all tests to match. Update the printer to print the standard format as well. The .wast and .wat test inputs were mechanically updated with this script: https://gist.github.com/tlively/85ae7f01f92f772241ec994c840ccbb1
*	Improve debug info printing with depth (#5903)	JesseChen	2023-08-28	1	-0/+5
\| \| \| \| \| \|	Skip repeated identical debug info only of more-nested nodes. Before this PR we skipped sibling nodes and even parent nodes, which could be confusing. After this PR there is a more clear connection: child nodes have the same debug location as the parent, by default, and so there is no need to print it again.
*	Simplify and consolidate type printing (#5816)	Thomas Lively	2023-08-24	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When printing Binaryen IR, we previously generated names for unnamed heap types based on their structure. This was useful for seeing the structure of simple types at a glance without having to separately go look up their definitions, but it also had two problems: 1. The same name could be generated for multiple types. The generated names did not take into account rec group structure or finality, so types that differed only in these properties would have the same name. Also, generated type names were limited in length, so very large types that shared only some structure could also end up with the same names. Using the same name for multiple types produces incorrect and unparsable output. 2. The generated names were not useful beyond the most trivial examples. Even with length limits, names for nontrivial types were extremely long and visually noisy, which made reading disassembled real-world code more challenging. Fix these problems by emitting simple indexed names for unnamed heap types instead. This regresses readability for very simple examples, but the trade off is worth it. This change also reduces the number of type printing systems we have by one. Previously we had the system in Print.cpp, but we had another, more general and extensible system in wasm-type-printing.h and wasm-type.cpp as well. Remove the old type printing system from Print.cpp and replace it with a much smaller use of the new system. This requires significant refactoring of Print.cpp so that PrintExpressionContents object now holds a reference to a parent PrintSExpression object that holds the type name state. This diff is very large because almost every test output changed slightly. To minimize the diff and ease review, change the type printer in wasm-type.cpp to behave the same as the old type printer in Print.cpp except for the differences in name generation. These changes will be reverted in much smaller PRs in the future to generally improve how types are printed.
*	Fix sourcemap nesting in reading and writing (#5504)	JesseCodeBones	2023-02-24	1	-1/+1
\| \| \| \|	The stack logic was incorrect, and led to source locations being emitted on parents instead of children.
*	Remove Type ordering (#3793)	Thomas Lively	2021-05-18	1	-3/+3
\| \| \| \| \| \| \| \| \|	As found in #3682, the current implementation of type ordering is not correct, and although the immediate issue would be easy to fix, I don't think the current intended comparison algorithm is correct in the first place. Rather than try to switch to using a correct algorithm (which I am not sure I know how to implement, although I have an idea) this PR removes Type ordering entirely. In places that used Type ordering with std::set or std::map because they require deterministic iteration order, this PR uses InsertOrdered{Set,Map} instead.
*	Assign import names consistently between text and binaryn reader (#3238)	Sam Clegg	2020-10-14	1	-4/+4
\| \| \| \| \| \| \| \| \|	The s-parser was assigning numbers names per-type where as the binaryn reader was using the global import count as the number to append. This change switches to use per-element count which I think it preferable as it increases the stability of the auto-generated names. e.g. memory is now always named `$mimport0`.
*	Prototype extended-name-section proposal (#3162)	Daniel Wirtz	2020-09-29	1	-1/+1
\| \| \|	Implements the parts of the Extended Name Section Proposal that are trivially applicable to Binaryen, in particular table, memory and global names. Does not yet implement label, type, elem and data names.
*	Remove function index printing (#2742)	Thomas Lively	2020-04-09	1	-7/+7
\| \| \| \| \| \| \| \|	`BinaryIndexes` was only used in two places (Print.cpp and wasm-binary.h), so it didn't seem to be a great fit for module-utils.h. This change moves it to wasm-binary.h and removes its usage in Print.cpp. This means that function indexes are no longer printed, but those were of limited utility and were the source of annoying noise when updating tests, anyway.
*	Remove FunctionType (#2510)	Thomas Lively	2019-12-11	1	-12/+12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Function signatures were previously redundantly stored on Function objects as well as on FunctionType objects. These two signature representations had to always be kept in sync, which was error-prone and needlessly complex. This PR takes advantage of the new ability of Type to represent multiple value types by consolidating function signatures as a pair of Types (params and results) stored on the Function object. Since there are no longer module-global named function types, significant changes had to be made to the printing and emitting of function types, as well as their parsing and manipulation in various passes. The C and JS APIs and their tests also had to be updated to remove named function types.
*	Print custom section contents if printable (#2326)	Ingvar Stepanyan	2019-09-06	1	-1/+1
\| \| \|	This helps with debugging human-readable sections like sourceMappingURL.
*	Massive renaming (#1855)	Thomas Lively	2019-01-07	1	-60/+60
\| \| \| \| \| \|	Automated renaming according to https://github.com/WebAssembly/spec/issues/884#issuecomment-426433329.
*	Emit imports before defined things in text format (#1715)	Alon Zakai	2018-11-01	1	-1/+1
\| \| \| \| \|	That is the correct order in the text format, wabt errors otherwise. See AssemblyScript/assemblyscript#310
*	Unify imported and non-imported things (#1678)	Alon Zakai	2018-09-19	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fixes #1649 This moves us to a single object for functions, which can be imported or nor, and likewise for globals (as a result, GetGlobals do not need to check if the global is imported or not, etc.). All imported things now inherit from Importable, which has the module and base of the import, and if they are set then it is an import. For convenient iteration, there are a few helpers like ModuleUtils::iterDefinedGlobals(wasm, [&](Global* global) { .. use global .. }); as often iteration only cares about imported or defined (non-imported) things.
*	Add debug information locations to the function prolog/epilog (#1674)	Yury Delendik	2018-09-17	1	-4/+0
\| \| \| \| \| \| \|	The current patch: * Preserves the debug locations from function prolog and epilog * Preserves the debug locations of the nested blocks
*	Binary format local parsing fixes (#1664)	Alon Zakai	2018-09-11	1	-55/+55
\| \| \| \| \| \|	* Error if there are more locals than browsers allow (50,000). We usually just warn about stuff like this, but we do need some limit (or else we hang or OOM), and if so, why not use the agreed-upon Web limit. * Do not generate nice string names for locals in binary parsing - the name is just $var$x instead of $x, so not much benefit, and worse as our names are interned this is actually slow (which is why the fuzz testcase here hangs instead of OOMing). Testcases and bugreport in #1663.
*	ensure unique import names for each type, by giving them a prefix, avoiding ↵	Alon Zakai	2018-02-22	1	-20/+20
\| \| \| \|	collisions between say a global import and a function with a name from the name section that happens to match it (#1424)
*	Emit binary function index in comment in text format, for convenience (#1232)	Alon Zakai	2017-10-20	1	-7/+7
\|
*	Avoid new blocks in binary reading/writing (#1165)	Alon Zakai	2017-09-12	1	-12/+12
\| \| \| \| \| \|	* don't emit a toplevel block if we don't need to, as in wasm it is a list context * don't create unnecessary blocks in wasm reading
*	Ignore unreachable code in wasm binaries (#1122)	Alon Zakai	2017-08-22	1	-5/+0
\| \| \|	Ignoring unreachable code in wasm binaries lets us avoid corner cases with unstructured code in wasm binaries that is a poor fit for Binaryen's structured IR.
*	Exporting/importing debug location information from .wast/.asm.js/.s formats ↵	Yury Delendik	2017-06-01	1	-0/+228
	(#1017) * Extends wasm-as, wasm-dis and s2wasm to consume debug locations. * Exports source map from asm2wasm