forks/binaryen.git -

	Commit message (Collapse)	Author	Age	Files	Lines
*	subtype-exprs.h additions [NFC] (#6323)	Alon Zakai	2024-02-20	1	-8/+31
\| \| \| \| \| \|	This pulls out the subtype-exprs.h parts of #6108 These are NFC in the current codebase, but are fixes for that unlanded PR, and another unrelated PR that will be opened shortly.
*	StringLowering: Escape the JSON in the custom section (#6316)	Alon Zakai	2024-02-20	4	-12/+104
\| \| \| \|	Also add an end-to-end test using node to verify we can parse the escaped content properly using TextDecoder+JSON.parse.
*	JS Bindings: Use stringToUTF8OnStack instead of deprecated ↵	Alon Zakai	2024-02-20	1	-1/+1
\| \| \| \| \|	allocateUTF8OnStack (#6324) This avoids a warning on recent Emscripten.
*	[Parser] Simplify the lexer interface (#6319)	Thomas Lively	2024-02-20	3	-318/+252
\| \| \| \| \| \| \| \| \| \| \|	The lexer was previously an iterator over tokens, but that expressivity is not actually used in the parser. Instead, we have `input.h` that adapts the token iterator interface into an iterface that is actually useful. As a first step toward simplifying the lexer implementation to no longer be an iterator over tokens, update its interface by moving the adaptation from input.h to the lexer itself. This requires extensive changes to the lexer unit tests, which will not have to change further when we actually simplify the lexer implementation.
*	SetGlobals: Fix segfault on invalid input (#6321)	Nikolay Khitrin	2024-02-20	1	-1/+1
\|
*	StringLowering: Lower nulls in call params (#6317)	Alon Zakai	2024-02-20	1	-0/+10
\|
*	StringLowering: Properly handle nullable inputs to StringAs (#6307)	Alon Zakai	2024-02-14	1	-1/+11
\| \| \|	StringAs's output must be non-nullable, so add a cast.
*	StringLowering: Fix up nulls written to struct.new fields (#6306)	Alon Zakai	2024-02-14	1	-16/+36
\|
*	Strings: Add some interpreter support (#6304)	Alon Zakai	2024-02-14	2	-4/+57
\| \| \| \| \| \| \|	This adds just enough support to be able to --fuzz-exec a small but realistic fuzz testcase from Java. To that end, just implement the minimal ops we need, which are all related to JS-style strings.
*	[NFC] Avoid a warning on an unused var (#6300)	Alon Zakai	2024-02-14	1	-1/+2
\|
*	StringLowering: Use an array16 type in its own rec group (#6302)	Alon Zakai	2024-02-13	1	-9/+25
\| \| \| \| \| \| \| \| \| \| \| \|	The input module might use an array of 16-bit elements type that is somewhere in a giant rec group, but that is not valid for imported strings: that array type is now on an import and must match the expected ABI, which is to be in its own personal rec group. The old array16 type remains in the module after this transformation, but all uses of it are replaced with uses of the new array16 type. Also move makeImports to after updateTypes: there are no types to update in the new imports. That does not matter but it can make debugging less pleasant, so improve it.
*	Fix --spill-pointers for the stack growing down (#6294)	YAMAMOTO Takashi	2024-02-13	1	-11/+11
\| \| \| \|	The LLVM wasm backend grows the stack downwards, and this pass did not fully account for that before.
*	StringLowering: Hack around if issue with bottom types (#6303)	Alon Zakai	2024-02-13	1	-0/+21
\| \| \| \| \|	Replacing the string heap type with extern is dangerous as they do not share top/bottom types. In practice this works out almost everywhere except for a few ifs, which we can fix up as a hack for now.
*	StringLowering: Modify string=>extern also in public types (#6301)	Alon Zakai	2024-02-13	3	-5/+31
\| \| \| \|	We want to actually remove all stringref appearances, in both public and private types.
*	Precompute: Optimize array.len (#6299)	Alon Zakai	2024-02-12	1	-1/+1
\| \| \|	Arrays have immutable length, so we can optimize them like immutable fields.
*	Fuzzer: Do not emit huge and possibly non-validating tables (#6288)	Alon Zakai	2024-02-12	1	-0/+17
\|
*	[Parser] Parse `resume` (#6295)	Thomas Lively	2024-02-09	4	-11/+97
\|
*	[Parser] Support references to struct fields by name (#6293)	Thomas Lively	2024-02-08	2	-11/+28
\| \| \| \|	Construct a mapping from heap type and field name to field index, then use it while parsing instructions.
*	Update lit tests to parse with the new parser (#6290)	Thomas Lively	2024-02-08	1	-1/+1
\| \| \| \| \| \| \| \| \|	Get as many of the lit tests as possible to parse with the new parser, mostly by moving declared module items to be after imports. Also fix a bug in the new parser's pop validation to allow supertypes of the expected type. The two big issues that still prevent some lit tests from working correctly under the new parser are missing support for symbolic field names and missing support for source map annotations.
*	Remove support for legacy stringref text syntax (#6289)	Thomas Lively	2024-02-08	1	-85/+16
\| \| \| \|	Removing support for the legacy syntax will allow us to avoid implementing support for it in the new text parser.
*	[NFC] Add links to specs in StringLowering (#6292)	Alon Zakai	2024-02-08	1	-0/+4
\|
*	Add a pass to propagate global constants to other globals (#6287)	Alon Zakai	2024-02-08	3	-2/+21
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SimplifyGlobals already does this, so this is a subset of that pass, and does not add anything new. It is useful for testing, however. In particular it allows testing that we propagate subsequent globals in a single pass, that is if one global reads from another and becomes constant, then it can be propagated as well. SimplifyGlobals runs multiple passes so this always worked, but with this pass we can test that we do it efficiently in one pass. This will also be useful for comparing stringref to imported strings, as it allows gathered strings to be propagated to other globals (possible with stringref, but not imported strings) but not anywhere else (which might have downsides as it could lead to more allocations). Also add an additional test for simplify-globals that we do not get confused by an unoptimizable global.get in the middle (see last part).
*	StringLowering: Lower all remaining important string operations (#6283)	Alon Zakai	2024-02-08	1	-0/+84
\| \| \|	All those in the list from #6271 (comment)
*	[Parser] Do not involve IRBuilder for imported functions (#6286)	Thomas Lively	2024-02-07	4	-13/+14
\| \| \| \| \| \| \| \| \| \|	We previously had a bug where we would begin and end an IRBuilder context for imported functions even though they don't have bodies. For functions that return results, ending this empty scope should have produced an error except that we had another bug where we only produced that error for multivalue functions. We did not previously have imported multivalue functions in wat-kitchen-sink.wast, so both of these bugs went undetected. Fix both bugs and update the test to include an imported multivalue function so that it would have failed without this fix.
*	SimplifyGlobals: Propagate constant globals into nested gets in other ↵	Alon Zakai	2024-02-07	1	-2/+4
\| \| \| \| \|	globals (#6285) Before we propagated to the top level, but not to anything interior.
*	Get more tests working with the new text parser (#6284)	Thomas Lively	2024-02-07	2	-0/+4
\| \| \| \| \| \| \| \|	The new parser enforces the rule that imports must come before declarations (except for type declarations). The old parser does not enforce this rule, so many of our tests did not follow it. Fix them to follow that rule and fix other invalid syntax. Also add missing finalization of Load expressions in wasm-builder.h that was causing a test to fail under the new parser and guard against an error case in wasm-ir-builder.cpp that used to cause a segfault.
*	[NFC] Move code to string.cpp (#6282)	Thomas Lively	2024-02-06	2	-84/+92
\| \| \| \|	Now that we have a .cpp file, none of the code that was in string.h needs to be in a header any more.
*	StringLowering: Start to lower instructions (#6281)	Alon Zakai	2024-02-06	1	-0/+82
\|
*	Properly stringify names in tests (#6279)	Thomas Lively	2024-02-06	7	-130/+199
\| \| \| \| \| \| \| \| \| \| \| \| \|	Update identifiers used in tests to use a format supported by the new text parser, i.e. either the standard format with its limited set of allowed characters or the non-standard `$"..."` format. Notably, any name containing square or curly braces now uses the string format. Input automatically updated with this script: https://gist.github.com/tlively/4e22311736661849e641d02e521a0748 The printer is updated to properly escape names in more places as well. The logic for escaping names is moved to a common location so that the type printing logic in wasm-type.cpp can use it as well.
*	[Parser] Support string-style identifiers (#6278)	Thomas Lively	2024-02-06	2	-29/+68
\| \| \| \| \| \| \| \| \| \|	In addition to normal identifiers, support parsing identifiers of the format `$"..."`. This format is not yet allowed by the standard, but it is a popular proposed extension (see https://github.com/WebAssembly/spec/issues/617 and https://github.com/WebAssembly/annotations/issues/21). Binaryen has historically allowed a similar format and has supported arbitrary non-standard identifier characters, so it's much easier to support this extended syntax than to fix everything to use the restricted standard syntax.
*	Make `array.new_fixed` length annotations mandatory (#6277)	Thomas Lively	2024-02-06	1	-11/+5
\| \| \| \| \|	They were previously optional to ease the transition to the standard text format, but now we can make them mandatory to match the spec. This will simplify the new text parser as well.
*	[EH] Add --experimental-new-eh option to wasm-opt (#6270)	Heejin Ahn	2024-02-06	1	-2/+37
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds `--experimental-new-eh` option to `wasm-opt`. The difference between this and `--translate-to-new-eh` is, `--translate-to-new-eh` just runs `TranslateToNewEH` pass, while `--experimental-new-eh` attaches `TranslateToNewEH` pass at the end of the whole optimization pipeline. So if no other passes or optimization options (`-On`) are specified, it is equivalent to `--translate-to-new-eh`. If other optimization passes are specified, it runs them and at the end run the translator to ensure the new EH instructions are emitted. The reason we are doing this this way is that the optimization pipeline as a whole does not support the new EH instruction yet, but we would like to provide an option to emit a reasonably OK code with the new EH instructions. This also means when the optimization level > 3, it will also run the StackIR + local2stack optimization after the translation. Not sure how to test the output of this option, given that there is not much point in testing the default optimization passes, and it is also not clear how to print the stack IR if the stack ir generation and optimization runs as a part of the pipeline and not the explicit command line options. This is created in favor of #6267, which added the option to `optimization-options.h`. It had a problem of running the translator multiple times when `-On` was given multiple times in the command line, which I learned was rather a common usage. This adds the option directly to `wasm-opt.cpp`, which avoids the problem. With this, it is still possible to create and optimize Stack IR unnecessarily, but that feels a better alternative.
*	StringLowering pass (#6271)	Alon Zakai	2024-02-05	3	-4/+64
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This extends StringGathering by replacing the gathered string globals to imported globals. It adds a custom section with the strings that the imports are expected to provide. It also replaces the string type with extern. This is a complete lowering of strings, except for string operations that are a TODO. After running this, no strings remain in the wasm, and the outside JS is expected to provide the proper imports, which it can do by processing the JSON of the strings in the custom section "string.consts", which looks like ["foo", "bar", ..] That is, an array of strings, which are imported as (import "string.const" "0" (global $string.const_foo (ref extern))) ;; foo (import "string.const" "1" (global $string.const_bar (ref extern))) ;; bar
*	wasm-ctor-eval: Properly eval strings (#6276)	Alon Zakai	2024-02-05	1	-8/+3
\| \| \| \| \| \| \|	#6244 tried to do this but was not quite right. It treated a string like an array or a struct, which means create a global for it. But just creating a global isn't enough, as it needs to also be sorted in the right place etc. which requires changes in other places. But there is a much simpler solution here: string constants are just constants, which we can emit in-line, so do that.
*	[Parser] Parse v128.const (#6275)	Thomas Lively	2024-02-05	5	-1/+142
\|
*	[Parser] Templatize lexing of integers (#6272)	Thomas Lively	2024-02-05	4	-108/+50
\| \| \| \| \| \|	Have a single implementation for lexing each of unsigned, signed, and uninterpreted integers, each generic over the bit width of the integer. This reduces duplication in the existing code and it will make it much easier to support lexing more 8- and 16-bit integers.
*	MemoryPacking: Handle non-empty trapping segments (#6261)	Alon Zakai	2024-02-01	1	-9/+68
\| \| \|	Followup to #6243 which handled empty ones.
*	JSON: Add simple printing and creation (#6265)	Alon Zakai	2024-02-01	3	-0/+46
\|
*	C API: Use segment names (#6254)	ericvergnaud	2024-02-01	3	-64/+74
\| \| \| \| \| \| \| \| \|	Move from segment indexes to names. This is a breaking change to make the API more capable and consistent. An effort has been made to reduce the burden on C API users where possible (specifically, you can avoid providing names and let Binaryen make them for you, which will basically be numbers that match the indexes from before). Fixes #6247
*	Allow updating basic HeapTypes in GlobalTypeRewriter::mapTypes (#6266)	Alon Zakai	2024-02-01	1	-3/+0
\|
*	GUFA: Propagate string literals (#6262)	Alon Zakai	2024-02-01	1	-1/+2
\| \| \|	We only noted the type but not the literal value.
*	Revert "Stop propagating/inlining string constants (#6234)" (#6258)	Alon Zakai	2024-01-31	2	-22/+6
\| \| \| \| \| \| \| \| \| \|	This reverts commit 9090ce56fcc67e15005aeedc59c6bc6773220f11. This has the effect of once more propagating string constants from globals to other places (and from non-globals too), which is useful for various optimizations even if it isn't useful in the final output. To fix the final output problem, #6257 added a pass that is run at the end to collect string.const to globals, which allows us to once more propagate strings in the optimizer, now without a downside.
*	StringGathering pass (#6257)	Alon Zakai	2024-01-31	5	-1/+190
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This pass finds all string.const and creates globals for them. After this transform, no string.const appears anywhere but in a global, and each string appears in one global which is then global.get-ed everywhere. This avoids overhead in VMs where executing a string.const is an allocation, and is also a good step towards imported strings. For that, this pass will be extended from gathering to a full lowering pass, which will first gather into globals as this pass does, and then turn each of those globals with a string.const into an imported externref. (For that reason this pass is in a file called StringLowering, as the two passes will share much of their code, and the larger pass should decide the name I think.) This pass runs in -O2 and above. Repeated executions have no downside (see details in code).
*	[PostEmscripten] Fix calcSegmentOffsets for large offsets (#6260)	Sam Clegg	2024-01-31	1	-4/+3
\| \| \| \|	Specifically offsets larger than 2^32 which were being interpreted misinterpreted here as very large int64_t values.
*	[EH] Change translator option name (#6259)	Heejin Ahn	2024-01-30	3	-24/+7
\| \| \| \| \| \|	The previous name feels too verbose and unwieldy. This also removes the "new-to-old EH" placeholder. I think it'd be better to add it back when it is actually added.
*	[Parser] Parse start declarations (#6256)	Thomas Lively	2024-01-30	3	-0/+37
\|
*	Directize: Handle overflows and out of bounds (#6255)	Alon Zakai	2024-01-30	1	-1/+8
\|
*	[Parser] Parse pops (by doing nothing) (#6252)	Thomas Lively	2024-01-30	4	-3/+33
\| \| \| \| \| \| \| \| \| \| \| \| \|	Parse pop expressions and check that they have the expected types, but do not actually create new Pop expressions or push anything onto the stack because we already create Pop expressions as necessary when visiting the beginning of catch blocks. Unlike the legacy text parser, the new text parser is not capable of parsing pops in invalid locations in the IR. This means that the new text parser will never be able to parse test/lit/catch-pop-fixup-eh-old.wast, which deliberately parses invalid IR to check that the pops can be fixed up and moved to the correct locations. It should be acceptable to delete that test when we turn on the new parser by default, though, so that won't be a problem.
*	Update pop text syntax (#6251)	Thomas Lively	2024-01-29	2	-9/+5
\| \| \| \| \| \|	Rather than `(pop valtype*)`, use `(pop valtype)`, where `valtype` is now allowed to be a tuple. This will make it possible to parse un-folded multivalue pops in the new text parser. The alternative would have been to put an arity in the syntax like we have for other tuple instructions, but that's much uglier.
*	[Parser] Parse local.set and global.set of tuple values correctly (#6250)	Thomas Lively	2024-01-29	2	-0/+20
\| \| \| \|	These instructions always pop a single value, except when tuples are involved, in which case they need special handling to know how many values to pop.