Unexpectedly changing the code when encountering parentheses in arrow function body #512

kevin940726 · 2018-06-25T05:05:22Z

Related to #504, #505.

Recast version: 0.15.0

As @cspotcode fixed a issue in #504, now recast will unexpectedly transform the code while it shouldn't.

Consider the following code

const x = () => ({})["foo"];

Running with recast will now print the following code back

const x = () => ({}["foo"]);

Note that the ending parentheses position has changed and therefore changing the source code unexpectedly. This will not happen when running with recast@0.14.7.

Any idea how to fix it?

The text was updated successfully, but these errors were encountered:

Recast has suffered for a long time because it did not have reliable access to the lexical analysis of source tokens during reprinting. Most importantly, accurate token information could be used to detect whether a node was originally wrapped with parentheses, even if the parentheses are separated from the node by comments or other incidental non-whitespace text, such as trailing commas. Here are just some of the issues that have resulted from the lack of reliable token information: - #533 - #528 - #513 - #512 - #366 - #327 - #286 With this change, every node in the AST returned by recast.parse will now have a node.loc.tokens array representing the entire sequence of original source tokens, as well as node.loc.{start,end}.token indexes into this array of tokens, such that node.loc.tokens.slice( node.loc.start.token, node.loc.end.token ) returns a complete list of all source tokens contained by the node. Note that some nodes (such as comments) may contain no source tokens, in which case node.loc.start.token === node.loc.end.token, which will be the index of the first token *after* the position where the node appeared. Most parsers can expose token information for free / very cheaply, as a byproduct of the parsing process. In case a custom parser is provided that does not expose token information, we fall back to Esprima's tokenizer. While there is considerable variation between different parsers in terms of AST format, there is much less variation in tokenization, so the Esprima tokenizer should be adequate in most cases (even for JS dialects like TypeScript). If it is not adequate, the caller should simply ensure that the custom parser exposes an ast.tokens array containing token objects with token.loc.{start,end}.{line,column} information.

Previously fixed by #505.

Recast has suffered for a long time because it did not have reliable access to the lexical analysis of source tokens during reprinting. Most importantly, accurate token information could be used to detect whether a node was originally wrapped with parentheses, even if the parentheses are separated from the node by comments or other incidental non-whitespace text, such as trailing commas. Here are just some of the issues that have resulted from the lack of reliable token information: - #533 - #528 - #513 - #512 - #366 - #327 - #286 With this change, every node in the AST returned by recast.parse will now have a node.loc.tokens array representing the entire sequence of original source tokens, as well as node.loc.{start,end}.token indexes into this array of tokens, such that node.loc.tokens.slice( node.loc.start.token, node.loc.end.token ) returns a complete list of all source tokens contained by the node. Note that some nodes (such as comments) may contain no source tokens, in which case node.loc.start.token === node.loc.end.token, which will be the index of the first token *after* the position where the node appeared. Most parsers can expose token information for free / very cheaply, as a byproduct of the parsing process. In case a custom parser is provided that does not expose token information, we fall back to Esprima's tokenizer. While there is considerable variation between different parsers in terms of AST format, there is much less variation in tokenization, so the Esprima tokenizer should be adequate in most cases (even for JS dialects like TypeScript). If it is not adequate, the caller should simply ensure that the custom parser exposes an ast.tokens array containing token objects with token.loc.{start,end}.{line,column} information.

Previously fixed by #505.

kevin940726 · 2019-12-12T07:47:21Z

I think it's already been fixed

benjamn mentioned this issue Sep 10, 2018

Use node.loc.tokens to improve handling of parentheses. #537

Merged

8 tasks

benjamn added a commit that referenced this issue Sep 10, 2018

Alternate fix for issues #504 and #512.

3d45d4a

Previously fixed by #505.

benjamn added a commit that referenced this issue Sep 11, 2018

Alternate fix for issues #504 and #512.

02cb38c

Previously fixed by #505.

kevin940726 closed this as completed Dec 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unexpectedly changing the code when encountering parentheses in arrow function body #512

Unexpectedly changing the code when encountering parentheses in arrow function body #512

kevin940726 commented Jun 25, 2018

kevin940726 commented Dec 12, 2019

Unexpectedly changing the code when encountering parentheses in arrow function body #512

Unexpectedly changing the code when encountering parentheses in arrow function body #512

Comments

kevin940726 commented Jun 25, 2018

kevin940726 commented Dec 12, 2019