diff --git a/spec.html b/spec.html index d7e6e45890a..fc8fb40ec0e 100644 --- a/spec.html +++ b/spec.html @@ -16043,40 +16043,7 @@

Syntax

Unicode Format-Control Characters

The Unicode format-control characters (i.e., the characters in category “Cf” in the Unicode Character Database such as LEFT-TO-RIGHT MARK or RIGHT-TO-LEFT MARK) are control codes used to control the formatting of a range of text in the absence of higher-level protocols for this (such as mark-up languages).

It is useful to allow format-control characters in source text to facilitate editing and display. All format control characters may be used within comments, and within string literals, template literals, and regular expression literals.

-

U+FEFF (ZERO WIDTH NO-BREAK SPACE) is a format-control character used primarily at the start of a text to mark it as Unicode and to allow detection of the text's encoding and byte order. <ZWNBSP> characters intended for this purpose can sometimes also appear after the start of a text, for example as a result of concatenating files. In ECMAScript source text <ZWNBSP> code points are treated as white space characters (see ).

-

The special treatment of certain format-control characters outside of comments, string literals, and regular expression literals is summarized in .

- - - - - - - - - - - - - - -
- Code Point - - Name - - Abbreviation - - Usage -
- `U+FEFF` - - ZERO WIDTH NO-BREAK SPACE - - <ZWNBSP> - - |WhiteSpace| -
-
+

U+FEFF (ZERO WIDTH NO-BREAK SPACE) is a format-control character used primarily at the start of a text to mark it as Unicode and to allow detection of the text's encoding and byte order. <ZWNBSP> characters intended for this purpose can sometimes also appear after the start of a text, for example as a result of concatenating files. In ECMAScript source text <ZWNBSP> code points are treated as white space characters (see ) outside of comments, string literals, template literals, and regular expression literals.