From 523dbfc312869ba056075b3f001dc3485f89484d Mon Sep 17 00:00:00 2001 From: Alex Burka Date: Wed, 15 Jun 2016 16:35:03 -0400 Subject: [PATCH 1/2] update reference for #29734 --- src/doc/reference.md | 8 ++------ 1 file changed, 2 insertions(+), 6 deletions(-) diff --git a/src/doc/reference.md b/src/doc/reference.md index fb8ea0f5661d3..6e794321f6444 100644 --- a/src/doc/reference.md +++ b/src/doc/reference.md @@ -114,12 +114,8 @@ Non-doc comments are interpreted as a form of whitespace. ## Whitespace -Whitespace is any non-empty string containing only the following characters: - -- `U+0020` (space, `' '`) -- `U+0009` (tab, `'\t'`) -- `U+000A` (LF, `'\n'`) -- `U+000D` (CR, `'\r'`) +Whitespace is any non-empty string containing only characters that have the +`Pattern_White_Space` Unicode property. Rust is a "free-form" language, meaning that all forms of whitespace serve only to separate _tokens_ in the grammar, and have no semantic significance. From b54afbee1f303391fbaa908a5610436405c3b741 Mon Sep 17 00:00:00 2001 From: Alex Burka Date: Wed, 15 Jun 2016 16:42:36 -0400 Subject: [PATCH 2/2] include list of characters --- src/doc/reference.md | 14 +++++++++++++- 1 file changed, 13 insertions(+), 1 deletion(-) diff --git a/src/doc/reference.md b/src/doc/reference.md index 6e794321f6444..59dbffd6e28e7 100644 --- a/src/doc/reference.md +++ b/src/doc/reference.md @@ -115,7 +115,19 @@ Non-doc comments are interpreted as a form of whitespace. ## Whitespace Whitespace is any non-empty string containing only characters that have the -`Pattern_White_Space` Unicode property. +`Pattern_White_Space` Unicode property, namely: + +- `U+0009` (horizontal tab, `'\t'`) +- `U+000A` (line feed, `'\n'`) +- `U+000B` (vertical tab) +- `U+000C` (form feed) +- `U+000D` (carriage return, `'\r'`) +- `U+0020` (space, `' '`) +- `U+0085` (next line) +- `U+200E` (left-to-right mark) +- `U+200F` (right-to-left mark) +- `U+2028` (line separator) +- `U+2029` (paragraph separator) Rust is a "free-form" language, meaning that all forms of whitespace serve only to separate _tokens_ in the grammar, and have no semantic significance.