I notice the spec does not address many whitespace characters, including a ‘NO-BREAK SPACE’ U+00A0 ( a long term bane of mine) and many of the following (copied from Java Character documentation)
It is ‘\t’, U+0009 HORIZONTAL TABULATION.
It is ‘\n’, U+000A LINE FEED.
It is ‘\u000B’, U+000B VERTICAL TABULATION.
It is ‘\f’, U+000C FORM FEED.
It is ‘\r’, U+000D CARRIAGE RETURN.
It is ‘\u001C’, U+001C FILE SEPARATOR.
It is ‘\u001D’, U+001D GROUP SEPARATOR.
It is ‘\u001E’, U+001E RECORD SEPARATOR.
It is ‘\u001F’, U+001F UNIT SEPARATOR.