isSpace -package:wraxml
Returns
True for any Unicode space character, and the control
characters
\t,
\n,
\r,
\f,
\v.
Fast predicate for matching ASCII space characters.
Note: This predicate only gives correct answers for the ASCII
encoding. For instance, it does not recognise U+00A0 (non-breaking
space) as a space character, even though it is a valid ISO-8859-15
byte. For a Unicode-aware and only slightly slower predicate, use
isSpace
Fast
Word8 predicate for matching ASCII space characters
isSpace w = w == 32 || w - 9 <= 4
Selects Unicode space characters (general category
Space),
and the control characters
\t,
\n,
\r,
\f,
\v.
Note: isSpace is
not equivalent to
isWhiteSpace.
isWhiteSpace selects the same characters
from
isSpace plus the following:
- U+0085 NEXT LINE (NEL)
- U+2028 LINE SEPARATOR
- U+2029 PARAGRAPH SEPARATOR
isSpace c == Data.Char.isSpace c
Selects white-space characters in the Latin-1 range
Selects words corresponding to white-space characters in the Latin-1
range
Fast
Word8 predicate for matching ASCII space characters.
Is this white-space? Only works for the Latin1 subset, otherwise
returns
sFalse.
Test if a character is the SPACE or the TAB character.
Test if a byte is the encoding of the SPACE or the TAB character.
Determines whether a character is a space, tab, or line separator
(newline, carriage return, etc.). Given some UTF-8 text, obtain a
character value with
utf8GetChar.
(Note: don't use this to do word breaking; you have to use Pango or
equivalent to get word breaking right, the algorithm is fairly
complex.)