diff options
author | John MacFarlane <jgm@berkeley.edu> | 2015-07-16 10:44:42 -0700 |
---|---|---|
committer | John MacFarlane <jgm@berkeley.edu> | 2015-07-16 10:44:42 -0700 |
commit | 333c7713fda24fe3444a59b664e3ca55fd45b32c (patch) | |
tree | e11c9918f9434827f917059b0032d62538f8cbcb | |
parent | 5971fa48b4a1c9f96011030e6e6b7550ce95e170 (diff) |
Clarify character = any code point.
-rw-r--r-- | spec.txt | 8 |
1 files changed, 6 insertions, 2 deletions
@@ -204,9 +204,13 @@ In the examples, the `→` character is used to represent tabs. Any sequence of [character]s is a valid CommonMark document. -A [character](@character) is a Unicode code point. +A [character](@character) is a Unicode code point. Although some +code points (for example, combining accents) do not correspond to +characters in an intuitive sense, all code points count as characters +for purposes of this spec. + This spec does not specify an encoding; it thinks of lines as composed -of characters rather than bytes. A conforming parser may be limited +of [character]s rather than bytes. A conforming parser may be limited to a certain encoding. A [line](@line) is a sequence of zero or more [character]s |