aboutsummaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorJohn MacFarlane <jgm@berkeley.edu>2015-07-16 10:44:42 -0700
committerJohn MacFarlane <jgm@berkeley.edu>2015-07-16 10:44:42 -0700
commit333c7713fda24fe3444a59b664e3ca55fd45b32c (patch)
treee11c9918f9434827f917059b0032d62538f8cbcb
parent5971fa48b4a1c9f96011030e6e6b7550ce95e170 (diff)
Clarify character = any code point.
-rw-r--r--spec.txt8
1 files changed, 6 insertions, 2 deletions
diff --git a/spec.txt b/spec.txt
index 2f7dbc9..753406a 100644
--- a/spec.txt
+++ b/spec.txt
@@ -204,9 +204,13 @@ In the examples, the `→` character is used to represent tabs.
Any sequence of [character]s is a valid CommonMark
document.
-A [character](@character) is a Unicode code point.
+A [character](@character) is a Unicode code point. Although some
+code points (for example, combining accents) do not correspond to
+characters in an intuitive sense, all code points count as characters
+for purposes of this spec.
+
This spec does not specify an encoding; it thinks of lines as composed
-of characters rather than bytes. A conforming parser may be limited
+of [character]s rather than bytes. A conforming parser may be limited
to a certain encoding.
A [line](@line) is a sequence of zero or more [character]s