Clarify character = any code point.

author: John MacFarlane <jgm@berkeley.edu> 2015-07-16 10:44:42 -0700
committer: John MacFarlane <jgm@berkeley.edu> 2015-07-16 10:44:42 -0700
commit: 333c7713fda24fe3444a59b664e3ca55fd45b32c (patch)
tree: e11c9918f9434827f917059b0032d62538f8cbcb
parent: 5971fa48b4a1c9f96011030e6e6b7550ce95e170 (diff)
1 files changed, 6 insertions, 2 deletions
diff --git a/spec.txt b/spec.txt
index 2f7dbc9..753406a 100644
--- a/spec.txt
+++ b/spec.txt
@@ -204,9 +204,13 @@ In the examples, the `→` character is used to represent tabs.
 Any sequence of [character]s is a valid CommonMark
 document.
 
-A [character](@character) is a Unicode code point.
+A [character](@character) is a Unicode code point.  Although some
+code points (for example, combining accents) do not correspond to
+characters in an intuitive sense, all code points count as characters
+for purposes of this spec.
+
 This spec does not specify an encoding; it thinks of lines as composed
-of characters rather than bytes.  A conforming parser may be limited
+of [character]s rather than bytes.  A conforming parser may be limited
 to a certain encoding.
 
 A [line](@line) is a sequence of zero or more [character]s
author	John MacFarlane <jgm@berkeley.edu>	2015-07-16 10:44:42 -0700
committer	John MacFarlane <jgm@berkeley.edu>	2015-07-16 10:44:42 -0700
commit	333c7713fda24fe3444a59b664e3ca55fd45b32c (patch)
tree	e11c9918f9434827f917059b0032d62538f8cbcb
parent	5971fa48b4a1c9f96011030e6e6b7550ce95e170 (diff)