From 333c7713fda24fe3444a59b664e3ca55fd45b32c Mon Sep 17 00:00:00 2001 From: John MacFarlane Date: Thu, 16 Jul 2015 10:44:42 -0700 Subject: Clarify character = any code point. --- spec.txt | 8 ++++++-- 1 file changed, 6 insertions(+), 2 deletions(-) diff --git a/spec.txt b/spec.txt index 2f7dbc9..753406a 100644 --- a/spec.txt +++ b/spec.txt @@ -204,9 +204,13 @@ In the examples, the `→` character is used to represent tabs. Any sequence of [character]s is a valid CommonMark document. -A [character](@character) is a Unicode code point. +A [character](@character) is a Unicode code point. Although some +code points (for example, combining accents) do not correspond to +characters in an intuitive sense, all code points count as characters +for purposes of this spec. + This spec does not specify an encoding; it thinks of lines as composed -of characters rather than bytes. A conforming parser may be limited +of [character]s rather than bytes. A conforming parser may be limited to a certain encoding. A [line](@line) is a sequence of zero or more [character]s -- cgit v1.2.3