From 7817cd7edd94b80e3d851189ad1290661e109c7c Mon Sep 17 00:00:00 2001 From: John MacFarlane Date: Sat, 3 Jan 2015 15:10:29 -0800 Subject: Make clear that any sequence of characters is a valid document. Closes #266 in a slightly different way than @anko's PR. --- spec.txt | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) (limited to 'spec.txt') diff --git a/spec.txt b/spec.txt index deb73df..c9b147a 100644 --- a/spec.txt +++ b/spec.txt @@ -202,17 +202,17 @@ In the examples, the `→` character is used to represent tabs. ## Characters and lines -The input is a sequence of zero or more [lines](#line). - -A [line](@line) -is a sequence of zero or more [characters](#character) followed by a -[line ending](#line-ending) or by the end of file. +Any sequence of [characters](#character) is a valid CommonMark +document. A [character](@character) is a unicode code point. This spec does not specify an encoding; it thinks of lines as composed of characters rather than bytes. A conforming parser may be limited to a certain encoding. +A [line](@line) is a sequence of zero or more [characters](#character) +followed by a [line ending](#line-ending) or by the end of file. + A [line ending](@line-ending) is, depending on the platform, a newline (`U+000A`), carriage return (`U+000D`), or carriage return + newline. -- cgit v1.2.3