diff options
author | John MacFarlane <jgm@berkeley.edu> | 2015-06-04 10:56:57 +0200 |
---|---|---|
committer | John MacFarlane <jgm@berkeley.edu> | 2015-06-04 10:56:57 +0200 |
commit | e0e8d35b1af33733c41465623ae4d497852c8a93 (patch) | |
tree | 44de1eace7560d8aab909a4e5944489170101c2b | |
parent | 02ca8128dea803ac03455e793ee4f33aad89ab58 (diff) |
Clarify that entities resolving to U+0000 are replaced by U+FFFD.
Closes #323.
-rw-r--r-- | spec.txt | 9 |
1 files changed, 5 insertions, 4 deletions
@@ -4285,13 +4285,14 @@ corresponding codepoints. [Decimal entities](@decimal-entities) consist of `&#` + a string of 1--8 arabic digits + `;`. Again, these entities need to be recognised and transformed into their corresponding -unicode codepoints. Invalid unicode codepoints will be written as the -"unknown codepoint" character (`0xFFFD`) +unicode codepoints. Invalid unicode codepoints will be replaced by +the "unknown codepoint" character (`U+FFFD`). For security reasons, +the codepoint `U+0000` will also be replaced by `U+FFFD`. . -# Ӓ Ϡ � +# Ӓ Ϡ � � . -<p># Ӓ Ϡ �</p> +<p># Ӓ Ϡ � �</p> . [Hexadecimal entities](@hexadecimal-entities) |