aboutsummaryrefslogtreecommitdiff
path: root/changelog.txt
blob: a3b63af554cf73d108c5b6214665db5d627ca4e5 (plain)
  1. [0.24]
  2.   * New format for spec tests, new lua formatter for specs.
  3. The format for the spec examples has changed from
  4. .
  5. markdown
  6. .
  7. html
  8. .
  9. to
  10. ```````````````````````````````` example
  11. markdown
  12. .
  13. html
  14. ````````````````````````````````
  15. One advantage of this is that `spec.txt` becomes a valid
  16. CommonMark file.
  17. * Change `tests/spec_test.py` to use the new format.
  18. * Replace `tools/makespec.py` with a lua script, `tools/make_spec.lua`,
  19. which uses the `lcmark` rock (and indirectly libcmark). It can
  20. generate HTML, LaTeX, and CommonMark versions of the spec. Pandoc
  21. is no longer needed for the latex/PDF version. And, since the new
  22. program uses the cmark API and operates directly on the parse tree,
  23. we get much more reliable translations than we got with the old
  24. Python script (#387).
  25. * Remove whitelist of valid schemes. Now a scheme is any sequence
  26. of 2-32 characters, beginning with an ASCII letter, and containing
  27. only ASCII letters, digits, and the symbols `-`, `+`, `.`.
  28. Discussion at <http://talk.commonmark.org/t/555>.
  29. * Added an example: URI schemes must be more than one character.
  30. * Disallow spaces in link destinations, even inside pointy braces.
  31. Discussion at <http://talk.commonmark.org/t/779> and
  32. <http://talk.commonmark.org/t/1287/12>.
  33. * Modify setext heading spec to allow multiline headings.
  34. Text like
  35. Foo
  36. bar
  37. ---
  38. baz
  39. is now interpreted as heading + paragraph, rather than
  40. paragraph + thematic break + paragraph.
  41. * Call U+FFFD the REPLACEMENT CHARACTER, not the "unknown code
  42. point character."
  43. * Change misleading undefined entity name example.
  44. * Remove misleading claim about entity references in raw HTML
  45. (a regression in 0.23). Entity references are not treated
  46. as literal text in raw HTML; they are just passed through.
  47. * CommonMark.dtd: allow `item` in `custom_block`.
  48. [0.23]
  49. * Don't allow space between link text and link label in a
  50. reference link. This fixes the problem of inadvertent capture:
  51. [foo] [bar]
  52. [foo]: /u1
  53. [bar]: /u2
  54. * Rename "horizontal rule" -> "thematic break". This matches the HTML5
  55. meaning for the hr element, and recognizes that the element may be
  56. rendered in various ways (not always as a horizontal rule).
  57. See http://talk.commonmark.org/t/horizontal-rule-or-thematic-break/912/3
  58. * Rename "header" -> "heading". This avoids a confusion that might arise
  59. now that HTML5 has a "header" element, distinct from the "headings"
  60. h1, h2, ... Our headings correspond to HTML5 headings, not HTML5 headers.
  61. The terminology of 'headings' is more natural, too.
  62. * ATX headers: clarify that a space (or EOL) is needed; other whitespace
  63. won't do (#373). Added a test case.
  64. * Rewrote "Entities" section with more correct terminology (#375).
  65. Entity references and numeric character references.
  66. * Clarified that spec does not dictate URL encoding/normalization policy.
  67. * New test case: list item code block with empty line (Craig M.
  68. Brandenburg).
  69. * Added example with escaped backslash at end of link label (#325).
  70. * Shortened an example so it doesn't wrap (#371).
  71. * Fixed duplicate id "attribute".
  72. * Fix four link targets (Lucas Werkmeister).
  73. * Fix typo for link to "attributes" (Robin Stocker).
  74. * Fix "delimiter" spellings and links (Sam Rawlins).
  75. * Consistent usage of "we" instead of "I" (renzo).
  76. * CommonMark.dtd - Rename `html` -> `html_block`,
  77. `inline_html` -> `html_inline` for consistency. (Otherwise it is too
  78. hard to remember whether `html` is block or inline, a source of
  79. some bugs.)
  80. * CommonMark.dtd - added `xmlns` attribute to document.
  81. * CommonMark.dtd - added `custom_block`, `custom_inline`.
  82. * CommonMark.dtd - renamed `hrule` to `thematic_break`.
  83. * Fixed some HTML inline tests, which were actually HTML blocks, given
  84. the changes to block parsing rules since these examples were written
  85. (#382).
  86. * Normalize URLs when comparing test output. This way we don't fail
  87. tests for legitimate variations in URL escaping/normalization policies
  88. (#334).
  89. * `normalize.py`: don't use `HTMLParseError`, which has been removed
  90. as of python 3.5 (#380).
  91. * Minor spacing adjustments in test output, to match cmark's output,
  92. since we test cmark without normalization.
  93. * `makespec.py`: remove need for link anchors to be on one line.
  94. * `makespec.py`: Only do two levels in the TOC.
  95. * Use `display:inline-block` rather than floats for side-by-side.
  96. This works when printed too.
  97. * Added better print CSS.
  98. [0.22]
  99. * Don't list `title` twice as HTML block tag (Robin Stocker).
  100. * More direct example of type 7 HTML block starting with closing tag.
  101. * Clarified rule 7 for HTML blocks. `pre`, `script`, and `style`
  102. are excluded because they're covered by other rules.
  103. * Clarified that type 7 HTML blocks can start with a closing tag (#349).
  104. * Removed `pre` tag from rule 6 of HTML blocks (#355).
  105. It is already covered by rule 1, so this removes an ambiguity.
  106. * Added `iframe` to list of tags that always start HTML blocks (#352).
  107. * Added example of list item starting with two blanks (#332).
  108. * Added test case clarifying laziness in block quotes (see
  109. jgm/commonmark.js#60).
  110. * Add an example with mixed indentation code block in "Tabs" section
  111. (Robin Stocker). This makes sure that implementations skip columns instead
  112. of offsets for continued indented code blocks.
  113. * Clarified that in ATX headers, the closing `#`s must be unescaped,
  114. and removed misleading reference to "non-whitespace character" in
  115. an example (#356).
  116. * Changed anchor for "non-whitespace character" to reflect new name.
  117. * Removed ambiguities concerning lines and line endings (#357, Lasse
  118. R.H. Nielsen). The previous spec allowed, technically, that a line
  119. ending in `\r\n` might be considered to have two line endings,
  120. or that the `\r` might be considered part of the line and the
  121. `\n` the line ending. These fixes rule out those interpretations.
  122. * Clarify that a character is any code point.
  123. * Space in "code point".
  124. * Capitalize "Unicode".
  125. * Reflow paragraph to avoid unwanted list item (#360, #361).
  126. * Avoid extra space before section number in `spec.md`.
  127. * `makespec.py`: Use `check_output` for simpler `pipe_through_prog`.
  128. * In README, clarified build requirements for `spec.html`, `spec.pdf`.
  129. * Fixed some encoding issues in `makespec.py` (#353).
  130. * Fixed various problems with `spec.pdf` generation (#353).
  131. * Added version to coverpage in PDF version of spec.
  132. [0.21.1]
  133. * Added date.
  134. [0.21]
  135. * Changed handling of tabs. Instead of having a preprocessing step
  136. where tabs are converted to spaces, we now handle tabs directly in
  137. the parser. This allows tabs to be retained in code blocks and code
  138. spans. This change adds some general language to the effect that,
  139. for purposes of determining block structure, tabs are to be treated
  140. just like equivalent spaces.
  141. * Completely rewrote spec for HTML blocks. The new spec provides
  142. better handling of tags like `<del>`, which can be either block
  143. or inline level content, better handling of custom tags, and
  144. better handling of verbatim contexts like `<pre>`, comments,
  145. and `<script>`.
  146. * Added 9-digit limit on ordered list start number.
  147. Some browsers use signed 32-bit integers for indexing
  148. the items of an ordered list, and this can lead to odd
  149. or undefined behavior if 10-digit start numbers are allowed.
  150. * Allow (non-initial) hyphens in tag names (#239). Custom
  151. tags in HTML5 must contain hyphens.
  152. * Clarified that HTML block is closed at end of containing
  153. block, not just end of the document (as with fenced code blocks.)
  154. * Specify nested link definition behavior in prose (Benjamin
  155. Dumke-von der Ehe).
  156. * Added test for edge case in link reference parsing
  157. (Benjamin Dumke-von der Ehe, see jgm/commonmark.js#49).
  158. * Added link tests with fragment identifiers and queries (David
  159. Green, #342).
  160. * Added test cases with a literal backslash in a link destination
  161. (see jgm/commonmark.js#45).
  162. * Added test for entity `&ngE;` which resolves to two code points.
  163. Put entity tests on several lines for readability (see
  164. jgm/commonmark.js#47).
  165. * Fixed broken "pre" literal HTML example. Contents
  166. weren't escaped properly.
  167. * Simplified defn of "unicode whitespace character,"
  168. rectifying omission of line tabulation, U+000B (#343).
  169. * Removed ambiguity in definition of "line" (#346).
  170. * Rewrapped two prose lines so `+` does not begin a line (#347).
  171. * Added another test with overlapping emphasis markers.
  172. * Fixed link to 'attributes'.
  173. * Revised appendix, "A parsing strategy," and
  174. added a description of emphasis/link parsing algorithm.
  175. * `spec_tests.py` - set options for conversions, set library
  176. paths in a more cross-platform way.
  177. * `spec_tests.py`: force utf-8 on test failure output and
  178. `--dump-tests` (#344, #345).
  179. * `spec_tests.py`: Properly handle visible tab `→` in expected output.
  180. * `normalize.py`: Don't collapse whitespace inside pre tag.
  181. * Added `spec.html` to `.gitignore` (#339).
  182. * Add `-dev` suffix to spec version after release (eksperimental).
  183. * Rename "non-space" to "non-whitespace" (Konstantin Zudov, #337).
  184. [0.20]
  185. * Require at least one nonspace character in a link label (#322).
  186. * Require replacement (rather than omission) of U+0000 (#327).
  187. * Clarified that entities resolving to U+0000 are to be
  188. replaced by U+FFFD (#323).
  189. * Removed statement that what counts as a line ending is
  190. platform-dependent (#326). We now count `\r`, `\n`,
  191. or `\r\n` as a line ending regardless of the platform.
  192. (The line ending styles can even be mixed in a single document.)
  193. * Defined "space."
  194. * Revised "non-space character". Previously a non-space character
  195. was defined as anything but a space (U+0020). Now it is anything
  196. that is not a whitespace character (as defined in the spec).
  197. * Clarified that tab expansion is a preprocessing step (#330).
  198. * Clarified lazy block quote examples (#328).
  199. * Clarified precedence of indentation that meets conditions for
  200. both list item continuation blocks and indented code.
  201. * Added a test case with `#` directly followed by a letter
  202. (not an ATX header).
  203. * Added two test cases illustrating that a list at the
  204. outer level can have items that are indented by more
  205. than four spaces (see commonmark.js#42 and
  206. <http://talk.commonmark.org/t/odd-list-behaviour/1189>).
  207. * Fixed typo in emphasis examples.
  208. [0.19]
  209. * Fixed rules for `_`-based emphasis and strong emphasis (#317).
  210. Previously `_(bar)_.` was not parsed as containing emphasis
  211. because the closing delimiter is both left- and right- flanking.
  212. This fix allows such delimiters, provided they're followed
  213. by punctuation (i.e., they have punctuation on both sides).
  214. Similarly, mutatis mutandis, for opening delimiters and for `__`.
  215. * Clarified definitions of left-flanking and right-flanking (#310).
  216. The spec now says explicitly that the beginning and end of line count
  217. as whitespace for purposes of this definition.
  218. * Clarified that a code fence followed by header line isn't a header (#306).
  219. * Fixed alignment in flankingness examples (cosmetic).
  220. * Fixed last "right flanking but not left flanking" example (#316).
  221. * Fixed a broken link (Konstantin Zudov).
  222. * Added link to list of implementations on wiki.
  223. * Fixed mistake in examples of left/right flanking delimiters
  224. (Konstantin Zudov).
  225. * Spell out `iff` (if and only if) the first time it is used (#309).
  226. * Fixed typos (isoroku, #309).
  227. * Clarified wording for soft line break: newline can't be preceded
  228. by two spaces or a backslash.
  229. * Replaced some references to UTF-8 that should be to unicode.
  230. * Fixed dingus link in tools/template.html.
  231. * Replaced obsolete reference to `spec2md.pl` in spec (#304).
  232. [0.18]
  233. * Added a shortcut link test with mismatched brackets (#282).
  234. * Added cases with newline whitespace around emphasis open delimiter
  235. (#282).
  236. * Added list item examples with no space after marker (#282).
  237. * Added additional test showing backslash escapes don't work in
  238. autolinks (#282).
  239. * Added test for multiline title in reference definition (#282).
  240. * Added a reference link definition test case (#282).
  241. * Clarified that link titles can't contain blank lines (#271).
  242. * Revised Rule 3 for list items (#275). Previously this just applied to
  243. empty list items. It has been rewritten to apply to any list item
  244. starting with a blank line, including items like:
  245. -
  246. ```
  247. code
  248. ```
  249. * Added U+000B and U+000C as whitespace characters (#300).
  250. * Added comment on sourcepos attribute format in DTD (#298).
  251. * Use `--smart` option in producing HTML version of spec.
  252. * Clarified that delimiter runs at beginning/end of line behave as
  253. if preceded/followed by whitespace (#302).
  254. * Ensure that URLs in examples have slash after domain.
  255. This helps with #9, making these tests less sensitive to
  256. the normalizer used.
  257. * Fixed typo (Robin Stocker).
  258. [0.17]
  259. * Improved rule limiting intraword `_` for emphasis and strong emphasis.
  260. To prevent intra-word emphasis, we used to check to see if
  261. the delimiter was followed/preceded by an ASCII alphanumeric.
  262. We now do something more elegant: whereas an opening `*` must
  263. be left-flanking, an opening `_` must be left-flanking *and
  264. not right-flanking*. And so on for the other cases.
  265. All the original tests passed except some tests with Russian
  266. text with internal `_`, which formerly created emphasis but no
  267. longer do with the new rule. These tests have been adjusted.
  268. A few new test cases have been added to illustrate the rule.
  269. * Added example with link break inside pointy braces (no link) (#295).
  270. * Added spec example: loose list with blank line after fenced code (#285).
  271. [0.16]
  272. * Rewrote beginning of Entities section, clarifying that only
  273. entities not in code blocks or spans are decoded.
  274. * Removed defective Example 449 (#284).
  275. * Fixed typo (#283).
  276. * Added intended two-space hard-breaks in Examples 521, 523.
  277. * Clarified that brackets in general don't take precedence over emph
  278. (#258).
  279. * Clarified that final newline is removed from paragraph content
  280. (#176).
  281. * Talk of "info string" rather than "attributes" for code blocks
  282. (#262).
  283. * Clarified precedence of code spans, HTML tags, autolinks (#259).
  284. * Fixed a number of internal links and duplicate references in the spec.
  285. * Linkify "info string" in spec.
  286. * Use shortcut reference links when possible in spec.txt.
  287. * cmark itself is now used to build spec.html, rather than pandoc.
  288. * Use shortcut reference links when possible in spec.txt. This
  289. relies on the new `spec2md.py` behavior of creating references
  290. for all internal anchors.
  291. * Moved some examples from block to inline HTML section.
  292. * Added examples of non-comments (#264).
  293. * Changed rule for comments to conform to HTML5 spec.
  294. * Made clear that any sequence of characters is a valid document
  295. (#266).
  296. * Changed wording: "is preferred" -> "takes precedence."
  297. * Regularized spelling of "non-space character" and added links
  298. (#260).
  299. * Use four spaces rather than five to show "four spaces is too much"
  300. (#261).
  301. [0.15]
  302. * Fixed some typos with "left-" and "right-flanking" delimiters in the
  303. section on emphasis and strong emphasis (#257).
  304. [0.14]
  305. * Clarified indented code blocks. Previously the spec said, wrongly,
  306. that a blank line was needed between a paragraph and a following
  307. code block. It is only needed between a code block and a following
  308. paragraph (due to lazy continuations). (Thanks to textnut.)
  309. * Added definitions of whitespace, unicode whitespace, punctuation,
  310. ASCII punctuation (#108).
  311. * Improved rules for emphasis and strong emphasis. This improves
  312. parsing of emphasis around punctuation. For background see
  313. <http://talk.commonmark.org/t/903/6>. The basic idea of the change
  314. is that if the delimiter is part of a delimiter clump that has
  315. punctuation to the left and a normal character (non-space,
  316. non-punctuation) to the right, it can only be an opener. If it has
  317. punctuation to the right and a normal character (non-space,
  318. non-punctuation) to the left, it can only be a closer. This handles
  319. cases like
  320. **Gomphocarpus (*Gomphocarpus physocarpus*, syn. *Asclepias
  321. physocarpa*)**
  322. and
  323. **foo "*bar*" foo**
  324. better than before.
  325. * Added test case for link-in-link-in-image (#252).
  326. * Fixed broken internal references.
  327. * Added another example of an unclarity in the canonical Markdown
  328. syntax description.
  329. * Reworded the principle of uniformity to be more general; it applies
  330. to all container blocks, not just list items.
  331. * Added a rule for empty list items (#242).
  332. * Clarified precedence of empty list items over setext header lines
  333. (#95).
  334. * Added an example with two blank lines in fenced code in a sublist (#180).
  335. * Added an explicit CC-BY-SA license to the spec (#55).
  336. [0.13]
  337. * Updated path of test program.
  338. * Use terminology "plain textual content" instead of "string."
  339. * Added condition that conforming parsers strip or replace NULL characters.
  340. * Changed Example 196 to reflect the spec's rules. It should not be a loose
  341. list as it has no blank lines.
  342. * Adjusted semantically insignificant formatting of HTML output.
  343. * Added example to spec of shortcut link with following space (#214).