FIXME: This chapter is unfinished --
currently contains 3-4 large chunks (separated by horisontal lines)
that need to be merged or maybe some parts dropped altogether...
This chapter will provide an analysis
of the Markdown processor Pandoc.
Many dialects of Markdown have evolved,
some tightening the language for parsing efficiency and disambiguation,
some extending to cover additional structures,
and some including support for a metadata header section.
Pandoc is a tool that can convert texts in Markdown dialects
into many document formats including HTML and (via LaTeX) PDF,
applying visual style and positioning throught templates.
Such document workflows,
including minimal structural markup as part of the creative writing,
applying visual layout as an automated templating process
tied to a target document format,
have been further streamlined for academic texts
in the Quarto document publishing system.
reads a text document,
parses its structural components into an AST,
and serialises and writes back into a text document.
The Pandoc AST deliberately prioritises structural information
and is relaxed about visual information,
to preserve literal content
while reducing format-specific stylistic details,
relevant especially when processing between different formats.
The most common use
is to read plaintext Markdown files and write LaTeX code,
which is then compiled into a PDF file.
The Markdown processor Pandoc can transform Markdown not only to HTML
but also to other output formats like PDF.
Pandoc offers an API for adapting its content processing
as well as a templating structure for customizing layout,
which is streamlined in the document authoring framework Quarto:
Pandoc with a set of plugins and templates
enables rendering of scholarly papers
conforming to prescribed style guides and document formats.
Pandoc is a document converter built around the Markdown markup language,
able to parse from and serialise to many Markdown dialects
as well as equivalent subsets of other text markup languages
including HTML, LaTeX (and by extension PDF),
Office Open XML (as used by recent releases of Microsoft Word),
and OpenDocument (as used by OpenOffice and LibreOffice).
Pandoc supports redefining input and output formats
and manipulating the internal document structure
as part of the automated parts of the framework.
Pandoc is extensible.
Source and output format can be changed or completely redefined
and the internal document structure manipulated,
in the automated parts of the framework.
Collection of interrelated POSIX scripts and Pandoc extensions
for enabling semantic annotations in Markdown-based authoring workflows.