pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2019-06-04	Include trailing {}s in raw latex commands.	John MacFarlane	1	-2/+7
	Change is in rawLaTeXInline in LaTeX reader, but it affects the markdown reader and other readers that allow raw LaTeX. Previously, trailing `{}` would be included for unknown commands, but not for known commands. However, they are sometimes used to avoid a trailing space after the command. The chances that a `{}` after a LaTeX command is not part of the command are very small. Closes #5439.
2019-06-04	Docx reader: Add support for w:rtl (ltr annotation).	John MacFarlane	2	-4/+19
	Closes #5545.
2019-06-04	Markdown reader: don't create implicit reference for empty header.	John MacFarlane	1	-4/+7
	Closes #5549.
2019-05-29	HTML reader: misc. epub related fixes.	John MacFarlane	1	-30/+41
	- With epub extensions, check for epub:type in addition to type. - Fix problem with noteref parsing which caused block-level content to be eaten with the noteref. - Rename pAnyTag to pAny. - Refactor note resolution.
2019-05-27	consolidate simple-table detection (#5524)	Mauro Bieg	1	-7/+2
	add `onlySimpleTableCells` to `Text.Pandoc.Shared` [API change] This fixes an inconsistency in the HTML reader, which did not treat tables with `<p>` inside cells as simple.
2019-05-25	Muse reader: allow images inside link descriptions	Alexander Krotov	1	-5/+4

2019-05-25	HTML reader: trim definition list terms	Alexander Krotov	1	-1/+1

2019-05-13	Org reader: fix planning elements in headers level 3 and higher	Albert Krewinkel	1	-1/+1
	Planning info is now always placed before the subtree contents. Previously, the planning info was placed after the content if the header's subtree was converted to a list, which happens with headers of level 3 and higher per default. Fixes: #5494
2019-05-13	Org reader: omit, but warn about unknown export options	Albert Krewinkel	2	-4/+14
	Unknown export options are properly ignored and omitted from the output.
2019-05-11	FB2 reader: parse notes	Alexander Krotov	1	-3/+51
	Closes #5493
2019-05-11	FB2 reader: use XML.Light.Input.parseXMLDoc to parse the root element	Alexander Krotov	1	-12/+11

2019-05-11	Reduce the amount of state in FB2 reader	Alexander Krotov	1	-1/+3

2019-05-11	FB2 reader: use Text.XML.Light.unqual where possible	Alexander Krotov	1	-8/+8

2019-05-05	Org reader: prefer plain symbols over math symbols	Albert Krewinkel	1	-1/+1
	Symbols like `\alpha` are output plain and unemphasized, not as math. Fixes: #5483
2019-05-05	Org reader: recognize emphasis after TODO/DONE keyword	Albert Krewinkel	1	-1/+3
	Fixes: #5484
2019-05-03	LaTeX reader: Allow newlines in `\mintinline`.	John MacFarlane	1	-3/+7

2019-05-01	MediaWiki reader: handle multiple attributes in table row (#5471)	chinapedia	1	-2/+2

2019-04-10	LaTeX reader: add braces when resolving `\DeclareMathOperator`.	John MacFarlane	1	-1/+2
	These seem to be needed for xelatex but not pdflatex. Closes #5441.
2019-04-05	Vimwiki reader: improve handling of internal links.	John MacFarlane	1	-5/+12
	1) Don't append `.html` 2) Add `wikilink` title This mirrors behavior of other wiki readers. Generally the `.html` extension is not wanted. It may be important for output to HTML in certain circumstances, but it can always be added using a filter that matches on links with title `wikilink`. Note that if you have a workflow that uses pandoc to convert vimwiki to readable HTML pages, you may need to add such a filter to reproduce current behavior. Here is a filter that does the job: ```lua function Link(el) if el.title == 'wikilink' then el.target = el.target .. ".html" end return el end ``` Save this as `fixlinks.lua` and use with `--lua-filter fixlinks.lua`. Closes #5414.
2019-04-01	Dokuwiki Reader fix: parse single curly brace (#5417)	Mauro Bieg	1	-1/+1
	fixes #5416
2019-03-30	ipynb reader/writer: use format 'ipynb' for raw cell where no format given.	John MacFarlane	1	-2/+3
	According to nbformat docs, this is supposed to render in every format. We don't do that, but we at least preserve it as a raw block in markdown, so you can round-trip.
2019-03-28	Markdown reader: fenced div takes priority over setext header.	John MacFarlane	1	-2/+2
	For ::: {.cell} --- :::
2019-03-28	Ipynb reader: use `html` for a raw cell with no format.	John MacFarlane	1	-1/+1
	The nbformat spec says that when no format is specified, the raw cell will be rendered in every markup format. Pandoc doesn't have a construct that works this way, so we just fall back to `html`.
2019-03-27	ipynb reader: avoid introducing spurious `.0` on integers in metadata.	John MacFarlane	1	-1/+4

2019-03-27	Drop support for ghc < 8.	John MacFarlane	1	-3/+0

2019-03-25	HTML reader: read `data-foo` attribute into `foo`.	John MacFarlane	1	-1/+2
	The HTML writer adds the `data-` prefix for HTML5 for nonstandard attributes. But the attributes are represented in the AST without the `data-` prefix, so we should strip this when reading HTML. Closes #5392.
2019-03-14	Markdown writer: be sure implicit figures work in list contexts.	John MacFarlane	1	-11/+13
	Previously they would sometimes not work: e.g., when they occured in final paragraphs in lists that were originally parsed as Plain and converted later using PlainToPara. Closes #5368.
2019-03-10	LaTeX reader: support `\underline`, `\ul`, `\uline` (#5359)	Paul Tilley	1	-0/+5
	These are parsed as a Span with class `underline`, as with other readers.
2019-03-10	ipynb reader: removed vestigial ReaderOptions param.	John MacFarlane	1	-18/+16

2019-03-09	ipynb reader: remove sensitivity to `raw_html`, `raw_tex` extensions.	John MacFarlane	1	-6/+2
	We now include every output format. Pruning is handled by `--ipynb-output=`.
2019-03-09	Ipynb reader/writer: better handling of cell metadata.	John MacFarlane	1	-7/+10
	We now handle even complex cell metadata in the Div's attributes. Simple metadata fields are rendered as a plain string, and complex ones as JSON.
2019-03-07	Add inNote to Footcite and Footcites	John MacFarlane	1	-2/+2

2019-03-02	JATS reader: Support fig-group block element (#5317).	John MacFarlane	1	-1/+4

2019-03-01	Remove license boilerplate.	John MacFarlane	51	-940/+0
	The haddock module header contains essentially the same information, so the boilerplate is redundant and just one more thing to get out of sync.
2019-02-28	Markdown Reader: yamlToMeta respects extensions (#5276)	Mauro Bieg	1	-3/+2
	Add ReaderOptions parameter to yamlToMeta [API change]. fixes #5272
2019-02-23	JATS reader: fix parsing of figures.	John MacFarlane	1	-18/+27
	This ensures that a figure containing a single image is parsed as a pandoc "implicit figure" (i.e., a Para with a single Image whose title attribute begins with `fig:`). More complex figures will still be parsed as divs. Closes #5321.
2019-02-21	Docx reader: Start adding comment to combine module	Jesse Rosenthal	1	-0/+40
	This module is one of the most opaque parts of the docx reader: it deals with the fact that runs have non-nesting formatting, so we have to figure out the nesting on the fly as we combine them. We start adding commenting, so new developers can understand and, if necessary, modify this module. Specific function comments will be added in the future, but this offers a global description of the purpose of the module.
2019-02-18	Docx reader: Trim space inside the last inline.	Jesse Rosenthal	1	-1/+2
	We have to add one final mempty when we're combining in order to trim inlines appropriately. (We need to use our own trimming routines here due to the way that formatted inlines are smushed together when converting from docx.) Closes #5273
2019-02-18	hlint Muse	Alexander Krotov	1	-1/+1

2019-02-18	Muse reader: add secondary note support	Alexander Krotov	1	-5/+11

2019-02-15	Markdown reader: fix bug parsing fenced code blocks.	John MacFarlane	1	-2/+3
	Previously parsing would break if the code block contained a string of backticks of sufficient length followed by something other than end of line. Closes #5304.
2019-02-15	JATS reader: handle citations with multiple references.	John MacFarlane	1	-7/+10
	The rid attribute can have a space-separated list of ids. Closes #5310.
2019-02-12	Docx reader: unwrap sdt elements in footnotes and comments.	Jesse Rosenthal	1	-3/+3
	We had previously walked the document to unwrap sdt/sdtContent and smartTag tags in `word/document.xml`, but not in the `word/{foot/end}note.xml` and `word/comments.xml`. Closes #5302
2019-02-11	Remove redundant import.	John MacFarlane	1	-1/+0

2019-02-10	ipynb writer: keep plain text fallbacks in output...	John MacFarlane	1	-26/+14
	even if a richer format is included. We don't know what output format will be needed. The fallback can always be weeded out using a filter. Closes #5293.
2019-02-08	Make --metadata-file use pandoc-markdown (#5279)	Mauro Bieg	1	-1/+2
	see #5272
2019-02-08	Docx reader: fix paths in archive to prevent Windows failure	Jesse Rosenthal	1	-1/+6
	Some paths in archives are absolute (have an opening slash) which, for reasons unknown, produces a failure in the test suite on MS Windows. This fixes that by removing the leading slash if it exists. Closes #5277 (previously closed with 4cce0ef but reopened due to this bug).
2019-02-07	Revert "Docx reader: Fix windows error"	Jesse Rosenthal	1	-2/+1
	This reverts commit 2142bbe572cea00b7bb5ad3e10a3afb26845a1f7.
2019-02-07	Docx reader: Fix windows error	Jesse Rosenthal	1	-1/+2
	Try fixing a parsing error on windows by insisting that the parser use a Posix filepath library for splitting doc paths in a zipfile. (It might default on Windows to using a backslash as a separator, while it's always a forward-slash in zip archives.)
2019-02-07	Docx reader: Some code cleanup	Jesse Rosenthal	1	-15/+25
	* clarify function name. We had previously used `getDocumentPath`, but `Document` is an overdetermined term here. Use `getDocumentXmlPath` to make clear what we're doing. * Use field notation for setting ReaderEnv. As we've added (and continue to add) fields, the assignment by position has gotten harder to read. * figure out document.xml path once at the beginning of parsing, and add it to the environment, so we can avoid repeated lookups.