pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2021-10-20	Markdown reader: don't parse links or bracketed spans as citations.	John MacFarlane	1	-2/+4
	Previously pandoc would parse [link to (@a)](url) as a citation; similarly [(@a)]{#ident} This is undesirable. One should be able to use example references in citations, and even if `@a` is not defined as an example reference, `[@a](url)` should be a link containing an author-in-text citation rather than a normal citation followed by literal `(url)`. Closes #7632.
2021-10-18	Docx reader: fix handling of empty fields	Milan Bracke	1	-0/+4
	Some fields only have an instrText and no content, Pandoc didn't understand these, causing other fields to be misunderstood because it seemed like a field was still open when it wasn't.
2021-10-18	Docx parser: implement PAGEREF fields	Milan Bracke	2	-0/+26
	These fields, often used in tables of contents, can be a hyperlink.
2021-10-18	Docx reader: fix handling of nested fields	Milan Bracke	2	-115/+150
	Fields delimited by fldChar elements can contain other fields. Before, the nested fields would be ignored, except for the end, which would be considered the end of the parent field. To fix this issue, fields needed to be considered containing ParParts instead of Runs, since a Run can't represent complex enough structures. This also impacted Hyperlinks since they can originate from a field.
2021-10-14	DocBook reader: honor linenumbering attribute	Samuel Tardieu	1	-0/+1
	The attribute DocBook linenumbering="numbered" attribute on code blocks maps to "numberLines" internally.
2021-10-13	Fix markdown parsing bug for math in bracketed spans and links.	John MacFarlane	1	-0/+1
	This affects math with unbalanced brackets (e.g. `$(0,1]$`) inside links, images, bracketed spans. Closes #7623.
2021-10-11	LaTeX reader: Implement siunitx v3 commands.	John MacFarlane	1	-1/+5
	We support `\unit`, `\qty`, `\qtyrange`, and `\qtylist` as synonynms of `\si`, `\SI`, `\SIrange`, and `\SIlist`. Closes #7614.
2021-10-10	Avoid blockquote when parent style has more indent	Milan Bracke	3	-53/+66
	When a paragraph has an indentation different from the parent (named) style, it used to be considered a blockquote. But this only makes sense when the paragraph has more indentation. So this commit adds a check for the indentation of the parent style.
2021-10-10	LaTeX reader: Properly handle `\^` followed by group closing.	John MacFarlane	1	-3/+3
	Closes #7615.
2021-09-30	Docx reader: Add placeholder for word diagram	Ezwal	2	-0/+17

2021-09-23	HTML reader: handle empty tbody element in table.	John MacFarlane	1	-5/+8
	Closes #7589.
2021-09-19	LaTeX reader: Recognize that `\vadjust` sometimes takes "pre".	John MacFarlane	1	-0/+7
	Closes #7531.
2021-09-19	Ignore (and gobble parameters of) CSLReferences environment.	John MacFarlane	1	-0/+1
	Otherwise we get the parameters as numbers in the output. Closes #7531.
2021-09-17	Fix linter warning.	John MacFarlane	1	-4/+3

2021-09-16	Fix code blocks using `--preserve-tabs`.	John MacFarlane	1	-1/+7
	Previously they did not behave as the equivalent input with spaces would. Closes #7573.
2021-09-13	RST reader: handle escaped colons in reference definitions.	John MacFarlane	1	-1/+2
	Cloess #7568.
2021-09-10	feat(ipynb reader): get cell output mime from raw_mimetype too	Kolen Cheung	1	-1/+2
	While the spec defined format, in practice raw_mimetype is used. See jupyter/nbformat#229
2021-09-10	feat(ipynb reader): add more Jupyter's "Raw NBConvert Format"	Kolen Cheung	1	-6/+10
	This adds most of the available formats selectable from Jupyter's interface "Raw NBConvert Format".
2021-09-10	fix!: rst mime type	Kolen Cheung	1	-1/+1
	BREAKING CHANGE: fix rst mime type according to https://docutils.sourceforge.io/FAQ.html
2021-09-10	Remove redundant import.	John MacFarlane	1	-1/+1

2021-09-10	Org reader: don't parse a list as first item in a list item.	John MacFarlane	1	-1/+4
	Closes #7557.
2021-09-10	Ipynb reader handleData: support text/markdown (#7561)	Kolen Cheung	1	-0/+3
	`text/markdown` is now a supported mime type for raw output.
2021-09-08	RTF reader: support `\binN` for binary image data.	John MacFarlane	1	-11/+22

2021-09-04	RTF reader: better handling of `\*` and bookmarks.	John MacFarlane	1	-8/+8
	We now ensure that groups starting with `\*` never cause text to be added to the document. In addition, bookmarks now create a span between the start and end of the bookmark, rather than an empty span.
2021-09-04	Minor renaming to avoid shadowing.	John MacFarlane	1	-2/+2

2021-09-03	RTF reader: if doc begins with {\rtf1 ... } only parse its contents.	John MacFarlane	1	-1/+7
	Some documents seem to have non-RTF (e.g. XML) material after the `{\rtf1 ... }` group.
2021-09-03	RTF reader: Ignore `\pgdsc` group.	John MacFarlane	1	-0/+1
	Otherwise we get style names treated as test.
2021-08-23	Markdown reader: fix interaction of --strip-comments and list	John MacFarlane	1	-1/+1
	parsing. Use of `--strip-comments` was causing tight lists to be rendered as loose (as if the comment were a blank line). Closes #7521.
2021-08-21	LaTeX-parser: restrict \endinput to current file	Simon Schuster	2	-1/+9

2021-08-20	RST reader: Fix `:literal:` includes.	John MacFarlane	1	-5/+2
	These should create code blocks, not insert raw RST. Closes #7513.
2021-08-19	Improve docx reader's robustness in extracting images.	John MacFarlane	1	-5/+6
	The docx reader made a couple assumptions about how docx containers were laid out that were not always true, with the result that some images in documents did not get found/extracted. Closes #7511.
2021-08-16	Fix bug in last commit due to removal of take1WhileP.	John MacFarlane	1	-2/+2

2021-08-15	Multimarkdown sub- and superscripts (#5512) (#7188)	OCzarnecki	1	-8/+16
	Added an extension `short_subsuperscripts` which modifies the behavior of `subscript` and `superscript`, allowing subscripts or superscripts containing only alphanumerics to end with a space character (eg. `x^2 = 4` or `H~2 is combustible`). This improves support for multimarkdown. Closes #5512. Add `Ext_short_subsuperscripts` constructor to `Extension` [API change]. This is enabled by default for `markdown_mmd`.
2021-08-13	LaTeX reader: proper implicit grouping around environment macros.	John MacFarlane	1	-1/+2

2021-08-12	Use Prelude from base-compat for ghc 8.4 too.	John MacFarlane	1	-5/+1
	We were having trouble building on ghc 8.4 because of the lack of a Foldable instance for (Alt Maybe) in base < 4.12. Mystery: for some reason our builds were failing for gitit but not in the pandoc CI.
2021-08-11	Try fixing compile error on older ghcs.	John MacFarlane	1	-1/+5
	See https://github.com/jgm/gitit/runs/3308381697
2021-08-11	Fix some lint issues.	John MacFarlane	2	-6/+5

2021-08-11	LaTeX reader: Support `\global` before `\def`, `\let`, etc.	John MacFarlane	1	-2/+10
	See #7494.
2021-08-11	Fix scope for LaTeX macros.	John MacFarlane	3	-55/+100
	They should by default scope over the group in which they are defined (except `\gdef` and `\xdef`, which are global). In addition, environments must be treated as groups. We handle this by making sMacros in the LaTeX parser state a STACK of macro tables. Opening a group adds a table to the stack, closing one removes one. Only the top of the stack is queried. This commit adds a parameter for scope to the Macro constructor (not exported). Closes #7494.
2021-08-11	LaTeX reader: improve handling of plain TeX macro primitives.	John MacFarlane	2	-6/+29
	- Fixed semantics for `\let`. - Implement `\edef`, `\gdef`, and `\xdef`. - Add comment noting that currently `\def` and `\edef` set global macros (so are equivalent to `\gdef` and `\xdef`). This should be fixed by scoping macro definitions to groups, in a future commit. Closes #7474.
2021-08-10	HTML reader: treat commments as blank when parsing.	John MacFarlane	1	-5/+7
	This modifies pBlank. Previously comments could sometimes flummox the parser. Cloes #7482.
2021-08-10	Fix RTF table parsing bug that created undesired nested tables.	John MacFarlane	1	-1/+1
	Closes #7488.
2021-08-10	Add RTF reader.	John MacFarlane	1	-0/+1333
	- `rtf` is now supported as an input format as well as output. - New module Text.Pandoc.Readers.RTF (exporting `readRTF`). [API change] Closes #3982.
2021-08-03	Stop using the HTTP package. (#7456)	mt_caret	1	-2/+2
	We only depend on the urlEncode function in the package, which is also provided by http-types. The HTTP package also depends on the network package, which has difficulty building on ghcjs. Add internal module Text.Pandoc.Network.HTTP, exporting `urlEncode`.
2021-07-17	LaTeX reader: avoid trailing hyphen in translating languages.	John MacFarlane	1	-2/+2
	Previously `\foreignlanguage{english}` turned into `<span lang="en-">`. The same issue affected Arabic. Closes #7447.
2021-07-16	DocBook reader: handle images with imageobjectco elements.	John MacFarlane	1	-3/+3
	Closes #7440.
2021-07-16	LaTeX reader: Support `\cline` in LaTeX tables.	John MacFarlane	1	-0/+1
	Closes #7442.
2021-07-11	DocBook reader: add support for citerefentry (#7437)	Jan Tojnar	1	-1/+5
	Originally intended for referring to UNIX manual pages, either part of the same DocBook document as refentry element, or external – hence the manvolnum element. These days, refentry is more general, for example the element documentation pages linked below are each a refentry. As per the Processing expectations section of citerefentry, the element is supposed to be a hyperlink to a refentry (when in the same document) but pandoc does not support refentry tag at the moment so that is moot. https://tdg.docbook.org/tdg/5.1/citerefentry.html https://tdg.docbook.org/tdg/5.1/manvolnum.html https://tdg.docbook.org/tdg/5.1/refentry.html This roughly corresponds to a `manpage` role in rST syntax, which produces a `Code` AST node with attributes `.interpreted-text role=manpage` but that does not fit DocBook parser. https://www.sphinx-doc.org/en/master/usage/restructuredtext/roles.html#role-manpage
2021-07-11	Improved parsing of raw LaTeX from Text streams (rawLaTeXParser).	John MacFarlane	2	-11/+37
	We now use source positions from the token stream to tell us how much of the text stream to consume. Getting this to work required a few other changes to make token source positions accurate. Closes #7434.
2021-07-09	RST reader: fix regression with code includes.	John MacFarlane	1	-1/+5
	With the recent changes to include infrastructure, included code blocks were getting an extra newline. Closes #7436. Added regression test.