pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2020-12-27	Powerpoint writer: allow arbitrary OOXML in raw inline elements	Albert Krewinkel	1	-22/+27
	The raw text is now included verbatim in the output. Previously is was parsed into XML elements, which prevented the inclusion of partial XML snippets.
2020-12-24	Citeproc: fix handling of empty URL variables (`DOI`, etc.).	John MacFarlane	1	-1/+3
	The `linkifyVariables` function was changing these to links which then got treated as non-empty by citeproc, leading to wrong results (e.g. ignoring nonempty URL when empty DOI is present). Addresses part 2 of jgm/citeproc#41.
2020-12-20	HTML writer: don't include p tags in CSL bibliography entries.	John MacFarlane	1	-2/+7
	Fixes a regression in 2.11.3. Closes #6966
2020-12-20	LaTeX writer: support colspans and rowspans in tables. (#6950)	Albert Krewinkel	4	-96/+236
	Note that the multirow package is needed for rowspans. It is included in the latex template under a variable, so that it won't be used unless needed for a table.
2020-12-16	Fix citeproc regression with duplicate references.	John MacFarlane	1	-1/+2
	- Use dev version of citeproc, which handles duplicate ids better, preferring the last one in the list and discarding the rest. - Ensure that inline citations take priority over external ones. See jgm/citeproc#36. This restores the behavior of pandoc-citeproc.
2020-12-16	Support Lua marshalling of doctemplates BoolVal.	John MacFarlane	1	-0/+1
	This updates T.P.Lua.Marshaling.Context for doctemplates >= 0.9.
2020-12-15	Properly handle boolean values in writing YAML metadata.	John MacFarlane	2	-2/+3
	(Markdown writer.) This requires doctemplates >= 0.9. Closes #6388.
2020-12-15	Use fetchItem to get external bibliography.	John MacFarlane	1	-8/+7
	This means that: - a URL may be provided, and pandoc will fetch the resource. - Pandoc will search the resource path for the bibliography if it is not found relative to the working directory. Closes #6940.
2020-12-15	Allow both inline and external references to be used	John MacFarlane	1	-14/+15
	with `--citeproc`. This fixes a regression, since pandoc-citeproc allowed these to be combined. Closes #6951.
2020-12-14	ImageSize: use exif width and height when available.	John MacFarlane	1	-0/+13
	After the move to JuicyPixels, we were getting incorrect width and heigh information for some images (see #6936, test-3.jpg). The correct information was encoded in Exif tags that JuicyPixels seemed to ignore. So we check these first before looking at the Width and Height identified by JuicyPixels. Closes #6936.
2020-12-13	RST writer: better image handling.	John MacFarlane	1	-9/+21
	- An image alone in its paragraph (but not a figure) is now rendered as an independent image, with an `alt` attribute if a description is supplied. - An inline image that is not alone in its paragraph will be rendered, as before, using a substitution. Such an image cannot have a "center", "left", or "right" alignment, so the classes `align-center`, `align-left`, or `align-right` are ignored. However, `align-top`, `align-middle`, `align-bottom` will generate a corresponding `align` attribute. Closes #6948.
2020-12-13	Merge pull request #6941 from tarleb/docx-raw	John MacFarlane	1	-59/+78
	Docx writer: keep raw openxml strings verbatim
2020-12-13	ImageSize: use JuicyPixels to extract size...	John MacFarlane	1	-305/+8
	...for png, jpeg, gif, instead of doing our own binary parsing. See #6936.
2020-12-13	ImageSize: use JuicyPixels to determine png size.	John MacFarlane	1	-31/+19

2020-12-13	Docx writer: keep raw openxml strings verbatim.	Albert Krewinkel	1	-2/+5
	Closes: #6933
2020-12-13	Docx writer: use Content instead of Element.	Albert Krewinkel	1	-59/+75

2020-12-12	Merge pull request #6946 from mb21/icml-image-fit	John MacFarlane	1	-1/+6
	ICML writer: fix image bounding box for custom widths/heights
2020-12-12	LaTeX writer: extract table handling into separate module.	Albert Krewinkel	5	-237/+355

2020-12-12	ICML writer: fix image bounding box for custom widths/heights	mb21	1	-1/+6
	fixes #6936
2020-12-10	HTML reader: pay attention to lang attributes on body.	John MacFarlane	1	-3/+6
	These (as well as lang attributes on html) should update lang in metadata. See #6938.
2020-12-10	HTML reader: retain attribute prefixes and avoid duplicates.	John MacFarlane	2	-24/+24
	Previously we stripped attribute prefixes, reading `xml:lang` as `lang` for example. This resulted in two duplicate `lang` attributes when `xml:lang` and `lang` were both used. This commit causes the prefixes to be retained, and also avoids invald duplicate attributes. Closes #6938.
2020-12-10	Add sourcepos extension for commonmarke	John MacFarlane	3	-5/+12
	* Add `Ext_sourcepos` constructor for `Extension`. * Add `sourcepos` extension (only for commonmark). * Bump to 2.11.3 With the `sourcepos` extension set set, `data-pos` attributes are added to the AST by the commonmark reader. No other readers are affected. The `data-pos` attributes are put on elements that accept attributes; for other elements, an enlosing Div or Span is added to hold the attributes. Closes #4565.
2020-12-10	Commonmark reader: refactor specFor, set input name to "".	John MacFarlane	1	-2/+8

2020-12-07	Parsing: Small code improvements.	John MacFarlane	1	-3/+4

2020-12-07	Parsing: More minor performance improvements.	John MacFarlane	1	-10/+13

2020-12-07	Small efficiency improvement in uri parser	John MacFarlane	1	-1/+14

2020-12-07	Bibtex parser: avoid noneOf.	John MacFarlane	1	-2/+2

2020-12-07	Parsing: in nonspaceChar use satisfy instead of oneOf.	John MacFarlane	1	-1/+7
	For efficiency.
2020-12-07	Dokuwiki reader: handle unknown interwiki links better.	John MacFarlane	1	-1/+1
	DokuWiki lets the user define his own Interwiki links. Previously pandoc reacted to these by emitting a google search link, which is not helpful. Instead, we now just emit the full URL including the wikilink prefix, e.g. `faquk>FAQ-mathml`. This at least gives users the ability to modify the links using filters. Closes #6932.
2020-12-07	Merge pull request #6922 from jtojnar/db-writer-admonitions	John MacFarlane	1	-19/+45
	Docbook writer: handle admonitions
2020-12-07	Docbook writer: Handle admonition titles from Markdown reader	Jan Tojnar	1	-0/+2
	Docbook reader produces a `Div` with `title` class for `<title>` element within an “admonition” element. Markdown writer then turns this into a fenced div with `title` class attribute. Since fenced divs are block elements, their content is recognized as a paragraph by the Markdown reader. This is an issue for Docbook writer because it would produce an invalid DocBook document from such AST – the `<title>` element can only contain “inline” elements. Let’s handle this invalid special case separately by unwrapping the paragraph before creating the `<title>` element.
2020-12-07	Docbook writer: Use correct id attribute consistently	Jan Tojnar	1	-10/+16
	DocBook5 should always use xml:id instead of id so let’s use it everywhere.
2020-12-07	Docbook writer: handle admonitions	Jan Tojnar	1	-12/+30
	Similarly to https://github.com/jgm/pandoc/commit/d6fdfe6f2bba2a8ed25d6c9f11861774001f7a91, we should handle admonitions.
2020-12-05	Org reader: preserve targets of spurious links	Albert Krewinkel	1	-5/+4
	Links with (internal) targets that the reader doesn't know about are converted into emphasized text. Information on the link target is now preserved by wrapping the text in a Span of class `spurious-link`, with an attribute `target` set to the link's original target. This allows to recover and fix broken or unknown links with filters. See: #6916
2020-12-05	OpenDocument writer: Allow references for internal links (#6774)	Nils Carlson	2	-18/+77
	This commit adds two extensions to the OpenDocument writer, `xrefs_name` and `xrefs_number`. Links to headings, figures and tables inside the document are substituted with cross-references that will use the name or caption of the referenced item for `xrefs_name` or the number for `xrefs_number`. For the `xrefs_number` to be useful heading numbers must be enabled in the generated document and table and figure captions must be enabled using for example the `native_numbering` extension. In order for numbers and reference text to be updated the generated document must be refreshed. Co-authored-by: Nils Carlson <nils.carlson@ludd.ltu.se>
2020-12-05	LaTeX reader: don't apply theorem default styling to a figure inside.	John MacFarlane	1	-0/+1
	If we put an image in italics, then when rendering to Markdown we no longer get an implicit figure. Closes #6925.
2020-12-04	Docbook writer: add XML namespaces to top-level elements (#6923)	Jan Tojnar	1	-8/+20
	Previously, we only added xmlns attributes to chapter elements, even when running with --top-level-division=section. Let’s add the namespaces to part and section elements too, when they are the selected top-level divisions. We do not need to add namespaces to documents produced with --standalone flag, since those will already have xmlns attribute on the root element in the template.
2020-12-04	Markdown writer: ensure that a new csl-block begins on a new line.	John MacFarlane	1	-1/+6
	This just looks better and doesn't affect the semantics. See #6921.
2020-12-04	LaTeX writer: Fix bug with nested csl- display Spans.	John MacFarlane	1	-36/+32
	See #6921.
2020-12-04	HTML writer: Fix handling of nested csl- display spans.	John MacFarlane	1	-20/+12
	Previously inner Spans used to represent CSL display attributes were not rendered as div tags. See #6921.
2020-12-03	EPUB writer: include title page in landmarks.	John MacFarlane	1	-2/+7
	Closes #6919. Note that the toc is also included if `--toc` is specified.
2020-12-03	EPUB writer: add frontmatter type on body element for nav.xhtml.	John MacFarlane	1	-1/+3
	Closes #6918.
2020-12-03	Docx writer: Support bold and italic in "complex script."	John MacFarlane	1	-2/+6
	Previously bold and italics didn't work properly in LTR text. This commit causes the w:bCs and w:iCs attributes to be used, in addition to w:b and w:i, for bold and italics respectively. Closes #6911.
2020-12-02	Citeproc: ensure that BCP47 lang codes can be used.	John MacFarlane	1	-2/+17
	We ignore the variants and just use the base lang code and country code when passing off to citeproc.
2020-11-29	LaTeX reader: don't parse `\rule` with width 0 as horizontal rule.	John MacFarlane	1	-1/+11

2020-11-28	Fix a tiny Typo in the CSV reader module	Tassos Manganaris	1	-1/+1
	Header comment in the CSV reader module says "RST" instead of "CSV".
2020-11-27	HTML reader tests: improve test coverage of new features	Albert Krewinkel	1	-1/+2

2020-11-27	HTML reader: support body headers, row head columns	Albert Krewinkel	1	-41/+61
	Closes: #6312
2020-11-26	Added some explicit imports.	John MacFarlane	1	-3/+3

2020-11-26	Docx writer: Fix bullets/lists indentation	cholonam	1	-3/+3
	Fix appearance of bullets/numbered lists (the first level is slightly indented to the right instead of right on the margin). New golden files have been tested using Word 2010 on Windows 10.