pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2020-12-13	Merge pull request #6941 from tarleb/docx-raw	John MacFarlane	1	-59/+78
	Docx writer: keep raw openxml strings verbatim
2020-12-13	ImageSize: use JuicyPixels to extract size...	John MacFarlane	1	-305/+8
	...for png, jpeg, gif, instead of doing our own binary parsing. See #6936.
2020-12-13	ImageSize: use JuicyPixels to determine png size.	John MacFarlane	1	-31/+19

2020-12-13	Docx writer: keep raw openxml strings verbatim.	Albert Krewinkel	1	-2/+5
	Closes: #6933
2020-12-13	Docx writer: use Content instead of Element.	Albert Krewinkel	1	-59/+75

2020-12-12	Merge pull request #6946 from mb21/icml-image-fit	John MacFarlane	1	-1/+6
	ICML writer: fix image bounding box for custom widths/heights
2020-12-12	LaTeX writer: extract table handling into separate module.	Albert Krewinkel	5	-237/+355

2020-12-12	ICML writer: fix image bounding box for custom widths/heights	mb21	1	-1/+6
	fixes #6936
2020-12-10	HTML reader: pay attention to lang attributes on body.	John MacFarlane	1	-3/+6
	These (as well as lang attributes on html) should update lang in metadata. See #6938.
2020-12-10	HTML reader: retain attribute prefixes and avoid duplicates.	John MacFarlane	2	-24/+24
	Previously we stripped attribute prefixes, reading `xml:lang` as `lang` for example. This resulted in two duplicate `lang` attributes when `xml:lang` and `lang` were both used. This commit causes the prefixes to be retained, and also avoids invald duplicate attributes. Closes #6938.
2020-12-10	Add sourcepos extension for commonmarke	John MacFarlane	3	-5/+12
	* Add `Ext_sourcepos` constructor for `Extension`. * Add `sourcepos` extension (only for commonmark). * Bump to 2.11.3 With the `sourcepos` extension set set, `data-pos` attributes are added to the AST by the commonmark reader. No other readers are affected. The `data-pos` attributes are put on elements that accept attributes; for other elements, an enlosing Div or Span is added to hold the attributes. Closes #4565.
2020-12-10	Commonmark reader: refactor specFor, set input name to "".	John MacFarlane	1	-2/+8

2020-12-07	Parsing: Small code improvements.	John MacFarlane	1	-3/+4

2020-12-07	Parsing: More minor performance improvements.	John MacFarlane	1	-10/+13

2020-12-07	Small efficiency improvement in uri parser	John MacFarlane	1	-1/+14

2020-12-07	Bibtex parser: avoid noneOf.	John MacFarlane	1	-2/+2

2020-12-07	Parsing: in nonspaceChar use satisfy instead of oneOf.	John MacFarlane	1	-1/+7
	For efficiency.
2020-12-07	Dokuwiki reader: handle unknown interwiki links better.	John MacFarlane	1	-1/+1
	DokuWiki lets the user define his own Interwiki links. Previously pandoc reacted to these by emitting a google search link, which is not helpful. Instead, we now just emit the full URL including the wikilink prefix, e.g. `faquk>FAQ-mathml`. This at least gives users the ability to modify the links using filters. Closes #6932.
2020-12-07	Merge pull request #6922 from jtojnar/db-writer-admonitions	John MacFarlane	1	-19/+45
	Docbook writer: handle admonitions
2020-12-07	Docbook writer: Handle admonition titles from Markdown reader	Jan Tojnar	1	-0/+2
	Docbook reader produces a `Div` with `title` class for `<title>` element within an “admonition” element. Markdown writer then turns this into a fenced div with `title` class attribute. Since fenced divs are block elements, their content is recognized as a paragraph by the Markdown reader. This is an issue for Docbook writer because it would produce an invalid DocBook document from such AST – the `<title>` element can only contain “inline” elements. Let’s handle this invalid special case separately by unwrapping the paragraph before creating the `<title>` element.
2020-12-07	Docbook writer: Use correct id attribute consistently	Jan Tojnar	1	-10/+16
	DocBook5 should always use xml:id instead of id so let’s use it everywhere.
2020-12-07	Docbook writer: handle admonitions	Jan Tojnar	1	-12/+30
	Similarly to https://github.com/jgm/pandoc/commit/d6fdfe6f2bba2a8ed25d6c9f11861774001f7a91, we should handle admonitions.
2020-12-05	Org reader: preserve targets of spurious links	Albert Krewinkel	1	-5/+4
	Links with (internal) targets that the reader doesn't know about are converted into emphasized text. Information on the link target is now preserved by wrapping the text in a Span of class `spurious-link`, with an attribute `target` set to the link's original target. This allows to recover and fix broken or unknown links with filters. See: #6916
2020-12-05	OpenDocument writer: Allow references for internal links (#6774)	Nils Carlson	2	-18/+77
	This commit adds two extensions to the OpenDocument writer, `xrefs_name` and `xrefs_number`. Links to headings, figures and tables inside the document are substituted with cross-references that will use the name or caption of the referenced item for `xrefs_name` or the number for `xrefs_number`. For the `xrefs_number` to be useful heading numbers must be enabled in the generated document and table and figure captions must be enabled using for example the `native_numbering` extension. In order for numbers and reference text to be updated the generated document must be refreshed. Co-authored-by: Nils Carlson <nils.carlson@ludd.ltu.se>
2020-12-05	LaTeX reader: don't apply theorem default styling to a figure inside.	John MacFarlane	1	-0/+1
	If we put an image in italics, then when rendering to Markdown we no longer get an implicit figure. Closes #6925.
2020-12-04	Docbook writer: add XML namespaces to top-level elements (#6923)	Jan Tojnar	1	-8/+20
	Previously, we only added xmlns attributes to chapter elements, even when running with --top-level-division=section. Let’s add the namespaces to part and section elements too, when they are the selected top-level divisions. We do not need to add namespaces to documents produced with --standalone flag, since those will already have xmlns attribute on the root element in the template.
2020-12-04	Markdown writer: ensure that a new csl-block begins on a new line.	John MacFarlane	1	-1/+6
	This just looks better and doesn't affect the semantics. See #6921.
2020-12-04	LaTeX writer: Fix bug with nested csl- display Spans.	John MacFarlane	1	-36/+32
	See #6921.
2020-12-04	HTML writer: Fix handling of nested csl- display spans.	John MacFarlane	1	-20/+12
	Previously inner Spans used to represent CSL display attributes were not rendered as div tags. See #6921.
2020-12-03	EPUB writer: include title page in landmarks.	John MacFarlane	1	-2/+7
	Closes #6919. Note that the toc is also included if `--toc` is specified.
2020-12-03	EPUB writer: add frontmatter type on body element for nav.xhtml.	John MacFarlane	1	-1/+3
	Closes #6918.
2020-12-03	Docx writer: Support bold and italic in "complex script."	John MacFarlane	1	-2/+6
	Previously bold and italics didn't work properly in LTR text. This commit causes the w:bCs and w:iCs attributes to be used, in addition to w:b and w:i, for bold and italics respectively. Closes #6911.
2020-12-02	Citeproc: ensure that BCP47 lang codes can be used.	John MacFarlane	1	-2/+17
	We ignore the variants and just use the base lang code and country code when passing off to citeproc.
2020-11-29	LaTeX reader: don't parse `\rule` with width 0 as horizontal rule.	John MacFarlane	1	-1/+11

2020-11-28	Fix a tiny Typo in the CSV reader module	Tassos Manganaris	1	-1/+1
	Header comment in the CSV reader module says "RST" instead of "CSV".
2020-11-27	HTML reader tests: improve test coverage of new features	Albert Krewinkel	1	-1/+2

2020-11-27	HTML reader: support body headers, row head columns	Albert Krewinkel	1	-41/+61
	Closes: #6312
2020-11-26	Added some explicit imports.	John MacFarlane	1	-3/+3

2020-11-26	Docx writer: Fix bullets/lists indentation	cholonam	1	-3/+3
	Fix appearance of bullets/numbered lists (the first level is slightly indented to the right instead of right on the margin). New golden files have been tested using Word 2010 on Windows 10.
2020-11-26	LaTeX reader: preserve center environment (#6852)	Igor Pashev	1	-1/+1
	The contents of the `center` environment are put in a `Div` with class `center`.
2020-11-26	HTML reader: improve support for table headers, footer, attributes	Albert Krewinkel	4	-118/+244
	- `<tfoot>` elements are no longer added to the table body but used as table footer. - Separate `<tbody>` elements are no longer combined into one. - Attributes on `<thead>`, `<tbody>`, `<th>`/`<td>`, and `<tfoot>` elements are preserved.
2020-11-26	HTML reader: allow finer grained options for tag omission	Albert Krewinkel	3	-13/+26

2020-11-25	LaTeX writer: improve longtable output.	John MacFarlane	1	-40/+60
	- Don't create minipages for regular paragraphs. - Put width and alignment information in the longtable column descriptors. - Closes #6883.
2020-11-25	LaTeX tables: Fix calculation of column spacing.	John MacFarlane	1	-1/+1
	See #6883.
2020-11-25	Fix truncation of `[Citation]` list in `Cite` inside footnotes...	John MacFarlane	1	-2/+2
	This affected author-in-text citations in footnotes. It didn't cause problems for the printed output, but for filters that expected the citation id and other information. Closes #6890.
2020-11-25	HTML reader: simplify list attribute handling	Albert Krewinkel	2	-27/+12
	This removes the `foldOrElse` function from the internal Text.Pandoc.CSS module.
2020-11-24	HTML reader: support row or column-spanning table cells	Albert Krewinkel	2	-28/+26

2020-11-24	HTML reader: support blocks in caption	Albert Krewinkel	2	-6/+6

2020-11-24	HTML reader: extract table parsing into separate module	Albert Krewinkel	3	-95/+140

2020-11-23	ImageSize: default to DPI 72 if the format specifies DPI of 0.	John MacFarlane	1	-1/+7
	This shouldn't happen, in general, but it can happen with JPEGs that don't conform to the spec. Having a DPI of 0 will blow up size calculations (division by 0). Closes #6880.