pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2021-01-02	LaTeX reader: put contents of unknown environments in a Div...	John MacFarlane	1	-1/+1
	when `raw_tex` is not enabled. (When `raw_tex` is enabled, the whole environment is parsed as a raw block.) The class name is the name of the environment. Previously, we just included the contents without the surrounding Div, but having a record of the environment's boundaries and name can be useful. Closes #6997.
2021-01-02	LaTeX writer: revert table line height increase in 2.11.3.	John MacFarlane	1	-1/+1
	In 2.11.3 we started adding `\addlinespace`, which produced less dense tables. This wasn't an intentional change; I misunderstood a comment in the discussion leading up to the change. This commit restores the earlier default table appearance. Note that if you want a less dense table, you can use something like `\def\arraystretch{1.5}` in your header. Closes #6996.
2021-01-01	Org reader: restructure output of captioned code blocks	Albert Krewinkel	1	-14/+12
	The Div wrapper of code blocks with captions now has the class "captioned-content". The caption itself is added as a Plain block inside a Div of class "caption". This makes it easier to write filters which match on captioned code blocks. Existing filters will need to be updated. Closes: #6977
2020-12-30	Mediawiki reader: allow space around storng/emph delimiters.	John MacFarlane	1	-6/+4
	Closes #6993.
2020-12-30	Undo the "Use fromRight" hlint hint.	John MacFarlane	1	-2/+1

2020-12-30	Hlint fixes	John MacFarlane	2	-2/+3

2020-12-30	Ms writer: don't justify inside table cells.	John MacFarlane	1	-1/+3

2020-12-29	Improve fix to #6983.	John MacFarlane	1	-1/+3
	If we have a paragraph then a bookmarkEnd, we don't need to insert the empty paragraph (and in fact it alters the spacing). Closes #6983.
2020-12-28	Docx writer: fix nested tables with captions.	John MacFarlane	1	-4/+6
	Previously we got unreadable content, because docx seems to want a `<w:p>` element (even an empty one) at the end of every table cell. Closes #6983.
2020-12-28	HTML reader: use renderTags' from Text.Pandoc.Shared.	Albert Krewinkel	1	-25/+3
	The `renderTags'` function was duplicated when the reader used `Text` as its string type. The duplication is no longer necessary. A side effect of this change is that empty `<col>` elements are written as self-closing tags in raw HTML blocks.
2020-12-27	Use meta-description instead of description in templates.	John MacFarlane	1	-0/+3
	Since this is an attribute value, we need to prepare it in the writer.
2020-12-27	Add support for writing nested tables to asciidoc (#6972)	timo-a	1	-7/+32
	Added field to WriterState that denotes the current nesting level for traversing tables. Depending on the value of that field nested tables are recognized and written. Asciidoc supports one level of nesting. If deeper tables are to be written, they are omitted and a warning is issued.
2020-12-27	Powerpoint writer: allow arbitrary OOXML in raw inline elements	Albert Krewinkel	1	-22/+27
	The raw text is now included verbatim in the output. Previously is was parsed into XML elements, which prevented the inclusion of partial XML snippets.
2020-12-24	Citeproc: fix handling of empty URL variables (`DOI`, etc.).	John MacFarlane	1	-1/+3
	The `linkifyVariables` function was changing these to links which then got treated as non-empty by citeproc, leading to wrong results (e.g. ignoring nonempty URL when empty DOI is present). Addresses part 2 of jgm/citeproc#41.
2020-12-20	HTML writer: don't include p tags in CSL bibliography entries.	John MacFarlane	1	-2/+7
	Fixes a regression in 2.11.3. Closes #6966
2020-12-20	LaTeX writer: support colspans and rowspans in tables. (#6950)	Albert Krewinkel	4	-96/+236
	Note that the multirow package is needed for rowspans. It is included in the latex template under a variable, so that it won't be used unless needed for a table.
2020-12-16	Fix citeproc regression with duplicate references.	John MacFarlane	1	-1/+2
	- Use dev version of citeproc, which handles duplicate ids better, preferring the last one in the list and discarding the rest. - Ensure that inline citations take priority over external ones. See jgm/citeproc#36. This restores the behavior of pandoc-citeproc.
2020-12-16	Support Lua marshalling of doctemplates BoolVal.	John MacFarlane	1	-0/+1
	This updates T.P.Lua.Marshaling.Context for doctemplates >= 0.9.
2020-12-15	Properly handle boolean values in writing YAML metadata.	John MacFarlane	2	-2/+3
	(Markdown writer.) This requires doctemplates >= 0.9. Closes #6388.
2020-12-15	Use fetchItem to get external bibliography.	John MacFarlane	1	-8/+7
	This means that: - a URL may be provided, and pandoc will fetch the resource. - Pandoc will search the resource path for the bibliography if it is not found relative to the working directory. Closes #6940.
2020-12-15	Allow both inline and external references to be used	John MacFarlane	1	-14/+15
	with `--citeproc`. This fixes a regression, since pandoc-citeproc allowed these to be combined. Closes #6951.
2020-12-14	ImageSize: use exif width and height when available.	John MacFarlane	1	-0/+13
	After the move to JuicyPixels, we were getting incorrect width and heigh information for some images (see #6936, test-3.jpg). The correct information was encoded in Exif tags that JuicyPixels seemed to ignore. So we check these first before looking at the Width and Height identified by JuicyPixels. Closes #6936.
2020-12-13	RST writer: better image handling.	John MacFarlane	1	-9/+21
	- An image alone in its paragraph (but not a figure) is now rendered as an independent image, with an `alt` attribute if a description is supplied. - An inline image that is not alone in its paragraph will be rendered, as before, using a substitution. Such an image cannot have a "center", "left", or "right" alignment, so the classes `align-center`, `align-left`, or `align-right` are ignored. However, `align-top`, `align-middle`, `align-bottom` will generate a corresponding `align` attribute. Closes #6948.
2020-12-13	Merge pull request #6941 from tarleb/docx-raw	John MacFarlane	1	-59/+78
	Docx writer: keep raw openxml strings verbatim
2020-12-13	ImageSize: use JuicyPixels to extract size...	John MacFarlane	1	-305/+8
	...for png, jpeg, gif, instead of doing our own binary parsing. See #6936.
2020-12-13	ImageSize: use JuicyPixels to determine png size.	John MacFarlane	1	-31/+19

2020-12-13	Docx writer: keep raw openxml strings verbatim.	Albert Krewinkel	1	-2/+5
	Closes: #6933
2020-12-13	Docx writer: use Content instead of Element.	Albert Krewinkel	1	-59/+75

2020-12-12	Merge pull request #6946 from mb21/icml-image-fit	John MacFarlane	1	-1/+6
	ICML writer: fix image bounding box for custom widths/heights
2020-12-12	LaTeX writer: extract table handling into separate module.	Albert Krewinkel	5	-237/+355

2020-12-12	ICML writer: fix image bounding box for custom widths/heights	mb21	1	-1/+6
	fixes #6936
2020-12-10	HTML reader: pay attention to lang attributes on body.	John MacFarlane	1	-3/+6
	These (as well as lang attributes on html) should update lang in metadata. See #6938.
2020-12-10	HTML reader: retain attribute prefixes and avoid duplicates.	John MacFarlane	2	-24/+24
	Previously we stripped attribute prefixes, reading `xml:lang` as `lang` for example. This resulted in two duplicate `lang` attributes when `xml:lang` and `lang` were both used. This commit causes the prefixes to be retained, and also avoids invald duplicate attributes. Closes #6938.
2020-12-10	Add sourcepos extension for commonmarke	John MacFarlane	3	-5/+12
	* Add `Ext_sourcepos` constructor for `Extension`. * Add `sourcepos` extension (only for commonmark). * Bump to 2.11.3 With the `sourcepos` extension set set, `data-pos` attributes are added to the AST by the commonmark reader. No other readers are affected. The `data-pos` attributes are put on elements that accept attributes; for other elements, an enlosing Div or Span is added to hold the attributes. Closes #4565.
2020-12-10	Commonmark reader: refactor specFor, set input name to "".	John MacFarlane	1	-2/+8

2020-12-07	Parsing: Small code improvements.	John MacFarlane	1	-3/+4

2020-12-07	Parsing: More minor performance improvements.	John MacFarlane	1	-10/+13

2020-12-07	Small efficiency improvement in uri parser	John MacFarlane	1	-1/+14

2020-12-07	Bibtex parser: avoid noneOf.	John MacFarlane	1	-2/+2

2020-12-07	Parsing: in nonspaceChar use satisfy instead of oneOf.	John MacFarlane	1	-1/+7
	For efficiency.
2020-12-07	Dokuwiki reader: handle unknown interwiki links better.	John MacFarlane	1	-1/+1
	DokuWiki lets the user define his own Interwiki links. Previously pandoc reacted to these by emitting a google search link, which is not helpful. Instead, we now just emit the full URL including the wikilink prefix, e.g. `faquk>FAQ-mathml`. This at least gives users the ability to modify the links using filters. Closes #6932.
2020-12-07	Merge pull request #6922 from jtojnar/db-writer-admonitions	John MacFarlane	1	-19/+45
	Docbook writer: handle admonitions
2020-12-07	Docbook writer: Handle admonition titles from Markdown reader	Jan Tojnar	1	-0/+2
	Docbook reader produces a `Div` with `title` class for `<title>` element within an “admonition” element. Markdown writer then turns this into a fenced div with `title` class attribute. Since fenced divs are block elements, their content is recognized as a paragraph by the Markdown reader. This is an issue for Docbook writer because it would produce an invalid DocBook document from such AST – the `<title>` element can only contain “inline” elements. Let’s handle this invalid special case separately by unwrapping the paragraph before creating the `<title>` element.
2020-12-07	Docbook writer: Use correct id attribute consistently	Jan Tojnar	1	-10/+16
	DocBook5 should always use xml:id instead of id so let’s use it everywhere.
2020-12-07	Docbook writer: handle admonitions	Jan Tojnar	1	-12/+30
	Similarly to https://github.com/jgm/pandoc/commit/d6fdfe6f2bba2a8ed25d6c9f11861774001f7a91, we should handle admonitions.
2020-12-05	Org reader: preserve targets of spurious links	Albert Krewinkel	1	-5/+4
	Links with (internal) targets that the reader doesn't know about are converted into emphasized text. Information on the link target is now preserved by wrapping the text in a Span of class `spurious-link`, with an attribute `target` set to the link's original target. This allows to recover and fix broken or unknown links with filters. See: #6916
2020-12-05	OpenDocument writer: Allow references for internal links (#6774)	Nils Carlson	2	-18/+77
	This commit adds two extensions to the OpenDocument writer, `xrefs_name` and `xrefs_number`. Links to headings, figures and tables inside the document are substituted with cross-references that will use the name or caption of the referenced item for `xrefs_name` or the number for `xrefs_number`. For the `xrefs_number` to be useful heading numbers must be enabled in the generated document and table and figure captions must be enabled using for example the `native_numbering` extension. In order for numbers and reference text to be updated the generated document must be refreshed. Co-authored-by: Nils Carlson <nils.carlson@ludd.ltu.se>
2020-12-05	LaTeX reader: don't apply theorem default styling to a figure inside.	John MacFarlane	1	-0/+1
	If we put an image in italics, then when rendering to Markdown we no longer get an implicit figure. Closes #6925.
2020-12-04	Docbook writer: add XML namespaces to top-level elements (#6923)	Jan Tojnar	1	-8/+20
	Previously, we only added xmlns attributes to chapter elements, even when running with --top-level-division=section. Let’s add the namespaces to part and section elements too, when they are the selected top-level divisions. We do not need to add namespaces to documents produced with --standalone flag, since those will already have xmlns attribute on the root element in the template.
2020-12-04	Markdown writer: ensure that a new csl-block begins on a new line.	John MacFarlane	1	-1/+6
	This just looks better and doesn't affect the semantics. See #6921.