pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2020-07-12	Merge pull request #6509 from lierdakil/docx-smush-inlines-refactor	John MacFarlane	1	-62/+39
	[Docx Reader] Refactor/update Text.Pandoc.Readers.Docx.Combine.smushInlines
2020-07-12	Ms writer: fix code highlighting with blank lines.	John MacFarlane	1	-5/+5
	Previously blank lines were simply omitted from highligted code.
2020-07-12	RST reader: fix spurious newlines in some attributes from directives.	John MacFarlane	1	-1/+2

2020-07-12	RST reader: avoid extra newline in included code blocks.	John MacFarlane	1	-2/+2

2020-07-08	Escape starting periods in ms writer code blocks	Michael Hoffmann	1	-1/+3
	If a line of ms code block output starts with a period (.), it should be prepended by '\&' so that it is not interpreted as a roff command. Fixes #6505
2020-07-07	[Docx Reader] Use null instead of isEmpty in Readers.Docx.Combine	Nikolay Yakimov	1	-9/+5

2020-07-07	[Docx Reader] Remove unused LANGUAGE from Readers.Docx.Combine	Nikolay Yakimov	1	-2/+0

2020-07-07	[Docx Reader] Remove no-op stack/unstackInlines in Readers.Docx.Combine	Nikolay Yakimov	1	-6/+4

2020-07-07	[Docx Reader] Get rid of unused NullModifier in Readers.Docx.Combine	Nikolay Yakimov	1	-18/+15

2020-07-07	[Docx Reader] Refactor/update smushInlines	Nikolay Yakimov	1	-44/+32

2020-07-02	Revert "Ipnyb: allow lossless round-tripping of markdown cell content."	John MacFarlane	2	-7/+3
	This reverts commit efbc2050315b60c8a753dee6255465f1083019ab.
2020-07-02	Revert "Ipynb reader: fix duplication of 'source' attribute."	John MacFarlane	1	-1/+1
	This reverts commit 2d009366cef2358ec2c99612ae2c73068841306c.
2020-07-02	Ipynb reader: fix duplication of 'source' attribute.	John MacFarlane	1	-1/+1
	See #5408.
2020-07-01	HTML writer: improve alt-text/caption handling for HTML5	Albert Krewinkel	1	-2/+10
	Screen readers read an image's `alt` attribute and the figure caption, both of which come from the same source in pandoc. The figure caption is hidden from screen readers with the `aria-hidden` attribute. This improves accessibility. For HTML4, where `aria-hidden` is not allowed, pandoc still uses an empty `alt` attribute to avoid duplicate contents. Closes: #6491
2020-07-01	Org reader: respect tables-excluding export setting	Albert Krewinkel	3	-2/+7
	Tables can be removed from the final document with the `#+OPTION: \|:nil` export setting.
2020-06-30	Org reader: respect export setting disabling footnotes	Albert Krewinkel	3	-2/+7
	Footnotes can be removed from the final document with the `#+OPTION: f:nil` export setting.
2020-06-30	Ipnyb: allow lossless round-tripping of markdown cell content.	John MacFarlane	2	-3/+7
	The reader now parses the contents of the markdown cell to a Pandoc structure, but also stores the raw markdown in a `source` attribute on the cell Div. When we convert back to markdown, this attribute is stripped off and the original source is used. When we convert to other formats, the attribute is usually ignored (though it will come through in HTML as a `data-source` attribute, not unhelpfully). I'll note some potential drawbacks of this approach: - It makes it impossible to use pandoc to clean up or change the contents of markdown cells, e.g. going from `+smart` to `-smart`. - There may be formats where the addition of the `source` attribute is problematic. I can't think of any, though. Closes #5408.
2020-06-30	Org reader: respect export setting which disables entities	Albert Krewinkel	3	-6/+16
	MathML-like entities, e.g., `\alpha`, can be disabled with the `#+OPTION: e:nil` export setting.
2020-06-29	Merge pull request #6328 from lierdakil/defaults-meta-parse	John MacFarlane	3	-57/+45
	Unify defaults metadata and markdown metadata parsers
2020-06-29	Org reader: keep unknown keyword lines as raw org	Albert Krewinkel	2	-2/+13
	The lines of unknown keywords, like `#+SOMEWORD: value` are no longer read as metadata, but kept as raw `org` blocks. This ensures that more information is retained when round-tripping org-mode files; additionally, this change makes it possible to support non-standard org extensions via filters.
2020-06-29	Org reader: unify keyword handling	Albert Krewinkel	1	-75/+67
	Handling of export settings and other keywords (like `#+LINK`) has been combined and unified.
2020-06-29	Org reader: support LATEX_HEADER_EXTRA and HTML_HEAD_EXTRA settings	Albert Krewinkel	1	-5/+9
	These export settings are treated like their non-extra counterparts, i.e., the values are added to the `header-includes` metadata list.
2020-06-29	Org reader: allow multiple #+SUBTITLE export settings	Albert Krewinkel	1	-0/+1
	The values of all lines are read as inlines and collected in the `subtitle` metadata field.
2020-06-29	Clean up T.P.R.Metadata	Nikolay Yakimov	2	-41/+25

2020-06-29	Handle errors in yamlToMeta	Nikolay Yakimov	2	-11/+9

2020-06-29	Unify defaults and markdown metadata parsers	Nikolay Yakimov	3	-29/+35

2020-06-28	Remove obsolete RelaxedPolyRec extension (#6487)	Nikolay Yakimov	5	-7/+0

2020-06-28	PDF: all verbose output now goes to stderr, not stdout.	John MacFarlane	1	-21/+21
	Closes #6483.
2020-06-28	JATS reader: parse abstract element into metadata field of same name (#6482)	Albert Krewinkel	1	-0/+9
	Closes: #6480
2020-06-28	Org reader: read `#+INSTITUTE` values as text with markup	Albert Krewinkel	1	-7/+13
	The value is stored in the `institute` metadata field and used in the default beamer presentation template.
2020-06-28	Org reader: update behavior of author, keywords export settings	Albert Krewinkel	1	-19/+9
	The behavior of the `#+AUTHOR` and `#+KEYWORD` export settings has changed: Org now allows multiple such lines and adds a space between the contents of each line. Pandoc now always parses these settings as meta inlines; setting values are no longer treated as comma-separated lists. Note that a Lua filter can be used to restore the previous behavior.
2020-06-28	Org reader: refactor export setting handling	Albert Krewinkel	1	-79/+67

2020-06-27	Org reader: read description lines as inlines	Albert Krewinkel	1	-10/+46
	`#+DESCRIPTION` lines are treated as text with markup. If multiple such lines are given, then all lines are read and separated by soft linebreaks. Closes: #6485
2020-06-25	Org reader: honor tex export option	Albert Krewinkel	4	-30/+75
	The `tex` export option can be set with `#+OPTION: tex:nil` and allows three settings: - `t` causes LaTeX fragments to be parsed as TeX or added as raw TeX, - `nil` removes all LaTeX fragments from the document, and - `verbatim` treats LaTeX as text. The default is `t`. Closes: #4070
2020-06-23	Remove redundant pattern match in pptx writer.	John MacFarlane	1	-3/+0

2020-06-23	LaTeX reader: Retain the Div around tables with attributes.	John MacFarlane	1	-1/+8
	We'll need this to store table attributes until all writers are adjusted to react to attributes on the Table element.
2020-06-23	Markdown reader: Don't require blank line after grid table.	John MacFarlane	1	-2/+2
	This fixes #6481, allowing grid tables to be enclosed in fenced divs with no intervening blank lines.
2020-06-22	Handle native Underline in Powerpoint writer.	John MacFarlane	1	-1/+1
	(Instead of old Span with underline class. Spans with `underline` will no longer be rendered as underlined text.)
2020-06-22	Use native Underline instead of Span in Jira	John MacFarlane	2	-5/+2

2020-06-22	Use --enable-local-file-access in invoking wkhtmltopdf.	John MacFarlane	1	-1/+2
	wkhtmltopdf changed in recent versions to require this for access to local files. This fixes PDF via HTML5 with `--css`. Closes #6474.
2020-06-20	Recognize images with uppercase extensions	Albert Krewinkel	1	-1/+2
	Fixes: #6472
2020-06-17	LaTeX writer: escape `^` specially for listings.	John MacFarlane	1	-1/+1
	Closes #6460.
2020-06-17	RST reader: pass arbitrary attributes through in code blocks.	John MacFarlane	1	-12/+12
	Exceptions: name (which becomes the id), class (which becomes the classes), and number-lines (which is treated specially to fit with pandoc highlighting). Closes #6465.
2020-06-17	Fix MIME type for TrueType fonts in EPUBs (#6464)	Michael Reed	1	-1/+1
	Per the EPUB 3.2 spec, "application/x-font-truetype" is no longer a valid identifier for TrueType (.ttf) fonts [1]. This fixes warnings when validating pandoc-generated EPUBs using `epubcheck` [2]. References [3]. [1]: https://www.w3.org/publishing/epub3/epub-spec.html#sec-core-media-types [2]: https://github.com/w3c/epubcheck
2020-06-14	Docbook reader: implement <procedure> (#6442)	Mathieu Boespflug	1	-4/+6
	A `<procedure>` contains a sequence of `<step>`'s, or `<substeps>` that themselves contain `<step>`'s.
2020-06-14	Docbook reader: implement <phrase> (#6438)	Mathieu Boespflug	1	-1/+7
	A `<phrase>` has no semantic meaning. It is only useful to hang an `id` or other attributes around a piece of text.
2020-06-14	Docbook reader: treat envar and systemitem like code (#6435)	Mathieu Boespflug	1	-2/+4

2020-06-14	Docbook: implement <replaceable> (#6437)	Mathieu Boespflug	1	-1/+3
	A `<replaceable>` is a placeholder that a user is instructed to replace with a value of their own, like `<replaceable>prefix</replacable>/bin/foo`. In the standard Docbook toolchain, this typically appears emphasized, and no other adornement. But a `<replaceable>` is nearly always in a code element, where emphasis won't work. So we do the same thing as for `<optional>`: decorate the content with brackets.
2020-06-14	Docbook: map <simplesect> to unnumbered section (#6436)	Mathieu Boespflug	1	-15/+19
	A <simplesect> is a section like any other, except that it never contains an subsection, and is typically rendered unnumbered.
2020-06-14	Distinguish between single and double quotes when using enquote package (#6457)	dbecher-ito	1	-1/+3