pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2021-08-11	LaTeX reader: improve handling of plain TeX macro primitives.	John MacFarlane	2	-6/+29
	- Fixed semantics for `\let`. - Implement `\edef`, `\gdef`, and `\xdef`. - Add comment noting that currently `\def` and `\edef` set global macros (so are equivalent to `\gdef` and `\xdef`). This should be fixed by scoping macro definitions to groups, in a future commit. Closes #7474.
2021-08-10	HTML reader: treat commments as blank when parsing.	John MacFarlane	1	-5/+7
	This modifies pBlank. Previously comments could sometimes flummox the parser. Cloes #7482.
2021-08-10	Fix RTF table parsing bug that created undesired nested tables.	John MacFarlane	1	-1/+1
	Closes #7488.
2021-08-10	Add RTF reader.	John MacFarlane	2	-0/+1336
	- `rtf` is now supported as an input format as well as output. - New module Text.Pandoc.Readers.RTF (exporting `readRTF`). [API change] Closes #3982.
2021-08-08	Allow `--slide-level=0`.	John MacFarlane	1	-2/+2
	When the slide level is set to 0, headings won't be used at all in splitting the document into slides. Horizontal rules must be used to separate slides. Closes #7476.
2021-08-04	RTF writer: emit \outlinelevel for section headings.	John MacFarlane	1	-1/+2

2021-08-03	Stop using the HTTP package. (#7456)	mt_caret	6	-11/+29
	We only depend on the urlEncode function in the package, which is also provided by http-types. The HTTP package also depends on the network package, which has difficulty building on ghcjs. Add internal module Text.Pandoc.Network.HTTP, exporting `urlEncode`.
2021-08-03	LaTeX table writer: Increase column width precision (#7466)	Peter Fabinski	1	-1/+1
	In some cases, the rounding performed by the LaTeX table writer would introduce visible overrun outside the text area. This adds two more decimal places to the width values.
2021-08-01	RTF writer: omit `\bin` in `\pict`.	John MacFarlane	1	-1/+1
	According to the spec, this is not needed or wanted when the data is in hexadecimal format, as it is here.
2021-07-29	parseFromString: preserve at least the source directory.	John MacFarlane	1	-1/+1
	Previously we just set the source name to "chunk" when parsing from strings, to avoid misleading source positions. This had the side effect that `rebase_relative_paths` would break inside sections that were parsed as strings. So, now we use "ORIGINAL_SOURCE_PATH_chunk" instead of just "chunk". Closes #7464.
2021-07-22	LaTeX writer: Use ulem for underline.	John MacFarlane	1	-1/+3
	ulem is conditionally included already when the `strikeout` variable is set, so we set this when there is underlined text, and use `\uline` instead of `\underline`. This fixes wrapping for underlined text. Closes #7351.
2021-07-22	MIME: use image/x-xcf instead of application/x-xcf.	John MacFarlane	1	-1/+1
	Closes #7454.
2021-07-17	LaTeX reader: avoid trailing hyphen in translating languages.	John MacFarlane	1	-2/+2
	Previously `\foreignlanguage{english}` turned into `<span lang="en-">`. The same issue affected Arabic. Closes #7447.
2021-07-16	DocBook reader: handle images with imageobjectco elements.	John MacFarlane	1	-3/+3
	Closes #7440.
2021-07-16	LaTeX reader: Support `\cline` in LaTeX tables.	John MacFarlane	1	-0/+1
	Closes #7442.
2021-07-16	PDF: Fix svgIn path error.	John MacFarlane	1	-1/+1
	We were duplicating the temp directory; this didn't show up on macOS or linux because there we use absolute paths for the temp directory. Closes #7431.
2021-07-11	DocBook reader: add support for citerefentry (#7437)	Jan Tojnar	1	-1/+5
	Originally intended for referring to UNIX manual pages, either part of the same DocBook document as refentry element, or external – hence the manvolnum element. These days, refentry is more general, for example the element documentation pages linked below are each a refentry. As per the Processing expectations section of citerefentry, the element is supposed to be a hyperlink to a refentry (when in the same document) but pandoc does not support refentry tag at the moment so that is moot. https://tdg.docbook.org/tdg/5.1/citerefentry.html https://tdg.docbook.org/tdg/5.1/manvolnum.html https://tdg.docbook.org/tdg/5.1/refentry.html This roughly corresponds to a `manpage` role in rST syntax, which produces a `Code` AST node with attributes `.interpreted-text role=manpage` but that does not fit DocBook parser. https://www.sphinx-doc.org/en/master/usage/restructuredtext/roles.html#role-manpage
2021-07-11	Improved parsing of raw LaTeX from Text streams (rawLaTeXParser).	John MacFarlane	2	-11/+37
	We now use source positions from the token stream to tell us how much of the text stream to consume. Getting this to work required a few other changes to make token source positions accurate. Closes #7434.
2021-07-09	Always use / when adding directory to image path with extractMedia.	John MacFarlane	1	-1/+1
	Even on Windows. May help with #7431.
2021-07-09	RST reader: fix regression with code includes.	John MacFarlane	1	-1/+5
	With the recent changes to include infrastructure, included code blocks were getting an extra newline. Closes #7436. Added regression test.
2021-07-07	Don't incorporate externally linked images in EPUB documents (#7430)	Michael Hoffmann	1	-1/+2
	Just like it is possible to avoid incorporating an image in EPUB by passing `data-external="1"` to a raw HTML snippet, this makes the same possible for native Images, by looking for an associated `external` attribute.
2021-07-06	Recognize data-external when reading HTML img tags (#7429)	Michael Hoffmann	1	-8/+3
	Preserve all attributes in img tags. If attributes have a `data-` prefix, it will be stripped. In particular, this preserves a `data-external` attribute as an `external` attribute in the pandoc AST.
2021-07-06	T.P.PDF, convertImage: normalize paths.	John MacFarlane	1	-3/+3
	This will avoid paths on Windows with mixed path separators, which may cause problems with SVG conversion. See #7431.
2021-07-06	Markdown reader: don't try to read contents in self-closing HTML tag.	John MacFarlane	1	-1/+4
	Previously we had problems parsing raw HTML with self-closing tags like `<col/>`. The problem was that pandoc would look for a closing tag to close the markdown contents, but the closing tag had, in effect, already been parsed by `htmlTag`. This fixes the issue described in <https://groups.google.com/d/msgid/pandoc-discuss/297bc662-7841-4423-bcbb-534e99bbba09n%40googlegroups.com>.
2021-07-06	HTML reader: add col, colgroup to 'closes' definitions	John MacFarlane	1	-1/+3

2021-07-05	Add command test for #7394.	John MacFarlane	1	-0/+1
	And fix a small bug in handling of citations in notes, which led to commas at the end of sentences in some cases.
2021-07-05	Citeproc: cleanup and efficiency improvement in deNote.	John MacFarlane	1	-15/+21

2021-07-05	Revamp note citation handling.	John MacFarlane	1	-14/+30
	Use latest citeproc, which uses a Span with a class rather than a Note for notes. This helps us distinguish between user notes and citation notes. Don't put citations at the beginning of a note in parentheses. (Closes #7394.)
2021-07-02	HTML5 writer, remove aria-hidden when explicit atl text is provided.	Aner Lucero	1	-4/+7

2021-06-29	Docx writer: Add table numbering for captioned tables.	John MacFarlane	2	-3/+30
	The numbers are added using fields, so that Word can create a list of tables that will update automatically.
2021-06-29	Docx writer: Fixed a couple bugs in Figure numbering.	John MacFarlane	1	-4/+3

2021-06-29	Docx writer: support figure numbers.	John MacFarlane	2	-3/+21
	These are set up in such a way that they will work with Word's automatic table of figures. Closes #7392.
2021-06-29	Remove duplicated alt text in HTML output.	Aner Lucero	1	-2/+3

2021-06-28	Improve punctuation moving with `--citeproc`.	John MacFarlane	1	-14/+15
	Previously, using `--citeproc` could cause punctuation to move in quotes even when there aer no citations. This has been changed; now, punctuation moving is limited to citations. In addition, we only move footnotes around punctuation if the style is a note style, even if `notes-after-punctuation` is `true`.
2021-06-28	Allow `$` characters in bibtex keys.	John MacFarlane	1	-1/+1
	Closes #7409.
2021-06-28	Text.Pandoc.Error: fix line calculations in reporting parsec errors.	John MacFarlane	1	-3/+3
	Also remove a spurious initial newline in the error report.
2021-06-28	Set proper initial source name in parsing BibTeX.	John MacFarlane	1	-1/+3
	(For better error messages.)
2021-06-28	Markdown writer: put space between Plain and following fenced Div.	John MacFarlane	1	-0/+3
	Closes #4465.
2021-06-23	ImageSize: Add Tiff constructor for ImageType.	John MacFarlane	3	-1/+7
	[Minor API change] This allows pandoc to get size information from tiff images. Closes #7405.
2021-06-23	reveal.js writer: Go back to setting boolean values for variables.	John MacFarlane	1	-30/+26
	In a previous commit we used strings because boolean False wouldn't render as `false`. This is changed in the dev version ofdoctemplates, so we can go back to the more straightforward approach.
2021-06-22	Fix regression with comment-only YAML metadata blocks.	John MacFarlane	1	-0/+3
	Closes #7400.
2021-06-22	Fix unneeded import	John MacFarlane	1	-1/+1

2021-06-21	LaTeX writer: add strut at end of minipage if it contains...	John MacFarlane	1	-2/+5
	line breaks. Without them, the last line is shorter than it should be, at least in some cases.
2021-06-21	Revert "LaTeX writer: put a strut after a line break (`\\`)."	John MacFarlane	1	-1/+1
	This reverts commit e2a7ecb5f73b12c8141ebf873a494652fc53babd.
2021-06-21	LaTeX writer: put a strut after a line break (`\\`).	John MacFarlane	1	-1/+1
	This ensures that we have proper spacing before the next line (which might e.g. be a table bottom border). This gives better results in cases like test/command/7272.md.
2021-06-21	Improve emailAddress in Text.Pandoc.Parsing.	John MacFarlane	2	-5/+24
	Previously the parser would accept characters in domains that are illegal in domains, and this sometimes caused it to gobble bits of the following text. Closes #7398. Note that this change, by itself, caused some txt2tag reader tests to fail. txt2tags allows bare email addresses with a following form query. So, in addition to the change to emailAddress, we modify the txt2tags parser so it can still handle these cases.
2021-06-21	LaTeX writer: always use a minipage for cells with line breaks...	John MacFarlane	1	-2/+7
	if width information is available. Otherwise the way we treat them can lead to content that overflows a cell. Closes #7393.
2021-06-21	LaTeX writer: Use `\strut` instead of `~` before `\\` in empty line.	John MacFarlane	1	-1/+1

2021-06-21	reveal.js writer: better handling of options.	John MacFarlane	1	-0/+50
	Previously it was impossible to specify false values for options that default to true; setting the option to false just caused the portion of the template setting the option to be omitted. Now we prepopulate all the variables with their default values, including them unconditionally and allowing them to be overridden.
2021-06-21	Markdown writer: Fix regression in code blocks with attributes.	John MacFarlane	1	-3/+3
	Code blocks with a single class but nonempty attributes were having attributes drop as a result of #7242. Closes #7397.