pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2020-02-03	Allow & in LaTeX citation keys.	John MacFarlane	1	-1/+1
	Closes #6110.
2020-02-03	Swap suboptimal uses of maybe and fromMaybe (#6111)	Joseph C. Sible	2	-3/+3
	Anywhere "maybe" is used with "id" as its second argument, using "fromMaybe" instead will simplify the code. Conversely, anywhere "fromMaybe" is used with the result of "fmap" or "<$>" as its second argument, using "maybe" instead will simplify the code.
2020-02-03	Clean up a confusing triple negative (#6102)	Joseph C. Sible	1	-5/+5
	There's currently `unless`, `not`, and `notParaOrPlain` in the same expression, which is a rather confusing triple negative. Replace `notParaOrPlain` with `paraOrPlain` and switch to `all` from `any` to clean this up.
2020-02-01	Text.Pandoc.Readers.CSV - reuse CSV parser from Text.Pandoc.CSV.	John MacFarlane	1	-65/+5

2020-01-31	csv reader: allow empty cells.	John MacFarlane	1	-7/+5

2020-01-31	Add Text.Pandoc.Readers.CSV (readCSV).	John MacFarlane	1	-0/+108
	This adds csv as an input format. The CSV table is converted into a pandoc simple table. Closes #6100.
2020-01-28	Added a try that was needed for the commit fc78be1.	John MacFarlane	1	-1/+1
	The intent of that commit was to parse unknown LaTeX enivronments as verbatim if they can't be parsed normally, avoiding crashes on environments that allow unescaped underscores and the like. But the fix didn't completely work: it worked for raw TeX in markdown but not when reading LaTeX. This change fixes that. See #6034. Closes #6093.
2020-01-10	LaTeX reader: allow beamer overlays for all commands in all raw tex.	John MacFarlane	1	-10/+10
	This affecs parsing of raw tex in LaTeX and in Markdown and other formats. Closes #6043.
2020-01-08	LaTeX reader: improve parsing of raw environments.	John MacFarlane	1	-1/+1
	If parsing fails in a raw environment (e.g. due to special characters like unescaped `_`), try again as a verbatim environment, which is less sensitive to special characters. This allows us to capture special environments that change catcodes as raw tex when `-f latex+raw_tex` is used. Closes #6034.
2019-12-27	Fix parsing bug affected indented code after raw HTML.	John MacFarlane	1	-8/+10
	Closes #6009, #5360.
2019-12-27	Add a needed try.	John MacFarlane	1	-2/+3

2019-12-19	Org reader: report errors properly	Albert Krewinkel	1	-2/+1
	Errors during parsing are now returned in full and no longer replaced by a custom message.
2019-12-19	Org reader: fix parsing problem for colons in headline	Albert Krewinkel	2	-11/+27
	Fixed a problem where words surrounded by colons could causing parse failures in some cases when they occurred in headers. Fixes: #5993
2019-12-18	Org reader: wrap named table in div, using name as id	Albert Krewinkel	1	-12/+10
	Closes: #5984
2019-12-17	Add jira reader (#5913)	Albert Krewinkel	1	-0/+173
	Closes #5556
2019-12-17	HTML reader: Add "nav" to list of block-level tags.	John MacFarlane	1	-1/+2

2019-12-13	Org reader: add table labels to caption if both are present	Albert Krewinkel	1	-3/+11
	The table `#+NAME:` or `#+LABEL:` is added to the table's caption in the form of an empty span with the label set as the span's ID. Closes: #5984
2019-12-05	Avoid deprecation warning for minimumDef using CPP.	John MacFarlane	1	-1/+6

2019-11-24	Add unexported Text.Pandoc.Readers.Metadata.	John MacFarlane	2	-104/+161
	For YAML metadata parsing. A step in the direction of #5914. No API change.
2019-11-21	LaTeX reader: parse \micro siunitx unit command (#5921)	Jose Luis Duran	1	-0/+1
	This was somehow missed in 884aef31c55e375cd62fcb55a71829d005087cae.
2019-11-20	Fix typos (#5919)	Brian Wignall	1	-1/+1

2019-11-18	DokuWiki reader: parse markup inside monospace ('') (#5917)	Alexander Krotov	1	-2/+2
	Fixes #5916
2019-11-15	LaTeX Reader: Add KOMA-Script metadata commands (#5910)	Andrew Dunning	1	-1/+8
	Add all titling commands to existing definition for `\dedication`.
2019-11-14	Markdown reader: use take1WhileP for table row.	John MacFarlane	1	-1/+1

2019-11-14	Markdown reader: Use take1WhileP for str.	John MacFarlane	1	-1/+3
	This yields a small but measurable performance improvement.
2019-11-13	Fix regression introduced by last commit.	John MacFarlane	1	-1/+2

2019-11-13	Markdown reader: don't parse footnote body unless extension enabled.	John MacFarlane	1	-18/+20

2019-11-12	Switch to new pandoc-types and use Text instead of String [API change].	despresc	47	-2307/+2427
	PR #5884. + Use pandoc-types 1.20 and texmath 0.12. + Text is now used instead of String, with a few exceptions. + In the MediaBag module, some of the types using Strings were switched to use FilePath instead (not Text). + In the Parsing module, new parsers `manyChar`, `many1Char`, `manyTillChar`, `many1TillChar`, `many1Till`, `manyUntil`, `mantyUntilChar` have been added: these are like their unsuffixed counterparts but pack some or all of their output. + `glob` in Text.Pandoc.Class still takes String since it seems to be intended as an interface to Glob, which uses strings. It seems to be used only once in the package, in the EPUB writer, so that is not hard to change.
2019-11-11	Markdown reader: fix small super/subscript issue.	John MacFarlane	1	-2/+6
	Superscripts and subscripts cannot contain spaces, but newlines were previously allowed (unintentionally). This led to bad interactions in some cases with footnotes. E.g. ``` foo^[note] bar^[note] ``` With this change newlines are also not allowed inside super/subscripts. Closes #5878.
2019-11-11	Change the implementation of `htmlSpanLikeElements` and implement `<dfn>` ↵	Florian Beeres	1	-4/+11
	(#5882) * Add HTML Reader support for `<dfn>`, parsing this as a Span with class `dfn`. * Change `htmlSpanLikeElements` implementation to retain classes, attributes and inline content.
2019-11-07	DocBook reader: Fix bug with entities in mathphrase element.	John MacFarlane	1	-4/+2
	Closes #5885.
2019-11-07	Change merge behavior for metadata.	John MacFarlane	1	-1/+3
	Previously, if a document contained two YAML metadata blocks that set the same field, the conflict would be resolved in favor of the first. Now it is resolved in favor of the second (due to a change in pandoc-types). This makes the behavior more uniform with other things in pandoc (such as reference links and `--metadata-file`).
2019-11-04	Removed an unnecessary unpack.	John MacFarlane	1	-1/+1

2019-11-04	HTML Reader/Writer - Add support for <var> and <samp> (#5861)	Amogh Rathore	1	-5/+7
	Closes #5799
2019-11-03	Docx reader: Only use LTR when it is overriding BiDi setting	Jesse Rosenthal	3	-2/+14
	The left-to-right direction setting in docx is used in the spec only for overriding an explicit right-to-left setting. We only process it when it happens in a paragraph set with BiDi. This is especially important for docs exported from Google Docs, which explicitly (and unnecessarily) set "rtl=0" for every paragraph. Closes: #5723
2019-11-03	Docx reader: fix list number resumption for sublists. Closes #4324.	John MacFarlane	1	-1/+8
	The first list item of a sublist should not resume numbering from the number of the last sublist item of the same level, if that sublist was a sublist of a different list item. That is, we should not get: ``` 1. one 1. sub one 2. sub two 2. two 3. sub one ```
2019-11-02	RST reader: avoid spurious warning...	John MacFarlane	1	-1/+1
	when resolving links to internal anchors ending with `_`. Closes #5763.
2019-11-02	LaTeX reader: Fixed dollar-math parsing...	John MacFarlane	1	-9/+9
	...to ensure that space is left between a control seq and a following word that would otherwise change its meaning. Closes #5836.
2019-11-02	LaTeX untokenize: Ensure space between control sequence and following letter.	John MacFarlane	2	-2/+15
	Closes #5836.
2019-11-02	LaTeX reader: Don't omit macro definitions defined in the preamble.	John MacFarlane	1	-6/+7
	These were formerly omitted (though they still affected macro resolution if `latex_macros` was set). Now they are included in the document.
2019-11-02	LaTeX reader: parse macro defs as raw latex...	John MacFarlane	1	-8/+13
	when `latex_macros` is disabled. (When `latex_macros` is enabled, we omit them, since pandoc is applying the macros itself.) Previously, it was documented that the macro definitions got passed through as raw latex regardless of whether `latex_macros` was set -- but in fact they never got passed through.
2019-11-02	LaTeX reader: fixed a hang/memory leak in certain circumstances.	John MacFarlane	1	-3/+3
	We were using `grouped blocks` instead of `grouped block`. This caused the reader to hang in an infinite loop (with a memory leak) on e.g. `\parbox{1em}{#1}`. Closes #5845.
2019-10-30	docbook reader: fix nesting of chapters and sections (#5864)	Florian Klink	1	-1/+1
	* Set dbBook to true when traversing a chapter too. Currently, a `<title/>` in a chapter and in a `<section/>` below that chapter have the same level if they're not inside a `<book/>`. This can happen in a multi-file book project. Also see the example at https://tdg.docbook.org/tdg/4.5/chapter.html Co-authored-by: Félix Baylac-Jacqué <felix@alternativebit.fr> * Add docbook-chapter test This tests nested `<section/>` and makes sure `<title/>` in the first `<section/>` below `<chapter/>` is one level deeper than the `<chapter/>`'s `<title/>`, also when not inside a `<book/>`. Co-authored-by: Félix Baylac-Jacqué <felix@alternativebit.fr>
2019-10-27	Org reader: fix parsing of empty comment lines	Albert Krewinkel	1	-1/+3
	Comment lines in Org-mode can be completely empty; both of these line should produce no output: # a comment # The reader used to produce a wrong result for the latter, but ignores that line as well now. Fixes: #5856
2019-10-24	HTML reader/writer: Better handling of <q> with cite attribute (#5837)	Ole Martin Ruud	1	-23/+34
	* HTML reader: Handle cite attribute for quotes. If a `<q>` tag has a `cite` attribute, we interpret it as a Quoted element with an inner Span. Closes #5798 * Refactor url canonicalization into a helper function * Modify HTML writer to handle quote with cite. [0]: https://developer.mozilla.org/en-US/docs/Web/HTML/Element/q
2019-10-23	T.P.Readers.LaTeX.Parsing: add `[Tok]` parameter to rawLaTeXParser.	John MacFarlane	2	-10/+16
	This allows us to avoid retokenizing multiple times in e.g. rawLaTeXBlock. (Unexported module, so not an API change.)
2019-10-23	Add Reader support for HTML <samp> element (#5843)	Amogh Rathore	1	-0/+9
	The `<samp>` element is parsed as a Span with class `sample`. Closes #5792.
2019-10-15	Add support for reading and writing <kbd> elements	Daniele D'Orazio	1	-1/+9
	* Text.Pandoc.Shared: export `htmlSpanLikeElements` [API change] This commit also introduces a mapping of HTML span like elements that are internally represented as a Span with a single class, but that are converted back to the original element by the html writer. As of now, only the kbd element is handled this way. Ideally these elements should be handled as plain AST values, but since that would be a breaking change with a large impact, we revert to this stop-gap solution. Fixes https://github.com/jgm/pandoc/issues/5796.
2019-10-15	Muse reader: do not allow closing asterisks to be followed by "*"	Alexander Krotov	1	-2/+7

2019-10-15	Muse reader: do not split series of asterisks into symbols and emphasis	Alexander Krotov	1	-0/+7
	Fixes #5821