pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2019-11-02	LaTeX reader: Don't omit macro definitions defined in the preamble.	John MacFarlane	1	-6/+7
	These were formerly omitted (though they still affected macro resolution if `latex_macros` was set). Now they are included in the document.
2019-11-02	LaTeX reader: parse macro defs as raw latex...	John MacFarlane	1	-8/+13
	when `latex_macros` is disabled. (When `latex_macros` is enabled, we omit them, since pandoc is applying the macros itself.) Previously, it was documented that the macro definitions got passed through as raw latex regardless of whether `latex_macros` was set -- but in fact they never got passed through.
2019-11-02	LaTeX reader: fixed a hang/memory leak in certain circumstances.	John MacFarlane	1	-3/+3
	We were using `grouped blocks` instead of `grouped block`. This caused the reader to hang in an infinite loop (with a memory leak) on e.g. `\parbox{1em}{#1}`. Closes #5845.
2019-10-30	docbook reader: fix nesting of chapters and sections (#5864)	Florian Klink	1	-1/+1
	* Set dbBook to true when traversing a chapter too. Currently, a `<title/>` in a chapter and in a `<section/>` below that chapter have the same level if they're not inside a `<book/>`. This can happen in a multi-file book project. Also see the example at https://tdg.docbook.org/tdg/4.5/chapter.html Co-authored-by: Félix Baylac-Jacqué <felix@alternativebit.fr> * Add docbook-chapter test This tests nested `<section/>` and makes sure `<title/>` in the first `<section/>` below `<chapter/>` is one level deeper than the `<chapter/>`'s `<title/>`, also when not inside a `<book/>`. Co-authored-by: Félix Baylac-Jacqué <felix@alternativebit.fr>
2019-10-27	Org reader: fix parsing of empty comment lines	Albert Krewinkel	1	-1/+3
	Comment lines in Org-mode can be completely empty; both of these line should produce no output: # a comment # The reader used to produce a wrong result for the latter, but ignores that line as well now. Fixes: #5856
2019-10-24	HTML reader/writer: Better handling of <q> with cite attribute (#5837)	Ole Martin Ruud	1	-23/+34
	* HTML reader: Handle cite attribute for quotes. If a `<q>` tag has a `cite` attribute, we interpret it as a Quoted element with an inner Span. Closes #5798 * Refactor url canonicalization into a helper function * Modify HTML writer to handle quote with cite. [0]: https://developer.mozilla.org/en-US/docs/Web/HTML/Element/q
2019-10-23	T.P.Readers.LaTeX.Parsing: add `[Tok]` parameter to rawLaTeXParser.	John MacFarlane	2	-10/+16
	This allows us to avoid retokenizing multiple times in e.g. rawLaTeXBlock. (Unexported module, so not an API change.)
2019-10-23	Add Reader support for HTML <samp> element (#5843)	Amogh Rathore	1	-0/+9
	The `<samp>` element is parsed as a Span with class `sample`. Closes #5792.
2019-10-15	Add support for reading and writing <kbd> elements	Daniele D'Orazio	1	-1/+9
	* Text.Pandoc.Shared: export `htmlSpanLikeElements` [API change] This commit also introduces a mapping of HTML span like elements that are internally represented as a Span with a single class, but that are converted back to the original element by the html writer. As of now, only the kbd element is handled this way. Ideally these elements should be handled as plain AST values, but since that would be a breaking change with a large impact, we revert to this stop-gap solution. Fixes https://github.com/jgm/pandoc/issues/5796.
2019-10-15	Muse reader: do not allow closing asterisks to be followed by "*"	Alexander Krotov	1	-2/+7

2019-10-15	Muse reader: do not split series of asterisks into symbols and emphasis	Alexander Krotov	1	-0/+7
	Fixes #5821
2019-10-15	Muse reader: do not terminate emphasis on "*" not followed by space	Alexander Krotov	1	-2/+1

2019-10-04	hlint FB2 reader	Alexander Krotov	1	-1/+1

2019-10-04	Fix all hlint warnings in Muse reader	Alexander Krotov	1	-2/+2

2019-10-03	Minor ghc 8.8 fixups.	John MacFarlane	1	-2/+6

2019-09-29	RST reader: don't strip final underscore from absolute URI.	John MacFarlane	1	-3/+7
	Partially addresses #5763.
2019-09-28	Use throwError instead of fail when appropriate.	John MacFarlane	1	-2/+3

2019-09-28	Use Prelude.fail to avoid ambiguity with fail from GHC.Base.	John MacFarlane	9	-20/+20

2019-09-24	LaTeX reader: Add 'tikzcd' to list of special environments.	Eigil Rischel	1	-0/+1
	This allows it to be processed by filters, in the same way that one can do for 'tikzpicture'
2019-09-22	RST reader: Fixed parsing of indented blocks.	John MacFarlane	1	-6/+9
	We were requiring consistent indentation, but this isn't required by RST, as long as each nonblank line of the block has some indentation. Closes #5753.
2019-09-22	[Docx Writer] Re-use Readers.Docx.Parse for StyleMap (#5766)	Nikolay Yakimov	3	-380/+307
	* [Docx Parser] Move style-parsing-specific code to a new module * [Docx Writer] Re-use Readers.Docx.Parse.Styles for StyleMap * [Docx Writer] Move Readers.Docx.StyleMap to Writers.Docx.StyleMap It's never used outside of writer code, so it makes more sense to scope it under writers really.
2019-09-22	Use HsYAML-0.2.0.0	John MacFarlane	1	-11/+12
	Most of this is due to @vijayphoenix (#5704), but it needed some revisions to integrate with current master, and to use the released HsYAML. Closes #5704.
2019-09-21	[Docx Reader] Use style names, not ids, for assigning semantic meaning	Nikolay Yakimov	3	-183/+287
	Motivating issues: #5523, #5052, #5074 Style name comparisons are case-insensitive, since those are case-insensitive in Word. w:styleId will be used as style name if w:name is missing (this should only happen for malformed docx and is kept as a fallback to avoid failing altogether on malformed documents) Block quote detection code moved from Docx.Parser to Readers.Docx Code styles, i.e. "Source Code" and "Verbatim Char" now honor style inheritance Docx Reader now honours "Compact" style (used in Pandoc-generated docx). The side-effect is that "Compact" style no longer shows up in docx+styles output. Styles inherited from "Compact" will still show up. Removed obsolete list-item style from divsToKeep. That didn't really do anything for a while now. Add newtypes to differentiate between style names, ids, and different style types (that is, paragraph and character styles) Since docx style names can have spaces in them, and pandoc-markdown classes can't, anywhere when style name is used as a class name, spaces are replaced with ASCII dashes `-`. Get rid of extraneous intermediate types, carrying styleId information. Instead, styleId is saved with other style data. Use RunStyle for inline style definitions only (lacking styleId and styleName); for Character Styles use CharStyle type (which is basicaly RunStyle with styleId and StyleName bolted onto it).
2019-09-21	[Docx Reader] Code clean-up	Nikolay Yakimov	2	-63/+39
	Reduce code duplication, remove redundant brackets, use newtype instead of data where appropriate
2019-09-19	MediaWiki: skip optional {{table}} template.	John MacFarlane	1	-0/+1
	See https://en.wikipedia.org/wiki/Template:Table Closes #5757.
2019-09-09	LaTeX reader: Fix parsing of optional arguments that contain braced text.	John MacFarlane	1	-4/+3
	Closes #5740.
2019-09-08	Org reader: modify handling of example blocks. (#5717)	Brian Leung	2	-14/+43
	* Org reader: allow the `-i` switch to ignore leading spaces. * Org reader: handle awkwardly-aligned code blocks within lists. Code blocks in Org lists must have their #+BEGIN_ aligned in a reasonable way, but their other components can be positioned otherwise.
2019-09-05	Roff reader: Better support for 'while'.	John MacFarlane	1	-0/+3

2019-09-05	Roff reader: improve handling of groups.	John MacFarlane	1	-4/+2

2019-09-04	Roff reader: Fix problem parsing comments before macro.	John MacFarlane	1	-2/+0

2019-09-04	Roff reader: more improvements in parsing conditionals.	John MacFarlane	1	-3/+4

2019-09-04	Roff readers: better parsing of groups.	John MacFarlane	1	-9/+5
	We now allow groups where the closing `\\}` isn't at the beginning of a line. Closes #5410.
2019-09-02	LaTeX reader: don't try to parse includes if raw_tex is set.	John MacFarlane	1	-5/+13
	When the `raw_tex` extension is set, we just carry through `\usepackage`, `\input`, etc. verbatim as raw LaTeX. Closes #5673.
2019-09-02	LaTeX reader: properly handle optional arguments for macros.	John MacFarlane	2	-2/+2
	Closes #5682.
2019-08-27	LaTeX reader: fix `\\` in `\parbox` inside a table cell.	John MacFarlane	1	-3/+18
	Closes #5711.
2019-08-27	Markdown reader: Headers: don't parse content over newline boundary.	John MacFarlane	1	-4/+15
	Closes #5714.
2019-08-26	Use parseFromString' in Muse reader.	John MacFarlane	1	-1/+1
	Now that it is polymorphic, this is possible, and it's a better choice because it resets last string pos.
2019-08-26	Fix inline parsing in grid table cells.	John MacFarlane	2	-2/+2
	* T.P.Parsing: Change type of `setLastStrPos` so it takes a `Maybe SourcePos` rather than a `SourcePos`. [API change] * T.P.Parsing: Make `parseFromString'` and `gridTableWith` and `gridTableWith'` polymorphic in the parser state, constraining it with `HasLastStrPosition`. [API change] Closes #5708.
2019-08-23	RST reader: use title, not admonition-title, for admonition title.	John MacFarlane	1	-1/+1
	This puts RST reader into alignment with docbook reader.
2019-08-23	docbook: richer parse for admonitions (#5593)	Michael Peyton Jones	1	-16/+27
	Fixes #1234. This parses admonitions not as a blockquote, but rather as a div with an appropriate class. We also handle titles for admonitions as a nested div with the "title" class. (I followed the behaviour of other docbook-to-html converters in this - there are clearly other ways you could encode it.) In general, the handling of elements with nested title elements is very inconsistent. I think we should make it consistent, but I'm leaivng that for later to make this a small change. Example: ```docbook <warning xml:id="someId"> <title>My title</title> <simpara>An admonition block</simpara> </warning> ``` goes to ```html <div id="someId" class="warning"> <div class="title">My title</div> <p>An admonition block</p> </div> ```
2019-08-14	LaTeX reader: improve withRaw so it can handle cases where...	John MacFarlane	1	-2/+3
	the token string is modified by a parser (e.g. accent when it only takes part of a Word token). Closes #5686. Still not ideal, because we get the whole `\t0BAR` and not just `\t0` as a raw latex inline command. But I'm willing to let this be an edge case, since you can easily work around this by inserting a space, braces, or raw attribute. The important thing is that we no longer drop the rest of the document after a raw latex inline command that gobbles only part of a Word token!
2019-08-14	Removed some needless lookaheads in Markdown reader.	John MacFarlane	1	-2/+0

2019-08-05	Treat `ly` as verbatim too (#5671)	Urs Liska	1	-0/+1
	According to https://github.com/jgm/pandoc/issues/4725#issuecomment-399772217 not only the `lilypond` environment but also `ly` should be included in the verbatim list. @jperon https://github.com/jperon/lyluatex/issues/203
2019-07-24	LaTeX reader: handle `\passthrough` macro used by latex writer.	John MacFarlane	1	-0/+2
	Closes #5659.
2019-07-22	LaTeX reader: support tex `\tt` command.	John MacFarlane	1	-0/+1
	Closes #5654.
2019-07-22	Org reader: accept ATTR_LATEX in block attributes	Albert Krewinkel	1	-3/+11
	Attributes for LaTeX output are accepted as valid block attributes; however, their values are ignored. Fixes: #5648
2019-07-20	LaTeX reader: search for image with list of extensions...	John MacFarlane	1	-6/+16
	like latex does, if an extension is not provided. Closes #4933.
2019-07-19	Markdown: Ensure that expanded latex macros end with space if original did.	John MacFarlane	1	-1/+10
	Closes #4442.
2019-07-16	LaTeX reader: handle \looseness command values better.	John MacFarlane	1	-5/+4
	Closes #4439.
2019-07-14	Muse: add RTL support	Alexander Krotov	1	-0/+12
	Closes #5551