pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2019-02-07	Docx reader: Fix windows error	Jesse Rosenthal	1	-1/+2
	Try fixing a parsing error on windows by insisting that the parser use a Posix filepath library for splitting doc paths in a zipfile. (It might default on Windows to using a backslash as a separator, while it's always a forward-slash in zip archives.)
2019-02-07	Docx reader: Some code cleanup	Jesse Rosenthal	1	-15/+25
	* clarify function name. We had previously used `getDocumentPath`, but `Document` is an overdetermined term here. Use `getDocumentXmlPath` to make clear what we're doing. * Use field notation for setting ReaderEnv. As we've added (and continue to add) fields, the assignment by position has gotten harder to read. * figure out document.xml path once at the beginning of parsing, and add it to the environment, so we can avoid repeated lookups.
2019-02-07	Docx reader: Extend dynamic xml location to detecting relationships	Jesse Rosenthal	1	-12/+19
	Getting the location used to depend on a hard-coded .rels file based on "word/document.xml". We now dynamically detect that file based on the document.xml file specified in "_rels/.rels"
2019-02-06	Docx reader: Dynamically determine document.xml path.	Jesse Rosenthal	1	-3/+12
	The desktop Word program places the main document file in "word/document.xml", but the online word places it in "word/document2.xml". This file path is actually stated in the root "_rels/.rels" file, in the "Relationship" element with an "http://../officedocument" type. Closes #5277
2019-02-06	Handle Word files generated by Microsoft Word Online.	John MacFarlane	1	-0/+2
	For some reason, Word in Office 365 Online uses `document2.xml` for the content, instead of `document.xml`. This causes pandoc not to be able to parse docx. This quick fix has the parser check for both `document.xml` and `document2.xml`. Addresses #5277, but a more robust solution would be to get the name of the main document dynamically (who knows whether it might change again?).
2019-02-04	Add missing copyright notices and remove license boilerplate (#5112)	Albert Krewinkel	38	-70/+107
	Quite a few modules were missing copyright notices. This commit adds copyright notices everywhere via haddock module headers. The old license boilerplate comment is redundant with this and has been removed. Update copyright years to 2019. Closes #4592.
2019-02-04	Markdown reader: add newline when parsing blocks in YAML.	John MacFarlane	1	-9/+10
	Otherwise last block gets parsed as a Plain rather than a Para. This is a regression in pandoc 2.x. This patch restores pandoc 1.19 behavior. Closes #5271.
2019-02-02	ipynb reader: handle images referring to attachments.	John MacFarlane	1	-1/+9
	Previously we didn't strip off the attachment: prefix, so even though the attachment is available in the mediabag, pandoc couldn't find it.
2019-01-31	LaTeX reader: don't let `\egroup` match `{`.	John MacFarlane	1	-3/+3
	`braced` now actually requires nested braces. Otherwise some legitimate command and environment definitions can break (see test/command/tex-group.md).
2019-01-30	Org reader: add support for #+SELECT_TAGS.	leungbk	4	-23/+78

2019-01-30	Org reader: separate filtering logic from conversion function.	leungbk	2	-8/+11

2019-01-25	MediaWiki reader: use `_` instead of `-` in auto-identifiers.	John MacFarlane	1	-1/+6
	Partially addresses #4731. We may not still be exactly matching mediawiki's algorithm for identifiers.
2019-01-24	Ipynb: Put all jupyter metadata under 'jupyter' key.	John MacFarlane	1	-1/+1

2019-01-24	Revert "Prepend `jupyter_` to jupyter metadata keys."	John MacFarlane	1	-6/+0
	This reverts commit 5eaff399d5d6dc30b0d453eff42c4101674d75ab.
2019-01-24	Prepend `jupyter_` to jupyter metadata keys.	John MacFarlane	1	-0/+6
	This avoids conflics with things like 'toc'.
2019-01-22	Support ipynb (Jupyter notebook) as input and output format.	John MacFarlane	1	-0/+249
	[API change] * Depend on ipynb library. * Add `ipynb` as input and output format. * Added Text.Pandoc.Readers.Ipynb (supports both nbformat v3 and v4). * Added Text.Pandoc.Writers.Ipynb (supports nbformat v4). * Added ipynb readers and writers to T.P.Readers, T.P.Writers, and T.P.Extensions. Register the file extension .ipynb for this format. * Add `PandocIpynbDecodingError` constructor to Text.Pandoc.Error.Error. * Note: there is no template for ipynb.
2019-01-22	LaTeX reader: support `\endinput`. Closes #5233.	John MacFarlane	1	-0/+1

2019-01-22	Man reader: fix typo. (#5245)	Brian Leung	1	-3/+3

2019-01-21	HTML and markdown: treat textarea as a verbatim environment.	John MacFarlane	2	-8/+10
	We don't want to parse its contents as Markdown or HTML. Closes #5241.
2019-01-20	LaTeX reader: allow includes with dots like cc_by_4.0.	John MacFarlane	1	-3/+5
	Previously the `.0` was interpreted as a file extension, leading pandoc not to add `.tex` (and thus not to find the file). The new behavior matches tex more closely.
2019-01-20	LaTeX reader: cleaned up 'input' code.	John MacFarlane	1	-10/+5

2019-01-09	RST reader: change treatment of `number-lines` directives. (#5207)	Brian Leung	1	-15/+15
	Directives of this type without numeric inputs should not have a `startFrom` attribute; with a blank value, the writers can produce extra whitespace.
2019-01-08	Removed superfluous sourceCode class on code blocks.	John MacFarlane	3	-11/+7
	* These were added by the RST reader and, for literate Haskell, by the Markdown and LaTeX readers. There is no point to this class, and it is not applied consistently by all readers. See #5047. * Reverse order of `literate` and `haskell` classes on code blocks when parsing literate Haskell. Better if `haskell` comes first.
2019-01-08	RST reader: handle sourcecode directive as synonynm for code.	John MacFarlane	1	-1/+1
	Closes #5204.
2019-01-07	Org reader: allow for case of :minlevel == 0.	John MacFarlane	1	-1/+3
	See #5190.
2019-01-07	Org reader: handle `minlevel` option differently. (#5190)	Brian Leung	1	-3/+1
	When `minlevel` exceeds the original minimum level observed in the file to be included, every heading should be shifted rightward.
2019-01-07	TWiki reader: fix performance issue with underscores.	John MacFarlane	1	-1/+3
	Underscore emphasis can't cross table cell boundaries, but the parser wasn't respecting this, leading to exponential behavior in documents with table cells containing underscores. This fixes the original sample; it's possible that there are other performance issues involving underscores. Closes #3921.
2019-01-06	Add DokuWiki reader (#5108)	Alexander	1	-0/+529
	Closes #1792
2019-01-02	Implement task lists (#5139)	Mauro Bieg	2	-3/+7
	Closes #3051
2019-01-01	Org reader: fix self-link parsing regression	Albert Krewinkel	1	-5/+6
	Fixes a regression introduced by the previous commit.
2019-01-01	Org reader: fix treatment of links to images	Albert Krewinkel	2	-13/+16
	Links with descriptions which are pointing to images are no longer read as inline images, but as proper links. Fixes: #5191
2019-01-01	Org reader: hlint	Albert Krewinkel	1	-2/+2

2018-12-31	Remove unused HasHeaderMap (#5175)	Alexander	6	-34/+3
	It is updated by some readers, but never actually used.
2018-12-31	Replace read with safeRead (#5186)	Mauro Bieg	1	-3/+6
	closes #5180
2018-12-17	LaTeX reader: Support `\DeclareMathOperator`.	John MacFarlane	1	-1/+11
	Closes #5149.
2018-12-17	Replace read with safeRead. Closes #5162.	John MacFarlane	3	-11/+8

2018-12-17	HTML reader: handle empty start attribute.	John MacFarlane	1	-4/+2
	See #5162.
2018-12-10	Docx: handle level overrides.	Jesse Rosenthal	1	-6/+17
	There can be overrides for the definitions of certain levels in numbering definitions. This implements that behavior. Closes: #5134
2018-12-10	Docx: add a levelOverride type.	Jesse Rosenthal	1	-3/+20

2018-12-10	Docx writer: Make Level into a real type.	Jesse Rosenthal	2	-4/+6
	It had previously been an alias for a tuple.
2018-11-29	Man reader: use mapLeft from Shared.	John MacFarlane	1	-6/+1

2018-11-29	LaTeX reader: Support \inputminted (closes #5103).	John MacFarlane	1	-2/+21

2018-11-25	Fix parsing of citations and quotes after parentheses.	John MacFarlane	1	-6/+2
	Starting with pandoc 2.4, citations and quoted inlines were no longer recognized after parentheses. This is because of commit 9b0bd4ec6f5c9125efb3e36232e6d1f6ac08a728, which is reverted here. The point of that commit was to allow relocation of soft line breaks to before an abbreviation, so that a nonbreaking space could be added after the abbreviation. Now we simply leave the soft line break in place, even though this means that we won't get a nonbreaking space after "Mr." at the end of a line (and in LaTeX this may result in a longer intersentential space). Those who care about this issue should take care not to end lines with an abbreviation, or to insert nonbreaking spaces manually. Closes #5099.
2018-11-25	MediaWiki reader internals: remove unnecessary binding	Alexander Krotov	1	-2/+1

2018-11-22	Hlint suggestions.	John MacFarlane	2	-3/+1

2018-11-19	LaTeX reader: cleaned up handling of dimension arguments.	John MacFarlane	2	-6/+12
	Allow decimal points, preceding space. Also require text 1.1+.
2018-11-18	LaTeX reader: don't allow arguments for verbatim, etc.	John MacFarlane	1	-1/+0

2018-11-18	LaTeX reader: Allow space before bracketed options.	John MacFarlane	1	-1/+2

2018-11-18	LaTeX reader: allow optional arguments after \\ in tables.	John MacFarlane	1	-1/+2

2018-11-18	LaTeX reader: improve parsing of `\tiny`, `scriptsize`, etc.	John MacFarlane	1	-4/+18
	Parse as raw, but know that these font changing commands take no arguments.