pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2017-07-07	Rewrote LaTeX reader with proper tokenization.	John MacFarlane	12	-8/+103
	This rewrite is primarily motivated by the need to get macros working properly. A side benefit is that the reader is significantly faster (27s -> 19s in one benchmark, and there is a lot of room for further optimization). We now tokenize the input text, then parse the token stream. Macros modify the token stream, so they should now be effective in any context, including math. Thus, we no longer need the clunky macro processing capacities of texmath. A custom state LaTeXState is used instead of ParserState. This, plus the tokenization, will require some rewriting of the exported functions rawLaTeXInline, inlineCommand, rawLaTeXBlock. * Added Text.Pandoc.Readers.LaTeX.Types (new exported module). Exports Macro, Tok, TokType, Line, Column. [API change] * Text.Pandoc.Parsing: adjusted type of `insertIncludedFile` so it can be used with token parser. * Removed old texmath macro stuff from Parsing. Use Macro from Text.Pandoc.Readers.LaTeX.Types instead. * Removed texmath macro material from Markdown reader. * Changed types for Text.Pandoc.Readers.LaTeX's rawLaTeXInline and rawLaTeXBlock. (Both now return a String, and they are polymorphic in state.) * Added orgMacros field to OrgState. [API change] * Removed readerApplyMacros from ReaderOptions. Now we just check the `latex_macros` reader extension. * Allow `\newcommand\foo{blah}` without braces. Fixes #1390. Fixes #2118. Fixes #3236. Fixes #3779. Fixes #934. Fixes #982.
2017-06-30	Updated jats tests for new texmath version.	John MacFarlane	1	-1/+1

2017-06-30	Markdown writer: Ensure that `+` and `-` are escaped properly...	John MacFarlane	1	-0/+14
	so they don't cause spurious lists. Previously they were only if succeeded by a space, not if they were at end of line. Closes #3773.
2017-06-29	Added parameter for user data directory to runLuaFilter.	John MacFarlane	1	-2/+2
	in Text.Pandoc.Lua. Also to pushPandocModule. This change allows users to override pandoc.lua with a file in their local data directory, adding custom functions, etc. @tarleb, if you think this is a bad idea, you can revert this. But in general our data files are all overridable.
2017-06-29	data/pandoc.lua: regularize constructors.	John MacFarlane	2	-3/+3
	We now use Pandoc instead of Doc (though Doc remains a deprecated Synonym), and we deprecate DoubleQuoted, SingleQuoted, InlineMath, and DisplayMath.
2017-06-29	Lua filters: Remove special treatment of Quoted, Math.	John MacFarlane	1	-2/+4
	No more SingleQuoted, DoubleQuoted, InlineMath, DisplayMath. This makes everything uniform and predictable, though it does open up a difference btw lua filters and custom writers.
2017-06-28	Muse reader: parse indented blockquotes (#3769)	Alexander Krotov	1	-1/+6

2017-06-27	RST reader: support anchors.	John MacFarlane	1	-0/+17
	E.g. `hello` .. _hello: paragraph This is supported by putting "paragraph" in a Div with id `hello`. Closes #262.
2017-06-27	RST reader: Handle chained link definitions.	John MacFarlane	1	-0/+9
	For example, .. _hello: .. _goodbye: example.com Here both `hello` and `goodbye` should link to `example.com`. Fixes the first part of #262.
2017-06-27	HTML reader: Use the lang value of <html> to set the lang meta value. (#3765)	bucklereed	1	-0/+6
	* HTML reader: Use the lang value of <html> to set the lang meta value. * Fix for pre-AMP environments.
2017-06-26	Muse writer: fix hlint errors (#3764)	Alexander Krotov	1	-2/+2

2017-06-26	Muse reader: fix horizontal rule parsing (#3762)	Alexander Krotov	1	-1/+12
	Do not parse 3 dashes as horizontal rule and allow whitespace after rule
2017-06-25	Muse reader: Require space before and after '=' for code (#3758)	Alexander Krotov	1	-0/+8

2017-06-23	Markdown reader: interpret YAML metadata as Inlines when possible.	John MacFarlane	1	-0/+23
	If the metadata field is all on one line, we try to interpret it as Inlines, and only try parsing as Blocks if that fails. If it extends over one line (including possibly the `\|` or `>` character signaling an indented block), then we parse as Blocks. This was motivated by some German users finding that date: '22. Juin 2017' got parsed as an ordered list. Closes #3755.
2017-06-23	Text.Pandoc.Extensions: Added `Ext_raw_attribute`.	John MacFarlane	1	-0/+28
	Documented in MANUAL.txt. This is enabled by default in pandoc markdown and multimarkdown.
2017-06-20	Muse reader: check that headers start at the first column (#3749)	Alexander Krotov	1	-0/+17

2017-06-20	Fixed footnotes in table captions.	John MacFarlane	1	-0/+27
	Note that if the table has a first page header and a continuation page header, the notes will appear only on the first occurrence of the header. Closes #2378.
2017-06-19	Muse reader: fix list item continuation parsing (#3747)	Alexander Krotov	1	-0/+13

2017-06-19	Added Vimwiki reader (#3705).	Yuchen Pei	3	-0/+723
	* New module Text.Pandoc.Readers.Vimwiki, exporting readVimwiki [API change]. * New input format `vimwiki`. * New data file, `data/vimwiki.css`, for displaying the HTML produced by this reader and pandoc's HTML writer in the style of vimwiki's own HTML export.
2017-06-19	Add Muse reader (#3620)	Alexander Krotov	2	-0/+266

2017-06-12	Markdown writer: don't allow soft break in header.	John MacFarlane	1	-0/+25
	Closes #3736.
2017-06-11	Don't allow backslash + newline to affect block structure.	John MacFarlane	3	-5/+22
	Note that as a result of this change, the following, which formerly produced a header with two lines separated by a line break, will now produce a header followed by a paragraph: # Hi\ there This may affect some existing documents that relied on this undocumented and unintended behavior. This change makes pandoc more consistent with other Markdown implementations, and with itself (since the two-space version of a line break doesn't work inside ATX headers, and neither version works inside Setext headers). Closes #3730.
2017-06-11	Revert "Command tests: small change to try to fix appveyor failures."	John MacFarlane	1	-4/+3
	This reverts commit 0ab26ac9ebb0196691ec064820eac4e640f0d52c. Failed experiment.
2017-06-11	Command tests: small change to try to fix appveyor failures.	John MacFarlane	1	-3/+4

2017-06-11	Support for \faCheck and \faClose (#3727)	schrieveslaach	1	-0/+13

2017-06-11	Switched Writer types to use Text.	John MacFarlane	12	-19/+30
	* XML.toEntities: changed type to Text -> Text. * Shared.tabFilter -- fixed so it strips out CRs as before. * Modified writers to take Text. * Updated tests, benchmarks, trypandoc. [API change] Closes #3731.
2017-06-10	Changed all readers to take Text instead of String.	John MacFarlane	11	-576/+598
	Readers: Renamed StringReader -> TextReader. Updated tests. API change.
2017-06-01	HTML writer: Avoid two class attributes when adding 'uri' class.	John MacFarlane	1	-0/+6
	Closes #3716.
2017-06-01	Markdown writer: Avoid inline surround-marking with empty content.	John MacFarlane	1	-0/+15
	E.g. we don't want `<strong></strong>` to become `****`. Similarly for emphasis, super/subscript, strikeout. Closes #3715.
2017-06-01	LaTeX reader: handle some width specifiers on table columns.	John MacFarlane	1	-2/+2
	Currently we only handle the form `0.9\linewidth`. Anything else would have to be converted to a percentage, using some kind arbitrary assumptions about line widths. See #3709.
2017-05-31	Org reader: respect export option for tags	Albert Krewinkel	1	-0/+6
	Tags are appended to headlines by default, but will be omitted when the `tags` export option is set to nil. Closes: #3713
2017-05-31	Org reader: include tags in headlines	Albert Krewinkel	1	-17/+19
	The Emacs default is to include tags in the headline when exporting. Instead of just empty spans, which contain the tag name as attribute, tags are rendered as small caps and wrapped in those spans. Non-breaking spaces serve as separators for multiple tags.
2017-05-31	Org reader: recognize babel result blocks with attributes	Albert Krewinkel	1	-0/+44
	Babel result blocks can have block attributes like captions and names. Result blocks with attributes were not recognized and were parsed as normal blocks without attributes. Fixes: #3706
2017-05-29	LaTeX reader: handle escaped & inside table cell.	John MacFarlane	1	-0/+15
	Closes #3708.
2017-05-28	Org reader: Fix cite parsing behaviour	Herwig Stuetz	1	-0/+36
	Until now, org-ref cite keys included special characters also at the end. This caused problems when citations occur right before colons or at the end of a sentence. With this change, all non alphanumeric characters at the end of a cite key are ignored. This also adds `,` to the list of special characters that are legal in cite keys to better mirror the behaviour of org-export.
2017-05-27	Markdown writer: changes to `--reference-links`.	John MacFarlane	1	-0/+60
	With `--reference-location` of `section` or `block`, pandoc will now repeat references that have been used in earlier sections. The Markdown reader has also been modified, so that exactly repeated references do not generate a warning, only references with the same label but different targets. The idea is that, with references after every block, one might want to repeat references sometimes. Closes #3701.
2017-05-27	Org reader: subject full doc tree to headline transformations	Albert Krewinkel	1	-2/+16
	Emacs parses org documents into a tree structure, which is then post-processed during exporting. The reader is changed to do the same, turning the document into a single tree of headlines starting at level 0. Fixes: #3695
2017-05-25	Allow em for image height/width in HTML, LaTeX.	John MacFarlane	3	-36/+48
	- Export `inEm` from ImageSize [API change]. - Change `showFl` and `show` instance for `Dimension` so extra decimal places are omitted. - Added `Em` as a constructor of `Dimension` [API change]. - Allow `em`, `cm`, `in` to pass through without conversion in HTML, LaTeX. Closes #3450.
2017-05-25	Added `spaced_reference_links` extension.	John MacFarlane	31	-211/+1082
	This is now the default for pandoc's Markdown. It allows whitespace between the two parts of a reference link: e.g. [a] [b] [b]: url This is now forbidden by default. Closes #2602.
2017-05-25	Update command tests to include stderr output.	John MacFarlane	7	-5/+23

2017-05-25	Test.Command: put stderr output at front, so it can be tested.	John MacFarlane	1	-3/+3

2017-05-25	MediaWiki reader: don't do curly quotes inside `<tt>` contexts.	John MacFarlane	1	-0/+16
	Even if `+smart`. See #3585.
2017-05-24	Markdown reader: fixed smart quotes after emphasis.	John MacFarlane	1	-0/+6
	E.g. in foo's 'foo' Closes #2228.
2017-05-24	Parsing: Provide parseFromString'.	John MacFarlane	1	-0/+8
	This is a verison of parseFromString specialied to ParserState, which resets stateLastStrPos at the end. This is almost always what we want. This fixes a bug where `_hi_` wasn't treated as emphasis in the following, because pandoc got confused about the position of the last word: - [o] _hi_ Closes #3690.
2017-05-23	RST Reader: parse list table directive (#3688)	keiichiro shikano	1	-0/+289
	Closes #3432.
2017-05-20	Improve SVG image size code.	Marc Schreiber	5	-0/+318
	The old code made some unwise assumptions about how the svg file would look. See #3580.
2017-05-19	RST writer: add empty comments when needed...	John MacFarlane	2	-0/+21
	to avoid including a blocquote in the indented content of a preceding block. Closes #3675.
2017-05-18	Org reader: fix smart parsing behavior	Albert Krewinkel	1	-2/+2
	Parsing of smart quotes and special characters can either be enabled via the `smart` language extension or the `'` and `-` export options. Smart parsing is active if either the extension or export option is enabled. Only smart parsing of special characters (like ellipses and en and em dashes) is enabled by default, while smart quotes are disabled. This means that all smart parsing features will be enabled by adding the `smart` language extension. Fine-grained control is possible by leaving the language extension disabled. In that case, smart parsing is controlled via the aforementioned export OPTIONS only. Previously, all smart parsing was disabled unless the language extension was enabled.
2017-05-18	Test updates for latex template changes.	John MacFarlane	1	-13/+14

2017-05-18	Markdown: allow attributes in reference links to start on next line.	John MacFarlane	1	-0/+13
	This addresses a subsidiary issue in #3674.