pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2014-04-11	HTML reader: Treat processing instructions & declarations as block.	John MacFarlane	1	-5/+9
	Previously these were treated as inline, and included in paragraph tags in HTML or DocBook output, which is generally not what is wanted. Closes #1233.
2014-04-11	Org reader: Fix parsing of sub-/superscript expressions	Albert Krewinkel	1	-10/+37
	This fixes the org-reader's handling of sub- and superscript expressions. Simple expressions (like `2^+10`), expressions in parentheses (`a_(n+1)`) and nested sexp (like `a_(nested()parens)`) are now read correctly.
2014-04-10	MediaWiki reader: Handle table rows containing just an HTML comment.	John MacFarlane	1	-1/+2
	Closes #1230.
2014-04-10	Org reader: Improve code by following HLint recommendations	Albert Krewinkel	1	-20/+24
	HLint's recommendations for better code are applied to the Org-mode reader code.
2014-04-10	Org reader: Support more inline/display math variants	Albert Krewinkel	1	-2/+26
	Support all of the following variants as valid ways to define inline or display math inlines: - `\[..\]` (display) - `$$..$$` (display) - `$..$` (inline) - `$..$` (inline) This closes #1223. Again.
2014-04-09	Merge pull request #1226 from tarleb/org-emphasis-reader	John MacFarlane	1	-120/+260
	Org reader: Precise rules for the recognition of markup
2014-04-09	Org reader: Precise rules for the recognition of markup	Albert Krewinkel	1	-120/+260
	The inline parsers have been rewritten using the org source code as a reference. This fixes a couple of bugs related to erroneous markup recognition.
2014-04-07	Textile reader: Improved link parsing.	John MacFarlane	1	-19/+15
	In particular we now pick up on attributes. Since pandoc links can't have attributes, we enclose the whole link in a span if there are attributes. Closes #1008.
2014-04-07	Merge pull request #1224 from tarleb/org-math	John MacFarlane	1	-37/+53
	Org reader: Read inline math, recognize definition lists
2014-04-07	Org reader: Support inline math (like $E=mc^2$)	Albert Krewinkel	1	-6/+16
	Closes #1223.
2014-04-06	LaTeX reader: handle @{} and p{length} in tabular.	John MacFarlane	1	-2/+3
	The length is not actually recorded, but at least we get a table. Closes #1180.
2014-04-06	Org reader: Add support for definition lists	Albert Krewinkel	1	-1/+16

2014-04-06	Org reader: Minor code clean-up	Albert Krewinkel	1	-30/+21

2014-04-05	HTML reader: Updated `closes` with rules from HTML5 spec.	John MacFarlane	1	-5/+12

2014-04-05	Textile reader: Better support for attributes.	John MacFarlane	1	-9/+12
	Instead of being ignored, attributes are now parsed and included in Span inlines. The output will be a bit different from stock textile: e.g. for `(foo)hi`, we'll get `<em><span class="foo">hi</span></em>` instead of `<em class="foo">hi</em>`. But at least the data is not lost.
2014-04-05	Textile reader: Improved treatment of HTML spans (%).	John MacFarlane	1	-5/+1
	Closes #1115.
2014-04-05	Removed whitespace at ends of lines.	John MacFarlane	1	-15/+15

2014-04-05	Org reader: Added type signature.	John MacFarlane	1	-0/+1

2014-04-05	Merge pull request #1219 from tarleb/org-images	John MacFarlane	1	-57/+127
	Org-reader: support inline images, clean-up code, fix bugs
2014-04-05	Org reader: Support inline images	Albert Krewinkel	1	-10/+24

2014-04-05	Org reader: Provide more language identifier translations	Albert Krewinkel	1	-1/+8
	Org-mode and Pandoc use different language identifiers, marking source code as being written in a certain programming language. This adds more translations from identifiers as used in Org to identifiers used in Pandoc. The full list of identifiers used in Org and Pandoc is available through http://orgmode.org/manual/Languages.html and `pandoc -v`, respectively.
2014-04-05	Org reader: Fix parsing of nested inlines	Albert Krewinkel	1	-7/+20
	Text such as /this/ was not correctly parsed as a strong, emphasised word. This was due to the end-of-word recognition being to strict as it did not accept markup chars as part of a word. The fix involves an additional parser state field, listing the markup chars which might be parsed as part of a word.
2014-04-05	Org reader: Use specialized org parser state	Albert Krewinkel	1	-7/+41
	The default pandoc ParserState is replaced with `OrgParserState`. This is done to simplify the introduction of new state fields required for efficient Org parsing.
2014-04-05	Org reader: Slight cleaning of table parsing code	Albert Krewinkel	1	-33/+35

2014-04-04	DocBook reader: Better treatment of formalpara.	John MacFarlane	1	-3/+3
	We now emit the title (if present) as a separate paragraph with boldface text. Closes #1215.
2014-04-04	DocBook reader: set metadata "author" not "authors"	John MacFarlane	1	-1/+1

2014-04-04	Removed trailing whitespace.	John MacFarlane	1	-15/+15

2014-04-04	DocBook reader: set "author" not "authors".	John MacFarlane	1	-3/+3

2014-04-04	Added recognition of authorgroup element and releaseinfo element to DocBook ↵	Matthew Pickering	1	-9/+16
	reader. Closes #1214
2014-04-04	Converted current meta information parsing in DocBook to a more extensible ↵	Matthew Pickering	1	-34/+48
	version which is aware of the more recent meta representation.
2014-04-01	MediaWiki reader: Fixed bug in certain nested lists.	John MacFarlane	1	-1/+2
	The bug: If a level 2 list was followed by a level 1 list, the first item of the level 1 list would be lost. Closes #1213.
2014-04-01	HTML reader: idiomatic rewriting for clarity.	John MacFarlane	1	-5/+4

2014-04-01	Changed the smart punctuation parser to return Inlines rather than an Inline ↵	Matthew Pickering	3	-5/+3
	element and updated files accordingly
2014-04-01	Converted HTML reader to use builder. Fixes #1162.	Matthew Pickering	1	-109/+126

2014-04-01	Bugfix for #1175 and convert textile reader to use builder.	Matthew Pickering	1	-134/+167
	The reader did not correctly parse inline markup. The behavoir is now as follows. (a) The markup must start at the start of a line, be inside previous inline markup or be preceeded by whitespace. (b) The markup can not span across paragraphs (delimited by \n\n) (c) The markup can not be followed by a alphanumeric character. (d) Square brackets can be placed around the markup to avoid having to have white space before it. In order to make these changes it was either necessary to convert the parser to return a list of inlines or to convert the whole reader to use the builder. The latter approach whilst more work makes a bit more sense as it becomes easy to arbitarily append and prepend elements without changing the type. Tests are accordingly updated in a later commit to reflect the different normalisation behavoir specified by the builder monoid.
2014-03-25	LaTeX reader: Better handling of figure and table with caption.	John MacFarlane	1	-11/+34
	We now look for a \caption inside the environment; if one is found, it is attached to the graphic or tabular found there. Closes #1204.
2014-03-25	Revert "LaTeX reader: Added LPState."	John MacFarlane	1	-18/+0
	This reverts commit 82ddec698e782fef83dcd1b1fba79cd3b698c717.
2014-03-25	LaTeX reader: Added LPState.	John MacFarlane	1	-0/+18
	Plan is to use this instead of ParserState in LP.
2014-03-25	Parsing: Added HasMacros, simplified other typeclasses.	John MacFarlane	1	-2/+2
	Removed updateHeaderMap, setHeaderMap, getHeaderMap, updateIdentifierList, setIdentifierList, getIdentifierList.
2014-03-25	API changes to HasReaderOptions, HasHeaderMap, HasIdentifierList.	John MacFarlane	1	-8/+8
	Previously these were typeclasses of monads. They've been changed to be typeclasses of states. This ismplifies the instance definitions and provides more flexibility. This is an API change! However, it should be backwards compatible unless you're defining instances of HasReaderOptions, HasHeaderMap, or HasIdentifierList. The old getOption function should work as before (albeit with a more general type). The function askReaderOption has been removed. extractReaderOptions has been added. getOption has been given a default definition. In HasHeaderMap, extractHeaderMap and updateHeaderMap have been added. Default definitions have been given for getHeaderMap, putHeaderMap, and modifyHeaderMap. In HasIdentifierList, extractIdentifierList and updateIdentifierList have been added. Default definitions have been given for getIdentifierList, putIdentifierList, and modifyIdentifierList. The ultimate goal here is to allow different parsers to use their own, tailored parser states (instead of ParserState) while still using shared functions.
2014-03-25	LaTeX reader: Better handling of "table" environment.	John MacFarlane	1	-0/+1
	Positioning options no longer rendered verbatim. Partially addresses #1204.
2014-03-24	Merge pull request #1068 from jaimeMF/mw-images-langs	John MacFarlane	1	-1/+5
	MediaWiki reader: Accept image links in more languages
2014-03-24	Markdown reader: Fixed regression on line breaks in strict mode.	John MacFarlane	1	-1/+1
	Closes #1203.
2014-03-04	Add a simple Emacs Org-mode reader	Albert Krewinkel	1	-0/+552
	The basic structure of org-mode documents is recognized; however, org-mode features like todo markers, tags etc. are not supported yet.
2014-02-26	Markdown reader: Improved parsing of nested divs.	John MacFarlane	1	-0/+2
	Formerly a closing div tag would be missed if it came right after other block-level tags.
2014-02-26	Markdown parser: avoid backtracking when closing `</div>` not found.	John MacFarlane	1	-6/+13

2014-02-26	Markdown reader: small efficiency improvement.	John MacFarlane	1	-1/+1
	Switched `notFollewdBy' rawHtmlBlocks` -> `notFollowedBy' (htmlTag isBlockTag)`, which is more efficient.
2014-02-25	Added readerTrace to ReaderOptions, --trace command line opt.	John MacFarlane	1	-1/+11
	This is to debug backtracking-related parsing bugs. So far it is only implemented for markdown, but it would be good to extend it to latex and html readers.
2014-02-21	Fixed bug in reference link parsing in markdown_mmd.	John MacFarlane	1	-1/+1
	The bug was triggered by: Link to [Google][]. Link to [twitter][]. [Google]: http://google.com [twitter]: http://twitter.com
2014-02-19	Make rst figures true figures. Closes #1168.	John MacFarlane	1	-1/+1
	Thanks to CasperVector.