pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2014-09-25	HTML Reader: Recognise <br> tags inside <pre> blocks	mpickering	1	-1/+6
	Closes #1620
2014-08-18	HTML reader: improved handling of tags that can be block or inline.	John MacFarlane	1	-5/+13
	Previously a section like this would be enclosed in a paragraph, with RawInline for the video tags (since video is a tag that can be either block or inline): <video controls="controls"> <source src="../videos/test.mp4" type="video/mp4" /> <source src="../videos/test.webm" type="video/webm" /> <p> The videos can not be played back on your system.<br/> Try viewing on Youtube (requires Internet connection): <a href="http://youtu.be/etE5urBps_w">Relative Velocity on Youtube</a>. </p> </video> This change will cause the video and source tags to be parsed as RawBlock instead, giving better output. The general change is this: when we're parsing a "plain" sequence of inlines, we don't parse anything that COULD be a block-level tag.
2014-08-16	HTML reader: Parse appropriately styled span as SmallCaps.	John MacFarlane	1	-1/+6

2014-08-12	EPUB Reader: Ignore title pages	Matthew Pickering	1	-4/+10

2014-08-08	Added `native_divs` and `native_spans` extensions.	John MacFarlane	1	-1/+4
	This allows users to turn off the default pandoc behavior of parsing contents of div and span tags in markdown and HTML as native pandoc Div blocks and Span inlines. Setting of default epub extensions has been moved from the EPUB reader to Text.Pandoc.
2014-08-08	HTML EPUB exts: switch element can now be in either the inline or block position	Matthew Pickering	1	-9/+10

2014-08-07	HTML reader: Really ignore DOCTYPE and xml declarations.	John MacFarlane	1	-2/+2
	This actually does what d71b013841f3c9c8c595591e312a31df16a728cb said it did. Revised epub tests to remove the repeated DOCTYPE and xml tags.
2014-08-04	HTML reader: ignore <?xml..> and <DOCTYPE..> tags.	John MacFarlane	1	-1/+1
	Previously they were parsed as raw.
2014-08-04	Use texmath 0.7 interface.	John MacFarlane	1	-2/+2

2014-07-31	HTML Reader: Added ability to read MathML formatted <math> blocks	Matthew Pickering	1	-0/+16

2014-07-31	HTML Reader: Added support for anchors on links and list items	Matthew Pickering	1	-4/+22

2014-07-31	HTML Reader: Extended HTML Reader to recognise EPUB specific elements	Matthew Pickering	1	-28/+178

2014-07-26	Generalised more in Parsing.hs to enable the use of custom state	Matthew Pickering	1	-18/+61

2014-07-20	HTML reader: parse Div and Span elements even without `--parse-raw`.	John MacFarlane	1	-2/+0
	Closes #1434.
2014-07-11	Removed (>>~) function	Matthew Pickering	1	-3/+3
	This function is equivalent to the more general (<*) which is defined in Control.Applicative. This change makes pandoc code easier to understand for those not familar with the codebase.
2014-07-07	HTML reader: adjust `blockTags` and `eitherBlockOrInline`.	John MacFarlane	1	-9/+13
	- Added `audio` and `source` in `eitherBlockOrInline`. - Moved `video`, `svg`, `progress`, `script`, `noscript`, `svg` from `blockTags` to `eitherBlockOrInline`. - `map` and `object` were mistakenly in both lists; they have been removed from `blockTags`.
2014-06-20	HTML reader: Fix performance issue with malformed HTML tables.	John MacFarlane	1	-0/+2
	We let a `</table>` tag close an open `<tr>` or `<td>`. Closes #1167.
2014-06-20	Support --trace in HTML reader.	John MacFarlane	1	-1/+10

2014-06-19	HTML reader: Allow space between `<col>` and `</col>`.	John MacFarlane	1	-0/+1
	Test case: ``` <table border="1"> <colgroup> <col> </col> <col></col> </colgroup> <tbody> <tr> <td>X</td> <td>Y</td> </tr> <tr> <td>1</td> <td>2</td> </tr> </tbody> </table> ```
2014-06-16	HTML reader: Fixed major parsing problem with HTML tables.	John MacFarlane	1	-15/+11
	Table cells were being combined into one cell. Closes #1341.
2014-06-16	Moved extractSpaces to Shared.hs	mpickering	1	-13/+4
	Generalised and move the extractSpaces function from `HTML.hs` to `Shared.hs` so that the docx reader can also use it.
2014-05-09	Update copyright notices for 2014, add missing notices	Albert Krewinkel	1	-2/+2

2014-04-11	HTML reader: Treat processing instructions & declarations as block.	John MacFarlane	1	-5/+9
	Previously these were treated as inline, and included in paragraph tags in HTML or DocBook output, which is generally not what is wanted. Closes #1233.
2014-04-05	HTML reader: Updated `closes` with rules from HTML5 spec.	John MacFarlane	1	-5/+12

2014-04-01	HTML reader: idiomatic rewriting for clarity.	John MacFarlane	1	-5/+4

2014-04-01	Converted HTML reader to use builder. Fixes #1162.	Matthew Pickering	1	-109/+126

2014-01-20	HTML reader: Fixed bug reading inline math with `$$`.	John MacFarlane	1	-2/+2
	See #225.
2014-01-01	HTML reader: Parse name/content pairs from meta tags as metadata.	John MacFarlane	1	-1/+10
	Closes #1106.
2013-12-19	HLint: use fromMaybe	Henry de Valence	1	-2/+2
	Replace uses of `maybe x id` with `fromMaybe x`.
2013-12-06	HTML reader: Parse LaTeX math if appropriate options are set.	John MacFarlane	1	-1/+8
	* Moved inlineMath, displayMath from Markdown reader to Parsing. * Export them from Parsing. (API change.) * Generalize their types.
2013-11-07	recognize svg tag in HTML Reader	MinRK	1	-1/+1
	avoids adding lots of `<p>` tags in embedded SVG content, for instance in markdown to HTML.
2013-11-03	HTML reader: Use pandoc Div and Span for raw "<div>", "<span>".	John MacFarlane	1	-10/+25
	Only if --parse-raw.
2013-08-10	Adjustments for new Format newtype.	John MacFarlane	1	-2/+2

2013-07-16	HTML reader: read widths from col tags if present.	John MacFarlane	1	-6/+23
	Closes #893.
2013-07-16	HTML reader: Handle non-simple tables (#893).	John MacFarlane	1	-3/+9
	Column widths are divided equally. TODO: Get column widths from col tags if present.
2013-07-16	HTML reader: Generalized table parser.	John MacFarlane	1	-4/+9
	This commit doesn't change the present behavior at all, but it will make it easier to support non-simple tables in the future.
2013-06-24	Use new flexible metadata type.	John MacFarlane	1	-23/+20
	* Depend on pandoc 1.12. * Added yaml dependency. * `Text.Pandoc.XML`: Removed `stripTags`. (API change.) * `Text.Pandoc.Shared`: Added `metaToJSON`. This will be used in writers to create a JSON object for use in the templates from the pandoc metadata. * Revised readers and writers to use the new Meta type. * `Text.Pandoc.Options`: Added `Ext_yaml_title_block`. * Markdown reader: Added support for YAML metadata block. Note that it must come at the beginning of the document. * `Text.Pandoc.Parsing.ParserState`: Replace `stateTitle`, `stateAuthors`, `stateDate` with `stateMeta`. * RST reader: Improved metadata. Treat initial field list as metadata when standalone specified. Previously ALL fields "title", "author", "date" in field lists were treated as metadata, even if not at the beginning. Use `subtitle` metadata field for subtitle. * `Text.Pandoc.Templates`: Export `renderTemplate'` that takes a string instead of a compiled template.. * OPML template: Use 'for' loop for authors. * Org template: '#+TITLE:' is inserted before the title. Previously the writer did this.
2013-03-28	Parsing: Better error reporting in readWith.	John MacFarlane	1	-1/+4
	- Specialize readWith to String input. - On error have it print the line in which the error occurred, with a caret pointing to the column. - This should help diagnose parsing problems in LaTeX especially.
2013-02-16	HTML reader: Preserve all header attributes.	John MacFarlane	1	-2/+4

2013-01-30	HTML reader: Handle colgroup tag.	John MacFarlane	1	-1/+2

2013-01-12	HTML reader: Added html5 tags to list of block-level tags.	John MacFarlane	1	-5/+8

2013-01-09	Added Attr field to Header.	John MacFarlane	1	-2/+4
	Previously header ids were autogenerated by the writers. Now they are generated (unless supplied explicitly) in the markdown parser, if the `header_identifiers` extension is selected. In addition, the textile reader now supports id attributes on headers.
2012-09-15	HTML reader: Modified htmlTag for fewer false positives.	John MacFarlane	1	-1/+1
	A tag must start with `<` followed by `!`,`?`, `/`, or a letter. This makes it more useful in the wikimedia and markdown parsers.
2012-09-13	MediaWiki reader: Use MWState instead of ParserState.	John MacFarlane	1	-1/+1

2012-09-09	HTML reader: Handle nested `<q>` tags properly.	John MacFarlane	1	-1/+9

2012-09-09	HTML reader: Parse <q> as Quoted DoubleQuote.	John MacFarlane	1	-0/+4

2012-08-15	Moved renderTags' from HTML reader & SelfContained to Shared.	John MacFarlane	1	-13/+1
	Improved removal of markdown="1" attribute in Markdow reader.
2012-07-26	Fixed whitespace errors.	John MacFarlane	1	-5/+5

2012-07-26	Use readerExtensions instead of readerStrict in readers.	John MacFarlane	1	-26/+19
	Test individually for the extensions.
2012-07-25	Changed reader parameters from ParserState to ReaderOptions.	John MacFarlane	1	-3/+3