pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2016-02-20	HTML reader: rewrote htmlInBalanced.	John MacFarlane	1	-10/+39
	This version avoids an exponential performance problem with `<script>` tags, and it should be faster in general. Closes #2730.
2016-02-16	HTML reader: properly handle an empty cell in a simple table.	John MacFarlane	1	-0/+1
	Closes #2718.
2016-01-29	HTML reader: handle multiple meta tags with same name.	John MacFarlane	1	-2/+6
	Put them in a list in the metadata so they are all preserved, rather than (as before) throwing out all but one..
2016-01-22	Changed type of Shared.uniqueIdent argument from [String] to Set String.	John MacFarlane	1	-3/+3
	This avoids performance problems in documents with many identically named headers. Closes #2671.
2015-12-12	Modified readers to emit SoftBreak when appropriate.	John MacFarlane	1	-1/+4

2015-11-19	Merge branch 'new-image-attributes' of https://github.com/mb21/pandoc into ↵	John MacFarlane	1	-15/+11
	mb21-new-image-attributes * Bumped version to 1.16. * Added Attr field to Link and Image. * Added `common_link_attributes` extension. * Updated readers for link attributes. * Updated writers for link attributes. * Updated tests * Updated stack.yaml to build against unreleased versions of pandoc-types and texmath. * Fixed various compiler warnings. Closes #261. TODO: * Relative (percentage) image widths in docx writer. * ODT/OpenDocument writer (untested, same issue about percentage widths). * Update pandoc-citeproc.
2015-11-09	Restored Text.Pandoc.Compat.Monoid.	John MacFarlane	1	-1/+1
	Don't use custom prelude for latest ghc. This is a better approach to making 'stack ghci' and 'cabal repl' work. Instead of using NoImplicitPrelude, we only use the custom prelude for older ghc versions. The custom prelude presents a uniform API that matches the current base version's prelude. So, when developing (presumably with latest ghc), we don't use a custom prelude at all and hence have no trouble with ghci. The custom prelude no longer exports (<>): we now want to match the base 4.8 prelude behavior.
2015-11-09	Revert "Use -XNoImplicitPrelude and 'import Prelude' explicitly."	John MacFarlane	1	-1/+0
	This reverts commit c423dbb5a34c2d1195020e0f0ca3aae883d0749b.
2015-11-08	Use -XNoImplicitPrelude and 'import Prelude' explicitly.	John MacFarlane	1	-0/+1
	This is needed for ghci to work with pandoc, given that we now use a custom prelude. Closes #2503.
2015-10-22	Fixed over-eager raw HTML inline parsing.	John MacFarlane	1	-0/+1
	Tightened up the inline HTML parser so it disallows TagWarnings. This only affects the markdown reader when the `markdown_in_html_blocks` option is disabled. Closes #2469.
2015-10-14	Use custom Prelude to avoid compiler warnings.	John MacFarlane	1	-2/+2
	- The (non-exported) prelude is in prelude/Prelude.hs. - It exports Monoid and Applicative, like base 4.8 prelude, but works with older base versions. - It exports (<>) for mappend. - It hides 'catch' on older base versions. This allows us to remove many imports of Data.Monoid and Control.Applicative, and remove Text.Pandoc.Compat.Monoid. It should allow us to use -Wall again for ghc 7.10.
2015-10-11	HTML reader/writer: better handling of "section" elements.	John MacFarlane	1	-3/+10
	Previously `<section>` tags were just parsed as raw HTML blocks. With this change, section elements are parsed as Div elements with the class "section". The HTML writer will use `<section>` tags to render these Divs in HTML5; otherwise they will be rendered as `<div class="section">`. Closes #2438.
2015-08-08	HTML reader: add auto identifiers if not present on headers.	John MacFarlane	1	-7/+17
	This makes TOC linking work properly. The same thing needs to be done to the org reader to fix #2354; in addition, `Ext_auto_identifiers` should be added to the list of default extensions for org in Text.Pandoc.
2015-08-07	Updated readers, writers and README for link attribute	mb21	1	-14/+4

2015-08-07	Updated readers and writers for new image attribute parameter.	John MacFarlane	1	-1/+7
	(mb21)
2015-07-27	HTML Reader: Detect font-variant with pickStyleAttrProps	Ophir Lifshitz	1	-6/+5

2015-07-24	HTML Reader: Parse <ol> type, class, and inline list-style(-type) CSS	Ophir Lifshitz	1	-17/+30

2015-07-21	Fix regression: allow HTML comments containing `--`.	John MacFarlane	1	-4/+4
	Technically this isn't allowed in an HTML comment, but we've always allowed it, and so do most other implementations. It is handy if e.g. you want to put command line arguments in HTML comments.
2015-07-21	HTML reader: handle type attribute on ol.	John MacFarlane	1	-1/+8
	E.g. `<ol type="i">`. Closes #2313.
2015-07-10	Avoid parsing partial URLs as HTML tags.	John MacFarlane	1	-1/+8
	Closes #2277.
2015-06-04	HTML reader: allow `<body>` to close `<head>`.	John MacFarlane	1	-0/+1

2015-05-13	HTML reader: Support base tag.	John MacFarlane	1	-7/+28
	We only support the href attribute, as there's no place for "target" in the Pandoc document model for links. Added HTML reader test module, with tests for this feature. Closes #1751.
2015-05-11	HTML reader: Fixed detection of self-closing tags.	John MacFarlane	1	-2/+2
	Earlier versions had a bug and would wrongly think opening tags containing attributes with slashes in them were self-closing. Closes #2146.
2015-04-29	HTML reader: Allow multiple colgroups in table.	John MacFarlane	1	-1/+1
	Closes #2122.
2015-04-26	Updated copyright notices to -2015. Closes #2111.	John MacFarlane	1	-2/+2

2015-04-17	More principled fix for #1820.	John MacFarlane	1	-5/+7
	If the tag parses as a comment, we check to see if the input starts with `<!--`. If not, it's bogus comment mode and we fail htmlTag. Includes test case. Closes #1820.
2015-04-17	Fixed `htmlTag` in HTML reader.	John MacFarlane	1	-1/+1
	Require that `<!` or `<?` be followed by nonspace. This prevents `</ div>` from being parsed as a comment. Closes #1820.
2015-02-18	Move utility error functions to Text.Pandoc.Shared	Matthew Pickering	1	-1/+1

2015-02-18	Change return type of HTML reader	Matthew Pickering	1	-5/+12

2015-01-25	fixes #1859 HTML Reader table parsing	mb21	1	-11/+22

2014-11-16	Make `embed` tag either block or inline.	John MacFarlane	1	-2/+2
	Closes #1756.
2014-09-25	HTML Reader: Recognise <br> tags inside <pre> blocks	mpickering	1	-1/+6
	Closes #1620
2014-08-18	HTML reader: improved handling of tags that can be block or inline.	John MacFarlane	1	-5/+13
	Previously a section like this would be enclosed in a paragraph, with RawInline for the video tags (since video is a tag that can be either block or inline): <video controls="controls"> <source src="../videos/test.mp4" type="video/mp4" /> <source src="../videos/test.webm" type="video/webm" /> <p> The videos can not be played back on your system.<br/> Try viewing on Youtube (requires Internet connection): <a href="http://youtu.be/etE5urBps_w">Relative Velocity on Youtube</a>. </p> </video> This change will cause the video and source tags to be parsed as RawBlock instead, giving better output. The general change is this: when we're parsing a "plain" sequence of inlines, we don't parse anything that COULD be a block-level tag.
2014-08-16	HTML reader: Parse appropriately styled span as SmallCaps.	John MacFarlane	1	-1/+6

2014-08-12	EPUB Reader: Ignore title pages	Matthew Pickering	1	-4/+10

2014-08-08	Added `native_divs` and `native_spans` extensions.	John MacFarlane	1	-1/+4
	This allows users to turn off the default pandoc behavior of parsing contents of div and span tags in markdown and HTML as native pandoc Div blocks and Span inlines. Setting of default epub extensions has been moved from the EPUB reader to Text.Pandoc.
2014-08-08	HTML EPUB exts: switch element can now be in either the inline or block position	Matthew Pickering	1	-9/+10

2014-08-07	HTML reader: Really ignore DOCTYPE and xml declarations.	John MacFarlane	1	-2/+2
	This actually does what d71b013841f3c9c8c595591e312a31df16a728cb said it did. Revised epub tests to remove the repeated DOCTYPE and xml tags.
2014-08-04	HTML reader: ignore <?xml..> and <DOCTYPE..> tags.	John MacFarlane	1	-1/+1
	Previously they were parsed as raw.
2014-08-04	Use texmath 0.7 interface.	John MacFarlane	1	-2/+2

2014-07-31	HTML Reader: Added ability to read MathML formatted <math> blocks	Matthew Pickering	1	-0/+16

2014-07-31	HTML Reader: Added support for anchors on links and list items	Matthew Pickering	1	-4/+22

2014-07-31	HTML Reader: Extended HTML Reader to recognise EPUB specific elements	Matthew Pickering	1	-28/+178

2014-07-26	Generalised more in Parsing.hs to enable the use of custom state	Matthew Pickering	1	-18/+61

2014-07-20	HTML reader: parse Div and Span elements even without `--parse-raw`.	John MacFarlane	1	-2/+0
	Closes #1434.
2014-07-11	Removed (>>~) function	Matthew Pickering	1	-3/+3
	This function is equivalent to the more general (<*) which is defined in Control.Applicative. This change makes pandoc code easier to understand for those not familar with the codebase.
2014-07-07	HTML reader: adjust `blockTags` and `eitherBlockOrInline`.	John MacFarlane	1	-9/+13
	- Added `audio` and `source` in `eitherBlockOrInline`. - Moved `video`, `svg`, `progress`, `script`, `noscript`, `svg` from `blockTags` to `eitherBlockOrInline`. - `map` and `object` were mistakenly in both lists; they have been removed from `blockTags`.
2014-06-20	HTML reader: Fix performance issue with malformed HTML tables.	John MacFarlane	1	-0/+2
	We let a `</table>` tag close an open `<tr>` or `<td>`. Closes #1167.
2014-06-20	Support --trace in HTML reader.	John MacFarlane	1	-1/+10

2014-06-19	HTML reader: Allow space between `<col>` and `</col>`.	John MacFarlane	1	-0/+1
	Test case: ``` <table border="1"> <colgroup> <col> </col> <col></col> </colgroup> <tbody> <tr> <td>X</td> <td>Y</td> </tr> <tr> <td>1</td> <td>2</td> </tr> </tbody> </table> ```