pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2014-06-24	Docx reader: remove T.P.Generic import.	Jesse Rosenthal	1	-1/+0
	This marks the removal of the final tree-walk in the code. (Though there is still one in the Lists module.)
2014-06-24	Docx reader: pass definition test.	Jesse Rosenthal	1	-8/+13
	This commit also fixes a problem with the previous code pushes, which wouldn't allow code blocks to share a div.
2014-06-24	Docx Reader: add failing defintion list tests.	Jesse Rosenthal	3	-0/+11

2014-06-24	Docx reader: pass code tests.	Jesse Rosenthal	1	-33/+47

2014-06-24	Docx reader: add failing tests for inline code and code blocks.	Jesse Rosenthal	5	-0/+13

2014-06-23	Merge pull request #1367 from jkr/reducible-copyright	John MacFarlane	1	-0/+31
	Add copyright block to T.P.R.Docx.Reducible.
2014-06-23	Add copyright block to T.P.R.Docx.Reducible.	Jesse Rosenthal	1	-0/+31

2014-06-23	Merge pull request #1366 from jkr/reducible3	John MacFarlane	7	-276/+289
	Docx rewrite and cleanup (in terms of Reducible typeclass)
2014-06-23	Add test for correctly trimming spaces in formatting.	Jesse Rosenthal	3	-0/+5
	This used to be fixed in the tree-walking. We need to make sure we're doing it right now.
2014-06-23	Add Reducible to cabal file.	Jesse Rosenthal	1	-0/+1

2014-06-23	Use Reducible in docx reader.	Jesse Rosenthal	1	-273/+111
	This cleans up them implementation, and cuts down on tree-walking. Anecdotally, I've seen about a 3-fold speedup.
2014-06-23	Move some of the clean-up logic into List module.	Jesse Rosenthal	1	-3/+22
	This will allow us to get rid of more general functions we no longer need in the main reader.
2014-06-23	Add new typeclass, Reducible	Jesse Rosenthal	1	-0/+150
	This defines a typeclass `Reducible` which allows us to "reduce" pandoc Inlines and Blocks, like so Emph [Strong [Str "foo", Space]] <++> Strong [Emph [Str "bar"]], Str "baz"] = [Strong [Emph [Str "foo", Space, Str "bar"], Space, Str "baz"]] So adjacent formattings and strings are appropriately grouped. Another set of operators for `(Reducible a) => (Many a)` are also included.
2014-06-23	LaTeX writer: Use `\textquotesingle` for `'` in inline code.	John MacFarlane	2	-0/+3
	Otherwise we get curly quotes in the PDF output. Closes #1364.
2014-06-23	Markdown reader: Combine consecutive latex environments.	John MacFarlane	1	-2/+4
	This helps when you have two minipages which can't have blank lines between them. See #690, #1196.
2014-06-21	Merge pull request #1363 from jkr/newNormalize	John MacFarlane	6	-12/+74
	Improve normalization
2014-06-22	Docx reader tests: add tests for normalization deep in blocks.	Jesse Rosenthal	3	-0/+10

2014-06-22	Docx reader tests: Correct normalize test.	Jesse Rosenthal	1	-1/+1

2014-06-22	Docx reader: Fix spacing in formatting.	Jesse Rosenthal	1	-1/+1
	The normalizing tests revealed a problem with unformatted spaces, brought about by `spanTrim`. This fixes by not trimming the spaces out of spans until they are in their final form.
2014-06-22	Add normalization test.	Jesse Rosenthal	3	-0/+6
	Add torture-test for new normalization functions. One problem that this test demonstrates is that word has a tendency to turn off formatting at a space, and then turn it back on after. I'm not sure yet whether this is something we should fix.
2014-06-22	Implement new normalization.	Jesse Rosenthal	1	-11/+57
	There were some problems with the old str normalization. This fixes those problems. Also, since it drills down on its own, it only needs to be mapped over the blocks, not walked over the tree.
2014-06-21	Fixed compiler warnings.	John MacFarlane	1	-2/+0

2014-06-20	Filters: don't print misleading error message.	John MacFarlane	1	-4/+1
	Previously pandoc would say that a filter was not found, even in a case where the filter had a syntax error.
2014-06-20	Merge pull request #1361 from jkr/testNormalize	John MacFarlane	1	-2/+27
	Docx reader tests: Introduce NoNormPandoc type.
2014-06-20	Docx reader tests: Introduce NoNormPandoc type.	Jesse Rosenthal	1	-2/+27
	This is just a wrapper around Pandoc that doesn't normalize with `toString`. We want to make sure that our own normalization process works. If, in the future, we are able to hook into the builder's normalization, this will be removed.
2014-06-20	Markdown reader: Support smallcaps through span.	John MacFarlane	2	-1/+14
	`<span style="font-variant:small-caps;">foo</span>` will be parsed as a `SmallCaps` inline, and will work in all output formats that support small caps. Closes #1360.
2014-06-20	MediaWiki reader: Tightened up template parsing.	John MacFarlane	1	-0/+1
	The opening "{{" must be followed by an alphanumeric or ':'. This prevents the exponential slowdown in #1033. Closes #1033.
2014-06-20	MediaWiki reader: Support --trace.	John MacFarlane	1	-1/+10

2014-06-20	LaTeX writer: Correctly handle figures in notes.	John MacFarlane	1	-5/+7
	Notes can't contain figures in LaTeX, so we fake it to avoid an error. Closes #1053.
2014-06-20	Markdown reader: Prevent spurious line breaks after list items.	John MacFarlane	1	-1/+2
	When the `hard_line_breaks` option was specified, pandoc would produce a spurious line break after a tight list item. This patch solves the problem. Closes #1137.
2014-06-20	ImageSize: Use default instead of failing if image size not found	John MacFarlane	1	-1/+6
	in exif header. Closes #1358.
2014-06-20	HTML reader: Fix performance issue with malformed HTML tables.	John MacFarlane	1	-0/+2
	We let a `</table>` tag close an open `<tr>` or `<td>`. Closes #1167.
2014-06-20	Support --trace in HTML reader.	John MacFarlane	1	-1/+10

2014-06-20	LaTeX writer: Fixed strikeout + highlighted code. Closes #1294.	John MacFarlane	2	-2/+21
	Previously strikeout highlighted code caused an error.
2014-06-20	Merge pull request #1357 from jkr/bottomUpStrNormalize	John MacFarlane	1	-5/+5
	Make strNormalize go bottomUp.
2014-06-20	Make strNormalize go bottomUp.	Jesse Rosenthal	1	-5/+5
	This was how it used to be before it was folded into blockNormalize.
2014-06-20	Merge pull request #1355 from jkr/normalizeFixes	John MacFarlane	1	-9/+15
	Docx reader: Fixes to block Normalization
2014-06-20	Docx reader: Add a comment explaining strNormalize	Jesse Rosenthal	1	-0/+4
	`normalize` from Text.Pandoc.Shared is more general. In tests, though, it more than doubles the run time. `strNormalize` does less, but it does what we need. This comment is added for future maintainability.
2014-06-20	Docx Reader: Normalize DefinitionLists	Jesse Rosenthal	1	-0/+2
	Previously DefinitionList had been left out of `blockNormalize`. Now it is included.
2014-06-20	Docx reader: simplify blockNormalize	Jesse Rosenthal	1	-10/+8
	Use a function `stripSpaces`, instead of recursion. Makes it a bit easier to read and mantain, and simplify normalizing DefinitionList, which was left out the first time.
2014-06-20	Docx reader: Fix hdr handling in block norm	Jesse Rosenthal	1	-0/+2
	`blockNormalize` previously forgot to account for the case in which a Header's inlines did not start with a space.
2014-06-19	Docx writer: Use Compact style for empty table cells.	John MacFarlane	1	-1/+3
	Otherwise we get overly tall lines when there are empty table cells and the other cells are compact. Closes #1353.
2014-06-19	HTML reader: Allow space between `<col>` and `</col>`.	John MacFarlane	1	-0/+1
	Test case: ``` <table border="1"> <colgroup> <col> </col> <col></col> </colgroup> <tbody> <tr> <td>X</td> <td>Y</td> </tr> <tr> <td>1</td> <td>2</td> </tr> </tbody> </table> ```
2014-06-19	Merge pull request #1354 from jkr/literalTab	John MacFarlane	6	-4/+29
	Parse literal tabs in docx
2014-06-19	Add tabs tests.	Jesse Rosenthal	3	-1/+8

2014-06-19	Fix notes test.	Jesse Rosenthal	1	-1/+1
	This previously allowed spaces at the beginning of a paragraph.
2014-06-19	Introduce blockNormalize	Jesse Rosenthal	1	-1/+14
	This will help take care of spaces introduced at the beginning of strings.
2014-06-19	Have Docx reader properly interpret tabs.	Jesse Rosenthal	1	-0/+2

2014-06-19	Add literal tabs to parser.	Jesse Rosenthal	1	-1/+4

2014-06-19	ImageSize: ignore unknown exif header tag rather than crashing.	John MacFarlane	1	-1/+2
	Some images seem to have tag type of 256, which was causing a runtime error.