pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2014-08-20	Txt2Tags reader: Header is now parsed only if standalone flag is set	mpickering	1	-1/+4

2014-08-18	Merge pull request #1547 from jkr/styleparse	John MacFarlane	2	-36/+110
	Docx reader: parsing styles
2014-08-18	HTML reader: improved handling of tags that can be block or inline.	John MacFarlane	1	-5/+13
	Previously a section like this would be enclosed in a paragraph, with RawInline for the video tags (since video is a tag that can be either block or inline): <video controls="controls"> <source src="../videos/test.mp4" type="video/mp4" /> <source src="../videos/test.webm" type="video/webm" /> <p> The videos can not be played back on your system.<br/> Try viewing on Youtube (requires Internet connection): <a href="http://youtu.be/etE5urBps_w">Relative Velocity on Youtube</a>. </p> </video> This change will cause the video and source tags to be parsed as RawBlock instead, giving better output. The general change is this: when we're parsing a "plain" sequence of inlines, we don't parse anything that COULD be a block-level tag.
2014-08-17	Docx reader: whitespace fix.	Jesse Rosenthal	1	-6/+6

2014-08-17	Docx reader: remove emph styles and strong styles list.	Jesse Rosenthal	1	-6/+0
	We no longer need the explicit lists since we're deriving them from the ground up.
2014-08-17	Docx reader: Add "Hyperlink" to blacklisted styles.	Jesse Rosenthal	1	-2/+2
	This is the only one so far. We'll add others as they show up.
2014-08-17	Docx reader: Use style resolver.	Jesse Rosenthal	1	-23/+9
	We now no longer check against explicit styles.
2014-08-17	Docx Reader: Introduce function for resolving dependent run styles.	Jesse Rosenthal	1	-0/+31
	We always favor an explicit positive or negative in a style in a descendent, and only turn to the ancestor if nothing is set. We also introduce an (empty) list of styles that are black-listed. We won't check them. (Think underlines in hyperlinks).
2014-08-17	Docx Parse: build a bottom-up style tree.	Jesse Rosenthal	1	-6/+31
	Two points here: (1) We're going bottom-up, from styles not based on anything, to avoid circular dependencies or any other sort of maliciousness/incompetence. And (2) each style points to its parent. That way, we don't need the whole tree to pass a style over to Docx.hs
2014-08-17	Update Reader.EPUB to use `MimeType`.	Artyom Kazak	1	-8/+7

2014-08-17	Alias string and runStyle to CharStyle type.	Jesse Rosenthal	1	-7/+10

2014-08-17	Docx Style parser: Basic one now just takes a parent style.	Jesse Rosenthal	1	-13/+15
	This will make it easier to build the style map from the bottom up (to avoid any infinite references).
2014-08-17	Docx reader: work with new rStyle.	Jesse Rosenthal	1	-4/+4
	Just discards info at the moment, so at least it works the same.
2014-08-17	Parser: Framework for parsing styles.	Jesse Rosenthal	1	-11/+44
	We want to be able to read user-defined styles. Eventually we'll be able to figure out styles in terms of inheritance as well. The actual cascading will happen in the docx reader.
2014-08-17	Docx reader: Change behavior of Super/Subscript	Jesse Rosenthal	2	-16/+17
	In docx, super- and subscript are attributes of Vertalign. It makes more sense to follow this, and have different possible values of Vertalign in runStyle. This is mainly a preparatory step for real style parsing, since it can distinguish between vertical align being explicitly turned off and it not being set. In addition, it makes parsing a bit clearer, and makes sure we don't do docx-impossible things like being simultaneously super and sub.
2014-08-16	HTML reader: Parse appropriately styled span as SmallCaps.	John MacFarlane	1	-1/+6

2014-08-16	Docx reader: Remove unnecessary plural functions	Jesse Rosenthal	1	-11/+5
	functions like runElemsToInlines and parPartsToInlines are just defined in terms of concatting and mapping their singular version (e.g. `runElemToInlines`). Having two functions with almost identical names makes it easier to introduce errors. It's easy enough to just concat and map inline, and it makes it clearer what is going on in the code.
2014-08-16	Docx reader: Fix bug in character styles.	Jesse Rosenthal	1	-2/+2
	Style handling has been cleaned up, but introduced a bug here. There wasn't previously a test to catch it.
2014-08-16	Rewrite Docx.hs and Reducible to use Builder.	Jesse Rosenthal	2	-415/+368
	The big news here is a rewrite of Docx to use the builder functions. As opposed to previous attempts, we now see a significant speedup -- times are cut in half (or more) in a few informal tests. Reducible has also been rewritten. It can doubtless be simplified and clarified further. We can consider this, at the moment, a reference for correct behavior.
2014-08-14	Markdown reader: Better handle quote characters in inline links.	John MacFarlane	1	-2/+4
	This was previously failing to be recognized as a link: [Test](http://en.wikipedia.org/wiki/Ward's_method) Closes #1534.
2014-08-13	Docx reader: Interpret "Strong" and Emphasis run styles.	Jesse Rosenthal	1	-2/+6

2014-08-13	Docx: Reducible forgot about smallcaps	Jesse Rosenthal	1	-0/+2

2014-08-12	Docx Reader: Trim line breaks from the beginning and end of Section	Jesse Rosenthal	1	-2/+10
	Headers. We might also want to do this elsewhere (for pars, for example).
2014-08-12	Docx: More robust handling of multiple bookmarks in header.	Jesse Rosenthal	1	-6/+8

2014-08-12	Docx reader: Check for null-id'd anchors too.	Jesse Rosenthal	1	-1/+0
	Otherwise they get left dangling in the document.
2014-08-12	Docx reader: accept explicit "Italic" and "Bold" rStyles.	Jesse Rosenthal	2	-18/+31
	Note that "Italic" can be on, and, from the last commit, `<w:i>` can be present, but be turned off. In that case, the turned-off tag takes precedence. So, we have to distinguish between something being off and something not being there. Hence, isItalic, isBold, isStrike, and isSmallCaps have become Maybes.
2014-08-12	Docx reader: Add "BlockQuotation" to divs list.	Jesse Rosenthal	1	-1/+1

2014-08-12	Docx Reader: Fix font style parsing.	Jesse Rosenthal	1	-12/+27
	Before we just checked for the existence of a tag. Now, we make sure to check for its on/off value.
2014-08-12	Merge pull request #1528 from mpickering/epubtitlepage	John MacFarlane	1	-4/+10
	EPUB Reader: Ignores titlepage attribute
2014-08-12	EPUB Reader: Ignore title pages	Matthew Pickering	1	-4/+10

2014-08-12	DocBook: Support equations with mathml.	John MacFarlane	1	-4/+16
	equation, informalequation, inlineequation and mml:math elements.
2014-08-12	Merge pull request #1524 from jkr/dropCap3	John MacFarlane	2	-3/+11
	Docx reader: move dropcap combining logic to Reducible
2014-08-12	Markdown reader: Improved parsing of indented code in list items.	John MacFarlane	1	-25/+42
	Indented code at the beginning of a list item must be indented eight spaces from the margin (or from the edge of the container), or four spaces past the list marker, whichever is farther. Some examples in `tests/markdown-reader-more.txt`.
2014-08-12	Docx reader: move combining logic to Reducible	Jesse Rosenthal	2	-3/+11
	Introduces a new function in Reducibles, concatR. The idea is that if we have two list of Reducibles (blocks or inlines), we can combine them and just perform the reduction on the joining parts (the last element of the first list, the first element of the second list). This is useful in cases where the two lists are already reduced, and we're only worried about the joining elements. This actually improves the efficiency a bit further, because concatR can be smart about empty lists.
2014-08-12	Docx reader: Make dropcap combining more efficient.	Jesse Rosenthal	1	-1/+3
	Before, we had to run reduceList on the whole combined paragraph, which was redundant, and could take some time for long paragraphs. We only need to combine the drop cap with the first inline of the next paragraph.
2014-08-11	Docx reader: combine inlines properly in dropcaps.	Jesse Rosenthal	1	-1/+1
	Make sure that adjacent inlines are combined properly in dropcaps. This updates the test results as well.
2014-08-11	Docx reader: Use dropcap state.	Jesse Rosenthal	1	-9/+17
	If we get to a dropcap, we keep hold the inlines until the next paragraph, and combine it there.
2014-08-11	Add dropCap to paragraph style.	Jesse Rosenthal	1	-2/+12

2014-08-11	EPUB reader: use walk instead of bottomUp.	John MacFarlane	1	-2/+1
	This should be more efficient.
2014-08-11	Merge pull request #1521 from jkr/emptyEmph	John MacFarlane	1	-5/+6
	Discard empty formatters
2014-08-11	Merge pull request #1519 from mpickering/more	John MacFarlane	1	-1/+1
	EPUB Normalisation and anchors for div blocks in tex
2014-08-11	Textile reader: list and HTML block parsing improvements.	John MacFarlane	1	-16/+13
	Closes #1513. Lists can now start without an intervening blank line. Also, html block-level tags that don't start a line are parsed as RawInline and don't interrupt paragraphs, as in RedCloth.
2014-08-11	Docx reader: handle empty reducibles.	Jesse Rosenthal	1	-5/+6

2014-08-11	EPUB Reader: Fixed another normalisation problem..	Matthew Pickering	1	-1/+1

2014-08-11	Merge pull request #1516 from mpickering/epubmetadata	John MacFarlane	1	-6/+7
	EPUB improvements
2014-08-11	Docx Parse: Improved font recognition when specified in rFonts element	Matthew Pickering	1	-8/+27

2014-08-11	Docx Fonts: Derives Show and Eq	Matthew Pickering	1	-0/+1

2014-08-11	EPUB Reader: Can now parse multiple meta data fields	Matthew Pickering	1	-2/+2

2014-08-11	EPUB reader: Fixed bug where filepaths weren't sufficiently normalised	Matthew Pickering	1	-4/+5

2014-08-10	Merge pull request #1510 from jkr/spacefix	John MacFarlane	1	-10/+12
	Docx reader: Fix spacing issue.