pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2014-10-20	Org reader: parse LaTeX-style MathML entities	Albert Krewinkel	1	-1/+8
	Org supports special symbols which can be included using LaTeX syntax, but are actually MathML entities. Examples for this are `\nbsp` (non-breaking space), `\Aacute` (the letter A with accent acute) or `\copy` (the copyright sign ©). This fixes #1657.
2014-10-18	Markdown reader: allow `startnum` to work without `fancy_lists`.	John MacFarlane	1	-2/+2
	Formerly `pandoc -f markdown-fancy_lists+startnum` did not work properly.
2014-10-18	Merge pull request #1680 from shelf/master	John MacFarlane	1	-7/+26
	Respect indent when parsing Org bullet lists
2014-10-18	Merge pull request #1700 from tarleb/org-emphasis-fix	John MacFarlane	1	-5/+5
	Org reader: fix rules for emphasis recognition
2014-10-18	Org reader: Drop COMMENT document trees	Albert Krewinkel	1	-1/+26
	Document trees under a header starting with the word `COMMENT` are comment trees and should not be exported. Those trees are dropped silently. This closes #1678.
2014-10-18	Org reader: fix rules for emphasis recognition	Albert Krewinkel	1	-5/+5
	Things like `/hello,/` or `/hi'/` were falsy recognized as emphasised strings. This is wrong, as `,` and `'` are forbidden border chars and may not occur on the inner border of emphasized text. This patch enables the reader to matches the reference implementation in that it reads the above strings as plain text.
2014-10-17	Fix indent issue for definition lists	Timothy Humphries	1	-14/+25
	Tidy up fix for #1650, #1698 as per comments in #1680. Fix same issue for definition lists with the same method.
2014-10-12	Respect indent when parsing Org bullet lists	Timothy Humphries	1	-2/+10
	Fixes issue with top-level bullet list parsing. Previously we would use `many1 spaceChars` rather than respecting the list's indent level. We also permitted `*` bullets on unindented lists, which should unambiguously parse as `header 1`. Combined, this meant headers at a different indent level were being unwittingly slurped into preceding bullet lists, as per Issue #1650.
2014-09-27	Merge pull request #1601 from jkr/windowsfix	John MacFarlane	1	-1/+1
	Fix path-slashes inside archive for windows
2014-09-27	Org Reader: Parse multi-inline terms correctly in definition list	Matthew Pickering	1	-1/+1
	Closes #1649
2014-09-25	HTML Reader: Recognise <br> tags inside <pre> blocks	mpickering	1	-1/+6
	Closes #1620
2014-09-06	Docx Reader: Remove header class properly in other langs	Jesse Rosenthal	1	-4/+4
	When we encounter one of the polyglot header styles, we want to remove that from the par styles after we convert to a header. To do that, we have to keep track of the style name, and remove it appropriately.
2014-09-05	Docx reader: Use polyglot header list.	Jesse Rosenthal	1	-7/+7
	We're just keeping a list of header formats that different languages use as their default styles. At the moment, we have English, German, Danish, and French. We can continue to add to this. This is simpler than parsing the styles file, and perhaps less error-prone, since there seems to be some variations, even within a language, of how a style file will define headers.
2014-09-05	Docx Reader: Start list of polyglot section headers.	Jesse Rosenthal	1	-0/+7

2014-09-04	Org reader: Added state changing blanklines.	Jesse Rosenthal	1	-1/+8
	This allows us to emphasize at the beginning of a new paragraph (or, in general, after blank lines).
2014-09-04	Docx reader: Rewrite rewriteLink to work with new headers.	Jesse Rosenthal	1	-9/+4
	There could be new top-level headers after making lists, so we have to rewrite links after that.
2014-09-04	Docx reader: Single-item headers in ordered lists are headers.	Jesse Rosenthal	1	-4/+6
	When users number their headers, Word understands that as a single item enumerated list. We make the assumption that such a list is, in fact, a header.
2014-09-02	Docx reader: Fix window path for image lookup.	Jesse Rosenthal	1	-1/+1
	Don't use os-sensitive "combine", since we always want the paths in our zip-archive to use forward-slashes.
2014-08-31	Markdown reader: better handling of paragraph in div.	John MacFarlane	1	-0/+7
	Previously text that ended a div would be parsed as Plain unless there was a blank line before the closing div tag. Test case: <div class="first"> This is a paragraph. This is another paragraph. </div> Closes #1591.
2014-08-30	Merge branch 'mime' of https://github.com/Aelve/John into Aelve-mime	John MacFarlane	1	-8/+7
	Conflicts: src/Text/Pandoc/Writers/Docx.hs
2014-08-28	Docx Reader: Read single para in table cell as plain	Jesse Rosenthal	1	-1/+12
	This makes to docx reader's native output fit with the way the markdown reader understands its markdown output. Ie, as far as table cells go: docx -> native == docx -> native -> markdown -> native (This identity isn't true for other things outside of table cells, of course).
2014-08-26	Fixed exampleLine parser to accept example lines which have indentation at ↵	Calvin Beck	1	-1/+1
	the start of the line.
2014-08-21	Txt2Tags Reader: Fixed crash when reading from stdin	mpickering	1	-3/+5

2014-08-21	Txt2Tags Reader: Corrected formatting of %%mtime macro	mpickering	1	-1/+1

2014-08-21	Txt2Tags Reader: Parse Meta information	mpickering	1	-10/+36
	The header is now parsed as meta information. The first line is the `title`, the second is the `author` and third line is the `date`.
2014-08-20	Txt2Tags reader: Header is now parsed only if standalone flag is set	mpickering	1	-1/+4

2014-08-18	Merge pull request #1547 from jkr/styleparse	John MacFarlane	2	-36/+110
	Docx reader: parsing styles
2014-08-18	HTML reader: improved handling of tags that can be block or inline.	John MacFarlane	1	-5/+13
	Previously a section like this would be enclosed in a paragraph, with RawInline for the video tags (since video is a tag that can be either block or inline): <video controls="controls"> <source src="../videos/test.mp4" type="video/mp4" /> <source src="../videos/test.webm" type="video/webm" /> <p> The videos can not be played back on your system.<br/> Try viewing on Youtube (requires Internet connection): <a href="http://youtu.be/etE5urBps_w">Relative Velocity on Youtube</a>. </p> </video> This change will cause the video and source tags to be parsed as RawBlock instead, giving better output. The general change is this: when we're parsing a "plain" sequence of inlines, we don't parse anything that COULD be a block-level tag.
2014-08-17	Docx reader: whitespace fix.	Jesse Rosenthal	1	-6/+6

2014-08-17	Docx reader: remove emph styles and strong styles list.	Jesse Rosenthal	1	-6/+0
	We no longer need the explicit lists since we're deriving them from the ground up.
2014-08-17	Docx reader: Add "Hyperlink" to blacklisted styles.	Jesse Rosenthal	1	-2/+2
	This is the only one so far. We'll add others as they show up.
2014-08-17	Docx reader: Use style resolver.	Jesse Rosenthal	1	-23/+9
	We now no longer check against explicit styles.
2014-08-17	Docx Reader: Introduce function for resolving dependent run styles.	Jesse Rosenthal	1	-0/+31
	We always favor an explicit positive or negative in a style in a descendent, and only turn to the ancestor if nothing is set. We also introduce an (empty) list of styles that are black-listed. We won't check them. (Think underlines in hyperlinks).
2014-08-17	Docx Parse: build a bottom-up style tree.	Jesse Rosenthal	1	-6/+31
	Two points here: (1) We're going bottom-up, from styles not based on anything, to avoid circular dependencies or any other sort of maliciousness/incompetence. And (2) each style points to its parent. That way, we don't need the whole tree to pass a style over to Docx.hs
2014-08-17	Update Reader.EPUB to use `MimeType`.	Artyom Kazak	1	-8/+7

2014-08-17	Alias string and runStyle to CharStyle type.	Jesse Rosenthal	1	-7/+10

2014-08-17	Docx Style parser: Basic one now just takes a parent style.	Jesse Rosenthal	1	-13/+15
	This will make it easier to build the style map from the bottom up (to avoid any infinite references).
2014-08-17	Docx reader: work with new rStyle.	Jesse Rosenthal	1	-4/+4
	Just discards info at the moment, so at least it works the same.
2014-08-17	Parser: Framework for parsing styles.	Jesse Rosenthal	1	-11/+44
	We want to be able to read user-defined styles. Eventually we'll be able to figure out styles in terms of inheritance as well. The actual cascading will happen in the docx reader.
2014-08-17	Docx reader: Change behavior of Super/Subscript	Jesse Rosenthal	2	-16/+17
	In docx, super- and subscript are attributes of Vertalign. It makes more sense to follow this, and have different possible values of Vertalign in runStyle. This is mainly a preparatory step for real style parsing, since it can distinguish between vertical align being explicitly turned off and it not being set. In addition, it makes parsing a bit clearer, and makes sure we don't do docx-impossible things like being simultaneously super and sub.
2014-08-16	HTML reader: Parse appropriately styled span as SmallCaps.	John MacFarlane	1	-1/+6

2014-08-16	Docx reader: Remove unnecessary plural functions	Jesse Rosenthal	1	-11/+5
	functions like runElemsToInlines and parPartsToInlines are just defined in terms of concatting and mapping their singular version (e.g. `runElemToInlines`). Having two functions with almost identical names makes it easier to introduce errors. It's easy enough to just concat and map inline, and it makes it clearer what is going on in the code.
2014-08-16	Docx reader: Fix bug in character styles.	Jesse Rosenthal	1	-2/+2
	Style handling has been cleaned up, but introduced a bug here. There wasn't previously a test to catch it.
2014-08-16	Rewrite Docx.hs and Reducible to use Builder.	Jesse Rosenthal	2	-415/+368
	The big news here is a rewrite of Docx to use the builder functions. As opposed to previous attempts, we now see a significant speedup -- times are cut in half (or more) in a few informal tests. Reducible has also been rewritten. It can doubtless be simplified and clarified further. We can consider this, at the moment, a reference for correct behavior.
2014-08-14	Markdown reader: Better handle quote characters in inline links.	John MacFarlane	1	-2/+4
	This was previously failing to be recognized as a link: [Test](http://en.wikipedia.org/wiki/Ward's_method) Closes #1534.
2014-08-13	Docx reader: Interpret "Strong" and Emphasis run styles.	Jesse Rosenthal	1	-2/+6

2014-08-13	Docx: Reducible forgot about smallcaps	Jesse Rosenthal	1	-0/+2

2014-08-12	Docx Reader: Trim line breaks from the beginning and end of Section	Jesse Rosenthal	1	-2/+10
	Headers. We might also want to do this elsewhere (for pars, for example).
2014-08-12	Docx: More robust handling of multiple bookmarks in header.	Jesse Rosenthal	1	-6/+8

2014-08-12	Docx reader: Check for null-id'd anchors too.	Jesse Rosenthal	1	-1/+0
	Otherwise they get left dangling in the document.