pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2014-07-27	Added txt2tags reader	Matthew Pickering	1	-0/+507
	http://txt2tags.org/ There are two points which currently do not match the official implementation. 1. In the official implementation lists can not be nested like the following but the reader would interpret this as a bullet list with the first item being a numbered list. ``` - + This is not a list ``` 2. The specification describes how URIs automatically becomes links. Unfortunately as is often the case, their definitiong of URI is not clear. I tried three solutions but was unsure about which to adopt. * Using isURI from Network.URI, this matches far too many strings and is therefore unsuitable * Using uri from Text.Pandoc.Shared, this doesn't match all strings that the reference implementation matches * Try to simulate the regex which is used in the native code I went with the third approach but it is not perfect, for example trailing punctuation is captured in Urls.
2014-07-26	Generalised more in Parsing.hs to enable the use of custom state	Matthew Pickering	2	-58/+114

2014-07-25	Fixed runtime error with compactify'DL on certain lists.	John MacFarlane	1	-11/+13
	Closes #1452. Added test.
2014-07-23	DocBook reader: Better handle elements inside code environments.	John MacFarlane	1	-1/+6
	Of course, we can't include structure in the code block, but this way we at least preserve the text. Closes #1449.
2014-07-22	Exported runParserT and Stream	Matthew Pickering	2	-2/+3

2014-07-22	Generalised readWith to readWithM	Matthew Pickering	1	-10/+19

2014-07-21	Revert "Shared.hierarchicalize: Don't number subsections of unnumbered ↵	John MacFarlane	1	-25/+18
	sections." This reverts commit 2a46042661a088096ac54097db5cd3674438bb63.
2014-07-21	Shared.hierarchicalize: Don't number subsections of unnumbered sections.	John MacFarlane	1	-18/+25
	They were previously numbered, starting from the previous numbered section, which was wrong.
2014-07-21	Markdown writer: Avoid wrapping that might start a list.	John MacFarlane	1	-1/+5
	Or a blockquote or header. Closes #1013.
2014-07-20	EPUB writer: Avoid excess whitespace in nav.xhtml.	John MacFarlane	1	-1/+1
	This should improve TOC view in iBooks. Closes #1392.
2014-07-20	HTML reader: parse Div and Span elements even without `--parse-raw`.	John MacFarlane	1	-2/+0
	Closes #1434.
2014-07-20	Fix behavior of `markdown_attribute` extension.	John MacFarlane	2	-4/+17
	It now works as in PHP markdown extra. Setting `markdown="1"` on an outer tag affects all contained tags until it is reversed with `markdown="0"`. Closes #1378. Added `stateMarkdownAttribute` to `ParserState`.
2014-07-20	Markdown reader: Fixed small bug in HTML parsing with markdown_attribute.	John MacFarlane	1	-3/+4
	Test case: <aside markdown="1"> hi </aside> Previously gave: <article markdown="1"> <p><em>hi</em> </article></p>
2014-07-20	Markdown reader: revised definition list syntax (closes #1429).	John MacFarlane	2	-23/+40
	* This change brings pandoc's definition list syntax into alignment with that used in PHP markdown extra and multimarkdown (with the exception that pandoc is more flexible about the definition markers, allowing tildes as well as colons). * Lazily wrapped definitions are now allowed; blank space is required between list items; and the space before definition is used to determine whether it is a paragraph or a "plain" element. * For backwards compatibility, a new extension, `compact_definition_lists`, has been added that restores the behavior of pandoc 1.12.x, allowing tight definition lists with no blank space between items, and disallowing lazy wrapping.
2014-07-20	readWith: reverted generalization from f201bdcb.	John MacFarlane	1	-8/+8
	We need input to be a string so we can print the offending line on an error.
2014-07-20	Org reader: text adjacent to a list yields a Plain, not Para.	John MacFarlane	1	-3/+7
	This gives better results for tight lists. Closes #1437. An alternative solution would be to use Para everywhere, and never Plain. I am not sufficiently familiar with org to know which is best. Thoughts, @tarleb?
2014-07-20	AsciiDoc writer: Double markers in intraword emphasis.	John MacFarlane	1	-11/+46
	Closes #1441.
2014-07-19	Renamed readTeXMath' to avoid name conflict with texmath 0.6.7	Matthew Pickering	9	-26/+17
	Also removed deprecated readTeXMath.
2014-07-17	Org reader: Respect :exports header arguments on code blocks	Craig S. Bosma	1	-5/+27
	Adds support to the org reader for conditionally exporting either the code block, results block immediately following, both, or neither, depending on the value of the `:exports` header argument. If no such argument is supplied, the default org behavior (for most languages) of exporting code is used.
2014-07-16	Remove unused import.	John MacFarlane	1	-1/+0

2014-07-16	Custom writers now work with `--template`.	John MacFarlane	1	-2/+14
	Removed HTML header scaffolding from data/sample.lua.
2014-07-16	Made Citation information available in lua custom writer.	John MacFarlane	1	-2/+17

2014-07-16	Removed redundant clause in markdown parser.	John MacFarlane	1	-2/+1
	Thanks @dubiousjim. Close #1431.
2014-07-15	Merge pull request #1430 from jkr/anchor-fix-2	John MacFarlane	1	-27/+31
	Fix auto identified headers when already auto-id'ed
2014-07-16	Docx Reader: Fix hdr auto-id when already auto-id.	Jesse Rosenthal	1	-11/+19
	If header anchors (bookmarks in a header paragraph) already have an auto-id, which will happen if they're generated by pandoc, we don't want to rename it twice, and thus end up with an unnecessary number at the end. So we add a state value to check if we're in a header. If we are, we don't rename the bookmark -- wait until we rename it in our header handling.
2014-07-16	Docx Reader: Change state handling.	Jesse Rosenthal	1	-16/+12
	We don't need `updateDState` -- the built-in `modify` works just fine. And we redefine `withDState` to use modify.
2014-07-15	HTML writer: Removed useless clause.	John MacFarlane	1	-4/+0

2014-07-15	LaTeX writer: Use \nolinkurl in email autolinks.	John MacFarlane	1	-2/+9
	This allows them to be styled using `\urlstyle{tt}`. Thanks to Ulrike Fischer for the solution.
2014-07-15	EPUB writer: Keep newlines between block elements.	John MacFarlane	1	-1/+1
	This allows easier diff-ability. Closes #1424.
2014-07-15	Shared.fetchItem: unescape URI encoding before reading local file.	John MacFarlane	1	-1/+1
	Close #1427.
2014-07-13	RTF writer: Avoid extra paragraph tags in metadata.	John MacFarlane	1	-2/+10
	Closes #1421.
2014-07-13	Use raw HTML for complex block quotes.	John MacFarlane	1	-1/+7
	As far as I can see, dokuwiki markup is pretty limited in what can go in a `>` block quote: just a single line of paragraph text. (#1398)
2014-07-13	DokuWiki writer: Use raw HTML for complex lists...	John MacFarlane	1	-13/+40
	as in the mediawiki writer. The dokuwiki markup isn't able to handle multiple block-level items within a list item, except in a few special cases (e.g. code blocks, and these must be started on the same line as the preceding paragraph). So we fall back to raw HTML for these. Perhaps there is a better solution. We can "fake" multiple paragraphs within list items using hard line breaks (`\\`), but we must keep everything on one line. (#1398)
2014-07-13	DokuWiki writer: Normalize to collapse adjacent raw HTML blocks.	John MacFarlane	1	-1/+1

2014-07-13	DokuWiki writer: More tweaks to email links. (#1398)	John MacFarlane	1	-4/+3

2014-07-13	DokuWiki writer: Use pointy brackets for email links.	John MacFarlane	1	-0/+2
	(#1398)
2014-07-13	Dokuwiki writer: More idiomatic code for escaping.	John MacFarlane	1	-2/+4

2014-07-13	DokuWiki writer: More raw HTML fixes. (#1398)	John MacFarlane	1	-2/+4
	* Use uppercase HTML tags for block-level content, lowercase for inline. * Newline before closing HTML tag.
2014-07-13	DokuWiki writer: Fix raw inlines and blocks.	John MacFarlane	1	-6/+6
	* mediawiki > dokuwiki * ignore raw content other than html or dokuwiki. (#1398)
2014-07-13	Markdown writer: Use span with style for SmallCaps. (#1360)	John MacFarlane	1	-1/+8

2014-07-13	Markdown writer: use Span instead of (hackish) SmallCaps in plainify.	John MacFarlane	1	-9/+10

2014-07-13	EPUB writer: Use stringify instead of custom plainify.	John MacFarlane	1	-16/+10
	As far as I can tell, it does about the same thing.
2014-07-13	Better comment on removeFormatting.	John MacFarlane	1	-1/+1

2014-07-13	Shared: Generalized type of removeFormatting.	John MacFarlane	1	-1/+1

2014-07-13	Merge branch 'claremacrae-dokuwiki'.	John MacFarlane	1	-2/+4
	Use removeFormatting from Shared instead of the custom unfancy function.
2014-07-13	Shared: Added removeFormatting.	John MacFarlane	1	-0/+14
	API change (addition of exported function).
2014-07-13	Use renderTags' for all tag rendering.	John MacFarlane	3	-5/+5
	This properly handles tags that should be self-closing. Previously `<hr/>` would appear in EPUB output as `<hr></hr>`. Closes #1420.
2014-07-12	Fixed typo in module header for Asciify.	John MacFarlane	1	-1/+1
	Thanks to @dubiousjim. Closes #1419.
2014-07-12	Parsing: Simplified dash and ellipsis.	John MacFarlane	1	-40/+13
	This originated with @dubiousjim's observation in #1419 that there was a typo in the definition of enDash. It returned an em dash character instead of an en dash. I thought about why this had not been noticed before, and realized that en dashes were just being parsed as regular symbols. That made me realize that, now that we no longer have dedicate EnDash, EmDash, and Ellipses inline elements, as we used to in pandoc, we no longer need to parse the unicode characters specially. This allowed a considerable simplification of the code. Partially resolves #1419.
2014-07-12	Removed space at ends of lines in source.	John MacFarlane	11	-95/+95