pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2011-01-04	Moved 'macro' and 'applyMacros'' from markdown reader to Parsing.	John MacFarlane	2	-26/+27

2011-01-01	Fixed regression in markdown reader.	John MacFarlane	1	-3/+3
	'(_hi_)' was being parsed with literal underscores (no emphasis). The fix: the 'str' parser now only parses alphanumerics and embedded underscores. All other symbols are handled by the 'symbol' parser. This has a slight effect on the AST, since you'll get [Str "hi",Str ":"] insntead of [Str "hi:"]. But there should not be a visible effect in any of the writers. Thanks to gwern for pointing out the regression.
2011-01-01	Updated copyright notices.	John MacFarlane	1	-1/+1

2010-12-30	LaTeX reader: Allow ignored comments after \end{document}.	John MacFarlane	1	-3/+1

2010-12-30	HTML reader: Fixed some parsing bugs.	John MacFarlane	1	-22/+28

2010-12-30	Added support for listings package code blocks and inline code.	Puneeth Chaganti	1	-2/+9

2010-12-30	Textile reader: Slight speed improvement.	John MacFarlane	1	-5/+5

2010-12-30	New HTML reader using tagsoup as a lexer.	John MacFarlane	4	-628/+424
	* The new reader is faster and more accurate. * API changes for Text.Pandoc.Readers.HTML: - removed rawHtmlBlock, anyHtmlBlockTag, anyHtmlInlineTag, anyHtmlTag, anyHtmlEndTag, htmlEndTag, extractTagType, htmlBlockElement, htmlComment - added htmlTag, htmlInBalanced, isInlineTag, isBlockTag, isTextTag * tagsoup is a new dependency. * Text.Pandoc.Parsing: Generalized type on readWith. * Benchmark.hs: Added length calculation to force full evaluation. * Updated HTML reader tests. * Updated markdown and textile readers to use the functions from the HTML reader. * Note: The markdown reader now correctly handles some cases it did not before. For example: <hr/> is reproduced without adding a space. <script> a = '<b>'; </script> is parsed correctly.
2010-12-26	normalize: Don't reduce [Space] to [].	John MacFarlane	1	-4/+1

2010-12-26	Improved 'normalize'.	John MacFarlane	1	-41/+44
	Now normalizeInlines is split into consolidateInlines and removeEmptyInlines. We need to remove empties before consolidating.
2010-12-26	Markdown writer: Fixed bug in Image.	John MacFarlane	1	-1/+1
	URI was getting unescaped twice!
2010-12-25	Improved normalize.	John MacFarlane	1	-0/+15

2010-12-24	Use functions from Text.Pandoc.Generic instead of processWith(M).	John MacFarlane	9	-17/+28

2010-12-22	HTML reader: Simplified parsing of <script> sections.	John MacFarlane	1	-24/+1
	I had previously assumed that we needed to ignore </script> occuring in a string literal or javascript comment. It turns out, though, that browsers aren't that smart.
2010-12-22	Made --smart work with HTML reader.	John MacFarlane	1	-4/+13
	It did not work before, because - and quotes were gobbled up by the str parser.
2010-12-22	RST reader: Added unicode quote characters to specialChars.	John MacFarlane	1	-1/+1
	(So they can trigger Quoted environments.)
2010-12-22	RST reader: recouped speed loss due to addition of --smart.	John MacFarlane	1	-4/+4
	This was achieved by rearranging the parsers in inline. Benchmarks went from 500ms to 307ms -- not quite back to the 279ms we had in 1.6, before supporting smart punctuation and footnotes, but close.
2010-12-22	ODT writer: Don't wrap text in opendocument.	John MacFarlane	1	-1/+1

2010-12-22	Removed all dependencies on 'pretty' package.	John MacFarlane	1	-4/+0

2010-12-22	Texinfo writer: Updated to use Pretty.	John MacFarlane	1	-56/+37

2010-12-22	Shared: Removed unneeded prettyprinting functions:	John MacFarlane	1	-75/+0
	wrapped, wrapIfNeeded, wrappedTeX, wrapTeXIfNeeded, hang'.
2010-12-22	Shared: Removed BlockWrapper, wrappedBlocksToDoc.	John MacFarlane	1	-13/+1
	These are no longer needed with the new Pretty module.
2010-12-22	Pretty: Added quote, doubleQuote.	John MacFarlane	1	-0/+10

2010-12-22	Man writer: updated to use Pretty.	John MacFarlane	1	-18/+22

2010-12-21	OpenDocument writer: Updated to use Pretty.	John MacFarlane	1	-8/+12

2010-12-21	XML: don't use breaking spaces in attribute lists.	John MacFarlane	1	-4/+5

2010-12-21	Docbook writer: Updated to use Pretty.	John MacFarlane	1	-21/+20

2010-12-21	Pretty: don't print a breaking space before a newline.	John MacFarlane	1	-0/+4

2010-12-21	Shared: Made splitBy take a test instead of an element.	John MacFarlane	4	-9/+9

2010-12-21	XML: Replaced escapeStringAsXML with a faster version.	John MacFarlane	1	-9/+1
	Benchmarked with criterion, it's about 8x faster than the old version. This speeds up docbook, opendocument, and html writers.
2010-12-20	Markdown writer: use \ for newline instead of two spaces at eol.	John MacFarlane	1	-1/+4
	(Unless --strict.)
2010-12-20	Markdown writer: Use delimited code block if there are attributes.	John MacFarlane	1	-2/+21
	(Unless in strict mode.)
2010-12-20	Plain writer: set stateStrictMarkdown automatically.	John MacFarlane	1	-3/+4

2010-12-20	ConTeXt writer: Updated to use Text.Pandoc.Pretty.	John MacFarlane	1	-73/+71

2010-12-20	Renamed 'enclosed' to 'inside'.	John MacFarlane	1	-7/+7
	This avoids conflict with 'enclosed' in Text.Pandoc.Parsing.
2010-12-19	Pretty: Fixed parens.	John MacFarlane	1	-1/+1

2010-12-19	Pretty: Added enclosed, parens.	John MacFarlane	1	-2/+13

2010-12-19	LaTeX writer: A bit of code polish.	John MacFarlane	1	-29/+28

2010-12-19	LaTeX writer: Modified to use Pretty.	John MacFarlane	1	-34/+30
	Improved footnote formatting, removed spurious blank lines.
2010-12-19	Shared: Use stringify to simplify inlineListToIdentifier.	John MacFarlane	1	-28/+11

2010-12-19	Pretty: Added braces and brackets.	John MacFarlane	1	-0/+9

2010-12-18	LaTeX writer: Use \paragraph, \subparagraph for level 4,5 headers.	John MacFarlane	1	-9/+10

2010-12-17	Added new prettyprinting module.	John MacFarlane	7	-451/+708
	* Added Text.Pandoc.Pretty. This is better suited for pandoc than the 'pretty' package. One advantage is that we now get proper wrapping; Emph [Inline] is no longer treated as a big unwrappable unit. Previously we only got breaks for spaces at the "outer level." We can also more easily avoid doubled blank lines. Performance is significantly better as well. * Removed Text.Pandoc.Blocks. Text.Pandoc.Pretty allows you to define blocks and concatenate them. * Modified markdown, RST, org readers to use Text.Pandoc.Pretty instead of Text.PrettyPrint.HughesPJ. * Text.Pandoc.Shared: Added writerColumns to WriterOptions. * Markdown, RST, Org writers now break text at writerColumns. * Added --columns command-line option, which sets stColumns and writerColumns. * Table parsing: If the size of the header > stColumns, use the header size as 100% for purposes of calculating relative widths of columns.
2010-12-15	HTML reader: allow : in tags.	John MacFarlane	1	-2/+6
	Resolves Issue #274.
2010-12-15	Use top-level header at end as bibliography title for natbib and biblatex ↵	Nathan Gass	1	-4/+13
	output.
2010-12-15	Remove punctuation at start of suffix for natbib and biblatex output.	Nathan Gass	1	-2/+6
	This is necessary as the latex citation commands include there own punctuation, which resulted in doubled commas for markdown documents where citeproc output works correctly.
2010-12-15	Support multiple bibliography files with natbib and biblatex output.	Nathan Gass	2	-3/+4

2010-12-14	Added 'normalize' to Text.Pandoc.Shared.	John MacFarlane	1	-1/+53

2010-12-14	Fixed preamble parsing in LaTeX reader.	John MacFarlane	1	-2/+8

2010-12-14	Fixed regression in parsing _emph_	John MacFarlane	1	-1/+1
	There was a bug in parsing '_emph_, ...': when followed by a comma, underscore emphasis did not register. (Thanks to gwern for pointing this out.) This bug was introduced by the change in c66921f2acea456af527b93e2daa1d8594798642