pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2011-01-22	Markdown reader: slight speedup by moving whitespace parser.	John MacFarlane	1	-2/+2

2011-01-19	Replaced more noneOf/oneOf parsers.	John MacFarlane	1	-5/+11

2011-01-19	Replaced uses of oneOf with more efficient parsers.	John MacFarlane	1	-12/+19
	This speeds up the markdown reader.
2011-01-04	Markdown reader: Removed unneeded definitions.	John MacFarlane	1	-10/+8
	specialChars, strChar, specialCharsMinusLt.
2011-01-04	Moved 'macro' and 'applyMacros'' from markdown reader to Parsing.	John MacFarlane	1	-24/+0

2011-01-01	Fixed regression in markdown reader.	John MacFarlane	1	-3/+3
	'(_hi_)' was being parsed with literal underscores (no emphasis). The fix: the 'str' parser now only parses alphanumerics and embedded underscores. All other symbols are handled by the 'symbol' parser. This has a slight effect on the AST, since you'll get [Str "hi",Str ":"] insntead of [Str "hi:"]. But there should not be a visible effect in any of the writers. Thanks to gwern for pointing out the regression.
2010-12-30	New HTML reader using tagsoup as a lexer.	John MacFarlane	1	-28/+27
	* The new reader is faster and more accurate. * API changes for Text.Pandoc.Readers.HTML: - removed rawHtmlBlock, anyHtmlBlockTag, anyHtmlInlineTag, anyHtmlTag, anyHtmlEndTag, htmlEndTag, extractTagType, htmlBlockElement, htmlComment - added htmlTag, htmlInBalanced, isInlineTag, isBlockTag, isTextTag * tagsoup is a new dependency. * Text.Pandoc.Parsing: Generalized type on readWith. * Benchmark.hs: Added length calculation to force full evaluation. * Updated HTML reader tests. * Updated markdown and textile readers to use the functions from the HTML reader. * Note: The markdown reader now correctly handles some cases it did not before. For example: <hr/> is reproduced without adding a space. <script> a = '<b>'; </script> is parsed correctly.
2010-12-24	Use functions from Text.Pandoc.Generic instead of processWith(M).	John MacFarlane	1	-1/+2

2010-12-14	Fixed regression in parsing _emph_	John MacFarlane	1	-1/+1
	There was a bug in parsing '_emph_, ...': when followed by a comma, underscore emphasis did not register. (Thanks to gwern for pointing this out.) This bug was introduced by the change in c66921f2acea456af527b93e2daa1d8594798642
2010-12-13	Moved special handling of punctuation in suffix out of markdown reader.	Nathan Gass	1	-7/+2
	This allows different writers to handle punctuation in the suffix differently.
2010-12-13	Markdown reader: Further fix to abbrevs.	John MacFarlane	1	-1/+1

2010-12-13	Markdown reader: Fixed abbrev handler to allow abbrev at end of line.	John MacFarlane	1	-2/+2
	E.g., Mr. Frank.
2010-12-13	Markdown reader: Fixed referenceKey parser to allow space after newline.	John MacFarlane	1	-2/+1

2010-12-13	Markdown reader: Fixed regression in reference key parser.	John MacFarlane	1	-0/+1
	* The recent change allowing spaces and newlines in the URL caused problems when reference keys are stacked up without blank lines between. This is now fixed. * Added test.
2010-12-12	Markdown reader: fix superscripts with links.	John MacFarlane	1	-1/+1
	Moved inlineNote parser after superscript parser, so ^[link](/foo)^ gets recognized as a superscripted link, not an inline note followed by garbage. Thanks to Conal Elliott for pointing out the problem.
2010-12-10	Markdown reader: small cosmetic code improvements.	John MacFarlane	1	-8/+6

2010-12-10	Removed HTML sanitization.	John MacFarlane	1	-11/+5
	This is better done on the resulting HTML; use the xss-sanitize library for this. xss-sanitize is based on pandoc's sanitization, but improves it. - Removed stateSanitize from ParserState. - Removed --sanitize-html option.
2010-12-10	Markdown reader: Allow linebreaks in URLs (treat as spaces).	John MacFarlane	1	-6/+21
	Also, a string of consecutive spaces or tabs is now parsed as a single space. If you have multiple spaces in your URL, use %20%20.
2010-12-10	Markdown reader: Rewrote para parser for better efficiency.	John MacFarlane	1	-10/+8
	This change avoids repeated parsing of inline lists for 'plain' blocks.
2010-12-08	Markdown reader: minor footnote changes.	John MacFarlane	1	-2/+3
	Don't skipNonindentSpaces in noteMarker, since it's also used in the inline note parser.
2010-12-07	Smart punctuation: recognize entities.	John MacFarlane	1	-1/+1
	Now “Hi” gets parsed as a Quoted DoubleQuote inline.
2010-12-07	Markdown reader: Moved smartPunctuation parser, for slight speed bump.	John MacFarlane	1	-1/+1

2010-12-07	Moved smartPunctuation from Markdown to Parsing.	John MacFarlane	1	-89/+2
	+ Parameterized smartPunctuation on an inline parser. + Handle smartPunctuation in Textile reader.
2010-12-06	Markdown reader: better handling of intraword _.	John MacFarlane	1	-3/+5
	The 'str' parser now reads internal _'s as part of the string. This prevents pandoc from getting started looking for an emphasized block, which can cause exponential slowdowns in some cases. Resolves Issue #182.
2010-12-06	Markdown reader: handle curly quotes better.	John MacFarlane	1	-15/+14
	Previously, curly quotes were just parsed literally, leading to problems in some output formats. Now they are parsed as Quoted inlines, if --smart is specified. Resolves Issue #270.
2010-12-05	Fix regression: markdown references should be case-insensitive.	John MacFarlane	1	-2/+2
	This broke when we added the Key type. We had assumed that the custom case-insensitive Ord instance would ensure case-insensitive matching, but that is not how Data.Map works. * Added a test case for case-insensitivity in markdown-reader-more * Removed old refsMatch from Text.Pandoc.Parsing module; * hid the 'Key' constructor; * dropped the custom Ord and Eq instances, deriving instead; * added fromKey and toKey to convert between Keys and Inline lists; * toKey ensures that keys are case-insensitive, since this is the only way the API provides to construct a Key. Resolves Issue #272.
2010-12-03	Merge branch 'citeproc' into master.	John MacFarlane	1	-37/+92
	Conflicts: src/Text/Pandoc/Definition.hs
2010-12-03	punctuation handling, and more html-specific handling	paul.rivier	1	-1/+2

2010-11-28	Merge branch 'master' into citeproc	John MacFarlane	1	-0/+3

2010-11-28	Markdown parser performance improvement.	John MacFarlane	1	-0/+3
	Do a quick lookahead to make sure what follows looks like a setext header before parsing any Inlines. This gives a 15% performance boost in one benchmark. Many thanks to knieriem for finding the problem (in peg-markdown): https://github.com/jgm/peg-markdown/issues/issue/3
2010-11-26	Markdown suffix parser fix.	John MacFarlane	1	-2/+7
	If suffix doesn't begin with punctuation, include opening comma and space in result. Previously, @item [only a suffix] would result in something like Doe (2002only a suffix) because there was no opening delimiter.
2010-11-26	Split locator and suffix in Biblio rather than Markdown parser.	John MacFarlane	1	-36/+2
	Patch from Nathan Gass.
2010-11-22	Check biblio for all citations, not just textual.	John MacFarlane	1	-5/+5

2010-11-18	Markdown citation parser: small refactoring for clarity.	John MacFarlane	1	-1/+5

2010-11-18	Markdown citation parser improvements and test updates.	John MacFarlane	1	-13/+15
	Now we handle a suffix after a bare locator, e.g. @item1 [p. 30, suffix] The suffix now includes any punctuation that introduces it. A few tests fail because of problems with citeproc (extra space before the suffix, missing space after comma separating multiple page ranges in the locator).
2010-11-18	Markdown reader: Revised parser for new citation syntax.	John MacFarlane	1	-56/+73
	Suffixes and prefixes are now [Inline]. The locator is separated from the citation key by a blank space. The locator consists of one introductory word and any number of words containing at least one digit. The suffix, if any, is separated from the locator by a comma, and continues til the end of the citation.
2010-11-17	Don't pass a [Str ""] as citationPrefix.	John MacFarlane	1	-1/+1

2010-11-16	Updated for changes in Citaiton type.	John MacFarlane	1	-3/+4
	citationPrefix now [Inline] rather than String; citationSuffix added. This change presupposes no changes in citeproc-hs. It passes a string for these values to citeproc-hs. Eventually, citeproc-hs should use an [Inline] for these as well.
2010-11-12	Repairs to citation parser + citation test suite.	John MacFarlane	1	-8/+8

2010-11-12	Revised citation parsers for markdown reader.	John MacFarlane	1	-44/+96
	Added a form for in-text citations: @doe99 [30; see also @smith99].
2010-11-11	Added support for textual citations (but not yet markdown syntax).	John MacFarlane	1	-1/+1
	Patch from Andrea Rossato.
2010-11-09	Merge branch 'master' into citeproc	John MacFarlane	1	-1/+1

2010-11-09	Allow HTML comments as inline elements in markdown.	John MacFarlane	1	-1/+1
	So, aaa <!-- comment --> bbb can be a single paragraph.
2010-11-06	Removed CITEPROC CPP conditionals from library code.	John MacFarlane	1	-4/+0
	By Cabal policy, the API should not change depending on flags.
2010-11-06	Changes to use citeproc-hs 0.3.	John MacFarlane	1	-1/+5

2010-10-27	Changes to use citeproc 0.3.	John MacFarlane	1	-12/+20
	Patch from Andrea Rossato. Note: the markdown syntax is preliminary and will probably change.
2010-10-26	Parse blanklines after macro definitions.	John MacFarlane	1	-1/+1

2010-10-26	Process LaTeX macros in markdown, and apply to TeX math.	John MacFarlane	1	-3/+28
	Example: \newcommand{\plus}[2]{#1 + #2} $\plus{3}{4}$ yields: 3+4
2010-07-12	Modified example refs so they can occur before or after target.	John MacFarlane	1	-5/+15
	The refs are now replaced by numbers at the final stage, using processWith.
2010-07-11	Merge branch 'atlists'. Added auto-numbered example lists.	John MacFarlane	1	-1/+11