pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2012-08-01	Major rewrite of markdown reader.	John MacFarlane	1	-14/+43
	* Use Builder's Inlines/Blocks instead of lists. * Return values in the reader monad, which are then run (at the end of parsing) against the final parser state. This allows links, notes, and example numbers to be resolved without a second parser pass. * An effect of using Builder is that everything is normalized automatically. * New exports from Text.Pandoc.Parsing: widthsFromIndices, NoteTable', KeyTable', Key', toKey', withQuoteContext, singleQuoteStart, singleQuoteEnd, doubleQuoteStart, doubleQuoteEnd, ellipses, apostrophe, dash * Updated opendocument tests. * Don't derive Show for ParserState. * Benchmarks: markdown reader takes 82% of the time it took before. Markdown writer takes 92% of the time (here the speedup is probably due to the fact that everything is normalized by default).
2012-07-27	Removed commented-out pandoc2 code.	John MacFarlane	1	-41/+0
	This will be developed in a branch, noreparsing.
2012-07-27	Parser: Changed types to use type alias Parser, not Parsec.	John MacFarlane	1	-97/+138

2012-07-26	Fixed whitespace errors.	John MacFarlane	1	-25/+25

2012-07-26	Parsing: Removed failIfStrict.	John MacFarlane	1	-5/+0

2012-07-26	Parsing: Added guardEnabled, guardDisabled.	John MacFarlane	1	-3/+14

2012-07-25	Moved stateApplyMacros, stateIndentedCodeClasses to ReaderOptions.	John MacFarlane	1	-6/+2

2012-07-25	stateCitations -> readerCitations.	John MacFarlane	1	-2/+0

2012-07-25	Moved stateLiterateHaskell to readerLiterateHaskell in Options.	John MacFarlane	1	-3/+1

2012-07-25	Got rid of stateStandalone, which was hardly used anyway.	John MacFarlane	1	-2/+0
	The only possible effect will be with rst fragments that begin with an rst title block, which will now cause the header transform.
2012-07-25	Moved stateOldDashes to readerOldDashes in ReaderOptions.	John MacFarlane	1	-5/+1

2012-07-25	Moved stateTabStop to readerTabStop in ReaderOptions.	John MacFarlane	1	-3/+0

2012-07-25	Moved stateColumns to readerColumns in ReaderOptions.	John MacFarlane	1	-3/+1

2012-07-25	Moved ParseRaw from ParserState to ReaderOptions.	John MacFarlane	1	-2/+0

2012-07-25	Text.Pandoc.Parsing: Added getOption.	John MacFarlane	1	-4/+6

2012-07-25	Options -> ReaderOptions.	John MacFarlane	1	-3/+3
	Better to keep reader and writer options separate.
2012-07-25	Put smart, strict in separate options field in state.	John MacFarlane	1	-8/+7
	This is the beginning of a larger transition that will make Options, not ParserState, the parameter of the read functions. (Options will also be used in writers, in place of WriterOptions.) Next step is to remove strict, replacing it with granular tests for different extensions.
2012-07-24	Better algorithm for oneOfStrings.	John MacFarlane	1	-2/+9
	This goes character by character, not backtracking.
2012-07-24	Refactored table parsers, captions now not part of core tableWith.	John MacFarlane	1	-10/+4

2012-07-22	Revised code for pipe tables.	John MacFarlane	1	-94/+4
	* All tables now require at least one body row. * Renamed from 'extra' to 'pipe' tables. * Moved functions from Parsing to Readers.Markdown. * Cleaned up code; revised to parse in one pass rather than parsing a raw string, splitting it, and parsing the components. * Allow pipe tables without pipes on the ends (as PHP Markdown Extra does).
2012-07-22	Merge pull request #510 from mytskine/markdown-extra	John MacFarlane	1	-1/+97
	Markdown extra tables [part of the multi-markdown syntax for tables]
2012-07-20	Use Parser as type synonym for Parsec.	John MacFarlane	1	-1/+3

2012-07-20	Text.Pandoc.Parsing: Export all Parsec functions used in pandoc code.	John MacFarlane	1	-1/+52
	No other module directly imports Parsec. This will make it easier to change the parsing backend in the future, if we want to.
2012-07-20	Use Text.Parsec instead of Text.ParserCombinators.Parsec.	John MacFarlane	1	-103/+103

2012-07-19	Provide Data.Default instances for ParserState and WriterOptions.	John MacFarlane	1	-2/+6
	Now you can use def (which is re-exported by Text.Pandoc) instead of defaultParserState or defaultWriterOptions. For now, these are still defined too, so existing code need not change. Closes #546.
2012-06-29	Changed macro parser so it returns raw macro if stateApplyMacros false.	John MacFarlane	1	-5/+8
	Closes #554.
2012-04-24	textile reader improvements : better conformance to RedCloth Textile inlines	paul.rivier	1	-0/+5

2012-03-24	Add parsing support for the rST default-role directive.	Greg Maslov	1	-2/+4

2012-02-21	Added support for markdown-extra tables in the markdown parser	François Gannaz	1	-1/+97
	Only tables whose lines begin with a "\|" are supported. There are 2 warnings about unused variables when compiling.
2012-02-07	Limit nesting of strong/emph.	John MacFarlane	1	-0/+2
	This avoids exponential lookahead in parasitic cases, like a*aa*aa*aa*aa*aa*aa*aa**. Added stateMaxNestingLevel to ParserState. We set this to 6, so you can still have Emph inside Emph, just not indefinitely.
2012-02-05	Parsing: Make characterReference fail if entity not found.	John MacFarlane	1	-2/+2

2012-02-05	Removed module Text.Pandoc.CharacterReferences.	John MacFarlane	1	-1/+11
	Moved characterReference parser to Text.Pandoc.Parsing. decodeCharacterReferences is now replaced by fromEntities in Text.Pandoc.XML.
2012-02-04	Complete rewrite of LaTeX reader.	John MacFarlane	1	-4/+20
	* The new reader is more robust, accurate, and extensible. It is still quite incomplete, but it should be easier now to add features. * Text.Pandoc.Parsing: Added withRaw combinator. * Markdown reader: do escapedChar before raw latex inline. Otherwise we capture commands like \{. * Fixed latex citation tests for new citeproc. * Handle \include{} commands in latex. This is done in pandoc.hs, not the (pure) latex reader. But the reader exports the needed function, handleIncludes. * Moved err and warn from pandoc.hs to Shared. * Fixed tests - raw tex should sometimes have trailing space. * Updated lhs-test for highlighting-kate changes.
2012-01-27	Fixed table parsing with wide or combining characters.	John MacFarlane	1	-1/+1
	Closes #348. Closes #108.
2012-01-01	New treatment of dashes in --smart mode.	John MacFarlane	1	-5/+29
	* `---` is always em-dash, `--` is always en-dash. * pandoc no longer tries to guess when `-` should be en-dash. * A new option, `--old-dashes`, is provided for legacy documents. Rationale: The rules for en-dash are too complex and language-dependent for a guesser to work reliably. This change gives users greater control. The alternative of using unicode isn't very good, since unicode em- and en- dashes are barely distinguishable in a monospace font.
2011-12-29	Better smart quote parsing.	John MacFarlane	1	-1/+7
	* Added stateLastStrPos to ParserState. This lets us keep track of whether we're parsing the position immediately after a 'str'. If we encounter a ' in such a location, it must be an apostrophe, and can't be a single quote start. * Set this in the markdown, textile, html, and rst str parsers. * Closes #360.
2011-12-27	Replaced Apostrophe, Ellipses, EmDash, EnDash w/ unicode strings.	John MacFarlane	1	-6/+6

2011-12-27	Pretty: return Str with unicode instead of Apostrophe.	John MacFarlane	1	-1/+1

2011-12-05	Parsing: Removed charsInBalanced', added param to charsInBalanced.	John MacFarlane	1	-20/+13
	The extra parameter is a character parser. This is needed for proper handling of escapes, etc.
2011-12-05	Parsing: Changed type of escaped to return Char	John MacFarlane	1	-5/+2

2011-07-30	Added nonspaceChar to Text.Pandoc.Parsing.	John MacFarlane	1	-0/+5

2011-07-25	Smart quotes: handle '...hi' properly.	John MacFarlane	1	-1/+2
	Also added test case.
2011-07-23	Properly handle characters in the 128..159 range.	John MacFarlane	1	-7/+7
	These aren't valid in HTML, but many HTML files produced by Windows tools contain them. We substitute correct unicode characters.
2011-04-29	Revert "Parsing: Use new type aliases, PandocParser, GeneralParser."	John MacFarlane	1	-123/+118
	This reverts commit ec5410bc4e9d228b7dc0123061d80f9addf825bf.
2011-04-29	Parsing: Use new type aliases, PandocParser, GeneralParser.	John MacFarlane	1	-118/+123
	This should make it easier to change the types later.
2011-03-18	Changed uri parser so it doesn't include trailing punctuation.	John MacFarlane	1	-3/+19
	So, in RST, 'http://google.com.' should be parsed as a link to 'http://google.com' followed by a period. The parser is smart enough to recognize balanced parentheses, as often occur in wikipedia links: 'http://foo.bar/baz_(bam)'. Also added ()s to RST specialChars, so '(http://google.com)' will be parsed as a link in parens. Added test cases. Resolves Issue #291.
2011-01-26	Add support for attributes in inline Code.	John MacFarlane	1	-1/+1
	Additional related changes: * URLs in Code in autolinks now use class "url". * Require highlighting-kate 0.2.8.2, which omits the final <br/> tag, essential for inline code.
2011-01-26	Bumped version to 1.8; depend on pandoc-types 1.8.	John MacFarlane	1	-7/+6
	The old TeX, HtmlInline and RawHtml elements have been removed and replaced by generic RawInline and RawBlock elements. All modules updated to use the new raw elements.
2011-01-19	More small parser rewrites for small performance gains.	John MacFarlane	1	-9/+11

2011-01-19	Parsing: Rewrote spaceChar for significant speedup in readers.	John MacFarlane	1	-1/+1