aboutsummaryrefslogtreecommitdiff
path: root/src/Text
AgeCommit message (Collapse)AuthorFilesLines
2010-12-10Markdown reader: small cosmetic code improvements.John MacFarlane1-8/+6
2010-12-10Removed HTML sanitization.John MacFarlane3-103/+10
This is better done on the resulting HTML; use the xss-sanitize library for this. xss-sanitize is based on pandoc's sanitization, but improves it. - Removed stateSanitize from ParserState. - Removed --sanitize-html option.
2010-12-10Markdown reader: Allow linebreaks in URLs (treat as spaces).John MacFarlane1-6/+21
Also, a string of consecutive spaces or tabs is now parsed as a single space. If you have multiple spaces in your URL, use %20%20.
2010-12-10Markdown reader: Rewrote para parser for better efficiency.John MacFarlane1-10/+8
This change avoids repeated parsing of inline lists for 'plain' blocks.
2010-12-09textile redcloth definition listspaul.rivier1-2/+29
2010-12-09Textile reader: better treatment of acronyms.John MacFarlane1-1/+1
We now parse PBS(Public Broadcasting System) as if it were "PBS (Public Broadcasting System)".
2010-12-08RST reader: Added footnote suppport.John MacFarlane1-3/+51
Resolves issue #258. Note that there are some differences in how docutils and pandoc treat footnotes. Currently pandoc ignores the numeral or symbol used in the note; footnotes are put in an auto-numbered ordered list.
2010-12-08Markdown reader: minor footnote changes.John MacFarlane1-2/+3
Don't skipNonindentSpaces in noteMarker, since it's also used in the inline note parser.
2010-12-08Textile reader: Implemented footnotes.John MacFarlane1-4/+43
2010-12-07Made --smart work with RST reader.John MacFarlane1-2/+3
2010-12-07Make --smart work in HTML reader.John MacFarlane1-2/+3
2010-12-07Smart punctuation: recognize entities.John MacFarlane2-9/+23
Now “Hi” gets parsed as a Quoted DoubleQuote inline.
2010-12-07Rewrote normalizeSpaces (mostly aesthetic reasons).John MacFarlane1-14/+11
2010-12-07Markdown reader: Moved smartPunctuation parser, for slight speed bump.John MacFarlane1-1/+1
2010-12-07Smart punctuation: don't alllow ellipses containing spaces.John MacFarlane1-1/+1
Previously we allowed '. . .', ' . . . ', etc. This caused too many complications, and removed author's flexibility in combining ellipses with spaces and periods.
2010-12-07Moved smartPunctuation from Markdown to Parsing.John MacFarlane3-102/+99
+ Parameterized smartPunctuation on an inline parser. + Handle smartPunctuation in Textile reader.
2010-12-07Textile reader: implemented acronyms, (tm), (r), (c).John MacFarlane1-6/+29
2010-12-06Markdown reader: better handling of intraword _.John MacFarlane1-3/+5
The 'str' parser now reads internal _'s as part of the string. This prevents pandoc from getting started looking for an emphasized block, which can cause exponential slowdowns in some cases. Resolves Issue #182.
2010-12-06Markdown reader: handle curly quotes better.John MacFarlane1-15/+14
Previously, curly quotes were just parsed literally, leading to problems in some output formats. Now they are parsed as Quoted inlines, if --smart is specified. Resolves Issue #270.
2010-12-05Fix regression: markdown references should be case-insensitive.John MacFarlane3-47/+27
This broke when we added the Key type. We had assumed that the custom case-insensitive Ord instance would ensure case-insensitive matching, but that is not how Data.Map works. * Added a test case for case-insensitivity in markdown-reader-more * Removed old refsMatch from Text.Pandoc.Parsing module; * hid the 'Key' constructor; * dropped the custom Ord and Eq instances, deriving instead; * added fromKey and toKey to convert between Keys and Inline lists; * toKey ensures that keys are case-insensitive, since this is the only way the API provides to construct a Key. Resolves Issue #272.
2010-12-05Org writer: Minor changes to documentation header.John MacFarlane1-3/+3
2010-12-04Added tests.Puneeth Chaganti1-1/+1
+ Added tables.org and writer.org to tests. + Added org.template to templates. + Changed RunTests.hs as required. + Minor changes to Org writer.
2010-12-04Added Org-mode writerPuneeth Chaganti2-0/+293
+ Added Text/Pandoc/Writers/Org.hs + Added to pandoc.cabal + Added to pandoc.hs and Text/Pandoc.hs exports.
2010-12-03Merge branch 'citeproc' into master.John MacFarlane5-221/+285
Conflicts: src/Text/Pandoc/Definition.hs
2010-12-03Textile reader: temporarily removed smartPunctuation.John MacFarlane1-2/+2
The smartPuncutation parser from the markdown parser was being used, but this creates two problems: * smart punctuation rules are slightly different in textile, for example, a single dash wish space around becomes an En dash. * the following gets parsed as a double quoted string followed by a colon, rather than as a link: "emphasized text":http://my.url.com This needs rethinking.
2010-12-03Textile reader: added hrule parser.John MacFarlane1-0/+13
2010-12-03Textile reader: Turn on smart punctuation by default.John MacFarlane1-2/+2
2010-12-03Textile reader: drop leading, trailing newline in pre block.John MacFarlane1-2/+10
This is consistent with how the other readers work.
2010-12-03Textile reader: modified str to handle acronyms, hyphens.John MacFarlane1-3/+16
* A single hyphen between two word characters is no longer a potential strikeout-starter. * Acronym explanations are dropped.
2010-12-03Textile reader: parse raw by default.John MacFarlane1-0/+2
It's part of the textile spec to allow raw HTML, just as with markdown. -R is no longer needed in test suite.
2010-12-03punctuation handling, and more html-specific handlingpaul.rivier2-8/+33
2010-12-03html inlines and html blocks handling in textile readerPaul Rivier1-17/+26
2010-12-03textile reader now ignores html/css attributesPaul Rivier1-8/+34
2010-12-03removed support for textile Inserted constructPaul Rivier4-9/+1
2010-12-03fix autolink by promoting it in the parser list, fix table parabreakPaul Rivier1-7/+5
2010-12-03more support for Textile reader (explicit links, images), tests and cabal ↵Paul Rivier1-17/+44
entries
2010-12-03simpler table cell handlingpaul.rivier1-24/+12
2010-12-03preliminary material toward table supportpaul.rivier2-2/+68
2010-12-03textile reader now imports import Text.Pandoc.Parsingpaul.rivier1-1/+2
2010-12-03Basic Textile Readerpaul.rivier6-1/+261
2010-11-28Merge branch 'master' into citeprocJohn MacFarlane1-0/+3
2010-11-28Markdown parser performance improvement.John MacFarlane1-0/+3
Do a quick lookahead to make sure what follows looks like a setext header before parsing any Inlines. This gives a 15% performance boost in one benchmark. Many thanks to knieriem for finding the problem (in peg-markdown): https://github.com/jgm/peg-markdown/issues/issue/3
2010-11-28Fixed spacing bug for reference-style citations.John MacFarlane1-1/+4
2010-11-27Merge branch 'master' into citeprocJohn MacFarlane1-2/+4
2010-11-27OpenDocument writer: don't print raw TeX.John MacFarlane1-2/+4
2010-11-27Merge branch 'master' into citeprocJohn MacFarlane1-0/+4
2010-11-27LaTeX writer: Escape curly quotes.John MacFarlane1-0/+4
2010-11-27Biblio: If locator ends with ",", add it to the suffix.John MacFarlane1-1/+9
2010-11-27Added procOpts parameter to citeproc call.John MacFarlane1-1/+2
2010-11-27Merge branch 'master' into citeprocJohn MacFarlane2-0/+417