aboutsummaryrefslogtreecommitdiff
path: root/src/Text
AgeCommit message (Collapse)AuthorFilesLines
2011-01-14HTML reader: parse simple tables.John MacFarlane1-2/+22
Resolves Issue #106. Thanks to Rodja Trappe for the idea and some sample code.
2011-01-14HTML reader: parse location tags in pSatisfy.John MacFarlane1-13/+17
This avoids the need for manual parsing all over the place.
2011-01-14LaTeX writer: Escape strings in \href{..}.John MacFarlane1-2/+2
Previously strings weren't escaped, so %5D would be interpreted as a LaTeX comment!
2011-01-14Simplified Text.Pandoc.CharacterReferences by using TagSoup entity lookupJohn MacFarlane1-266/+11
2011-01-14LateX writer: In nonsimple tables, put cells in \parbox.John MacFarlane1-9/+12
Otherwise we can get problems with linebreaks, and cell spacing isn't right. Thanks to Jef Allbright for pointing out the problem.
2011-01-14Parsing: Fixed bug in grid table parser.John MacFarlane1-5/+5
Spaces at end of line were not being stripped properly, resulting in unintended LineBreaks.
2011-01-11Improvements to --html5 support:John MacFarlane1-13/+36
+ <nav> for TOC, <figure> for figures, type attribute in <ol>. + Don't add math javascript in html5. + Use style attributes instead of deprecated width, align. + html template: move <title> after <meta>. Note: charset needs to be declared before title. + slidy and s5 templates: move <title> after <meta>. + html template: Added link to html5 shim for IE. + Make --html5 have an effect only for 'html' writer (not s5, slidy, epub).
2011-01-11Preliminary support for HTML5.John MacFarlane2-1/+6
+ Added writerHtml5 writer option. + Added --html5 option. + Added support for lang in html tag (so you can do 'pandoc -s --V lang=en', for example). + Updated html template with conditionals for HTML5. + When HTML5 selected, use <header> tag around title in document, and use <section> tags instead of <div>s if --section-divs specified.
2011-01-11Text.Pandoc: Improved readers, writers lists for lhs variants.John MacFarlane1-6/+12
Now the lhs variants set the needed literate Haskell flag in parser state and writer options.
2011-01-07LaTeX reader: Support simple tables.John MacFarlane1-0/+58
2011-01-06RST writer: blank line after literate Haskell code block.John MacFarlane1-3/+3
2011-01-06Markdown writer: blank line after delimited code block.John MacFarlane1-2/+2
2011-01-06LaTeX reader: allow spaces btw \\begin or \\end and {.John MacFarlane1-10/+27
2011-01-06LaTeX reader: Removed unnecessary 'spaces'.John MacFarlane1-4/+0
2011-01-06HTML reader: Fixed bug in htmlTag for comments.John MacFarlane1-2/+9
2011-01-05LaTeX reader: Apply macros to non-math; handle ensuremath.John MacFarlane1-9/+23
2011-01-05LaTeX reader: Don't handle \label and \ref specially.John MacFarlane1-11/+0
Put labels in {} instead of ().
2011-01-05LaTeX reader: Support \L \l accents.John MacFarlane1-1/+8
2011-01-05Updated for texmath 0.5.John MacFarlane1-13/+13
2011-01-05Fixed macro parsing.John MacFarlane1-8/+10
2011-01-05LaTeX reader: Parse inside arguments when ignoring commands.John MacFarlane1-9/+11
2011-01-05LaTeX reader: Don't handle \index separately.John MacFarlane1-7/+0
Instead, just put it in list of commands to ignore.
2011-01-05LaTeX reader: Added "index" to ignorable commands.John MacFarlane1-1/+1
2011-01-05LaTeX reader: skip space before option or argument.John MacFarlane1-1/+1
2011-01-05LaTeX reader: Skip \index commands.John MacFarlane1-1/+8
2011-01-05LaTeX reader: Removed \group (we want to parse inside {}).John MacFarlane1-6/+0
2011-01-05LaTeX reader: Better handling of preamble, inc. parsing macros.John MacFarlane1-10/+20
2011-01-04LaTeX reader: Parse bracketed {parts} as raw TeX.John MacFarlane1-2/+2
2011-01-04Markdown reader: Removed unneeded definitions.John MacFarlane1-10/+8
specialChars, strChar, specialCharsMinusLt.
2011-01-04LaTeX reader: parse macros and apply to math.John MacFarlane1-7/+8
2011-01-04Moved 'macro' and 'applyMacros'' from markdown reader to Parsing.John MacFarlane2-26/+27
2011-01-01Fixed regression in markdown reader.John MacFarlane1-3/+3
'(_hi_)' was being parsed with literal underscores (no emphasis). The fix: the 'str' parser now only parses alphanumerics and embedded underscores. All other symbols are handled by the 'symbol' parser. This has a slight effect on the AST, since you'll get [Str "hi",Str ":"] insntead of [Str "hi:"]. But there should not be a visible effect in any of the writers. Thanks to gwern for pointing out the regression.
2011-01-01Updated copyright notices.John MacFarlane1-1/+1
2010-12-30LaTeX reader: Allow ignored comments after \end{document}.John MacFarlane1-3/+1
2010-12-30HTML reader: Fixed some parsing bugs.John MacFarlane1-22/+28
2010-12-30Added support for listings package code blocks and inline code.Puneeth Chaganti1-2/+9
2010-12-30Textile reader: Slight speed improvement.John MacFarlane1-5/+5
2010-12-30New HTML reader using tagsoup as a lexer.John MacFarlane4-628/+424
* The new reader is faster and more accurate. * API changes for Text.Pandoc.Readers.HTML: - removed rawHtmlBlock, anyHtmlBlockTag, anyHtmlInlineTag, anyHtmlTag, anyHtmlEndTag, htmlEndTag, extractTagType, htmlBlockElement, htmlComment - added htmlTag, htmlInBalanced, isInlineTag, isBlockTag, isTextTag * tagsoup is a new dependency. * Text.Pandoc.Parsing: Generalized type on readWith. * Benchmark.hs: Added length calculation to force full evaluation. * Updated HTML reader tests. * Updated markdown and textile readers to use the functions from the HTML reader. * Note: The markdown reader now correctly handles some cases it did not before. For example: <hr/> is reproduced without adding a space. <script> a = '<b>'; </script> is parsed correctly.
2010-12-26normalize: Don't reduce [Space] to [].John MacFarlane1-4/+1
2010-12-26Improved 'normalize'.John MacFarlane1-41/+44
Now normalizeInlines is split into consolidateInlines and removeEmptyInlines. We need to remove empties before consolidating.
2010-12-26Markdown writer: Fixed bug in Image.John MacFarlane1-1/+1
URI was getting unescaped twice!
2010-12-25Improved normalize.John MacFarlane1-0/+15
2010-12-24Use functions from Text.Pandoc.Generic instead of processWith(M).John MacFarlane9-17/+28
2010-12-22HTML reader: Simplified parsing of <script> sections.John MacFarlane1-24/+1
I had previously assumed that we needed to ignore </script> occuring in a string literal or javascript comment. It turns out, though, that browsers aren't that smart.
2010-12-22Made --smart work with HTML reader.John MacFarlane1-4/+13
It did not work before, because - and quotes were gobbled up by the str parser.
2010-12-22RST reader: Added unicode quote characters to specialChars.John MacFarlane1-1/+1
(So they can trigger Quoted environments.)
2010-12-22RST reader: recouped speed loss due to addition of --smart.John MacFarlane1-4/+4
This was achieved by rearranging the parsers in inline. Benchmarks went from 500ms to 307ms -- not quite back to the 279ms we had in 1.6, before supporting smart punctuation and footnotes, but close.
2010-12-22ODT writer: Don't wrap text in opendocument.John MacFarlane1-1/+1
2010-12-22Removed all dependencies on 'pretty' package.John MacFarlane1-4/+0
2010-12-22Texinfo writer: Updated to use Pretty.John MacFarlane1-56/+37