aboutsummaryrefslogtreecommitdiff
path: root/src/Text/Pandoc/Readers
AgeCommit message (Collapse)AuthorFilesLines
2011-01-28RST reader: skip blanklines at beginning, not all leading spaces.John MacFarlane1-1/+1
If you skip all spaces, it becomes impossible to start with a blockquote.
2011-01-28RST reader: Skip blank space at beginning.John MacFarlane1-0/+1
Resolves Debian Bug #611328.
2011-01-26Add support for attributes in inline Code.John MacFarlane6-13/+19
Additional related changes: * URLs in Code in autolinks now use class "url". * Require highlighting-kate 0.2.8.2, which omits the final <br/> tag, essential for inline code.
2011-01-26RST reader: Improved field lists.John MacFarlane1-59/+56
Field lists now work properly with block content. (Thanks to Lachlan Musicman for pointing out the bug.) In addition, definition list items are now always Para instead of Plain -- which matches behavior of rst2xml.py. Finally, in image blocks, the alt attribute is parsed properly and used for the alt, not also the title.
2011-01-26LaTeX reader: Fixed an incomplete pattern match.John MacFarlane1-1/+3
2011-01-26RST reader: Include line breaks in raw field list parser output.John MacFarlane1-1/+3
Note: field list items can have lists, etc. as values.
2011-01-26RST reader: Allow spaces in field list names.John MacFarlane1-1/+1
2011-01-26Markdown reader: Don't parse latex/context environments as inline.John MacFarlane1-9/+15
2011-01-26Distinguish latex & context environments; blank line after in writers.John MacFarlane1-3/+4
2011-01-26Bumped version to 1.8; depend on pandoc-types 1.8.John MacFarlane5-19/+22
The old TeX, HtmlInline and RawHtml elements have been removed and replaced by generic RawInline and RawBlock elements. All modules updated to use the new raw elements.
2011-01-23Textile writer: Don't HTML-escape between @'s.John MacFarlane1-1/+1
2011-01-23Textile reader: Fixed bug (swallowed p at beginning of paragraph).John MacFarlane1-3/+3
The problem was a missing 'try' in the maybeExplicitBlock parser. Test case, a paragraph beginning with 'p', has been added.
2011-01-23Textile reader: Support <tt> for inline code.John MacFarlane1-3/+11
2011-01-23Textile reader: Added code blocks with bc.John MacFarlane1-2/+11
2011-01-22Markdown reader: slight speedup by moving whitespace parser.John MacFarlane1-2/+2
2011-01-22RST reader: Big speed improvement (300->260ms).John MacFarlane1-2/+2
Moved whitespace parser to top of inline parsers.
2011-01-21Haddock comment improvements.John MacFarlane1-1/+2
2011-01-19Added Text.Pandoc.Readers.Native (readNative).John MacFarlane1-0/+80
readNative can now read full pandoc documents, block lists, blocks, inline lists, or inlines. It will interpret Str "hi" as if it were Pandoc (Meta [] [] []) [Plain [Str "hi"]] This should make testing easier.
2011-01-19Use spaceChar instead of oneOf " \t" in rst reader.John MacFarlane1-5/+5
2011-01-19Replaced more noneOf/oneOf parsers.John MacFarlane1-5/+11
2011-01-19Replaced uses of oneOf with more efficient parsers.John MacFarlane1-12/+19
This speeds up the markdown reader.
2011-01-14HTML reader: parse simple tables.John MacFarlane1-2/+22
Resolves Issue #106. Thanks to Rodja Trappe for the idea and some sample code.
2011-01-14HTML reader: parse location tags in pSatisfy.John MacFarlane1-13/+17
This avoids the need for manual parsing all over the place.
2011-01-07LaTeX reader: Support simple tables.John MacFarlane1-0/+58
2011-01-06LaTeX reader: allow spaces btw \\begin or \\end and {.John MacFarlane1-10/+27
2011-01-06LaTeX reader: Removed unnecessary 'spaces'.John MacFarlane1-4/+0
2011-01-06HTML reader: Fixed bug in htmlTag for comments.John MacFarlane1-2/+9
2011-01-05LaTeX reader: Apply macros to non-math; handle ensuremath.John MacFarlane1-9/+23
2011-01-05LaTeX reader: Don't handle \label and \ref specially.John MacFarlane1-11/+0
Put labels in {} instead of ().
2011-01-05LaTeX reader: Support \L \l accents.John MacFarlane1-1/+8
2011-01-05Updated for texmath 0.5.John MacFarlane1-13/+13
2011-01-05LaTeX reader: Parse inside arguments when ignoring commands.John MacFarlane1-9/+11
2011-01-05LaTeX reader: Don't handle \index separately.John MacFarlane1-7/+0
Instead, just put it in list of commands to ignore.
2011-01-05LaTeX reader: Added "index" to ignorable commands.John MacFarlane1-1/+1
2011-01-05LaTeX reader: skip space before option or argument.John MacFarlane1-1/+1
2011-01-05LaTeX reader: Skip \index commands.John MacFarlane1-1/+8
2011-01-05LaTeX reader: Removed \group (we want to parse inside {}).John MacFarlane1-6/+0
2011-01-05LaTeX reader: Better handling of preamble, inc. parsing macros.John MacFarlane1-10/+20
2011-01-04LaTeX reader: Parse bracketed {parts} as raw TeX.John MacFarlane1-2/+2
2011-01-04Markdown reader: Removed unneeded definitions.John MacFarlane1-10/+8
specialChars, strChar, specialCharsMinusLt.
2011-01-04LaTeX reader: parse macros and apply to math.John MacFarlane1-7/+8
2011-01-04Moved 'macro' and 'applyMacros'' from markdown reader to Parsing.John MacFarlane1-24/+0
2011-01-01Fixed regression in markdown reader.John MacFarlane1-3/+3
'(_hi_)' was being parsed with literal underscores (no emphasis). The fix: the 'str' parser now only parses alphanumerics and embedded underscores. All other symbols are handled by the 'symbol' parser. This has a slight effect on the AST, since you'll get [Str "hi",Str ":"] insntead of [Str "hi:"]. But there should not be a visible effect in any of the writers. Thanks to gwern for pointing out the regression.
2010-12-30LaTeX reader: Allow ignored comments after \end{document}.John MacFarlane1-3/+1
2010-12-30HTML reader: Fixed some parsing bugs.John MacFarlane1-22/+28
2010-12-30Added support for listings package code blocks and inline code.Puneeth Chaganti1-2/+9
2010-12-30Textile reader: Slight speed improvement.John MacFarlane1-5/+5
2010-12-30New HTML reader using tagsoup as a lexer.John MacFarlane3-625/+421
* The new reader is faster and more accurate. * API changes for Text.Pandoc.Readers.HTML: - removed rawHtmlBlock, anyHtmlBlockTag, anyHtmlInlineTag, anyHtmlTag, anyHtmlEndTag, htmlEndTag, extractTagType, htmlBlockElement, htmlComment - added htmlTag, htmlInBalanced, isInlineTag, isBlockTag, isTextTag * tagsoup is a new dependency. * Text.Pandoc.Parsing: Generalized type on readWith. * Benchmark.hs: Added length calculation to force full evaluation. * Updated HTML reader tests. * Updated markdown and textile readers to use the functions from the HTML reader. * Note: The markdown reader now correctly handles some cases it did not before. For example: <hr/> is reproduced without adding a space. <script> a = '<b>'; </script> is parsed correctly.
2010-12-24Use functions from Text.Pandoc.Generic instead of processWith(M).John MacFarlane1-1/+2
2010-12-22HTML reader: Simplified parsing of <script> sections.John MacFarlane1-24/+1
I had previously assumed that we needed to ignore </script> occuring in a string literal or javascript comment. It turns out, though, that browsers aren't that smart.