aboutsummaryrefslogtreecommitdiff
path: root/src/Text/Pandoc/Readers
AgeCommit message (Collapse)AuthorFilesLines
2014-03-25Parsing: Added HasMacros, simplified other typeclasses.John MacFarlane1-2/+2
Removed updateHeaderMap, setHeaderMap, getHeaderMap, updateIdentifierList, setIdentifierList, getIdentifierList.
2014-03-25API changes to HasReaderOptions, HasHeaderMap, HasIdentifierList.John MacFarlane1-8/+8
Previously these were typeclasses of monads. They've been changed to be typeclasses of states. This ismplifies the instance definitions and provides more flexibility. This is an API change! However, it should be backwards compatible unless you're defining instances of HasReaderOptions, HasHeaderMap, or HasIdentifierList. The old getOption function should work as before (albeit with a more general type). The function askReaderOption has been removed. extractReaderOptions has been added. getOption has been given a default definition. In HasHeaderMap, extractHeaderMap and updateHeaderMap have been added. Default definitions have been given for getHeaderMap, putHeaderMap, and modifyHeaderMap. In HasIdentifierList, extractIdentifierList and updateIdentifierList have been added. Default definitions have been given for getIdentifierList, putIdentifierList, and modifyIdentifierList. The ultimate goal here is to allow different parsers to use their own, tailored parser states (instead of ParserState) while still using shared functions.
2014-03-25LaTeX reader: Better handling of "table" environment.John MacFarlane1-0/+1
Positioning options no longer rendered verbatim. Partially addresses #1204.
2014-03-24Merge pull request #1068 from jaimeMF/mw-images-langsJohn MacFarlane1-1/+5
MediaWiki reader: Accept image links in more languages
2014-03-24Markdown reader: Fixed regression on line breaks in strict mode.John MacFarlane1-1/+1
Closes #1203.
2014-03-04Add a simple Emacs Org-mode readerAlbert Krewinkel1-0/+552
The basic structure of org-mode documents is recognized; however, org-mode features like todo markers, tags etc. are not supported yet.
2014-02-26Markdown reader: Improved parsing of nested divs.John MacFarlane1-0/+2
Formerly a closing div tag would be missed if it came right after other block-level tags.
2014-02-26Markdown parser: avoid backtracking when closing `</div>` not found.John MacFarlane1-6/+13
2014-02-26Markdown reader: small efficiency improvement.John MacFarlane1-1/+1
Switched `notFollewdBy' rawHtmlBlocks` -> `notFollowedBy' (htmlTag isBlockTag)`, which is more efficient.
2014-02-25Added readerTrace to ReaderOptions, --trace command line opt.John MacFarlane1-1/+11
This is to debug backtracking-related parsing bugs. So far it is only implemented for markdown, but it would be good to extend it to latex and html readers.
2014-02-21Fixed bug in reference link parsing in markdown_mmd.John MacFarlane1-1/+1
The bug was triggered by: Link to [Google][]. Link to [twitter][]. [Google]: http://google.com [twitter]: http://twitter.com
2014-02-19Make rst figures true figures. Closes #1168.John MacFarlane1-1/+1
Thanks to CasperVector.
2014-02-15Enhanced Pandoc's support for rST roles.Merijn Verstraaten1-10/+81
rST parser now supports: - All built-in rST roles - New role definition - Role inheritance Issues/TODO: - Silently ignores illegal fields on roles - Silently drops class annotations for roles - Only supports :format: fields with a single format for :raw: roles, requires a change to Text.Pandoc.Definition.Format to support multiple formats. - Allows direct use of :raw: role, rST only allows indirect (i.e., inherited use of :raw:).
2014-02-04Slight code reorganization in endline.John MacFarlane1-5/+3
2014-01-22Fixed mediawiki ordered list parsing.John MacFarlane1-10/+15
Closes #1122.
2014-01-20HTML reader: Fixed bug reading inline math with `$$`.John MacFarlane1-2/+2
See #225.
2014-01-16Merge pull request #974 from merijn/masterJohn MacFarlane1-6/+17
Added support for LaTeX style literate Haskell code blocks in rST.
2014-01-09Markdown parser: be more permissive about citation keys.John MacFarlane1-3/+4
Keys may now start with an underscore as well as a letter. Underscores do not count as internal punctuation, but are treated like alphanumerics, so "key:_2008" will work, as it did not before. (This change was necessary to use keys generated by zotero.) Closes #1111, closes #1011.
2014-01-07Markdown reader: Allow hard line breaks in table cells.John MacFarlane1-3/+3
The \-newline form must be used; the two-space+newline form won't work, since in a table cell nearly every line ends with two spaces.
2014-01-01HTML reader: Parse name/content pairs from meta tags as metadata.John MacFarlane1-1/+10
Closes #1106.
2013-12-19HLint: use fromMaybeHenry de Valence4-10/+12
Replace uses of `maybe x id` with `fromMaybe x`.
2013-12-19HLint: redundant parensHenry de Valence2-2/+2
Remove parens enclosing a single element.
2013-12-19HLint: use `elem` and `notElem`Henry de Valence2-7/+6
Replaces long conditional chains with calls to `elem` and `notElem`.
2013-12-17LaTeX reader: Support babel-style quoting: `` "`..."' ``.John MacFarlane1-0/+2
2013-12-16Docbook reader: Avoid failure if tbody contains no tr or row elements.John MacFarlane1-1/+3
2013-12-15Properly handle script blocks in strict mode.John MacFarlane1-1/+3
(That is, markdown-markdown_in_html_blocks.) Previously a spurious `<p>` tag was being added. Closes #1093.
2013-12-13LaTeX reader: add support for Verb macroJeff Arnold1-0/+1
2013-12-12Markdown reader: Fixed regression in title blocks.John MacFarlane1-4/+4
If author field was empty, date was being ignored. Closes #1089.
2013-12-07Another small performance improvement.John MacFarlane1-3/+5
2013-12-07Small performance improvement in list parsing.John MacFarlane1-2/+3
2013-12-07Markdown reader: Fixed bug with literal `</div>` in lists.John MacFarlane1-3/+4
Closes #1078.
2013-12-06HTML reader: Parse LaTeX math if appropriate options are set.John MacFarlane3-35/+10
* Moved inlineMath, displayMath from Markdown reader to Parsing. * Export them from Parsing. (API change.) * Generalize their types.
2013-12-04Stop parsing "list lines" when we hit a block tag.John MacFarlane1-1/+3
This fixes exponential slowdown in certain input, e.g. a series of lists followed by `</div>`.
2013-11-24MediaWiki reader: Accept image links in more languagesJaime Marquínez Ferrándiz1-1/+5
In some of the Wikipedia versions the local version of 'File' is used (for example 'Archivo' in Spanish)
2013-11-22Fixed bug with intraword emphasis.John MacFarlane1-1/+2
Closes #1066.
2013-11-18LaTeX reader: Parse contents of curly quotes or matched `"` as quotes.John MacFarlane1-7/+17
2013-11-17LaTeX reader: Support `\textnormal` as span with class "nodecor".John MacFarlane1-0/+1
This is needed for pandoc-citeproc.
2013-11-17MediaWiki reader: Add automatic header identifiers.John MacFarlane1-2/+20
2013-11-07recognize svg tag in HTML ReaderMinRK1-1/+1
avoids adding lots of `<p>` tags in embedded SVG content, for instance in markdown to HTML.
2013-11-03Markdown reader: Correctly handle empty bullet list items.John MacFarlane1-4/+5
For example: - one - - two This should NOT be parsed as a setext header followed by a list.
2013-11-03HTML reader: Use pandoc Div and Span for raw "<div>", "<span>".John MacFarlane1-10/+25
Only if --parse-raw.
2013-11-01TexMath: Export readTeXMath', which attends to display/inline.John MacFarlane1-5/+19
Deprecate readTeXMath, and use readTeXMath' in all the writers. Require texmath >= 0.6.5.
2013-10-29Markdown reader: Yaml block must start immediately after `---`.John MacFarlane1-0/+1
If there's a blank line after `---`, we interpreted it as a horizontal rule.
2013-10-22DocBook reader: Handle numerical attributes starting with decimal.John MacFarlane1-6/+7
Also use safeRead instead of read.
2013-10-21LaTeX reader: Improved citation parsing.John MacFarlane1-6/+13
This fixes a run-time error that occured with `\citet{}` (empty list of keys). It also ensures that empty keys don't get produced.
2013-10-18MediaWiki reader: Trim contents of `<math>` tags.John MacFarlane1-5/+5
Otherwise we get problems when converting to markdown. Closes #1027.
2013-10-11LaTeX reader: Ensure that preamble doesn't contribute to text of doc.John MacFarlane1-8/+8
2013-10-11LaTeX reader: Fixed character escaping in \url{}.John MacFarlane1-3/+1
Previously `\~` wasn't handled properly, among others.
2013-09-28LaTeX reader: Parse {groups} as Span.John MacFarlane1-1/+10
This is needed for accurate conversion of bibtex titles, since we need to know what was protected from titlecase conversions.
2013-09-14Markdown reader: unresolved citations fall back to original text.John MacFarlane1-11/+14
Not ???. Reason: Less surprising, especially for people using @ as in twitter.