aboutsummaryrefslogtreecommitdiff
path: root/src/Text/Pandoc/Readers
AgeCommit message (Collapse)AuthorFilesLines
2017-08-09RST reader: reorganize block parsers for ~20% faster parsing.John MacFarlane1-3/+4
2017-08-09Removed spurious comments.John MacFarlane1-4/+0
2017-08-09Org reader: use org-language attribute rather than data-org-language.John MacFarlane1-1/+1
2017-08-09Org reader: use tag-name attribute instead of data-tag-name.John MacFarlane1-1/+1
2017-08-09LaTeX reader: Use `label` instead of `data-label` for label in caption.John MacFarlane1-1/+1
See d441e656db576f266c4866e65ff9e4705d376381, #3639.
2017-08-09HTML reader: parse <main> like <div role=main>. (#3791)bucklereed1-7/+11
* HTML reader: parse <main> like <div role=main>. * <main> closes <p> and behaves like a block element generally
2017-08-09Muse reader: simplify tableCell implementation (#3846)Alexander1-3/+1
2017-08-08RST reader: support :widths: attribute for table directive.John MacFarlane1-3/+13
2017-08-08Thread options through CommonMark reader.John MacFarlane1-81/+77
This is more efficient than doing AST traversals for emojis and hard breaks. Also make behavior sensitive to `raw_html` extension.
2017-08-08Support `hard_line_breaks` in CommonMark reader.John MacFarlane1-0/+7
2017-08-08CommonMark reader: support `emoji` extension.John MacFarlane1-1/+19
2017-08-08CommonMark reader: support `gfm_auto_identifiers`.John MacFarlane1-0/+31
Added `Ext_gfm_auto_identifiers`: new constructor for `Extension` in `Text.Pandoc.Extensions` [API change]. Use this in githubExtensions. Closes #2821.
2017-08-07CommonMark reader: make exts depend on extensions.John MacFarlane1-2/+4
2017-08-07Remove GFM modules; use CMarkGFM for both gfm and commonmark.John MacFarlane2-191/+63
We no longer have a separate readGFM and writeGFM; instead, we'll use readCommonMark and writeCommonMark with githubExtensions. It remains to implement these extensions conditionally. Closes #3841.
2017-08-07Markdown reader: fixed spurious parsing as citation as reference def.John MacFarlane1-2/+4
We now disallow reference keys starting with `@` if the `citations` extension is enabled. Closes #3840.
2017-08-07Added gfm (GitHub-flavored CommonMark) as an input and output format.John MacFarlane2-2/+187
This uses bindings to GitHub's fork of cmark, so it should parse gfm exactly as GitHub does (excepting certain postprocessing steps, involving notifications, emojis, etc.). * Added Text.Pandoc.Readers.GFM (exporting readGFM) * Added Text.Pandoc.Writers.GFM (exporting writeGFM) * Added `gfm` as input and output forma Note that tables are currently always rendered as HTML in the writer; this can be improved when CMarkGFM supports tables in output.
2017-08-07Small tweak to previous commit.John MacFarlane1-1/+1
2017-08-07LaTeX reader: Support simple `\def` macros.John MacFarlane1-2/+21
Note that we still don't support macros with fancy parameter delimiters, like \def\foo#1..#2{...}
2017-08-07LaTeX reader: Support `\let`.John MacFarlane2-14/+33
Also, fix regular macros so they're expanded at the point of use, and NOT also the point of definition. `\let` macros, by contrast, are expanded at the point of definition. Added an `ExpansionPoint` field to `Macro` to track this difference.
2017-08-06Muse reader: debug indented paragraph support (#3839)Alexander1-21/+5
Take only first line indentation into account and do not start new paragraph on indentation change.
2017-08-06Docx reader: Avoid 0-level headers.Jesse Rosenthal1-6/+5
We used to parse paragraphs styled with "HeadingN" as "nth-level header." But if a document has a custom style named "Heading0", this will produce a 0-level header, which shouldn't exist. We only parse this style if N>0. Otherwise we treat it as a normal style name, and follow its dependencies, if any. Closes #3830.
2017-08-06Muse reader: debug list and list item separation rules (#3837)Alexander1-5/+4
2017-08-05LaTeX reader: plainbreak, fancybreak et al from the memoir class (#3833)bucklereed1-0/+9
2017-08-03Muse reader: do not allow headers in blockquotes (#3831)Alexander Krotov1-1/+1
2017-08-03Muse reader: do not parse blocks inside comments (#3828)Alexander Krotov1-1/+1
2017-07-25Merge branch 'textcolor-support' of https://github.com/schrieveslaach/pandoc ↵John MacFarlane1-0/+20
into schrieveslaach-textcolor-support
2017-07-24LaTeX reader: support etoolbox's ifstrequal.John MacFarlane1-0/+13
2017-07-24LaTeX reader: removed code for non-exported 'macro'.John MacFarlane1-8/+0
2017-07-24LaTeX reader: Removed 'macro'.John MacFarlane2-11/+2
It is no longer necessary, since the rawLaTeXBlock parser will parse macro definitions. This also avoids the need for a separate latexMacro parser in the Markdown reader.
2017-07-24LaTeX reader: some improvements in macro parsing.John MacFarlane1-19/+19
Fixed applyMacros so that it operates on the whole string, not just the first token! Don't remove macro definitions from the output, even if Ext_latex_macros is set, so that macros will be applied. Since they're only applied to math in Markdown, removing the macros can have bad effects. Even for math macros, keeping them should be harmless.
2017-07-24Refactored some common code in LaTeX reader.John MacFarlane1-37/+15
2017-07-22HTML Reader: parse figure and figcaption (#3813)Mauro Bieg1-0/+20
2017-07-21LaTeX reader: improved heuristic for raw block/inline.John MacFarlane1-2/+18
An unknown command at the beginning of the line that could be either block or inline is treated as block if we have a sequence of block commands followed by a newline or a `\startXXX` command (which might start a raw ConTeXt environment).
2017-07-21Added TikiWiki reader (#3800)rlpowell1-0/+658
Added TikiWiki reader, including tests and documentation. It's probably not *complete*, but it works pretty well, handles all the basics (and some not-so-basics).
2017-07-21Muse reader: fix reading of lists inside tags (#3802)Alexander Krotov1-3/+2
2017-07-21LaTeX reader: handle optional args in raw `\titleformat`.John MacFarlane1-8/+15
Closes #3804.
2017-07-19LaTeX reader: fixed regression with starred environment names.John MacFarlane1-14/+3
Closes #3803.
2017-07-13Add LaTeX xspace support (#3797)schrieveslaach1-0/+12
2017-07-13Merge branch 'master' of https://github.com/jgm/pandoc into textcolor-supportMarc Schreiber22-1196/+3177
2017-07-12Fixed #3760. (#3784)Yuchen Pei1-6/+26
Using the same solution as in the LaTeX reader: equation -> displaymath align -> displaymath \begin{aligned} ... \end{aligned} etc..
2017-07-12Remove redundant imports (#3796)Alexander Krotov1-1/+1
2017-07-12Make sure \write18 is parsed as raw LaTeX.John MacFarlane1-7/+9
The change is in the LaTeX reader's treatment of raw commands, but it also affects the Markdown reader.
2017-07-11HTML reader: Ensure that paragraphs are closed properly...John MacFarlane1-0/+2
when the parent block element closes, even without `</p>`. Closes #3794.
2017-07-07Use takeP in LaTeX reader.John MacFarlane1-4/+4
2017-07-07Rewrote LaTeX reader with proper tokenization.John MacFarlane9-1095/+1720
This rewrite is primarily motivated by the need to get macros working properly. A side benefit is that the reader is significantly faster (27s -> 19s in one benchmark, and there is a lot of room for further optimization). We now tokenize the input text, then parse the token stream. Macros modify the token stream, so they should now be effective in any context, including math. Thus, we no longer need the clunky macro processing capacities of texmath. A custom state LaTeXState is used instead of ParserState. This, plus the tokenization, will require some rewriting of the exported functions rawLaTeXInline, inlineCommand, rawLaTeXBlock. * Added Text.Pandoc.Readers.LaTeX.Types (new exported module). Exports Macro, Tok, TokType, Line, Column. [API change] * Text.Pandoc.Parsing: adjusted type of `insertIncludedFile` so it can be used with token parser. * Removed old texmath macro stuff from Parsing. Use Macro from Text.Pandoc.Readers.LaTeX.Types instead. * Removed texmath macro material from Markdown reader. * Changed types for Text.Pandoc.Readers.LaTeX's rawLaTeXInline and rawLaTeXBlock. (Both now return a String, and they are polymorphic in state.) * Added orgMacros field to OrgState. [API change] * Removed readerApplyMacros from ReaderOptions. Now we just check the `latex_macros` reader extension. * Allow `\newcommand\foo{blah}` without braces. Fixes #1390. Fixes #2118. Fixes #3236. Fixes #3779. Fixes #934. Fixes #982.
2017-06-30Make `east_asian_line_breaks` affect all readers/writers.John MacFarlane1-2/+1
Closes #3703.
2017-06-28Muse reader: parse indented blockquotes (#3769)Alexander Krotov1-1/+22
2017-06-27Require nonempty alt text for `implicit_figures`.John MacFarlane1-1/+2
A figure with an empty caption doesn't make sense. Closes #2844.
2017-06-27RST reader: support anchors.John MacFarlane1-1/+23
E.g. `hello` .. _hello: paragraph This is supported by putting "paragraph" in a Div with id `hello`. Closes #262.
2017-06-27RST reader: Handle chained link definitions.John MacFarlane1-7/+20
For example, .. _hello: .. _goodbye: example.com Here both `hello` and `goodbye` should link to `example.com`. Fixes the first part of #262.