aboutsummaryrefslogtreecommitdiff
path: root/test/Tests/Readers
AgeCommit message (Collapse)AuthorFilesLines
2017-10-02Org reader: support `\n` export optionAlbert Krewinkel1-0/+7
The `\n` export option turns all newlines in the text into hard linebreaks. Closes #3950
2017-09-30Removed writerSourceURL, add source URL to common state.John MacFarlane2-8/+4
Removed `writerSourceURL` from `WriterOptions` (API change). Added `stSourceURL` to `CommonState`. It is set automatically by `setInputFiles`. Text.Pandoc.Class now exports `setInputFiles`, `setOutputFile`. The type of `getInputFiles` has changed; it now returns `[FilePath]` instead of `Maybe [FilePath]`. Functions in Class that formerly took the source URL as a parameter now have one fewer parameter (`fetchItem`, `downloadOrRead`, `setMediaResource`, `fillMediaBag`). Removed `WriterOptions` parameter from `makeSelfContained` in `SelfContained`.
2017-09-28Muse reader: parse anchorsAlexander Krotov1-0/+20
2017-09-26Muse reader: test metadata parsingAlexander Krotov1-0/+7
2017-09-25Org reader: update emphasis border charsAlbert Krewinkel1-9/+9
The org reader was updated to match current org-mode behavior: the set of characters which are acceptable to occur as the first or last character in an org emphasis have been changed and now allows all non-whitespace chars at the inner border of emphasized text (see `org-emphasis-regexp-components`). Fixes: #3933
2017-09-11Merge pull request #3911 from labdsf/muse-reader-bracesJohn MacFarlane1-0/+33
Muse reader: parse {{{ }}} example syntax
2017-09-11Muse reader: allow inline markup to be followed by punctuationAlexander Krotov1-2/+18
Previously code was not allowed to be followed by comma, and emphasis was allowed to be followed by letter.
2017-09-11Muse reader: parse {{{ }}} example syntaxAlexander Krotov1-0/+33
2017-09-11Muse reader: parse verbatim tagAlexander Krotov1-0/+2
2017-09-10Muse reader: trim newlines from <example>sAlexander Krotov1-0/+25
2017-09-09Muse reader: debug inline code markupAlexander Krotov1-2/+32
2017-09-06Muse reader: Allow finishing header with EOF (#3897)Alexander1-6/+6
2017-09-05Muse reader: require at least one space char after * in header (#3895)Alexander1-0/+1
2017-09-04Muse reader: parse <div> tag (#3888)Alexander1-0/+8
2017-08-29Muse reader: parse verse markup (#3882)Alexander1-0/+24
2017-08-28RST reader: handle blank lines correctly in line blocks (#3881)Alexander1-0/+13
Previously pandoc would sometimes combine two line blocks separated by blanks, and ignore trailing blank lines within the line block. Test is checked to be consisted with http://rst.ninjs.org/
2017-08-25Muse reader: parse <verse> tag (#3872)Alexander1-0/+21
2017-08-24LaTeX reader: RN and Rn, from biblatex (#3854)bucklereed1-0/+23
2017-08-22Muse reader: avoid crashes on multiparagraph inline tags (#3866)Alexander1-0/+10
Test checks that behavior is consistent with Amusewiki
2017-08-22Muse reader: do not allow closing tags with EOF (#3863)Alexander1-0/+6
This behavior is compatible to Amusewiki
2017-08-21Muse reader: add definition list support (#3860)Alexander1-0/+52
2017-08-17RST reader/writer: support unknown interpreted text roles...John MacFarlane1-1/+2
...by parsing them as Span with "role" attributes. This way they can be manipulated in the AST. Closes #3407.
2017-08-09Org reader: use org-language attribute rather than data-org-language.John MacFarlane1-9/+9
2017-08-09Org reader: use tag-name attribute instead of data-tag-name.John MacFarlane1-1/+1
2017-08-09HTML reader: parse <main> like <div role=main>. (#3791)bucklereed1-0/+15
* HTML reader: parse <main> like <div role=main>. * <main> closes <p> and behaves like a block element generally
2017-08-06Muse reader: debug indented paragraph support (#3839)Alexander1-6/+29
Take only first line indentation into account and do not start new paragraph on indentation change.
2017-08-06Docx reader: Add tests for avoiding zero-level header.Jesse Rosenthal1-0/+4
2017-08-06Muse reader: debug list and list item separation rules (#3837)Alexander1-0/+89
2017-08-05LaTeX reader: plainbreak, fancybreak et al from the memoir class (#3833)bucklereed1-0/+26
2017-08-03Muse reader: do not allow headers in blockquotes (#3831)Alexander Krotov1-1/+7
2017-07-30Class: Removed unnecessary withMedia, improved haddocks.John MacFarlane1-1/+3
2017-07-21Muse reader: fix reading of lists inside tags (#3802)Alexander Krotov1-0/+28
2017-07-07Rewrote LaTeX reader with proper tokenization.John MacFarlane1-1/+2
This rewrite is primarily motivated by the need to get macros working properly. A side benefit is that the reader is significantly faster (27s -> 19s in one benchmark, and there is a lot of room for further optimization). We now tokenize the input text, then parse the token stream. Macros modify the token stream, so they should now be effective in any context, including math. Thus, we no longer need the clunky macro processing capacities of texmath. A custom state LaTeXState is used instead of ParserState. This, plus the tokenization, will require some rewriting of the exported functions rawLaTeXInline, inlineCommand, rawLaTeXBlock. * Added Text.Pandoc.Readers.LaTeX.Types (new exported module). Exports Macro, Tok, TokType, Line, Column. [API change] * Text.Pandoc.Parsing: adjusted type of `insertIncludedFile` so it can be used with token parser. * Removed old texmath macro stuff from Parsing. Use Macro from Text.Pandoc.Readers.LaTeX.Types instead. * Removed texmath macro material from Markdown reader. * Changed types for Text.Pandoc.Readers.LaTeX's rawLaTeXInline and rawLaTeXBlock. (Both now return a String, and they are polymorphic in state.) * Added orgMacros field to OrgState. [API change] * Removed readerApplyMacros from ReaderOptions. Now we just check the `latex_macros` reader extension. * Allow `\newcommand\foo{blah}` without braces. Fixes #1390. Fixes #2118. Fixes #3236. Fixes #3779. Fixes #934. Fixes #982.
2017-06-28Muse reader: parse indented blockquotes (#3769)Alexander Krotov1-1/+6
2017-06-27HTML reader: Use the lang value of <html> to set the lang meta value. (#3765)bucklereed1-0/+6
* HTML reader: Use the lang value of <html> to set the lang meta value. * Fix for pre-AMP environments.
2017-06-26Muse reader: fix horizontal rule parsing (#3762)Alexander Krotov1-1/+12
Do not parse 3 dashes as horizontal rule and allow whitespace after rule
2017-06-25Muse reader: Require space before and after '=' for code (#3758)Alexander Krotov1-0/+8
2017-06-20Muse reader: check that headers start at the first column (#3749)Alexander Krotov1-0/+17
2017-06-19Muse reader: fix list item continuation parsing (#3747)Alexander Krotov1-0/+13
2017-06-19Add Muse reader (#3620)Alexander Krotov1-0/+264
2017-06-11Switched Writer types to use Text.John MacFarlane3-3/+6
* XML.toEntities: changed type to Text -> Text. * Shared.tabFilter -- fixed so it strips out CRs as before. * Modified writers to take Text. * Updated tests, benchmarks, trypandoc. [API change] Closes #3731.
2017-06-10Changed all readers to take Text instead of String.John MacFarlane8-573/+588
Readers: Renamed StringReader -> TextReader. Updated tests. API change.
2017-05-31Org reader: respect export option for tagsAlbert Krewinkel1-0/+6
Tags are appended to headlines by default, but will be omitted when the `tags` export option is set to nil. Closes: #3713
2017-05-31Org reader: include tags in headlinesAlbert Krewinkel1-17/+19
The Emacs default is to include tags in the headline when exporting. Instead of just empty spans, which contain the tag name as attribute, tags are rendered as small caps and wrapped in those spans. Non-breaking spaces serve as separators for multiple tags.
2017-05-28Org reader: Fix cite parsing behaviourHerwig Stuetz1-0/+36
Until now, org-ref cite keys included special characters also at the end. This caused problems when citations occur right before colons or at the end of a sentence. With this change, all non alphanumeric characters at the end of a cite key are ignored. This also adds `,` to the list of special characters that are legal in cite keys to better mirror the behaviour of org-export.
2017-05-27Org reader: subject full doc tree to headline transformationsAlbert Krewinkel1-2/+16
Emacs parses org documents into a tree structure, which is then post-processed during exporting. The reader is changed to do the same, turning the document into a single tree of headlines starting at levelĀ 0. Fixes: #3695
2017-05-18Org reader: fix smart parsing behaviorAlbert Krewinkel1-2/+2
Parsing of smart quotes and special characters can either be enabled via the `smart` language extension or the `'` and `-` export options. Smart parsing is active if either the extension or export option is enabled. Only smart parsing of special characters (like ellipses and en and em dashes) is enabled by default, while smart quotes are disabled. This means that all smart parsing features will be enabled by adding the `smart` language extension. Fine-grained control is possible by leaving the language extension disabled. In that case, smart parsing is controlled via the aforementioned export OPTIONS only. Previously, all smart parsing was disabled unless the language extension was enabled.
2017-05-12Replace `repeat' and `take' with `replicate'Alexander Krotov1-1/+1
2017-05-06Org reader: support macrosAlbert Krewinkel1-0/+18
Closes: #3401
2017-04-23Org reader: allow multi-word arguments to src block paramsAlbert Krewinkel1-0/+9
The reader now correctly parses src block parameter list even if parameter arguments contain multiple words. Closes: #3477