aboutsummaryrefslogtreecommitdiff
path: root/test/Tests/Readers
AgeCommit message (Collapse)AuthorFilesLines
2019-11-12Switch to new pandoc-types and use Text instead of String [API change].despresc8-16/+23
PR #5884. + Use pandoc-types 1.20 and texmath 0.12. + Text is now used instead of String, with a few exceptions. + In the MediaBag module, some of the types using Strings were switched to use FilePath instead (not Text). + In the Parsing module, new parsers `manyChar`, `many1Char`, `manyTillChar`, `many1TillChar`, `many1Till`, `manyUntil`, `mantyUntilChar` have been added: these are like their unsuffixed counterparts but pack some or all of their output. + `glob` in Text.Pandoc.Class still takes String since it seems to be intended as an interface to Glob, which uses strings. It seems to be used only once in the package, in the EPUB writer, so that is not hard to change.
2019-11-04HTML Reader/Writer - Add support for <var> and <samp> (#5861)Amogh Rathore1-0/+6
Closes #5799
2019-11-03Docx reader: fix list number resumption for sublists. Closes #4324.John MacFarlane1-0/+4
The first list item of a sublist should not resume numbering from the number of the last sublist item of the same level, if that sublist was a sublist of a different list item. That is, we should not get: ``` 1. one 1. sub one 2. sub two 2. two 3. sub one ```
2019-10-27Org reader: fix parsing of empty comment linesAlbert Krewinkel1-1/+11
Comment lines in Org-mode can be completely empty; both of these line should produce no output: # a comment # The reader used to produce a wrong result for the latter, but ignores that line as well now. Fixes: #5856
2019-10-23Add Reader support for HTML <samp> element (#5843)Amogh Rathore1-0/+6
The `<samp>` element is parsed as a Span with class `sample`. Closes #5792.
2019-10-15Muse reader: do not allow closing asterisks to be followed by "*"Alexander Krotov1-3/+23
2019-10-15Muse reader: do not split series of asterisks into symbols and emphasisAlexander Krotov1-0/+8
Fixes #5821
2019-10-15Muse reader: do not terminate emphasis on "*" not followed by spaceAlexander Krotov1-0/+4
2019-10-04hlint Muse reader testsAlexander Krotov1-1/+1
2019-09-21[Docx Reader] Use style names, not ids, for assigning semantic meaningNikolay Yakimov1-0/+9
Motivating issues: #5523, #5052, #5074 Style name comparisons are case-insensitive, since those are case-insensitive in Word. w:styleId will be used as style name if w:name is missing (this should only happen for malformed docx and is kept as a fallback to avoid failing altogether on malformed documents) Block quote detection code moved from Docx.Parser to Readers.Docx Code styles, i.e. "Source Code" and "Verbatim Char" now honor style inheritance Docx Reader now honours "Compact" style (used in Pandoc-generated docx). The side-effect is that "Compact" style no longer shows up in docx+styles output. Styles inherited from "Compact" will still show up. Removed obsolete list-item style from divsToKeep. That didn't really do anything for a while now. Add newtypes to differentiate between style names, ids, and different style types (that is, paragraph and character styles) Since docx style names can have spaces in them, and pandoc-markdown classes can't, anywhere when style name is used as a class name, spaces are replaced with ASCII dashes `-`. Get rid of extraneous intermediate types, carrying styleId information. Instead, styleId is saved with other style data. Use RunStyle for inline style definitions only (lacking styleId and styleName); for Character Styles use CharStyle type (which is basicaly RunStyle with styleId and StyleName bolted onto it).
2019-09-15Revert "FB2 reader test: better diagnostics on failure."John MacFarlane1-28/+1
This reverts commit c65af7d1a2f35cbfd1235df2960f7156d38e8f92.
2019-09-15FB2 reader test: better diagnostics on failure.John MacFarlane1-1/+28
2019-09-14FB2 reader test: Another attempt to fix test failure on GitHub CI.John MacFarlane1-4/+5
2019-09-13Revert "FB2 reader test: filter CRs."John MacFarlane1-2/+2
This reverts commit e35147d715a737bb854e0c527243f77d970d1b86.
2019-09-13FB2 reader test: filter CRs.John MacFarlane1-2/+2
This may help with the test failure on GitHub CI. https://github.com/jgm/pandoc/commit/b59e6d03762becd5c9d767463ce7ba5062a1b4a0/checks
2019-09-04Roff readers: better parsing of groups.John MacFarlane1-1/+1
We now allow groups where the closing `\\}` isn't at the beginning of a line. Closes #5410.
2019-07-28Use doctemplates 0.3, change type of writerTemplate.John MacFarlane3-3/+3
* Require recent doctemplates. It is more flexible and supports partials. * Changed type of writerTemplate to Maybe Template instead of Maybe String. * Remove code from the LaTeX, Docbook, and JATS writers that looked in the template for strings to determine whether it is a book or an article, or whether csquotes is used. This was always kludgy and unreliable. To use csquotes for LaTeX, set `csquotes` in your variables or metadata. It is no longer sufficient to put `\usepackage{csquotes}` in your template or header includes. To specify a book style, use the `documentclass` variable or `--top-level-division`. * Change template code to use new API for doctemplates.
2019-07-22Org reader: accept ATTR_LATEX in block attributesAlbert Krewinkel1-0/+7
Attributes for LaTeX output are accepted as valid block attributes; however, their values are ignored. Fixes: #5648
2019-07-14Muse: add RTL supportAlexander Krotov1-0/+3
Closes #5551
2019-07-13Merge pull request #5589 from blmage/fix-3992John MacFarlane1-1/+33
Add support for EPUB2 covers (fix #3992)
2019-06-22add tests for EPUB2 and EPUB3 cover readermartinfrancois1-1/+33
2019-06-20Add a test for MathML formulas in ODT documentsblmage1-0/+1
2019-06-09Muse reader: test that links inside image descriptions workAlexander Krotov1-0/+3
2019-05-25Muse reader: allow images inside link descriptionsAlexander Krotov1-0/+3
2019-05-13Org reader: fix planning elements in headers level 3 and higherAlbert Krewinkel1-0/+24
Planning info is now always placed before the subtree contents. Previously, the planning info was placed after the content if the header's subtree was converted to a list, which happens with headers of level 3 and higher per default. Fixes: #5494
2019-05-13Org reader: omit, but warn about unknown export optionsAlbert Krewinkel1-0/+18
Unknown export options are properly ignored and omitted from the output.
2019-05-11FB2 reader: add notes parsing testAlexander Krotov1-0/+1
2019-05-05Org reader: prefer plain symbols over math symbolsAlbert Krewinkel1-3/+7
Symbols like `\alpha` are output plain and unemphasized, not as math. Fixes: #5483
2019-05-05Org reader: recognize emphasis after TODO/DONE keywordAlbert Krewinkel1-0/+7
Fixes: #5484
2019-03-25HTML reader: read `data-foo` attribute into `foo`.John MacFarlane1-1/+1
The HTML writer adds the `data-` prefix for HTML5 for nonstandard attributes. But the attributes are represented in the AST without the `data-` prefix, so we should strip this when reading HTML. Closes #5392.
2019-02-18Docx reader: add tests for trimming last inline.Jesse Rosenthal1-0/+4
2019-02-18Muse reader: add secondary note supportAlexander Krotov1-0/+17
2019-02-12Docx reader: Add test for reading sdts in footnotes.Jesse Rosenthal1-0/+4
2019-02-06Docx reader: Tests for alternate document.xmlJesse Rosenthal1-2/+7
2019-02-06Muse reader: test that block level markup does not break <verbatim>Alexander Krotov1-0/+7
2019-02-04Add missing copyright notices and remove license boilerplate (#5112)Albert Krewinkel28-0/+311
Quite a few modules were missing copyright notices. This commit adds copyright notices everywhere via haddock module headers. The old license boilerplate comment is redundant with this and has been removed. Update copyright years to 2019. Closes #4592.
2019-01-09RST reader: change treatment of `number-lines` directives. (#5207)Brian Leung1-2/+2
Directives of this type without numeric inputs should not have a `startFrom` attribute; with a blank value, the writers can produce extra whitespace.
2019-01-08Removed superfluous sourceCode class on code blocks.John MacFarlane2-8/+8
* These were added by the RST reader and, for literate Haskell, by the Markdown and LaTeX readers. There is no point to this class, and it is not applied consistently by all readers. See #5047. * Reverse order of `literate` and `haskell` classes on code blocks when parsing literate Haskell. Better if `haskell` comes first.
2019-01-07Org reader: handle `minlevel` option differently. (#5190)Brian Leung1-1/+6
When `minlevel` exceeds the original minimum level observed in the file to be included, every heading should be shifted rightward.
2019-01-06Add DokuWiki reader (#5108)Alexander1-0/+315
Closes #1792
2019-01-01Org reader: fix self-link parsing regressionAlbert Krewinkel1-0/+4
Fixes a regression introduced by the previous commit.
2019-01-01Org reader: fix treatment of links to imagesAlbert Krewinkel1-13/+17
Links with descriptions which are pointing to images are no longer read as inline images, but as proper links. Fixes: #5191
2018-12-25Muse reader tests: test #cover directiveAlexander Krotov1-0/+5
2018-12-10Docx: add test for lists with level overrides.Jesse Rosenthal1-0/+4
2018-11-14Muse reader: trim whitespace before parsing grid table cellsAlexander Krotov1-0/+11
2018-11-14Muse reader: add grid tables supportAlexander Krotov1-0/+43
2018-11-02table testsYan Pashkovsky1-0/+11
2018-10-30Roff reader: properly handle unknown backslash escapes.John MacFarlane1-0/+2
They are treated as regular characters, according to groff 7. Cloess #5034.
2018-10-28Roff tokenizer: better handling of escapes.John MacFarlane1-2/+2
2018-10-28Roff tokenizer: revamped font parsing using escapeArg.John MacFarlane1-1/+1
Add support for \C'...' escapes.