aboutsummaryrefslogtreecommitdiff
path: root/tests/Tests/Readers
AgeCommit message (Collapse)AuthorFilesLines
2017-01-30Fixed some compiler warnings in tests.John MacFarlane2-4/+0
2017-01-25Removed readerVerbosity and writerVerbosity.John MacFarlane1-1/+2
API change. Also added a verbosity parameter to makePDF.
2017-01-25Removed unneeded imports.John MacFarlane1-2/+0
2017-01-25Made `smart` extension default for pandoc markdown.John MacFarlane5-22/+27
Updated tests.
2017-01-25Removed readerSmart and the --smart option; added Ext_smart extension.John MacFarlane2-2/+4
Now you will need to do -f markdown+smart instead of -f markdown --smart This change opens the way for writers, in addition to readers, to be sensitive to +smart, but this change hasn't yet been made. API change. Command-line option change. Updated manual.
2017-01-25Make Extensions a custom type instead of a Set Extension.John MacFarlane1-4/+3
The type is implemented in terms of an underlying bitset which should be more efficient. API change: from Text.Pandoc.Extensions export Extensions, emptyExtensions, extensionsFromList, enableExtension, disableExtension, extensionEnabled.
2017-01-25Removed `--normalize` option and normalization functions from Shared.John MacFarlane2-2/+2
* Removed normalize, normalizeInlines, normalizeBlocks from Text.Pandoc.Shared. These shouldn't now be necessary, since normalization is handled automatically by the Builder monoid instance. * Remove `--normalize` command-line option. * Don't use normalize in tests. * A few revisions to readers so they work well without normalize.
2017-01-25Added ReaderOptions parameter to readNative.John MacFarlane2-2/+2
This makes it similar to the other readers -- even though ReaderOptions is essentially ignored, the uniformity is nice.
2017-01-25Class: no more MonadState CommonState.John MacFarlane1-2/+2
- Added getCommonState, putCommonState, getsCommonState, modifyCommonState to PandocMonad interface. - Removed MonadState CommonState instances.
2017-01-25ODT test: remove unnecessary imports.Jesse Rosenthal1-1/+1
2017-01-25Make Txt2Tags test pass.Jesse Rosenthal1-4/+9
We don't have a good way to set things that aren't in the common state. That will be the next order of business.
2017-01-25Update all tests to use new readers and error structure.Jesse Rosenthal9-25/+28
2017-01-25Changed readNative to use PandocMonad.John MacFarlane2-2/+6
2017-01-25Update tests to work with runPure.John MacFarlane2-2/+2
2017-01-19Org reader: allow short hand for single-line raw blocksAlbert Krewinkel1-0/+12
Single-line raw blocks can be given via `#+FORMAT: raw line`, where `FORMAT` must be one of `latex`, `beamer`, `html`, or `texinfo`. Closes: #3366
2017-01-06Org reader: accept org-ref citations followed by commasAlbert Krewinkel1-0/+12
Bugfix for an issue which, whenever the citation was immediately followed by a comma, prevented correct parsing of org-ref citations.
2017-01-05Org reader: ensure emphasis markup can be nestedAlbert Krewinkel1-0/+4
Nested emphasis markup (e.g. `/*strong and emphasized*/`) was interpreted incorrectly in that the inner markup was not recognized.
2016-12-24markdown reader: disallow space between inline code and attributes (#3326)Mauro Bieg1-1/+2
closes #3323
2016-12-13Test case for issue #3223 (#3308)hubertp-lshift1-0/+1
2016-12-13Test case for bug 2752 (#3306)hubertp-lshift1-0/+1
2016-12-08Docx reader: Ensure one-row tables don't have header.Jesse Rosenthal1-0/+4
Tables in MS Word are set by default to have special first-row formatting, which pandoc uses to determine whether or not they have a header. This means that one-row tables will, by default, have only a header -- which we imagine is not what people want. This change ensures that a one-row table is not understood to be a header only. Note that this means that it is impossible to produce a header-only table from docx, even though it is legal pandoc. But we believe that in nearly all cases, it will be an accidental (and unwelcome) result Closes #3285.
2016-11-30Options: Removed writerStandalone, made writerTemplate a Maybe.John MacFarlane2-6/+6
Previously setting writerStandalone = True did nothing unless a template was provided in writerTemplate. Now a fragment will be generated if writerTemplate is Nothing; otherwise, the specified template will be used and standalone output generated. [API change]
2016-11-26[odt] Infer table's caption from the paragraph (#3224)hubertp-lshift1-0/+1
ODT's reader always put empty captions for the parsed tables. This commit 1) checks paragraphs that follow the table definition 2) treats specially a paragraph with a style named 'Table' 3) does some postprocessing of the paragraphs that combines tables followed immediately by captions The ODT writer used 'TableCaption' style name for the caption paragraph. This commit follows the open office approach which allows for appending captions to table but uses a built-in style named 'Table' instead of 'TableCaption'. Any users of odt format (both writer and reader) are therefore required to change the style's name to 'Table', if necessary.
2016-11-19Org reader: Ensure images in paragraphs are not parsed as figuresAlbert Krewinkel1-12/+22
This fixes a regression introduced in 7e5220b57c5a48fabe6e43ba270db812593d3463.
2016-11-13HTML reader: only treat "a" element as link if it has href.John MacFarlane1-0/+4
Otherwise treat as span. Closes #3226.
2016-11-09Org reader: allow HTML attribs on non-figure imagesAlbert Krewinkel1-0/+6
Images which are the only element in a paragraph can still be given HTML attributes, even if the image does not have a caption and is hence not a figure. The following will add set the `width` attribute of the image to `50%`: #+ATTR_HTML: :width 50% [[file:image.jpg]] Closes: #3222
2016-11-08Inline code when text has a special styleHubert Plociniczak1-0/+1
When a piece of text has a text 'Source_Text' then we assume that this is a piece of the document that represents a code that needs to be inlined. Addapted an odt writer to also reflect that change; previously it was just writing a 'preformatted' text using a non-distinguishable font style. Code blocks are still not recognized by the ODT reader. That's a separate issue.
2016-10-30Org reader: support `ATTR_HTML` for special blocksAlbert Krewinkel1-0/+9
Special blocks (i.e. blocks with unrecognized names) can be prefixed with an `ATTR_HTML` block attribute. The attributes defined in that meta-directive are added to the `Div` which is used to represent the special block. Closes: #3182
2016-10-30Org reader: support the `todo` export optionAlbert Krewinkel1-0/+6
The `todo` export option allows to toggle the inclusion of TODO keywords in the output. Setting this to `nil` causes TODO keywords to be dropped from headlines. The default is to include the keywords.
2016-10-30Org reader: add support for todo-markersAlbert Krewinkel1-125/+165
Headlines can have optional todo-markers which can be controlled via the `#+TODO`, `#+SEQ_TODO`, or `#+TYP_TODO` meta directive. Multiple such directives can be given, each adding a new set of recognized todo-markers. If no custom todo-markers are defined, the default `TODO` and `DONE` markers are used. Todo-markers are conceptually separate from headline text and are hence excluded when autogenerating headline IDs. The markers are rendered as spans and labelled with two classes: One class is the markers name, the other signals the todo-state of the marker (either `todo` or `done`).
2016-10-26Markdown Reader: add attributes for autolink (#3183)Daniele D'Orazio1-1/+10
2016-10-24Export Text.Pandoc.Error in Text.Pandoc.John MacFarlane7-13/+1
[API change]
2016-10-23Tighten up parsing of raw email addresses.John MacFarlane1-0/+5
Technically `**@user` is a valid email address, but if we allow things like this, we get bad results in markdown flavors that autolink raw email addresses. (See #2940.) So we exclude a few valid email addresses in order to avoid these more common bad cases. Closes #2940.
2016-10-18Merge pull request #3165 from hubertp-lshift/feature/odt-imageJohn MacFarlane1-2/+4
[odt] images parser
2016-10-17Infer caption from the text following the imgHubert Plociniczak1-2/+4
Frame can contain other frames with the text boxes. This is something that has not been considered before and meant that the whole construction of images was broken in those cases. Also the captions were fixed/ignored.
2016-10-17RST reader: Add test for space-before-note.Jesse Rosenthal1-0/+9
2016-10-14Org reader: allow figure with empty captionAlbert Krewinkel1-0/+6
A `#+CAPTION` attribute before an image is enough to turn an image into a figure. This wasn't the case because the `parseFromString` function, which processes the caption value, would fail on empty values. Adding a newline character to the caption value fixes this. Fixes: #3161
2016-10-14Remove Tests.ArbitraryJesse Rosenthal5-5/+5
Use exported Arbitrary instances from pandoc-types instead.
2016-10-13Parse line-oriented markup as LineBlockAlbert Krewinkel2-13/+9
Markup-features focusing on lines as distinctive part of the markup are read into `LineBlock` elements. This currently means line blocks in reStructuredText and Markdown (the latter only if the `line_block` extension is enabled), the `linegroup`/`line` combination from the Docbook 5.1 working draft, and Org-mode `VERSE` blocks.
2016-08-30Org reader: respect unnumbered header propertyAlbert Krewinkel1-0/+9
Sections the `unnumbered` property should, as the name implies, be excluded from the automatic numbering of section provided by some output formats. The Pandoc convention for this is to add an "unnumbered" class to the header. The reader treats properties as key-value pairs per default, so a special case is added to translate the above property to a class instead. Closes #3095.
2016-08-29Merge branch 'org-meta-handling'Albert Krewinkel1-63/+124
2016-08-29Docx reader: test for nested anchor spans in headerJesse Rosenthal1-0/+4
This ensures that anchor spans in header with content (or with other anchor spans inside) will resolve to links to a header id properly.
2016-08-29Org reader: respect `creator` export optionAlbert Krewinkel1-0/+6
The `creator` option controls whether the creator meta-field should be included in the final markup. Setting `#+OPTIONS: creator:nil` will drop the creator field from the final meta-data output. Org-mode recognizes the special value `comment` for this field, causing the creator to be included in a comment. This is difficult to translate to Pandoc internals and is hence interpreted the same as other truish values (i.e. the meta field is kept if it's present).
2016-08-29Org reader: respect `email` export optionAlbert Krewinkel1-0/+6
The `email` option controls whether the email meta-field should be included in the final markup. Setting `#+OPTIONS: email:nil` will drop the email field from the final meta-data output.
2016-08-29Org reader: respect `author` export optionAlbert Krewinkel1-0/+6
The `author` option controls whether the author should be included in the final markup. Setting `#+OPTIONS: author:nil` will drop the author from the final meta-data output.
2016-08-29Put Org reader export option tests into test groupAlbert Krewinkel1-61/+65
Using a separate test group instead of prefixing the test subject should be clearer than the current approach.
2016-08-29Org reader: read HTML_head as header-includesAlbert Krewinkel1-0/+7
HTML-specific head content can be defined in `#+HTML_head` lines. They are parsed as format-specific inlines to ensure that they will only show up in HTML output.
2016-08-29Org reader: set classoption meta from LaTeX_class_optionsAlbert Krewinkel1-0/+5
2016-08-29Org reader: set documentclass meta from LaTeX_classAlbert Krewinkel1-0/+5
2016-08-29Org reader: read LaTeX_header as header-includesAlbert Krewinkel1-0/+7
LaTeX-specific header commands can be defined in `#+LaTeX_header` lines. They are parsed as format-specific inlines to ensure that they will only show up in LaTeX output.