aboutsummaryrefslogtreecommitdiff
path: root/tests/Tests/Readers
AgeCommit message (Collapse)AuthorFilesLines
2014-06-23Add test for correctly trimming spaces in formatting.Jesse Rosenthal1-0/+4
This used to be fixed in the tree-walking. We need to make sure we're doing it right now.
2014-06-22Docx reader tests: add tests for normalization deep in blocks.Jesse Rosenthal1-0/+4
2014-06-22Add normalization test.Jesse Rosenthal1-0/+4
Add torture-test for new normalization functions. One problem that this test demonstrates is that word has a tendency to turn off formatting at a space, and then turn it back on after. I'm not sure yet whether this is something we should fix.
2014-06-20Docx reader tests: Introduce NoNormPandoc type.Jesse Rosenthal1-2/+27
This is just a wrapper around Pandoc that doesn't normalize with `toString`. We want to make sure that our own normalization process works. If, in the future, we are able to hook into the builder's normalization, this will be removed.
2014-06-19Add tabs tests.Jesse Rosenthal1-1/+6
2014-06-19Add ReaderOptions to the docx testsJesse Rosenthal1-8/+11
This will allow for testing different media embedding (in addition to any other applicable options.)
2014-06-16Naming: Use Docx instead of DocX.John MacFarlane1-3/+3
For consistency with the existing writer.
2014-06-16Merge branch 'docx' of https://github.com/jkr/pandoc into jkr-docxJohn MacFarlane1-0/+68
2014-06-16Org reader: make tildes create inline code.John MacFarlane1-3/+3
Closes #1345. Also relabeled 'code' and 'verbatim' parsers to accord with the org-mode manual. I'm not sure what the distinction between code and verbatim is supposed to be, but I'm pretty sure both should be represented as Code inlines in pandoc. The previous behavior resulted in the text not appearing in any output format.
2014-06-16Add DocX automated tests.Jesse Rosenthal1-0/+68
Note this makes use of input and output files in the tests/ dir.
2014-05-20Org reader: support for inline LaTeXAlbert Krewinkel1-0/+27
Inline LaTeX is now accepted and parsed by the org-mode reader. Both, math symbols (like \tau) and LaTeX commands (like \cite{Coffee}), can be used without any further escaping.
2014-05-14Org reader: support Pandocs citation extensionAlbert Krewinkel1-0/+22
Citations are defined via the "normal citation" syntax used in markdown, with the sole difference that newlines are not allowed between "[...]". This is for consistency, as org-mode generally disallows newlines between square brackets. The extension is turned on by default and can be turned off via the default syntax-extension mechanism, i.e. by specifying "org-citation" as the input format. Move `citeKey` from Readers.Markdown into Parsing The function can be used by other readers, so it is made accessible for all parsers.
2014-05-10Org reader: Fix block parameter reader, relax constraintsAlbert Krewinkel1-0/+12
The reader produced wrong results for block containing non-letter chars in their parameter arguments. This patch relaxes constraints in that it allows block header arguments to contain any non-space character (except for ']' for inline blocks). Thanks to Xiao Hanyu for noticing this.
2014-05-09Org reader: Fix parsing of blank lines within blocksAlbert Krewinkel1-0/+9
Blank lines were parsed as two newlines instead of just one. Thanks to Xiao Hanyu (@xiaohanyu) for pointing this out.
2014-05-09Org reader: Support arguments for code blocksAlbert Krewinkel1-0/+14
The general form of source block headers (`#+BEGIN_SRC <language> <switches> <header arguments>`) was not recognized by the reader. This patch adds support for the above form, adds header arguments to the block's key-value pairs and marks the block as a rundoc block if header arguments are present. This closes #1286.
2014-05-06Org reader: Read inline code blocksAlbert Krewinkel1-0/+18
Org's inline code blocks take forms like `src_haskell(print "hi")` and are frequently used to include results from computations called from within the document. The blocks are read as inline code and marked with the special class `rundoc-block`. Proper handling and execution of these blocks is the subject of a separate library, rundoc, which is work in progress. This closes #1278.
2014-05-04Added test for #1154.John MacFarlane1-0/+9
2014-05-01Org reader: Add support for custom link typesAlbert Krewinkel1-0/+26
Org allows users to define their own custom link types. E.g., in a document with a lot of links to Wikipedia articles, one can define a custom wikipedia link-type via #+LINK: wp https://en.wikipedia.org/wiki/ This allows to write [[wp:Org_mode][Org-mode]] instead of the equivallent [[https://en.wikipedia.org/wiki/Org_mode][Org-mode]].
2014-04-25Org reader: Enable internal linksAlbert Krewinkel1-0/+25
Internal links in Org are possible by using an anchor-name as the target of a link: [[some-anchor][This]] is an internal link. It links <<some-anchor>> here.
2014-04-24Org reader: Recognize plain and angle linksAlbert Krewinkel1-0/+14
This adds support for plain links (like http://zeitlens.com) and angle links (like <http://moltkeplatz.de>).
2014-04-19Org reader: Allow for compact definition listsAlbert Krewinkel1-0/+12
Use `Text.Pandoc.Shared.compactify'DL` to allow for compact definition lists.
2014-04-19Org reader: Fix parsing of footnotesAlbert Krewinkel1-0/+42
Footnotes can consist of multiple blocks and end only at a header or at the beginning of another footnote. This fixes the previous behavior, which restricted notes to a single paragraph.
2014-04-19Org reader: Apply captions to code blocks and tablesAlbert Krewinkel1-1/+30
The `Table` blocktype already takes the caption as an argument, while code blocks are wrapped in a `Div` block together with a labelling `Span`.
2014-04-18Org reader: Add support for plain LaTeX fragmentsAlbert Krewinkel1-1/+19
This adds support for LaTeX fragments like the following: ``` \begin{equation} \int fg \mathrm{d}x \end{equation} ```
2014-04-18Org reader: Fix parsing of loose listsAlbert Krewinkel1-5/+16
Loose lists (i.e. lists with blankline separated items), were parsed as multiple lists, each containing a single item. This patch fixes this issue.
2014-04-17Org reader: Support more types of '#+BEGIN_<type>' blocksAlbert Krewinkel1-23/+74
Support for standard org-blocks is improved. The parser now handles "HTML", "LATEX", "ASCII", "EXAMPLE", "QUOTE" and "VERSE" blocks in a sensible fashion.
2014-04-17Org reader: Support footnotesAlbert Krewinkel1-0/+4
2014-04-13Merge pull request #1239 from tarleb/org-linebreakJohn MacFarlane1-0/+4
Org linebreaks
2014-04-12Org reader: Read linebreaksAlbert Krewinkel1-0/+4
Linebreaks are marked by the string `\\` at the end of a line.
2014-04-12Org reader: Add support for figuresAlbert Krewinkel1-0/+21
Support for figures (images with name and caption) is added.
2014-04-11Org reader: Fix parsing of sub-/superscript expressionsAlbert Krewinkel1-9/+30
This fixes the org-reader's handling of sub- and superscript expressions. Simple expressions (like `2^+10`), expressions in parentheses (`a_(n+1)`) and nested sexp (like `a_(nested()parens)`) are now read correctly.
2014-04-10Org reader: Support more inline/display math variantsAlbert Krewinkel1-6/+30
Support all of the following variants as valid ways to define inline or display math inlines: - `\[..\]` (display) - `$$..$$` (display) - `\(..\)` (inline) - `$..$` (inline) This closes #1223. Again.
2014-04-09Org reader: Precise rules for the recognition of markupAlbert Krewinkel1-2/+23
The inline parsers have been rewritten using the org source code as a reference. This fixes a couple of bugs related to erroneous markup recognition.
2014-04-07Org reader: Support inline math (like $E=mc^2$)Albert Krewinkel1-0/+4
Closes #1223.
2014-04-06Org reader: Add support for definition listsAlbert Krewinkel1-3/+23
2014-04-05Org reader: Support inline imagesAlbert Krewinkel1-2/+10
2014-04-05Org reader: Fix parsing of nested inlinesAlbert Krewinkel1-0/+4
Text such as /*this*/ was not correctly parsed as a strong, emphasised word. This was due to the end-of-word recognition being to strict as it did not accept markup chars as part of a word. The fix involves an additional parser state field, listing the markup chars which might be parsed as part of a word.
2014-03-04Add a simple Emacs Org-mode readerAlbert Krewinkel1-0/+533
The basic structure of org-mode documents is recognized; however, org-mode features like todo markers, tags etc. are not supported yet.
2013-12-19HLint: redundant parensHenry de Valence2-3/+3
Remove parens enclosing a single element.
2013-11-22Fixed bug with intraword emphasis.John MacFarlane1-0/+5
Closes #1066.
2013-09-01Markdown reader: Don't autolink a bare URI that is followed by `</a>`.John MacFarlane1-1/+4
Closes #937.
2013-09-01Use registerHeader in RST and LaTeX readers.John MacFarlane1-5/+5
This will give automatic unique identifiers, unless `-auto_identifiers` is specified.
2013-08-22Write id for code block to label attr in latex when listing is usedFlorian Eitel1-0/+7
The code: ~~~{#test} asdf ~~~ gets compiled to html: <pre id="test"> asdf </pre> So it is possible to link to the identifier `test` But this doesn't happen on latex When using the listings package (`--listings`) it is possible to set the identifier using the `label=test` property: \begin{lstlisting}[label=id] hi \end{lstlisting} And this is exactly what this patch is doing. Modified LaTeX Reader/Writer and added tests for this.
2013-07-21LaTeX reader: Don't add spurious ", " to citation suffixes.John MacFarlane1-18/+18
This is added when needed in Text.Pandoc.Biblio anyway.
2013-06-24Use new flexible metadata type.John MacFarlane1-11/+23
* Depend on pandoc 1.12. * Added yaml dependency. * `Text.Pandoc.XML`: Removed `stripTags`. (API change.) * `Text.Pandoc.Shared`: Added `metaToJSON`. This will be used in writers to create a JSON object for use in the templates from the pandoc metadata. * Revised readers and writers to use the new Meta type. * `Text.Pandoc.Options`: Added `Ext_yaml_title_block`. * Markdown reader: Added support for YAML metadata block. Note that it must come at the beginning of the document. * `Text.Pandoc.Parsing.ParserState`: Replace `stateTitle`, `stateAuthors`, `stateDate` with `stateMeta`. * RST reader: Improved metadata. Treat initial field list as metadata when standalone specified. Previously ALL fields "title", "author", "date" in field lists were treated as metadata, even if not at the beginning. Use `subtitle` metadata field for subtitle. * `Text.Pandoc.Templates`: Export `renderTemplate'` that takes a string instead of a compiled template.. * OPML template: Use 'for' loop for authors. * Org template: '#+TITLE:' is inserted before the title. Previously the writer did this.
2013-06-19Added test for #882.John MacFarlane1-0/+5
2013-03-17Revert "LaTeX reader: citation handling changes."John MacFarlane1-8/+2
This reverts commit f7229b147314042f946dfded3b441ab0fae260a0.
2013-03-09LaTeX reader: citation handling changes.John MacFarlane1-2/+8
Previously, a LaTeX citation would always be parsed as a Citation element, with the raw LaTeX in the [Inline] part. Now, the LaTeX citation is parsed as a Citation element only if `--biblio` was specified (i.e. only if there is a nonempty set of references in readerReferences). Otherwise it is parsed as raw LaTeX. This will make it possible to simplify some things in the markdown writer. It also makes the LaTeX reader behave more like the Markdown reader.
2013-01-23Eliminated use of TH in test suite.John MacFarlane2-20/+19
2013-01-15Changed Ext_autolink_urls -> Ext_autolink_bare_uris.John MacFarlane1-1/+1
Added tests.