aboutsummaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Collapse)AuthorFilesLines
2020-07-23Markdown writer: use numerical labels for refs...John MacFarlane1-5/+9
...that are longer than 999 characters or contain square brackets. For conformity with commonmark. Closes #6560
2020-07-23Add `raw_markdown` extension affecting `ipynb` reader.John MacFarlane3-4/+10
Specifying `-f ipynb+raw_markdown` will cause Markdown cells to be represented as raw Markdown blocks, instead of being parsed. This is not what you want when going from `ipynb` to other formats, but it may be useful when going from `ipynb` to Markdown or to `ipynb`, to avoid semantically insignificant changes in the contents of the Markdown cells that might otherwise be introduced. Closes #5408.
2020-07-23Minimal support for SIRange in LaTeX reader (#6418)Emerson Harkin1-0/+24
Add support for `\SIRange{firstnumber}{secondnumber}{unit}` provided by siunitx. An en-dash is used instead of localized "to".
2020-07-23Col-span and row-span in LaTeX reader (#6470)Laurent P. René de Cotret1-19/+57
Add multirow and multicolumn support in LaTex reader. Partially addresses #6311.
2020-07-23Further improvements to ams theorem support, and a test.John MacFarlane1-2/+3
See #1608.
2020-07-23LaTeX reader: Add identifier in divs for ams theorem environments.John MacFarlane1-1/+1
2020-07-22LaTeX reader: SUpport ams `\theoremstyle`.John MacFarlane2-8/+38
2020-07-22Implement first optional argument for `\newtheorem`.John MacFarlane1-23/+30
This allows groups of theorem environments to be put in the same numbering sequence.
2020-07-22LaTeX reader: Don't boldface alt title in theorems.John MacFarlane1-2/+2
2020-07-22LaTeX reader: support theorem environments and `\newtheorem`.John MacFarlane2-3/+65
Includes numbering and labels and refs. Note that numbering support is not complete; we don't reset numbers with sections for example.
2020-07-22LaTeX reader: support ams proof environment.John MacFarlane2-0/+37
2020-07-22Moved more from LaTeX reader to LaTeX.Parsing.John MacFarlane2-63/+67
2020-07-22Docx writer: support --number-sections.John MacFarlane1-4/+17
Closes #1413.
2020-07-21RST reader: fix csv tables with multiline cells.John MacFarlane1-1/+6
Closes #6549.
2020-07-20Move some code from T.P.R.LaTeX. to T.P.R.LaTeX.Parsing.John MacFarlane2-70/+78
We need to reduce the size of the LaTeX reader to ease compilation on resource-limited systems. More can be done in this vein.
2020-07-19Markdown writer: move asciify out of escapeString.John MacFarlane1-9/+9
Otherwise unsmartify doesn't catch quotes that have already been turned to entities.
2020-07-19Remove use of cmark-gfm for commonmark/gfm rendering.John MacFarlane3-429/+97
Instead rely on the markdown writer with appropriate extensions. Export writeCommonMark variant from Markdown writer. This changes a few small things in rendering markdown, e.g. w/r/t requiring backslashes before spaces inside super/subscripts.
2020-07-19Markdown writer: use unicode super/subscript characters...John MacFarlane1-18/+44
when possible if the superscript or subscript extension or raw_html aren't available.
2020-07-19Markdown writer: render caption as following paragraph...John MacFarlane1-2/+5
when `Ext_table_caption` not enabled.
2020-07-19Add commonmark_x output format...John MacFarlane4-3/+34
commonmark with a number of useful extensions (more than gfm).
2020-07-19Trim down githubMarkdownExtensions.John MacFarlane1-9/+11
Previously it included all of the following, which make sense for the legacy markdown_github but not for gfm, since they are part of base commonmark and thus can't be turned off in gfm: - `Ext_all_symbols_escapable` - `Ext_backtick_code_blocks` - `Ext_fenced_code_blocks` - `Ext_space_in_atx_header` - `Ext_intraword_underscores` - `Ext_lists_without_preceding_blankline` - `Ext_shortcut_reference_links` ` These have been removed from `githubMarkdownExtensions`, though they're still turned on for legacy `markdown_github`.
2020-07-19Add generic `attributes` extension.John MacFarlane2-8/+3
This allows attributes to be added to any block or inline element, in principle. (Though in many cases this will be done by adding a Div or Span container, since pandoc's AST doesn't have a slot for attributes for most elements.) Currently this is only possible with the commonmark and gfm readers. Add `Ext_attributes` constructor for `Extension` [API change].
2020-07-19Use commonmark-hs to parse commonmark/gfm...John MacFarlane2-198/+57
...instead of cmark-gfm (a wrapper around a C library). We can now support many more pandoc extensions for commonmark and gfm. Add fenced_code_attributes to gfm/commonmark extensions.
2020-07-19Merge pull request #6495 from tarleb/html5-figure-accessiblityJohn MacFarlane1-2/+10
HTML writer: improve alt-text/caption handling for HTML5
2020-07-18DocBook reader: parse releaseinfo as metadata.John MacFarlane1-0/+1
Closes #6542. Note that you'll need to put releaseinfo somewhere in your template if you want this to be part of the converted output.
2020-07-17Jira writer: keep image caption as alt attributeAlbert Krewinkel1-10/+12
Fixes #6529
2020-07-14Docbook reader: remove misleading comment...John MacFarlane1-1/+1
suggesting that releaseinfo is handled. It isn't.
2020-07-13Merge pull request #6527 from lierdakil/fix-6514John MacFarlane3-55/+59
[Docx Reader] Only use bCs/iCs on runs with rtl or cs property
2020-07-13[Docx Reader] Only use bCs/iCs on runs with rtl or cs propertyNikolay Yakimov3-55/+59
Fixes #6514
2020-07-12Merge pull request #6513 from brisad/masterJohn MacFarlane1-1/+3
Escape starting periods in ms writer code blocks
2020-07-12Merge pull request #6509 from lierdakil/docx-smush-inlines-refactorJohn MacFarlane1-62/+39
[Docx Reader] Refactor/update Text.Pandoc.Readers.Docx.Combine.smushInlines
2020-07-12Ms writer: fix code highlighting with blank lines.John MacFarlane1-5/+5
Previously blank lines were simply omitted from highligted code.
2020-07-12RST reader: fix spurious newlines in some attributes from directives.John MacFarlane1-1/+2
2020-07-12RST reader: avoid extra newline in included code blocks.John MacFarlane1-2/+2
2020-07-08Escape starting periods in ms writer code blocksMichael Hoffmann1-1/+3
If a line of ms code block output starts with a period (.), it should be prepended by '\&' so that it is not interpreted as a roff command. Fixes #6505
2020-07-07[Docx Reader] Use null instead of isEmpty in Readers.Docx.CombineNikolay Yakimov1-9/+5
2020-07-07[Docx Reader] Remove unused LANGUAGE from Readers.Docx.CombineNikolay Yakimov1-2/+0
2020-07-07[Docx Reader] Remove no-op stack/unstackInlines in Readers.Docx.CombineNikolay Yakimov1-6/+4
2020-07-07[Docx Reader] Get rid of unused NullModifier in Readers.Docx.CombineNikolay Yakimov1-18/+15
2020-07-07[Docx Reader] Refactor/update smushInlinesNikolay Yakimov1-44/+32
2020-07-02Revert "Ipnyb: allow lossless round-tripping of markdown cell content."John MacFarlane2-7/+3
This reverts commit efbc2050315b60c8a753dee6255465f1083019ab.
2020-07-02Revert "Ipynb reader: fix duplication of 'source' attribute."John MacFarlane1-1/+1
This reverts commit 2d009366cef2358ec2c99612ae2c73068841306c.
2020-07-02Ipynb reader: fix duplication of 'source' attribute.John MacFarlane1-1/+1
See #5408.
2020-07-01HTML writer: improve alt-text/caption handling for HTML5Albert Krewinkel1-2/+10
Screen readers read an image's `alt` attribute and the figure caption, both of which come from the same source in pandoc. The figure caption is hidden from screen readers with the `aria-hidden` attribute. This improves accessibility. For HTML4, where `aria-hidden` is not allowed, pandoc still uses an empty `alt` attribute to avoid duplicate contents. Closes: #6491
2020-07-01Org reader: respect tables-excluding export settingAlbert Krewinkel3-2/+7
Tables can be removed from the final document with the `#+OPTION: |:nil` export setting.
2020-06-30Org reader: respect export setting disabling footnotesAlbert Krewinkel3-2/+7
Footnotes can be removed from the final document with the `#+OPTION: f:nil` export setting.
2020-06-30Ipnyb: allow lossless round-tripping of markdown cell content.John MacFarlane2-3/+7
The reader now parses the contents of the markdown cell to a Pandoc structure, but *also* stores the raw markdown in a `source` attribute on the cell Div. When we convert back to markdown, this attribute is stripped off and the original source is used. When we convert to other formats, the attribute is usually ignored (though it will come through in HTML as a `data-source` attribute, not unhelpfully). I'll note some potential drawbacks of this approach: - It makes it impossible to use pandoc to clean up or change the contents of markdown cells, e.g. going from `+smart` to `-smart`. - There may be formats where the addition of the `source` attribute is problematic. I can't think of any, though. Closes #5408.
2020-06-30Org reader: respect export setting which disables entitiesAlbert Krewinkel3-6/+16
MathML-like entities, e.g., `\alpha`, can be disabled with the `#+OPTION: e:nil` export setting.
2020-06-29Merge pull request #6328 from lierdakil/defaults-meta-parseJohn MacFarlane3-57/+45
Unify defaults metadata and markdown metadata parsers
2020-06-29Org reader: keep unknown keyword lines as raw orgAlbert Krewinkel2-2/+13
The lines of unknown keywords, like `#+SOMEWORD: value` are no longer read as metadata, but kept as raw `org` blocks. This ensures that more information is retained when round-tripping org-mode files; additionally, this change makes it possible to support non-standard org extensions via filters.