pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2021-05-09	Org reader: Resolve org includes relative to ...	John MacFarlane	2	-2/+5
	...the directory containing the file containing the INCLUDE directive. Closes #5501.
2021-05-09	T.P.Parsing: improve include file functions.	John MacFarlane	2	-3/+3
	Remove old `insertIncludedFileF`. [API change] Give `insertIncludedFile` a more general type, allowing it to be used where `insertIncludedFileF` was.
2021-05-09	Change reader types, allowing better tracking of source positions.	John MacFarlane	2	-2/+3
	Previously, when multiple file arguments were provided, pandoc simply concatenated them and passed the contents to the readers, which took a Text argument. As a result, the readers had no way of knowing which file was the source of any particular bit of text. This meant that we couldn't report accurate source positions on errors or include accurate source positions as attributes in the AST. More seriously, it meant that we couldn't resolve resource paths relative to the files containing them (see e.g. #5501, #6632, #6384, #3752). Add Text.Pandoc.Sources (exported module), with a `Sources` type and a `ToSources` class. A `Sources` wraps a list of `(SourcePos, Text)` pairs. [API change] A parsec `Stream` instance is provided for `Sources`. The module also exports versions of parsec's `satisfy` and other Char parsers that track source positions accurately from a `Sources` stream (or any instance of the new `UpdateSourcePos` class). Text.Pandoc.Parsing now exports these modified Char parsers instead of the ones parsec provides. Modified parsers to use a `Sources` as stream [API change]. The readers that previously took a `Text` argument have been modified to take any instance of `ToSources`. So, they may still be used with a `Text`, but they can also be used with a `Sources` object. In Text.Pandoc.Error, modified the constructor PandocParsecError to take a `Sources` rather than a `Text` as first argument, so parse error locations can be accurately reported. T.P.Error: showPos, do not print "-" as source name.
2021-03-20	Text.Pandoc.Parsing: remove F type synonym.	John MacFarlane	1	-2/+2
	Muse and Org were defining their own F anyway, with their own state. We therefore move this definition to the Markdown reader.
2021-03-19	Use NonEmpty instead of minimumDef.	John MacFarlane	1	-2/+2

2021-03-18	Use minimumDef instead of minimum (partial function).	John MacFarlane	1	-2/+2

2021-02-18	Org reader: fix bug in org-ref citation parsing.	Albert Krewinkel	1	-1/+1
	The org-ref syntax allows to list multiple citations separated by comma. This fixes a bug that accepted commas as part of the citation id, so all citation lists were parsed as one single citation. Fixes: #7101
2021-02-13	Org: support task_lists extension	Albert Krewinkel	1	-2/+39
	The tasks lists extension is now supported by the org reader and writer; the extension is turned on by default. Closes: #6336
2021-01-09	Org reader: allow multiple pipe chars in todo sequences	Albert Krewinkel	1	-4/+10
	Additional pipe chars, used to separate "action" state from "no further action" states, are ignored. E.g., for the following sequence, both `DONE` and `FINISHED` are states with no further action required. #+TODO: UNFINISHED \| DONE \| FINISHED Previously, parsing of the todo sequence failed if multiple pipe chars were included. Closes: #7014
2021-01-08	Update copyright notices for 2021 (#7012)	Albert Krewinkel	9	-9/+9

2021-01-03	Org reader: mark verbatim code with class "verbatim". (#6998)	Dimitri Sabadie	1	-1/+1
	* Replace org-mode’s verbatim from code to codeWith. This adds the `"verbatim"` class so that exporters can apply a specific style on it. For instance, it will be possible for HTML to add a CSS rule for code + verbatim class. * Alter test for org-mode’s verbatim change. See previous commit for further detail on the new implementation.
2021-01-01	Org reader: restructure output of captioned code blocks	Albert Krewinkel	1	-14/+12
	The Div wrapper of code blocks with captions now has the class "captioned-content". The caption itself is added as a Plain block inside a Div of class "caption". This makes it easier to write filters which match on captioned code blocks. Existing filters will need to be updated. Closes: #6977
2020-12-05	Org reader: preserve targets of spurious links	Albert Krewinkel	1	-5/+4
	Links with (internal) targets that the reader doesn't know about are converted into emphasized text. Information on the link target is now preserved by wrapping the text in a Span of class `spurious-link`, with an attribute `target` set to the link's original target. This allows to recover and fix broken or unknown links with filters. See: #6916
2020-11-22	Org reader: parse `#+LANGUAGE` into `lang` metadata field	Albert Krewinkel	1	-0/+2
	Fixes: #6845
2020-11-18	Replace org #+KEYWORDS with #+keywords	TEC	1	-11/+11
	As of ~2 years ago, lower case keywords became the standard (though they are handled case insensitive, as always): https://code.orgmode.org/bzg/org-mode/commit/13424336a6f30c50952d291e7a82906c1210daf0 Upper case keywords are exclusive to the manual: - https://orgmode.org/list/871s50zn6p.fsf@nicolasgoaziou.fr/ - https://orgmode.org/list/87tuuw3n15.fsf@nicolasgoaziou.fr/
2020-10-14	Fix typos in comments, doc strings, error messages, and tests	Albert Krewinkel	1	-1/+1
	Typos reported by https://fossies.org/linux/test/pandoc-master.tar.gz/codespell.html See: #6738
2020-09-25	Org reader: fix HLint warnings	Albert Krewinkel	1	-2/+2

2020-09-13	Fix hlint suggestions, update hlint.yaml (#6680)	Christian Despres	3	-16/+13
	* Fix hlint suggestions, update hlint.yaml Most suggestions were redundant brackets. Some required LambdaCase. The .hlint.yaml file had a small typo, and didn't ignore camelCase suggestions in certain modules.
2020-07-01	Org reader: respect tables-excluding export setting	Albert Krewinkel	3	-2/+7
	Tables can be removed from the final document with the `#+OPTION: \|:nil` export setting.
2020-06-30	Org reader: respect export setting disabling footnotes	Albert Krewinkel	3	-2/+7
	Footnotes can be removed from the final document with the `#+OPTION: f:nil` export setting.
2020-06-30	Org reader: respect export setting which disables entities	Albert Krewinkel	3	-6/+16
	MathML-like entities, e.g., `\alpha`, can be disabled with the `#+OPTION: e:nil` export setting.
2020-06-29	Org reader: keep unknown keyword lines as raw org	Albert Krewinkel	2	-2/+13
	The lines of unknown keywords, like `#+SOMEWORD: value` are no longer read as metadata, but kept as raw `org` blocks. This ensures that more information is retained when round-tripping org-mode files; additionally, this change makes it possible to support non-standard org extensions via filters.
2020-06-29	Org reader: unify keyword handling	Albert Krewinkel	1	-75/+67
	Handling of export settings and other keywords (like `#+LINK`) has been combined and unified.
2020-06-29	Org reader: support LATEX_HEADER_EXTRA and HTML_HEAD_EXTRA settings	Albert Krewinkel	1	-5/+9
	These export settings are treated like their non-extra counterparts, i.e., the values are added to the `header-includes` metadata list.
2020-06-29	Org reader: allow multiple #+SUBTITLE export settings	Albert Krewinkel	1	-0/+1
	The values of all lines are read as inlines and collected in the `subtitle` metadata field.
2020-06-28	Org reader: read `#+INSTITUTE` values as text with markup	Albert Krewinkel	1	-7/+13
	The value is stored in the `institute` metadata field and used in the default beamer presentation template.
2020-06-28	Org reader: update behavior of author, keywords export settings	Albert Krewinkel	1	-19/+9
	The behavior of the `#+AUTHOR` and `#+KEYWORD` export settings has changed: Org now allows multiple such lines and adds a space between the contents of each line. Pandoc now always parses these settings as meta inlines; setting values are no longer treated as comma-separated lists. Note that a Lua filter can be used to restore the previous behavior.
2020-06-28	Org reader: refactor export setting handling	Albert Krewinkel	1	-79/+67

2020-06-27	Org reader: read description lines as inlines	Albert Krewinkel	1	-10/+46
	`#+DESCRIPTION` lines are treated as text with markup. If multiple such lines are given, then all lines are read and separated by soft linebreaks. Closes: #6485
2020-06-25	Org reader: honor tex export option	Albert Krewinkel	4	-30/+75
	The `tex` export option can be set with `#+OPTION: tex:nil` and allows three settings: - `t` causes LaTeX fragments to be parsed as TeX or added as raw TeX, - `nil` removes all LaTeX fragments from the document, and - `verbatim` treats LaTeX as text. The default is `t`. Closes: #4070
2020-06-20	Recognize images with uppercase extensions	Albert Krewinkel	1	-1/+2
	Fixes: #6472
2020-04-28	Support new Underline element in readers and writers (#6277)	Vaibhav Sagar	1	-2/+1
	Deprecate `underlineSpan` in Shared in favor of `Text.Pandoc.Builder.underline`.
2020-04-15	Use the new builders, modify readers to preserve empty headers	despresc	1	-1/+7
	The Builder.simpleTable now only adds a row to the TableHead when the given header row is not null. This uncovered an inconsistency in the readers: some would unconditionally emit a header filled with empty cells, even if the header was not present. Now every reader has the conditional behaviour. Only the XWiki writer depended on the header row being always present; it now pads its head as necessary.
2020-04-15	Adapt to the newest Table type, fix some previous adaptation issues	despresc	1	-2/+2
	- Writers.Native is now adapted to the new Table type. - Inline captions should now be conditionally wrapped in a Plain, not a Para block. - The toLegacyTable function now lives in Writers.Shared.
2020-04-15	Implement the new Table type	despresc	1	-7/+7

2020-03-22	Finer grained imports of Text.Pandoc.Class submodules (#6203)	Albert Krewinkel	5	-5/+5
	This should speed-up recompilation after changes in `Text.Pandoc.Class`, as the number of modules affected by a change will be smaller in general. It also offers faster insights into the parts of `T.P.Class` used within a module.
2020-03-15	Use implicit Prelude (#6187)	Albert Krewinkel	9	-18/+0
	* Use implicit Prelude The previous behavior was introduced as a fix for #4464. It seems that this change alone did not fix the issue, and `stack ghci` and `cabal repl` only work with GHC 8.4.1 or newer, as no custom Prelude is loaded for these versions. Given this, it seems cleaner to revert to the implicit Prelude. * PandocMonad: remove outdated check for base version Only base versions 4.9 and later are supported, the check for `MIN_VERSION_base(4,8,0)` is therefore unnecessary. * Always use custom prelude Previously, the custom prelude was used only with older GHC versions, as a workaround for problems with ghci. The ghci problems are resolved by replacing package `base` with `base-noprelude`, allowing for consistent use of the custom prelude across all GHC versions.
2020-03-13	Update copyright year (#6186)	Albert Krewinkel	9	-9/+9
	* Update copyright year * Copyright: add notes for Lua and Jira modules
2020-02-08	Org reader: simplify parsing of sub- and superscripts	Albert Krewinkel	1	-11/+11
	Speeds up parsing of single-word, markup-less sub- and superscripts. Fixes: #6127
2020-02-07	Resolve HLint warnings	Albert Krewinkel	1	-1/+1
	All warnings are either fixed or, if more appropriate, HLint is configured to ignore them. HLint suggestions remain. * Ignore "Use camelCase" warnings in Lua and legacy code * Fix or ignore remaining HLint warnings * Remove redundant brackets * Remove redundant `return`s * Remove redundant as-pattern * Fuse mapM_/map * Use `.` to shorten code * Remove redundant `fmap` * Remove unused LANGUAGE pragmas * Hoist `not` in Text.Pandoc.App * Use fewer imports for `Text.DocTemplates` * Remove redundant `do`s * Remove redundant `$`s * Jira reader: remove unnecessary parentheses
2020-02-06	Use traverse instead of reimplementing it (#6124)	Joseph C. Sible	1	-3/+1

2019-12-19	Org reader: fix parsing problem for colons in headline	Albert Krewinkel	2	-11/+27
	Fixed a problem where words surrounded by colons could causing parse failures in some cases when they occurred in headers. Fixes: #5993
2019-12-18	Org reader: wrap named table in div, using name as id	Albert Krewinkel	1	-12/+10
	Closes: #5984
2019-12-13	Org reader: add table labels to caption if both are present	Albert Krewinkel	1	-3/+11
	The table `#+NAME:` or `#+LABEL:` is added to the table's caption in the form of an empty span with the label set as the span's ID. Closes: #5984
2019-11-20	Fix typos (#5919)	Brian Wignall	1	-1/+1

2019-11-12	Switch to new pandoc-types and use Text instead of String [API change].	despresc	9	-320/+338
	PR #5884. + Use pandoc-types 1.20 and texmath 0.12. + Text is now used instead of String, with a few exceptions. + In the MediaBag module, some of the types using Strings were switched to use FilePath instead (not Text). + In the Parsing module, new parsers `manyChar`, `many1Char`, `manyTillChar`, `many1TillChar`, `many1Till`, `manyUntil`, `mantyUntilChar` have been added: these are like their unsuffixed counterparts but pack some or all of their output. + `glob` in Text.Pandoc.Class still takes String since it seems to be intended as an interface to Glob, which uses strings. It seems to be used only once in the package, in the EPUB writer, so that is not hard to change.
2019-10-27	Org reader: fix parsing of empty comment lines	Albert Krewinkel	1	-1/+3
	Comment lines in Org-mode can be completely empty; both of these line should produce no output: # a comment # The reader used to produce a wrong result for the latter, but ignores that line as well now. Fixes: #5856
2019-09-08	Org reader: modify handling of example blocks. (#5717)	Brian Leung	2	-14/+43
	* Org reader: allow the `-i` switch to ignore leading spaces. * Org reader: handle awkwardly-aligned code blocks within lists. Code blocks in Org lists must have their #+BEGIN_ aligned in a reasonable way, but their other components can be positioned otherwise.
2019-08-26	Fix inline parsing in grid table cells.	John MacFarlane	1	-1/+1
	* T.P.Parsing: Change type of `setLastStrPos` so it takes a `Maybe SourcePos` rather than a `SourcePos`. [API change] * T.P.Parsing: Make `parseFromString'` and `gridTableWith` and `gridTableWith'` polymorphic in the parser state, constraining it with `HasLastStrPosition`. [API change] Closes #5708.
2019-07-22	Org reader: accept ATTR_LATEX in block attributes	Albert Krewinkel	1	-3/+11
	Attributes for LaTeX output are accepted as valid block attributes; however, their values are ignored. Fixes: #5648