pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2021-05-09	Change reader types, allowing better tracking of source positions.	John MacFarlane	1	-45/+52
	Previously, when multiple file arguments were provided, pandoc simply concatenated them and passed the contents to the readers, which took a Text argument. As a result, the readers had no way of knowing which file was the source of any particular bit of text. This meant that we couldn't report accurate source positions on errors or include accurate source positions as attributes in the AST. More seriously, it meant that we couldn't resolve resource paths relative to the files containing them (see e.g. #5501, #6632, #6384, #3752). Add Text.Pandoc.Sources (exported module), with a `Sources` type and a `ToSources` class. A `Sources` wraps a list of `(SourcePos, Text)` pairs. [API change] A parsec `Stream` instance is provided for `Sources`. The module also exports versions of parsec's `satisfy` and other Char parsers that track source positions accurately from a `Sources` stream (or any instance of the new `UpdateSourcePos` class). Text.Pandoc.Parsing now exports these modified Char parsers instead of the ones parsec provides. Modified parsers to use a `Sources` as stream [API change]. The readers that previously took a `Text` argument have been modified to take any instance of `ToSources`. So, they may still be used with a `Text`, but they can also be used with a `Sources` object. In Text.Pandoc.Error, modified the constructor PandocParsecError to take a `Sources` rather than a `Text` as first argument, so parse error locations can be accurately reported. T.P.Error: showPos, do not print "-" as source name.
2021-04-19	Issue error message when reader or writer format is malformed.	John MacFarlane	1	-3/+3
	Previously we exited with an error status but (due to a bug) no message. Closes #7231.
2021-01-08	Update copyright notices for 2021 (#7012)	Albert Krewinkel	1	-1/+1

2020-09-21	Add built-in citation support using new citeproc library.	John MacFarlane	1	-0/+8
	This deprecates the use of the external pandoc-citeproc filter; citation processing is now built in to pandoc. * Add dependency on citeproc library. * Add Text.Pandoc.Citeproc module (and some associated unexported modules under Text.Pandoc.Citeproc). Exports `processCitations`. [API change] * Add data files needed for Text.Pandoc.Citeproc: default.csl in the data directory, and a citeproc directory that is just used at compile-time. Note that we've added file-embed as a mandatory rather than a conditional depedency, because of the biblatex localization files. We might eventually want to use readDataFile for this, but it would take some code reorganization. * Text.Pandoc.Loging: Add `CiteprocWarning` to `LogMessage` and use it in `processCitations`. [API change] * Add tests from the pandoc-citeproc package as command tests (including some tests pandoc-citeproc did not pass). * Remove instructions for building pandoc-citeproc from CI and release binary build instructions. We will no longer distribute pandoc-citeproc. * Markdown reader: tweak abbreviation support. Don't insert a nonbreaking space after a potential abbreviation if it comes right before a note or citation. This messes up several things, including citeproc's moving of note citations. * Add `csljson` as and input and output format. This allows pandoc to convert between `csljson` and other bibliography formats, and to generate formatted versions of CSL JSON bibliographies. * Add module Text.Pandoc.Writers.CslJson, exporting `writeCslJson`. [API change] * Add module Text.Pandoc.Readers.CslJson, exporting `readCslJson`. [API change] * Added `bibtex`, `biblatex` as input formats. This allows pandoc to convert between BibLaTeX and BibTeX and other bibliography formats, and to generated formatted versions of BibTeX/BibLaTeX bibliographies. * Add module Text.Pandoc.Readers.BibTeX, exporting `readBibTeX` and `readBibLaTeX`. [API change] * Make "standalone" implicit if output format is a bibliography format. This is needed because pandoc readers for bibliography formats put the bibliographic information in the `references` field of metadata; and unless standalone is specified, metadata gets ignored. (TODO: This needs improvement. We should trigger standalone for the reader when the input format is bibliographic, and for the writer when the output format is markdown.) * Carry over `citationNoteNum` to `citationNoteNumber`. This was just ignored in pandoc-citeproc. * Text.Pandoc.Filter: Add `CiteprocFilter` constructor to Filter. [API change] This runs the processCitations transformation. We need to treat it like a filter so it can be placed in the sequence of filter runs (after some, before others). In FromYAML, this is parsed from `citeproc` or `{type: citeproc}`, so this special filter may be specified either way in a defaults file (or by `citeproc: true`, though this gives no control of positioning relative to other filters). TODO: we need to add something to the manual section on defaults files for this. * Add deprecation warning if `upandoc-citeproc` filter is used. * Add `--citeproc/-C` option to trigger citation processing. This behaves like a filter and will be positioned relative to filters as they appear on the command line. * Rewrote the manual on citatations, adding a dedicated Citations section which also includes some information formerly found in the pandoc-citeproc man page. * Look for CSL styles in the `csl` subdirectory of the pandoc user data directory. This changes the old pandoc-citeproc behavior, which looked in `~/.csl`. Users can simply symlink `~/.csl` to the `csl` subdirectory of their pandoc user data directory if they want the old behavior. * Add support for CSL bibliography entry formatting to LaTeX, HTML, Ms writers. Added CSL-related CSS to styles.html.
2020-07-19	Add commonmark_x output format...	John MacFarlane	1	-0/+1
	commonmark with a number of useful extensions (more than gfm).
2020-03-15	Use implicit Prelude (#6187)	Albert Krewinkel	1	-2/+0
	* Use implicit Prelude The previous behavior was introduced as a fix for #4464. It seems that this change alone did not fix the issue, and `stack ghci` and `cabal repl` only work with GHC 8.4.1 or newer, as no custom Prelude is loaded for these versions. Given this, it seems cleaner to revert to the implicit Prelude. * PandocMonad: remove outdated check for base version Only base versions 4.9 and later are supported, the check for `MIN_VERSION_base(4,8,0)` is therefore unnecessary. * Always use custom prelude Previously, the custom prelude was used only with older GHC versions, as a workaround for problems with ghci. The ghci problems are resolved by replacing package `base` with `base-noprelude`, allowing for consistent use of the custom prelude across all GHC versions.
2020-03-13	Update copyright year (#6186)	Albert Krewinkel	1	-1/+1
	* Update copyright year * Copyright: add notes for Lua and Jira modules
2020-01-31	Add Text.Pandoc.Readers.CSV (readCSV).	John MacFarlane	1	-0/+3
	This adds csv as an input format. The CSV table is converted into a pandoc simple table. Closes #6100.
2019-12-17	Add jira reader (#5913)	Albert Krewinkel	1	-0/+3
	Closes #5556
2019-12-09	More informative JSON parse error.	John MacFarlane	1	-1/+1
	Closes #5973.
2019-11-12	Switch to new pandoc-types and use Text instead of String [API change].	despresc	1	-5/+6
	PR #5884. + Use pandoc-types 1.20 and texmath 0.12. + Text is now used instead of String, with a few exceptions. + In the MediaBag module, some of the types using Strings were switched to use FilePath instead (not Text). + In the Parsing module, new parsers `manyChar`, `many1Char`, `manyTillChar`, `many1TillChar`, `many1Till`, `manyUntil`, `mantyUntilChar` have been added: these are like their unsuffixed counterparts but pack some or all of their output. + `glob` in Text.Pandoc.Class still takes String since it seems to be intended as an interface to Glob, which uses strings. It seems to be used only once in the package, in the EPUB writer, so that is not hard to change.
2019-09-29	Raise error on unsupported extensions. Closes #4338.	John MacFarlane	1	-6/+20
	+ An error is now raised if you try to specify (enable or disable) an extension that does not affect the given format, e.g. `docx+pipe_tables`. + The `--list-extensions[=FORMAT]` option now lists only extensions that affect the given FORMAT. + Text.Pandoc.Error: Add constructors `PandocUnknownReaderError`, `PandocUnknownWriterError`, `PandocUnsupportedExtensionError`. [API change] + Text.Pandoc.Extensions now exports `getAllExtensions`, which returns the extensions that affect a given format (whether enabled by default or not). [API change] + Text.Pandoc.Extensions: change type of `parseFormatSpec` from `Either ParseError (String, Extensions -> Extensions)` to `Either ParseError (String, [Extension], [Extension])` [API change]. + Text.Pandoc.Readers: change type of `getReader` so it returns a value in the PandocMonad instance rather than an Either [API change]. Exceptions for unknown formats and unsupported extensions are now raised by this function and need not be handled by the calling function. + Text.Pandoc.Writers: change type of `getWriter` so it returns a value in the PandocMonad instance rather than an Either [API change]. Exceptions for unknown formats and unsupported extensions are now raised by this function and need not be handled by the calling function.
2019-03-01	Remove license boilerplate.	John MacFarlane	1	-19/+0
	The haddock module header contains essentially the same information, so the boilerplate is redundant and just one more thing to get out of sync.
2019-02-04	Add missing copyright notices and remove license boilerplate (#5112)	Albert Krewinkel	1	-2/+2
	Quite a few modules were missing copyright notices. This commit adds copyright notices everywhere via haddock module headers. The old license boilerplate comment is redundant with this and has been removed. Update copyright years to 2019. Closes #4592.
2019-01-22	Support ipynb (Jupyter notebook) as input and output format.	John MacFarlane	1	-0/+3
	[API change] * Depend on ipynb library. * Add `ipynb` as input and output format. * Added Text.Pandoc.Readers.Ipynb (supports both nbformat v3 and v4). * Added Text.Pandoc.Writers.Ipynb (supports nbformat v4). * Added ipynb readers and writers to T.P.Readers, T.P.Writers, and T.P.Extensions. Register the file extension .ipynb for this format. * Add `PandocIpynbDecodingError` constructor to Text.Pandoc.Error.Error. * Note: there is no template for ipynb.
2019-01-06	Add DokuWiki reader (#5108)	Alexander	1	-0/+3
	Closes #1792
2018-11-29	Changed types of writeJSON and readJSON.	John MacFarlane	1	-8/+7
	Previously they were not monadic; we now have them run in an instance of the Pandoc monad, like the other readers and writers. [API change]
2018-05-09	Merge branch 'master' into groff_reader	Yan Pashkovsky	1	-2/+7

2018-04-26	Add FB2 reader (#4539)	Alexander	1	-0/+3

2018-03-29	Removed unused extensions (#4506)	Anabra	1	-3/+3

2018-03-18	Use NoImplicitPrelude and explicitly import Prelude.	John MacFarlane	1	-0/+2
	This seems to be necessary if we are to use our custom Prelude with ghci. Closes #4464.
2018-02-25	initial	Yan Pas	1	-0/+2

2018-01-05	Update copyright notices to include 2018	Albert Krewinkel	1	-2/+2

2017-12-20	Add Basic JATS reader based on DocBook reader	Hamish Mackenzie	1	-1/+4

2017-10-27	Automatic reformating by stylish-haskell.	John MacFarlane	1	-5/+5

2017-10-26	Creole reader (#3994)	Sascha Wilde	1	-0/+3
	This is feature complete but not very thoroughly tested yet.
2017-08-07	Remove GFM modules; use CMarkGFM for both gfm and commonmark.	John MacFarlane	1	-3/+1
	We no longer have a separate readGFM and writeGFM; instead, we'll use readCommonMark and writeCommonMark with githubExtensions. It remains to implement these extensions conditionally. Closes #3841.
2017-08-07	Added gfm (GitHub-flavored CommonMark) as an input and output format.	John MacFarlane	1	-0/+3
	This uses bindings to GitHub's fork of cmark, so it should parse gfm exactly as GitHub does (excepting certain postprocessing steps, involving notifications, emojis, etc.). * Added Text.Pandoc.Readers.GFM (exporting readGFM) * Added Text.Pandoc.Writers.GFM (exporting writeGFM) * Added `gfm` as input and output forma Note that tables are currently always rendered as HTML in the writer; this can be improved when CMarkGFM supports tables in output.
2017-07-21	Added TikiWiki reader (#3800)	rlpowell	1	-0/+3
	Added TikiWiki reader, including tests and documentation. It's probably not complete, but it works pretty well, handles all the basics (and some not-so-basics).
2017-06-24	Readers.getReader, Writers.getWriter API change.	John MacFarlane	1	-8/+4
	Now these functions return a pair of a reader/writer and an Extensions, instead of building the extensions into the reader/writer. The calling code must explicitly set readerExtensions or writerExtensions using the Extensions returned. The point of the change is to make it possible for the calling code to determine what extensions are being used. See #3659.
2017-06-19	Added Vimwiki reader (#3705).	Yuchen Pei	1	-0/+3
	* New module Text.Pandoc.Readers.Vimwiki, exporting readVimwiki [API change]. * New input format `vimwiki`. * New data file, `data/vimwiki.css`, for displaying the HTML produced by this reader and pandoc's HTML writer in the style of vimwiki's own HTML export.
2017-06-19	Add Muse reader (#3620)	Alexander Krotov	1	-0/+3

2017-06-10	Changed all readers to take Text instead of String.	John MacFarlane	1	-1/+0
	Readers: Renamed StringReader -> TextReader. Updated tests. API change.
2017-06-10	Readers: Changed StringReader -> TextReader.	John MacFarlane	1	-23/+26

2017-04-26	API change: move reader functions to Text.Pandoc.Readers	Albert Krewinkel	1	-0/+146
	Reader helper functions were defined in the top-level Text.Pandoc module. These functions are moved to the Readers submodule as to enable reuse in other submodules.