pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2016-05-11	Org reader: move parser state into separate module	Albert Krewinkel	2	-158/+232
	The org reader code has become large and confusing. Extracting smaller parts into submodules should help to clean things up.
2016-05-11	Retake on strut with \minipage inside tables	Jose Luis Duran	1	-2/+2
	Reimplement on 4c684561ee0665b014e887ae559b7020e4e9f2d3 The problem with 4c68456 was a space between the cell contents and the `\strut` that affected the alignment.
2016-05-09	Avoid lazy foldl in LaTeX writer.	John MacFarlane	1	-4/+5

2016-05-09	Merge pull request #2907 from tarleb/org-fixes	John MacFarlane	2	-5/+10
	Org fixes (reader and writer)
2016-05-09	Org writer: print empty table rows	Albert Krewinkel	1	-1/+1
	Empty table rows should not be dropped from the output, so row-height is always set to be at least 1.
2016-05-09	Org reader: fix inline-LaTeX regression	Albert Krewinkel	1	-4/+9
	The last fix for whitespace handling of inline LaTeX commands was incorrect, preventing correct recognition of inline LaTeX commands which contain spaces. This fix ensures that only trailing whitespace is cut off.
2016-05-09	Allow spaces before '!' in MediaWiki table header	roblabla	1	-1/+1

2016-05-05	Merge pull request #2898 from tarleb/org-table-refactoring	John MacFarlane	1	-59/+59
	Org reader: table parsing code refactoring and fixes
2016-05-04	Org reader: fix spacing after LaTeX-style symbols	Albert Krewinkel	1	-5/+7
	The org-reader was droping space after unescaped LaTeX-style symbol commands: `\ForAll \Auml` resulted in `∀Ä` but should give `∀ Ä` instead. This seems to be because the LaTeX-reader treats the command-terminating space as part of the command. Dropping the trailing space from the symbol-command fixes this issue.
2016-05-04	Org reader: fix handling of empty table cells, rows	Albert Krewinkel	1	-13/+17
	This fixes Org mode parsing of some corner cases regarding empty cells and rows. Empty cells weren't parsed correctly, e.g. `\|\|\|` should be two empty cells, but would be parsed as a single cell containing a pipe character. Empty rows where parsed as alignment rows and dropped from the output. This fixes #2616.
2016-05-04	Org reader: refactor rows-to-table conversion	Albert Krewinkel	1	-25/+25
	This refactores the codes conversing a list table lines to an org table ADT. The old code was simplified and is now slightly less ugly.
2016-05-04	Org reader: stop padding short table rows	Albert Krewinkel	1	-24/+20
	Emacs Org-mode doesn't add any padding to table rows. The first row (header or first body row) is used to determine the column count, no other magic is performed. The org reader was padding rows to the length of the longest table row. This was done due to a misunderstanding of how Org handles tables. This feature reflected how Org-mode handles tables when pressing <TAB>. The Org exporter however, which is what the reader should implement, doesn't do any of this. So this was a mis-feature that made the reader more complex and reduced comparability. It was hence removed.
2016-05-01	Merge pull request #2890 from bcdevices/docbook5-writer	John MacFarlane	2	-6/+17
	Docbook5 write support
2016-05-01	Add class option for code block in RST reader	Sidharth Kapur	1	-6/+8
	According to http://docutils.sourceforge.net/docs/ref/rst/directives.html#code, the code directive supports the ":class:" option.
2016-05-01	Binary fmts throw PandocError on zip-archive fail	Jesse Rosenthal	2	-3/+7
	Commit 91dc3342 made `readDocx` throw PandocError if there was an unarchiving error. This extends that fix to `readOdt` and `readEPUB`.
2016-05-01	LaTeX writer: use {} around options containing special chars.	John MacFarlane	1	-4/+9
	Closes #2892.
2016-05-01	Docx Reader: Throw PandocError on unzip failure	Jesse Rosenthal	1	-4/+5
	Previously, readDocx would error out if zip-archive failed. We change the archive extraction step from `toArchive` to `toArchiveOrFail`, which returns an Either value.
2016-04-29	Docbook5 writer: Properly handle ulink/link	Ivo Clarysse	1	-1/+3

2016-04-29	Write out Docbook 5 namespace	Ivo Clarysse	2	-6/+9

2016-04-29	HTML writer: ensure mathjax link is added when math appears in footnote.	John MacFarlane	1	-3/+2
	Previously if a document only had math in a footnote, the MathJax link would not be added. Closes #2881.
2016-04-29	Add docbook5 writer support	Ivo Clarysse	2	-3/+9

2016-04-27	Revert "LaTeX writer: Add `\strut` to fix multiline tables"	John MacFarlane	1	-2/+1
	This reverts commit 4c684561ee0665b014e887ae559b7020e4e9f2d3. See https://groups.google.com/d/msg/pandoc-discuss/u6J-_aCProU/UufN3IYRAgAJ This should fix uneven spacing issues in multiline tables.
2016-04-26	Merge pull request #2735 from mb21/patch-1	John MacFarlane	1	-1/+1
	LaTeX Writer: fix polyglossia to babel env mapping
2016-04-26	Merge pull request #2829 from adunning/patch-1	John MacFarlane	1	-7/+17
	LaTeX writer: Add missing languages.
2016-04-26	Merge pull request #2876 from shosti/org-code-indent	John MacFarlane	1	-4/+20
	Ignore leading space in org code blocks
2016-04-26	LaTeX writer: ignore --incremental unless -t beamer.	John MacFarlane	1	-1/+2
	Closes #2843.
2016-04-26	Ignore leading space in org code blocks	Emanuel Evans	1	-4/+20
	Fixes #2862 Also fix up tab handling for leading whitespace in code blocks.
2016-04-15	Docx Reader: parse `moveTo` and `moveFrom`	Jesse Rosenthal	1	-2/+2
	`moveTo` and `moveFrom` are track-changes tags that are used when a block of text is moved in the document. We now recognize these tags and treat them the same as `insert` and `delete`, respectively. So, `--track-changes=accept` will show the moved version, while `--track-changes=reject` will show the original version.
2016-04-10	Markdown reader: Fix pandoc title blocks with lines ending in 2 spaces.	John MacFarlane	1	-19/+23
	Closes #2799. Also added -s to markdown-reader-more test.
2016-04-10	Markdown + HTML readers: be more forgiving about unescaped &.	John MacFarlane	1	-10/+15
	We are now more forgiving about parsing invalid HTML with unescaped `&` as raw HTML. (Previously any unescaped `&` would cause pandoc not to recognize the string as raw HTML.) Closes #2410.
2016-04-01	LaTeX writer: Add missing languages.	Andrew Dunning	1	-7/+17
	Updates the list from the hyphenation files at <http://mirror.ctan.org/language/hyph-utf8/tex/generic/hyph-utf8/loadhyph/>.
2016-03-30	Recognize `la-x-classic` as Classical Latin.	Andrew Dunning	1	-0/+2
	This allows one to access the hyphenation patterns at <http://mirrors.ctan.org/language/hyph-utf8/tex/generic/hyph-utf8/patterns/tex/hyph-la-x-classic.tex>, using its private language tag.
2016-03-26	EPUB writer: set 'navpage' variable on nav page.	John MacFarlane	1	-1/+2
	This allows templates to treat it differently.
2016-03-25	Removed two superfluous lines.	John MacFarlane	1	-2/+0

2016-03-24	LaTeX writer: better positioning for hypertarget in figures.	John MacFarlane	1	-16/+23
	Closes #2813.
2016-03-24	LaTeX writer: Fixed position of label in figures.	John MacFarlane	1	-3/+3
	Partially addresses #2813. This isn't perfect, because now the hypertarget is in the wrong place -- when you link to the figure, the screen is positioned with the caption at the top, and most of the figure off screen. So this needs a bit more tweaking.
2016-03-22	Updated copyright dates to include 2016.	John MacFarlane	17	-34/+34

2016-03-22	Fixed bug in Markdown raw HTML parsing.	John MacFarlane	1	-1/+1
	This was a regression, with the rewrite of `htmlInBalanced` (from `Text.Pandoc.Readers.HTML`) in 1.17. It caused newlines to be omitted in raw HTML blocks. Closes #2804.
2016-03-20	LaTeX Writer: fix polyglossia to babel env mapping	Mauro Bieg	1	-1/+1
	allow for optional argument in square brackets, closes #2728
2016-03-19	Merge pull request #2637 from mb21/latex-figure-label	John MacFarlane	1	-19/+24
	LaTeX writer: figure label
2016-03-18	ConTeXt writer: fix whitespace at line beginning in line blocks.	John MacFarlane	1	-1/+11
	Add a `\strut` after `\crlf` before space. Closes #2744, #2745. Thanks to @c-foster. This uses the fix suggested by @c-foster. Mid-line spaces are still not supported, because of limitations of the Markdown parser.
2016-03-18	LaTeX writer: Avoid double toprule in headerless table with caption.	John MacFarlane	1	-7/+10
	Closes #2742.
2016-03-18	Docx reader: Handle alternate content	Jesse Rosenthal	1	-14/+37
	Some word functions -- especially graphics -- give various choices for content so there can be backwards compatibility. This follows the largely undocumented feature by working through the choices until we find one that works. Note that we had to split out the processing of child elems of runs into a separate function so we can recurse properly. Any processing of an element within a run (other than a plain run) should go into `childElemToRun`.
2016-03-16	Docx reader: Don't make numbered heads into lists.	Jesse Rosenthal	1	-6/+8
	Word uses list numbering styles to number its headings. We only call something a numbered list if it does not also heave a heading style.
2016-03-15	Introduce file-scope parsing (parse-before-combine)	Jesse Rosenthal	1	-0/+2
	Traditionally pandoc operates on multiple files by first concetenating them (around extra line breaks) and then processing the joined file. So it only parses a multi-file document at the document scope. This has the benefit that footnotes and links can be in different files, but it also introduces a couple of difficulties: - it is difficult to join files with footnotes without some sort of preprocessing, which makes it difficult to write academic documents in small pieces. - it makes it impossible to process multiple binary input files, which can't be catted. - it makes it impossible to process files from different input formats. This commit introduces alternative method. Instead of catting the files first, it parses the files first, and then combines the parsed output. This makes it impossible to have links across multiple files, and auto-identified headers won't work correctly if headers in multiple files have the same name. On the other hand, footnotes across multiple files will work correctly and will allow more freedom for input formats. Since ByteStringReaders can currently only read one binary file, and will ignore subsequent files, we also changes the behavior to automatically parse before combining if using the ByteStringReader. If we use one file, it will work as normal. If there is more than one file it will combine them after parsing (assuming that the format is the same). Note that this is intended to be an optional method, defaulting to off. Turn it on with `--file-scope`.
2016-03-12	Add readDocxWithWarnings	Jesse Rosenthal	1	-6/+15
	The regular readDocx just becomes a special case.
2016-03-12	Docx Reader: Add state to the parser, for warnings	Jesse Rosenthal	1	-6/+19
	In order to be able to collect warnings during parsing, we add a state monad transformer to the D monad. At the moment, this only includes a list of warning strings (nothing currently triggers them, however). We use StateT instead of WriterT to correspond more closely with the warnings behavior in T.P.Parsing.
2016-03-10	Fixed behavior of base tag.	John MacFarlane	1	-17/+11
	+ If the base path does not end with slash, the last component will be replaced. E.g. base = `http://example.com/foo` combines with `bar.html` to give `http://example.com/bar.html`. + If the href begins with a slash, the whole path of the base is replaced. E.g. base = `http://example.com/foo/` combines with `/bar.html` to give `http://example.com/bar.html`. Closes #2777.
2016-03-10	Docx Writer: handle image alt text	mb21	1	-2/+2
	closes #2754
2016-03-09	Markdown reader: Improved pipe table parsing.	John MacFarlane	1	-15/+15
	Fixes #2765. Added test case.