pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2016-11-26	HTML reader: improved table parsing.	John MacFarlane	1	-11/+24
	We now check explicitly for non-1 rowspan or colspan attributes, and fail when we encounter them. Previously we checked that each row had the same number of cells, but that could be true even with rowspans/colspans. And there are cases where it isn't true in tables that we can handle fine -- e.g. when a tr element is empty. So now we just pad rows with empty cells when needed. Closes #3027.
2016-11-26	[odt] Infer table's caption from the paragraph (#3224)	hubertp-lshift	1	-6/+21
	ODT's reader always put empty captions for the parsed tables. This commit 1) checks paragraphs that follow the table definition 2) treats specially a paragraph with a style named 'Table' 3) does some postprocessing of the paragraphs that combines tables followed immediately by captions The ODT writer used 'TableCaption' style name for the caption paragraph. This commit follows the open office approach which allows for appending captions to table but uses a built-in style named 'Table' instead of 'TableCaption'. Any users of odt format (both writer and reader) are therefore required to change the style's name to 'Table', if necessary.
2016-11-26	LaTeX reader: don't treat `\vspace` and `\hspace` as block commands.	John MacFarlane	1	-1/+0
	Fixed an error which came up, for example, with `\vspace` inside a caption. (Captions expect inlines.) Closes #3256.
2016-11-24	Org reader: respect column width settings	Albert Krewinkel	2	-28/+48
	Table column properties can optionally specify a column's width with which it is displayed in the buffer. Some exporters, notably the ODT exporter in org-mode v9.0, use these values to calculate relative column widths. The org reader now implements the same behavior. Note that the org-mode LaTeX and HTML exporters in Emacs don't support this feature yet, which should be kept in mind by users who use the column widths parameters. Closes: #3246
2016-11-20	Allow beamer-style <...> options in raw LaTeX (also in Markdown).	John MacFarlane	1	-1/+13
	This allows use of things like `\only<2,3>{my content}` in Markdown that is going to be converted to beamer. Closes #3184.
2016-11-19	LaTeX reader: improved table handling.	John MacFarlane	1	-4/+13
	We can now parse all of the tables emitted by pandoc in our tests. The only thing we don't get yet are alignments and column widths in more complex tables. See #2669.
2016-11-19	LaTeX reader: limited support for minipage.	John MacFarlane	1	-0/+2

2016-11-19	Un-break Travis build	Albert Krewinkel	1	-2/+2
	Remove whitespace before function documentation The extra spaced cause problems with documentation tools and Travis tests are failing because of this.
2016-11-19	LaTeX reader: improved parsing of tables.	John MacFarlane	1	-5/+13
	Reader can now parse simple LaTeX tables such as those generated by pandoc itself. We still can't handle pandoc multiline tables which involve minipages and column widths. Partially addresses #2669.
2016-11-19	Fixed xref lookup in DocBook reader. Closes #3243.	John MacFarlane	1	-4/+6
	It previously only worked when the qnames lacked the docbook namespace URI.
2016-11-19	Org reader: Ensure images in paragraphs are not parsed as figures	Albert Krewinkel	3	-15/+32
	This fixes a regression introduced in 7e5220b57c5a48fabe6e43ba270db812593d3463.
2016-11-16	Small caps in Bracketed Spans (#3191)	ickc	1	-1/+7
	* Markdown reader: modify bracketedSpan to check small caps * MANUAL.txt: add description on the use of `bracketed_spans` in small cap * Improve markdown readers: bracketedSpan function EXACTLY as spanHtml
2016-11-15	Allow alignments to be specified in Markdown grid tables.	John MacFarlane	1	-17/+23

2016-11-13	HTML reader: only treat "a" element as link if it has href.	John MacFarlane	1	-7/+19
	Otherwise treat as span. Closes #3226.
2016-11-10	Docx reader: add a placeholder value for CHART.	Jesse Rosenthal	2	-0/+17
	We wrap `[CHART]` in a `<span class="chart">`. Note that it maps to inlines because, in docx, anything in a drawing tag can be part of a larger paragraph.
2016-11-10	Docx reader: Be more specific in parsing images	Jesse Rosenthal	1	-6/+10
	We not only want "w:drawing", because that could also include charts. Now we specify "w:drawing"//"pic:pic". This shouldn't change behavior at all, but it's a first step toward allowing other sorts of drawing data as well.
2016-11-09	Org reader: allow HTML attribs on non-figure images	Albert Krewinkel	1	-6/+8
	Images which are the only element in a paragraph can still be given HTML attributes, even if the image does not have a caption and is hence not a figure. The following will add set the `width` attribute of the image to `50%`: #+ATTR_HTML: :width 50% [[file:image.jpg]] Closes: #3222
2016-11-08	Inline code when text has a special style	Hubert Plociniczak	1	-6/+20
	When a piece of text has a text 'Source_Text' then we assume that this is a piece of the document that represents a code that needs to be inlined. Addapted an odt writer to also reflect that change; previously it was just writing a 'preformatted' text using a non-distinguishable font style. Code blocks are still not recognized by the ODT reader. That's a separate issue.
2016-11-05	Markdown reader: Allow reference link labels starting with @...	John MacFarlane	1	-1/+2
	...if citations extension disabled. Example: in [link text][@a] [@a]: url `link text` isn't hyperlinked because `[@a]` is parsed as a citation. Previously this happened whether or not the `citations` extension was enabled. Now it happens only if the `citations` extension is enabled. Closes #3209.
2016-11-02	Docx Reader: abstract out function to avoid code repetition.	Jesse Rosenthal	1	-16/+14

2016-11-02	Docx reader: Handle Alt text and titles in images.	Jesse Rosenthal	2	-11/+28
	We use the "description" field as alt text and the "title" field as title. These can be accessed through the "Format Picture" dialog in Word.
2016-11-02	Docx reader utils: handle empty namespace in elemName	Jesse Rosenthal	1	-1/+2
	Previously, if given an empty namespace: (elemName ns "" "foo") `elemName` would output a QName with a `Just ""` namespace. This is never what we want. Now we output a `Nothing`. If someone does want a `Just ""` in the namespace, they can enter the QName value explicitly.
2016-11-02	HTML reader: treat `<math>` as MathML by default...	John MacFarlane	1	-8/+11
	unless something else is explicitly specified in xmlns. Provided it parses as MathML, of course. Also fixed default which should be to inline math if no display attribute is used.
2016-11-02	LaTeX reader: Handle BVerbatim from fancyvrb. Fixes #3203.	John MacFarlane	1	-10/+15

2016-11-01	Handle hungarumlaut in LaTeX reader. Closes #3201.	John MacFarlane	1	-0/+16

2016-11-01	[odt] Infer tables' header props from rows (#3199)	hubertp-lshift	1	-2/+9
	ODT reader simply provided an empty header list which meant that the contents of the whole table, even if not empty, was simply ignored. While we still do not infer headers we at least have to provide default properties of columns.
2016-10-31	LaTeX reader: allow for []s inside LaTeX optional args.	John MacFarlane	1	-1/+2
	Fixes cases like: \begin{center} \begin{tikzpicture}[baseline={([yshift=+-.5ex]current bounding box.center)}, level distance=24pt] \Tree [.{S} [.NP John\index{i} ] [.VP [.V likes ] [.NP himself\index{i,*j} ]]] \end{tikzpicture} \end{center}
2016-10-30	Org reader: support `ATTR_HTML` for special blocks	Albert Krewinkel	1	-9/+22
	Special blocks (i.e. blocks with unrecognized names) can be prefixed with an `ATTR_HTML` block attribute. The attributes defined in that meta-directive are added to the `Div` which is used to represent the special block. Closes: #3182
2016-10-30	Org reader: support the `todo` export option	Albert Krewinkel	3	-2/+7
	The `todo` export option allows to toggle the inclusion of TODO keywords in the output. Setting this to `nil` causes TODO keywords to be dropped from headlines. The default is to include the keywords.
2016-10-30	Org reader: add support for todo-markers	Albert Krewinkel	3	-5/+98
	Headlines can have optional todo-markers which can be controlled via the `#+TODO`, `#+SEQ_TODO`, or `#+TYP_TODO` meta directive. Multiple such directives can be given, each adding a new set of recognized todo-markers. If no custom todo-markers are defined, the default `TODO` and `DONE` markers are used. Todo-markers are conceptually separate from headline text and are hence excluded when autogenerating headline IDs. The markers are rendered as spans and labelled with two classes: One class is the markers name, the other signals the todo-state of the marker (either `todo` or `done`).
2016-10-26	Markdown Reader: add attributes for autolink (#3183)	Daniele D'Orazio	1	-1/+3

2016-10-24	Export Text.Pandoc.Error in Text.Pandoc.	John MacFarlane	1	-3/+2
	[API change]
2016-10-22	Added `angle_brackets_escapable` extension.	John MacFarlane	1	-0/+2
	This is needed because github flavored Markdown has a slightly different set of escapable symbols than original Markdown; it includes angle brackets. Closes #2846.
2016-10-22	EPUB reader: don't add root path to data: URIs.	John MacFarlane	1	-1/+3
	Closes #3150. Thanks to @lep for the bug report and patch.
2016-10-19	Image with a caption needs special formatting	Hubert Plociniczak	1	-2/+6
	Latex Writer only handles captions if the image's title is prefixed with 'fig:'.
2016-10-18	Merge pull request #3166 from hubertp-lshift/bug/3134	John MacFarlane	1	-3/+2
	Issue 3143: Don't duplicate text for anchors
2016-10-18	Merge pull request #3165 from hubertp-lshift/feature/odt-image	John MacFarlane	3	-38/+138
	[odt] images parser
2016-10-18	Better fix for the problem with ghc 7.8.	John MacFarlane	1	-1/+3

2016-10-18	Try to fix build error on ghc 7.8.	John MacFarlane	1	-1/+1
	@tarleb this is an interesting one, see the build log in https://travis-ci.org/jgm/pandoc/jobs/168612017 It only failed on ghc 7.8; I think this must have to do with the change making Monad a superclass of Applicative, hence this change.
2016-10-18	Issue 3143: Don't duplicate text for anchors	Hubert Plociniczak	1	-3/+2
	When creating an anchor element we were adding its representation as well as the original content, leading to text duplication.
2016-10-17	Minor refactoring	Hubert Plociniczak	1	-10/+6

2016-10-17	Infer caption from the text following the img	Hubert Plociniczak	1	-20/+47
	Frame can contain other frames with the text boxes. This is something that has not been considered before and meant that the whole construction of images was broken in those cases. Also the captions were fixed/ignored.
2016-10-17	RST reader: skip whitespace before note.	Jesse Rosenthal	1	-2/+3
	RST requires a space before a footnote marker. We discard those spaces so that footnotes will be adjacent to the text that comes before it. This is in line with what rst2latex does. rst2html does not discard the space, but its html output is different than pandoc's, so this seems the most semantically correct approach. Closes #3163
2016-10-14	Org reader: allow figure with empty caption	Albert Krewinkel	1	-3/+1
	A `#+CAPTION` attribute before an image is enough to turn an image into a figure. This wasn't the case because the `parseFromString` function, which processes the caption value, would fail on empty values. Adding a newline character to the caption value fixes this. Fixes: #3161
2016-10-14	Merge pull request #3146 from hubertp-lshift/feature/odt-list-start-value	John MacFarlane	2	-13/+21
	[ODT Parser] Include list's starting value
2016-10-14	Added tests and a corner case for starting number	Hubert Plociniczak	1	-0/+1
	Review revealed that we didn't handle the case when the starting point is an empty string. While this is not a valid .odt file, we simply added a special case to deal with it. Also added tests for the new feature.
2016-10-13	Parse line-oriented markup as LineBlock	Albert Krewinkel	4	-9/+9
	Markup-features focusing on lines as distinctive part of the markup are read into `LineBlock` elements. This currently means line blocks in reStructuredText and Markdown (the latter only if the `line_block` extension is enabled), the `linegroup`/`line` combination from the Docbook 5.1 working draft, and Org-mode `VERSE` blocks.
2016-10-12	[ODT Parser] Include list's starting value	Hubert Plociniczak	2	-13/+20
	Previously the starting value of the lists' items has been hardcoded to 1. In reality ODT's list style definition can provide a new starting value in one of its attributes. Writers already handle the modified start value so no need to change anything in that area.
2016-10-12	Basic support for images in ODT documents	Hubert Plociniczak	3	-38/+115
	Highly influenced by the docx support, refactored some code to avoid DRY.
2016-10-10	Org reader: trim verse lines properly	Albert Krewinkel	1	-2/+4
	An empty verse line should not result in `Str ""` but in `mempty`.