pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2016-12-08	Docx reader: Ensure one-row tables don't have header.	Jesse Rosenthal	3	-0/+9
	Tables in MS Word are set by default to have special first-row formatting, which pandoc uses to determine whether or not they have a header. This means that one-row tables will, by default, have only a header -- which we imagine is not what people want. This change ensures that a one-row table is not understood to be a header only. Note that this means that it is impossible to produce a header-only table from docx, even though it is legal pandoc. But we believe that in nearly all cases, it will be an accidental (and unwelcome) result Closes #3285.
2016-12-07	Fixed tests with dynamic linking.	John MacFarlane	1	-3/+12
	Closes #2709.
2016-12-07	RST reader: fix hyperlink aliases.	John MacFarlane	2	-0/+3
	`link <google_>`_ .. _google: https://google.com is really a reference link. Closes #3283.
2016-12-04	LaTeX writer: Fix unnumbered headers when used with `--top-level`	Albert Krewinkel	1	-0/+22
	Fix interaction of top-level divisions `part` or `chapter` with unnumbered headers when emitting LaTeX. Headers are ensured to be written using stared commands (like `\subsection*{}`). Fixes: #3272
2016-12-04	Markdown writer: Fixed incorrect word wrapping.	John MacFarlane	3	-6/+6
	Previously pandoc would sometimes wrap lines too early due to this bug. Closes #3277.
2016-11-30	Options: Removed writerStandalone, made writerTemplate a Maybe.	John MacFarlane	5	-14/+12
	Previously setting writerStandalone = True did nothing unless a template was provided in writerTemplate. Now a fragment will be generated if writerTemplate is Nothing; otherwise, the specified template will be used and standalone output generated. [API change]
2016-11-30	Use new module from texmath to lookup MS font codepoints.	John MacFarlane	1	-0/+1
	+ Removed Text.Pandoc.Readers.Docx.Fonts + Moved its code to texmath; we now use (from texmath 0.9) Text.TeXMath.Unicode.Fonts + Use texmath 0.9 (currently from git). + Updated epub tests because texmath now handles more mathml.
2016-11-27	Refactor top-level division selection (#3261)	Albert Krewinkel	2	-15/+52
	The "default" option is no longer represented as `Nothing` but via a new type constructor, making the `Maybe` wrapper superfluous. The default behavior of using heuristics can now be enabled explicitly by setting `--top-level-division=default`. API change (`Text.Pandoc.Options`): The `Division` type was renamed to `TopLevelDivision`. The `Section`, `Chapter`, and `Part` constructors were renamed to `TopLevelSection`, `TopLevelChapter`, and `TopLevelPart`, respectively. An additional `TopLevelDefault` constructor was added, which is now also the new default value of the `writerTopLevelDivision` field in `WriterOptions`.
2016-11-26	[odt] Infer table's caption from the paragraph (#3224)	hubertp-lshift	4	-6/+6
	ODT's reader always put empty captions for the parsed tables. This commit 1) checks paragraphs that follow the table definition 2) treats specially a paragraph with a style named 'Table' 3) does some postprocessing of the paragraphs that combines tables followed immediately by captions The ODT writer used 'TableCaption' style name for the caption paragraph. This commit follows the open office approach which allows for appending captions to table but uses a built-in style named 'Table' instead of 'TableCaption'. Any users of odt format (both writer and reader) are therefore required to change the style's name to 'Table', if necessary.
2016-11-26	Allow to overwrite top-level division type heuristics (#3258)	Albert Krewinkel	2	-3/+3
	Pandoc uses heuristics to determine the most resonable top-level division type when emitting LaTeX or Docbook markup. It is now possible to overwrite this implicitly set top-level division via the `top-level-division` command line parameter. API change (`Text.Pandoc.Options`): the type of the `writerTopLevelDivision` field in of the `WriterOptions` data type is altered from `Division` to `Maybe Division`. The field's default value is changed from `Section` to `Nothing`. Closes: #3197
2016-11-19	Fixed xref lookup in DocBook reader. Closes #3243.	John MacFarlane	1	-3/+3
	It previously only worked when the qnames lacked the docbook namespace URI.
2016-11-19	Org reader: Ensure images in paragraphs are not parsed as figures	Albert Krewinkel	1	-12/+22
	This fixes a regression introduced in 7e5220b57c5a48fabe6e43ba270db812593d3463.
2016-11-15	Allow alignments to be specified in Markdown grid tables.	John MacFarlane	2	-0/+42

2016-11-15	Markdown writer: fixed inconsistent spacing issue.	John MacFarlane	3	-3/+1
	Previously a tight bullet sublist got rendered with a blank line after, while a tight ordered sublist did not. Now we don't get the blank line in either case. Closes #3232.
2016-11-13	HTML reader: only treat "a" element as link if it has href.	John MacFarlane	1	-0/+4
	Otherwise treat as span. Closes #3226.
2016-11-09	Org reader: allow HTML attribs on non-figure images	Albert Krewinkel	1	-0/+6
	Images which are the only element in a paragraph can still be given HTML attributes, even if the image does not have a caption and is hence not a figure. The following will add set the `width` attribute of the image to `50%`: #+ATTR_HTML: :width 50% [[file:image.jpg]] Closes: #3222
2016-11-08	Inline code when text has a special style	Hubert Plociniczak	4	-62/+55
	When a piece of text has a text 'Source_Text' then we assume that this is a piece of the document that represents a code that needs to be inlined. Addapted an odt writer to also reflect that change; previously it was just writing a 'preformatted' text using a non-distinguishable font style. Code blocks are still not recognized by the ODT reader. That's a separate issue.
2016-11-02	Docx reader/writer: Update tests for img title and alt	Jesse Rosenthal	5	-4/+4
	Closes #3204
2016-11-01	[odt] Infer tables' header props from rows (#3199)	hubertp-lshift	1	-1/+1
	ODT reader simply provided an empty header list which meant that the contents of the whole table, even if not empty, was simply ignored. While we still do not infer headers we at least have to provide default properties of columns.
2016-10-31	Added a test case with a complex raw latex environment in Markdown.	John MacFarlane	2	-0/+10

2016-10-30	Org reader: support `ATTR_HTML` for special blocks	Albert Krewinkel	1	-0/+9
	Special blocks (i.e. blocks with unrecognized names) can be prefixed with an `ATTR_HTML` block attribute. The attributes defined in that meta-directive are added to the `Div` which is used to represent the special block. Closes: #3182
2016-10-30	Org reader: support the `todo` export option	Albert Krewinkel	1	-0/+6
	The `todo` export option allows to toggle the inclusion of TODO keywords in the output. Setting this to `nil` causes TODO keywords to be dropped from headlines. The default is to include the keywords.
2016-10-30	Org reader: add support for todo-markers	Albert Krewinkel	1	-125/+165
	Headlines can have optional todo-markers which can be controlled via the `#+TODO`, `#+SEQ_TODO`, or `#+TYP_TODO` meta directive. Multiple such directives can be given, each adding a new set of recognized todo-markers. If no custom todo-markers are defined, the default `TODO` and `DONE` markers are used. Todo-markers are conceptually separate from headline text and are hence excluded when autogenerating headline IDs. The markers are rendered as spans and labelled with two classes: One class is the markers name, the other signals the todo-state of the marker (either `todo` or `done`).
2016-10-26	Markdown Reader: add attributes for autolink (#3183)	Daniele D'Orazio	1	-1/+10

2016-10-24	Export Text.Pandoc.Error in Text.Pandoc.	John MacFarlane	7	-13/+1
	[API change]
2016-10-23	Tighten up parsing of raw email addresses.	John MacFarlane	1	-0/+5
	Technically `**@user` is a valid email address, but if we allow things like this, we get bad results in markdown flavors that autolink raw email addresses. (See #2940.) So we exclude a few valid email addresses in order to avoid these more common bad cases. Closes #2940.
2016-10-19	Merge pull request #3108 from tarleb/part	John MacFarlane	2	-6/+112
	Add command line option allowing to set type of top-level divisions
2016-10-19	Add option for top-level division type	Albert Krewinkel	2	-6/+112
	The `--chapters` option is replaced with `--top-level-division` which allows users to specify the type as which top-level headers should be output. Possible values are `section` (the default), `chapter`, or `part`. The formats LaTeX, ConTeXt, and Docbook allow `part` as top-level division, TEI only allows to set the `type` attribute on `div` containers. The writers are altered to respect this option in a sensible way.
2016-10-19	Image with a caption needs special formatting	Hubert Plociniczak	2	-2/+2
	Latex Writer only handles captions if the image's title is prefixed with 'fig:'.
2016-10-18	Merge pull request #3166 from hubertp-lshift/bug/3134	John MacFarlane	2	-1/+1
	Issue 3143: Don't duplicate text for anchors
2016-10-18	Merge pull request #3165 from hubertp-lshift/feature/odt-image	John MacFarlane	4	-2/+7
	[odt] images parser
2016-10-18	Issue 3143: Don't duplicate text for anchors	Hubert Plociniczak	2	-1/+1
	When creating an anchor element we were adding its representation as well as the original content, leading to text duplication.
2016-10-17	Org writer: drop space before footnote markers	Albert Krewinkel	1	-4/+4
	The writer no longer adds an extra space before footnote markers. Fixes: #3162
2016-10-17	Infer caption from the text following the img	Hubert Plociniczak	4	-2/+7
	Frame can contain other frames with the text boxes. This is something that has not been considered before and meant that the whole construction of images was broken in those cases. Also the captions were fixed/ignored.
2016-10-17	RST reader: Add test for space-before-note.	Jesse Rosenthal	1	-0/+9

2016-10-14	Org reader: allow figure with empty caption	Albert Krewinkel	1	-0/+6
	A `#+CAPTION` attribute before an image is enough to turn an image into a figure. This wasn't the case because the `parseFromString` function, which processes the caption value, would fail on empty values. Adding a newline character to the caption value fixes this. Fixes: #3161
2016-10-14	Remove Tests.Arbitrary	Jesse Rosenthal	18	-211/+17
	Use exported Arbitrary instances from pandoc-types instead.
2016-10-14	Merge pull request #3146 from hubertp-lshift/feature/odt-list-start-value	John MacFarlane	6	-3/+3
	[ODT Parser] Include list's starting value
2016-10-14	Added tests and a corner case for starting number	Hubert Plociniczak	6	-3/+3
	Review revealed that we didn't handle the case when the starting point is an empty string. While this is not a valid .odt file, we simply added a special case to deal with it. Also added tests for the new feature.
2016-10-13	Parse line-oriented markup as LineBlock	Albert Krewinkel	4	-17/+25
	Markup-features focusing on lines as distinctive part of the markup are read into `LineBlock` elements. This currently means line blocks in reStructuredText and Markdown (the latter only if the `line_block` extension is enabled), the `linegroup`/`line` combination from the Docbook 5.1 working draft, and Org-mode `VERSE` blocks.
2016-10-11	Markdown writer: add test for note placement.	Jesse Rosenthal	1	-2/+138

2016-10-02	AsciiDoc writer: avoid unnecessary use of "unconstrained" emphasis.	John MacFarlane	1	-4/+4
	In AsciiDoc, you must use a special form of emphasis (double `__`) for intraword emphasis. Pandoc was previously using this more than necessary. Closes #3068.
2016-09-28	Markdown reader: added bracket syntax for native spans.	John MacFarlane	2	-1/+6
	See #168. Text.Pandoc.Options.Extension has a new constructor `Ext_brackted_spans`, which is enabled by default in pandoc's Markdown.
2016-09-28	Updated test suite.	John MacFarlane	4	-0/+24

2016-09-28	Merge pull request #3093 from wilx/master-figure-placement	John MacFarlane	1	-1/+1
	LaTeX: Do not set [htbp] figure placement options.
2016-09-20	LaTeX writer: change braced backtick to \textasciigrave{}	Jesse Rosenthal	1	-1/+1
	Backticks in verbatim environments are converted to open-single-quotes. This change makes them appear as backticks. This corresponds to how we treat `'' in verbatim environments (with \textquotesingle{}).
2016-09-19	Add test for backtick in verbatim.	Jesse Rosenthal	1	-0/+2

2016-08-30	Org reader: respect unnumbered header property	Albert Krewinkel	1	-0/+9
	Sections the `unnumbered` property should, as the name implies, be excluded from the automatic numbering of section provided by some output formats. The Pandoc convention for this is to add an "unnumbered" class to the header. The reader treats properties as key-value pairs per default, so a special case is added to translate the above property to a class instead. Closes #3095.
2016-08-29	Merge branch 'org-meta-handling'	Albert Krewinkel	1	-63/+124

2016-08-29	Docx reader: test for nested anchor spans in header	Jesse Rosenthal	3	-0/+14
	This ensures that anchor spans in header with content (or with other anchor spans inside) will resolve to links to a header id properly.