aboutsummaryrefslogtreecommitdiff
diff options
context:
space:
mode:
-rw-r--r--debian/changelog185
1 files changed, 185 insertions, 0 deletions
diff --git a/debian/changelog b/debian/changelog
index 11a9b0278..9f24b2510 100644
--- a/debian/changelog
+++ b/debian/changelog
@@ -1,3 +1,188 @@
+pandoc (0.46) unstable; urgency=low
+
+ [ John MacFarlane ]
+
+ * Made -H, -A, and -B options cumulative: if they are specified
+ multiple times, multiple files will be included.
+
+ * Added optional HTML sanitization using a whitelist.
+ When this option is specified (--sanitize-html on the command line),
+ unsafe HTML tags will be replaced by HTML comments, and unsafe HTML
+ attributes will be removed. This option should be especially useful
+ for those who want to use pandoc libraries in web applications, where
+ users will provide the input.
+
+ + Main.hs: Added --sanitize-html option.
+
+ + Text.Pandoc.Shared: Added stateSanitizeHTML to ParserState.
+
+ + Text.Pandoc.Readers.HTML:
+ - Added whitelists of sanitaryTags and sanitaryAttributes.
+ - Added parsers to check these lists (and state) to see if a given
+ tag or attribute should be counted unsafe.
+ - Modified anyHtmlTag and anyHtmlEndTag to replace unsafe tags
+ with comments.
+ - Modified htmlAttribute to remove unsafe attributes.
+ - Modified htmlScript and htmlStyle to remove these elements if
+ unsafe.
+
+ + Modified README and man pages to document new option.
+
+ * Improved handling of email addresses in markdown and reStructuredText.
+ Consolidated uri and email address parsers. (Resolves Issue #37.)
+
+ + New emailAddress and uri parsers in Text.Pandoc.Shared.
+ - uri parser uses parseURI from Network.URI.
+ - emailAddress parser properly handles email addresses with periods
+ in them.
+
+ + Removed uri and emailAddress parsers from Text.Pandoc.Readers.RST
+ and Text.Pandoc.Readers.Markdown.
+
+ * Markdown reader:
+
+ + Fixed emph parser so that "*hi **there***" is parsed as a Strong
+ nested in an Emph. (A '*' is only recognized as the end of the
+ emphasis if it's not the beginning of a strong emphasis.)
+
+ + Moved blockQuote parser before list parsers for performance.
+
+ + Modified 'source' parser to allow backslash-escapes in URLs.
+ So, for example, [my](/url\(1\)) yields a link to /url(1).
+ Resolves Issue #34.
+
+ + Disallowed links within links. (Resolves Issue #35.)
+ - Replaced inlinesInBalanced with inlinesInBalancedBrackets, which
+ instead of hard-coding the inline parser takes an inline parser
+ as a parameter.
+ - Modified reference and inlineNote to use inlinesInBalancedBrackets.
+ - Removed unneeded inlineString function.
+ - Added inlineNonLink parser, which is now used in the definition of
+ reference.
+ - Added inlineParsers list and redefined inline and inlineNonLink parsers
+ in terms of it.
+ - Added failIfLink parser.
+
+ + Better handling of parentheses in URLs and quotation marks in titles.
+ - 'source' parser first tries to parse URL with balanced parentheses;
+ if that doesn't work, it tries to parse everything beginning with
+ '(' and ending with ')'.
+ - source parser now uses an auxiliary function source'.
+ - linkTitle parser simplified and improved, under assumption that it
+ will be called in context of source'.
+
+ + Make 'block' conditional on strictness state, instead of using
+ failIfStrict in block parsers. Use a different ordering of parsers
+ in strict mode (raw HTML block before paragraph) for performance.
+ In non-strict mode use rawHtmlBlocks instead of htmlBlock.
+ Simplified htmlBlock, since we know it's only called in strict
+ mode.
+
+ + Improved handling of raw HTML. (Resolves Issue #36.)
+ - Tags that can be either block or inline (e.g. <ins>) should
+ be treated as block when appropriate and as inline when
+ appropriate. Thus, for example,
+ <ins>hi</ins>
+ should be treated as a paragraph with inline <ins> tags, while
+ <ins>
+ hi
+ </ins>
+ should be treated as a paragraph within <ins> tags.
+ - Moved htmlBlock after para in list of block parsers. This ensures
+ that tags that can be either block or inline get parsed as inline
+ when appropriate.
+ - Modified rawHtmlInline' so that block elements aren't treated as
+ inline.
+ - Modified para parser so that paragraphs containing only HTML tags and
+ blank space are not allowed. Treat these as raw HTML blocks instead.
+
+ + Fixed bug wherein HTML preceding a code block could cause it to
+ be parsed as a paragraph. The problem is that the HTML parser
+ used to eat all blank space after an HTML block, including the
+ indentation of the code block. (Resolves Issue #39.)
+ - In Text.Pandoc.Readers.HTML, removed parsing of following space
+ from rawHtmlBlock.
+ - In Text.Pandoc.Readers.Markdown, modified rawHtmlBlocks so that
+ indentation is eaten *only* on the first line after the HTML
+ block. This means that in
+ <div>
+ foo
+ <div>
+ the foo won't be treated as a code block, but in
+ <div>
+
+ foo
+
+ </div>
+ it will. This seems the right approach for least surprise.
+
+ * RST reader:
+
+ + Fixed bug in parsing explicit links (resolves Issue #44).
+ The problem was that we were looking for inlines until a '<' character
+ signaled the start of the URL; so, if you hit a reference-style link,
+ it would keep looking til the end of the document. Fix: change
+ inline => (notFollowedBy (char '`') >> inline). Note that this won't
+ allow code inlines in links, but these aren't allowed in resT anyway.
+
+ + Cleaned up parsing of reference names in key blocks and links.
+ Allow nonquoted reference links to contain isolated '.', '-', '_', so
+ so that strings like 'a_b_' count as links.
+
+ + Removed unnecessary check for following link in str.
+ This is unnecessary now that link is above str in the definition of
+ 'inline'.
+
+ * HTML reader:
+
+ + Modified rawHtmlBlock so it parses </html> and </body> tags.
+ This allows these tags to be handled correctly in Markdown.
+ HTML reader now uses rawHtmlBlock', which excludes </html> and </body>,
+ since these are handled in parseHtml. (Resolves Issue #38.)
+
+ + Fixed bug (emph parser was looking for <IT> tag, not <I>).
+
+ + Don't interpret contents of style tags as markdown.
+ (Resolves Issue #40.)
+ - Added htmlStyle, analagous to htmlScript.
+ - Use htmlStyle in htmlBlockElement and rawHtmlInline.
+ - Moved "script" from the list of tags that can be either block or
+ inline to the list of block tags.
+
+ + Modified rawHtmlBlock to use anyHtmlBlockTag instead of anyHtmlTag
+ and anyHtmlEndTag. This fixes a bug in markdown parsing, where
+ inline tags would be included in raw HTML blocks.
+
+ + Modified anyHtmlBlockTag to test for (not inline) rather than
+ directly for block. This allows us to handle e.g. docbook in
+ the markdown reader.
+
+ * LaTeX reader: Properly recognize --parse-raw in rawLaTeXInline.
+ Updated LaTeX reader test to use --parse-raw.
+
+ * HTML writer:
+
+ + Modified rules for automatic HTML header identifiers to
+ ensure that identifiers begin with an alphabetic character.
+ The new rules are described in README. (Resolves Issue #33.)
+
+ + Changed handling of titles in HTML writer so you don't get
+ "titleprefix - " followed by nothing.
+
+ * ConTeXt writer: Use wrappers around Doc elements to ensure proper
+ spacing. Each block element is wrapped with either Pad or Reg.
+ Pad'ed elements are guaranteed to have a blank line in between.
+
+ * RST writer:
+
+ + Refactored RST writer to use a record instead of a tuple for state,
+ and to include options in state so it doesn't need to be passed as
+ a parameter.
+
+ + Use an interpreted text role to render math in restructuredText.
+ See http://www.american.edu/econ/itex2mml/mathhack.rst for the
+ strategy.
+
pandoc (0.45) unstable; urgency=low
[ John MacFarlane ]