pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2017-03-12	Issue warning for duplicate header identifiers.	John MacFarlane	1	-5/+11
	As noted in the previous commit, an autogenerated identifier may still coincide with an explicit identifier that is given for a header later in the document, or with an identifier on a div, span, link, or image. This commit adds a warning in this case, so users can supply an explicit identifier. * Added `DuplicateIdentifier` to LogMessage. * Modified HTML, Org, MediaWiki readers so their custom state type is an instance of HasLogMessages. This is necessary for `registerHeader` to issue warnings. See #1745.
2017-03-04	Fixed some loose ends in #1592.	John MacFarlane	1	-1/+3
	Added test cases. Fixed HTML reader to parse a span with class "smallcaps" as SmallCaps. Fixed Markdown writer to render SmallCaps as a native span when native spans are enabled.
2017-02-20	Tighten up HasQuoteContext instance in HTML reader.	John MacFarlane	1	-1/+1
	We constrain it to the state used in the HTML reader. Otherwise we can get overlap with the general instance for ParserState m.
2017-02-11	Use new warnings throughout the code base.	John MacFarlane	1	-6/+4

2017-02-10	Added Text.Pandoc.Logging (exported module).	John MacFarlane	1	-1/+2
	This now contains the Verbosity definition previously in Options, as well as a new LogMessage datatype that will eventually be used instead of raw strings for warnings. This will enable us, among other things, to provide machine-readable warnings if desired. See #3392.
2017-02-10	HTML reader: Added warnings for ignored material.	John MacFarlane	1	-5/+14
	See #3392.
2017-02-06	Removed --parse-raw and readerParseRaw.	John MacFarlane	1	-7/+7
	These were confusing. Now we rely on the +raw_tex or +raw_html extension with latex or html input. Thus, instead of --parse-raw -f latex we use -f latex+raw_tex and instead of --parse-raw -f html we use -f html+raw_html
2017-01-25	More logging-related changes.	John MacFarlane	1	-9/+5
	Class: * Removed getWarnings, withWarningsToStderr * Added report * Added logOutput to PandocMonad * Make logOutput streaming in PandocIO monad * Properly reverse getLog output Readers: * Replaced use of trace with report DEBUG. TWiki Reader: Put everything inside PandocMonad m. API changes.
2017-01-25	Changes to verbosity in writer and reader options.	John MacFarlane	1	-3/+3
	API changes: Text.Pandoc.Options: * Added Verbosity. * Added writerVerbosity. * Added readerVerbosity. * Removed writerVerbose. * Removed readerTrace. pandoc CLI: The `--trace` option sets verbosity to DEBUG; the `--quiet` option sets it to ERROR, and the `--verbose` option sets it to INFO. The default is WARNING.
2017-01-25	Unify Errors.	Jesse Rosenthal	1	-1/+2

2017-01-25	Working on readers.	Jesse Rosenthal	1	-98/+115

2016-12-08	Removed debug trace from HTML reader.	John MacFarlane	1	-2/+1

2016-12-07	HTML reader: Understand `style=width:` as well as `width` in `col`.	John MacFarlane	1	-2/+7
	Closes #3286.
2016-12-06	Fixed some bad regressions in HTML table parser.	John MacFarlane	1	-3/+3
	This regression leads to the introduction of empty rows in some circumstances. Closes #3280.
2016-11-26	HTML reader: improved table parsing.	John MacFarlane	1	-11/+24
	We now check explicitly for non-1 rowspan or colspan attributes, and fail when we encounter them. Previously we checked that each row had the same number of cells, but that could be true even with rowspans/colspans. And there are cases where it isn't true in tables that we can handle fine -- e.g. when a tr element is empty. So now we just pad rows with empty cells when needed. Closes #3027.
2016-11-13	HTML reader: only treat "a" element as link if it has href.	John MacFarlane	1	-7/+19
	Otherwise treat as span. Closes #3226.
2016-11-02	HTML reader: treat `<math>` as MathML by default...	John MacFarlane	1	-8/+11
	unless something else is explicitly specified in xmlns. Provided it parses as MathML, of course. Also fixed default which should be to inline math if no display attribute is used.
2016-09-02	Remove Compat.Monoid	Jesse Rosenthal	1	-1/+1
	This was only necessary for GHC versions with base below 4.5 (i.e., ghc < 7.4).
2016-05-21	HTML reader: fixed bug in pClose.	John MacFarlane	1	-1/+1
	This caused exponential parsing behavior in documnets with unclosed tags in dl, dd, dt.
2016-04-10	Markdown + HTML readers: be more forgiving about unescaped &.	John MacFarlane	1	-10/+15
	We are now more forgiving about parsing invalid HTML with unescaped `&` as raw HTML. (Previously any unescaped `&` would cause pandoc not to recognize the string as raw HTML.) Closes #2410.
2016-03-22	Fixed bug in Markdown raw HTML parsing.	John MacFarlane	1	-1/+1
	This was a regression, with the rewrite of `htmlInBalanced` (from `Text.Pandoc.Readers.HTML`) in 1.17. It caused newlines to be omitted in raw HTML blocks. Closes #2804.
2016-03-10	Fixed behavior of base tag.	John MacFarlane	1	-17/+11
	+ If the base path does not end with slash, the last component will be replaced. E.g. base = `http://example.com/foo` combines with `bar.html` to give `http://example.com/bar.html`. + If the href begins with a slash, the whole path of the base is replaced. E.g. base = `http://example.com/foo/` combines with `/bar.html` to give `http://example.com/bar.html`. Closes #2777.
2016-02-20	Fixed some linter warnings.	John MacFarlane	1	-3/+3

2016-02-20	HTML reader: rewrote htmlInBalanced.	John MacFarlane	1	-10/+39
	This version avoids an exponential performance problem with `<script>` tags, and it should be faster in general. Closes #2730.
2016-02-16	HTML reader: properly handle an empty cell in a simple table.	John MacFarlane	1	-0/+1
	Closes #2718.
2016-01-29	HTML reader: handle multiple meta tags with same name.	John MacFarlane	1	-2/+6
	Put them in a list in the metadata so they are all preserved, rather than (as before) throwing out all but one..
2016-01-22	Changed type of Shared.uniqueIdent argument from [String] to Set String.	John MacFarlane	1	-3/+3
	This avoids performance problems in documents with many identically named headers. Closes #2671.
2015-12-12	Modified readers to emit SoftBreak when appropriate.	John MacFarlane	1	-1/+4

2015-11-19	Merge branch 'new-image-attributes' of https://github.com/mb21/pandoc into ↵	John MacFarlane	1	-15/+11
	mb21-new-image-attributes * Bumped version to 1.16. * Added Attr field to Link and Image. * Added `common_link_attributes` extension. * Updated readers for link attributes. * Updated writers for link attributes. * Updated tests * Updated stack.yaml to build against unreleased versions of pandoc-types and texmath. * Fixed various compiler warnings. Closes #261. TODO: * Relative (percentage) image widths in docx writer. * ODT/OpenDocument writer (untested, same issue about percentage widths). * Update pandoc-citeproc.
2015-11-09	Restored Text.Pandoc.Compat.Monoid.	John MacFarlane	1	-1/+1
	Don't use custom prelude for latest ghc. This is a better approach to making 'stack ghci' and 'cabal repl' work. Instead of using NoImplicitPrelude, we only use the custom prelude for older ghc versions. The custom prelude presents a uniform API that matches the current base version's prelude. So, when developing (presumably with latest ghc), we don't use a custom prelude at all and hence have no trouble with ghci. The custom prelude no longer exports (<>): we now want to match the base 4.8 prelude behavior.
2015-11-09	Revert "Use -XNoImplicitPrelude and 'import Prelude' explicitly."	John MacFarlane	1	-1/+0
	This reverts commit c423dbb5a34c2d1195020e0f0ca3aae883d0749b.
2015-11-08	Use -XNoImplicitPrelude and 'import Prelude' explicitly.	John MacFarlane	1	-0/+1
	This is needed for ghci to work with pandoc, given that we now use a custom prelude. Closes #2503.
2015-10-22	Fixed over-eager raw HTML inline parsing.	John MacFarlane	1	-0/+1
	Tightened up the inline HTML parser so it disallows TagWarnings. This only affects the markdown reader when the `markdown_in_html_blocks` option is disabled. Closes #2469.
2015-10-14	Use custom Prelude to avoid compiler warnings.	John MacFarlane	1	-2/+2
	- The (non-exported) prelude is in prelude/Prelude.hs. - It exports Monoid and Applicative, like base 4.8 prelude, but works with older base versions. - It exports (<>) for mappend. - It hides 'catch' on older base versions. This allows us to remove many imports of Data.Monoid and Control.Applicative, and remove Text.Pandoc.Compat.Monoid. It should allow us to use -Wall again for ghc 7.10.
2015-10-11	HTML reader/writer: better handling of "section" elements.	John MacFarlane	1	-3/+10
	Previously `<section>` tags were just parsed as raw HTML blocks. With this change, section elements are parsed as Div elements with the class "section". The HTML writer will use `<section>` tags to render these Divs in HTML5; otherwise they will be rendered as `<div class="section">`. Closes #2438.
2015-08-08	HTML reader: add auto identifiers if not present on headers.	John MacFarlane	1	-7/+17
	This makes TOC linking work properly. The same thing needs to be done to the org reader to fix #2354; in addition, `Ext_auto_identifiers` should be added to the list of default extensions for org in Text.Pandoc.
2015-08-07	Updated readers, writers and README for link attribute	mb21	1	-14/+4

2015-08-07	Updated readers and writers for new image attribute parameter.	John MacFarlane	1	-1/+7
	(mb21)
2015-07-27	HTML Reader: Detect font-variant with pickStyleAttrProps	Ophir Lifshitz	1	-6/+5

2015-07-24	HTML Reader: Parse <ol> type, class, and inline list-style(-type) CSS	Ophir Lifshitz	1	-17/+30

2015-07-21	Fix regression: allow HTML comments containing `--`.	John MacFarlane	1	-4/+4
	Technically this isn't allowed in an HTML comment, but we've always allowed it, and so do most other implementations. It is handy if e.g. you want to put command line arguments in HTML comments.
2015-07-21	HTML reader: handle type attribute on ol.	John MacFarlane	1	-1/+8
	E.g. `<ol type="i">`. Closes #2313.
2015-07-10	Avoid parsing partial URLs as HTML tags.	John MacFarlane	1	-1/+8
	Closes #2277.
2015-06-04	HTML reader: allow `<body>` to close `<head>`.	John MacFarlane	1	-0/+1

2015-05-13	HTML reader: Support base tag.	John MacFarlane	1	-7/+28
	We only support the href attribute, as there's no place for "target" in the Pandoc document model for links. Added HTML reader test module, with tests for this feature. Closes #1751.
2015-05-11	HTML reader: Fixed detection of self-closing tags.	John MacFarlane	1	-2/+2
	Earlier versions had a bug and would wrongly think opening tags containing attributes with slashes in them were self-closing. Closes #2146.
2015-04-29	HTML reader: Allow multiple colgroups in table.	John MacFarlane	1	-1/+1
	Closes #2122.
2015-04-26	Updated copyright notices to -2015. Closes #2111.	John MacFarlane	1	-2/+2

2015-04-17	More principled fix for #1820.	John MacFarlane	1	-5/+7
	If the tag parses as a comment, we check to see if the input starts with `<!--`. If not, it's bogus comment mode and we fail htmlTag. Includes test case. Closes #1820.
2015-04-17	Fixed `htmlTag` in HTML reader.	John MacFarlane	1	-1/+1
	Require that `<!` or `<?` be followed by nonspace. This prevents `</ div>` from being parsed as a comment. Closes #1820.