pandoc - Conversion between markup formats

Age	Commit message (Collapse)	Author	Files	Lines
2011-01-01	Fixed regression in markdown reader.	John MacFarlane	3	-201/+201
	'(_hi_)' was being parsed with literal underscores (no emphasis). The fix: the 'str' parser now only parses alphanumerics and embedded underscores. All other symbols are handled by the 'symbol' parser. This has a slight effect on the AST, since you'll get [Str "hi",Str ":"] insntead of [Str "hi:"]. But there should not be a visible effect in any of the writers. Thanks to gwern for pointing out the regression.
2010-12-30	New HTML reader using tagsoup as a lexer.	John MacFarlane	2	-8/+8
	* The new reader is faster and more accurate. * API changes for Text.Pandoc.Readers.HTML: - removed rawHtmlBlock, anyHtmlBlockTag, anyHtmlInlineTag, anyHtmlTag, anyHtmlEndTag, htmlEndTag, extractTagType, htmlBlockElement, htmlComment - added htmlTag, htmlInBalanced, isInlineTag, isBlockTag, isTextTag * tagsoup is a new dependency. * Text.Pandoc.Parsing: Generalized type on readWith. * Benchmark.hs: Added length calculation to force full evaluation. * Updated HTML reader tests. * Updated markdown and textile readers to use the functions from the HTML reader. * Note: The markdown reader now correctly handles some cases it did not before. For example: <hr/> is reproduced without adding a space. <script> a = '<b>'; </script> is parsed correctly.
2010-12-22	Made --smart work with HTML reader.	John MacFarlane	1	-71/+71
	It did not work before, because - and quotes were gobbled up by the str parser.
2010-12-22	Texinfo writer: Updated to use Pretty.	John MacFarlane	2	-16/+1

2010-12-22	Man writer: updated to use Pretty.	John MacFarlane	2	-28/+20

2010-12-21	OpenDocument writer: Updated to use Pretty.	John MacFarlane	2	-132/+302

2010-12-21	Docbook writer: Updated to use Pretty.	John MacFarlane	1	-127/+106

2010-12-20	Markdown writer: use \ for newline instead of two spaces at eol.	John MacFarlane	1	-1/+1
	(Unless --strict.)
2010-12-20	ConTeXt writer: Updated to use Text.Pandoc.Pretty.	John MacFarlane	2	-89/+80

2010-12-19	Fixed markdown-citations.ieee.txt.	John MacFarlane	1	-1/+1

2010-12-19	Fixed markdown-citations.txt.	John MacFarlane	1	-3/+1

2010-12-19	Fixed biblatex/natbib citation writer tests.	John MacFarlane	2	-32/+2

2010-12-19	LaTeX writer: Modified to use Pretty.	John MacFarlane	1	-72/+35
	Improved footnote formatting, removed spurious blank lines.
2010-12-18	LaTeX writer: Use \paragraph, \subparagraph for level 4,5 headers.	John MacFarlane	1	-2/+2

2010-12-17	Added new prettyprinting module.	John MacFarlane	12	-583/+278
	* Added Text.Pandoc.Pretty. This is better suited for pandoc than the 'pretty' package. One advantage is that we now get proper wrapping; Emph [Inline] is no longer treated as a big unwrappable unit. Previously we only got breaks for spaces at the "outer level." We can also more easily avoid doubled blank lines. Performance is significantly better as well. * Removed Text.Pandoc.Blocks. Text.Pandoc.Pretty allows you to define blocks and concatenate them. * Modified markdown, RST, org readers to use Text.Pandoc.Pretty instead of Text.PrettyPrint.HughesPJ. * Text.Pandoc.Shared: Added writerColumns to WriterOptions. * Markdown, RST, Org writers now break text at writerColumns. * Added --columns command-line option, which sets stColumns and writerColumns. * Table parsing: If the size of the header > stColumns, use the header size as 100% for purposes of calculating relative widths of columns.
2010-12-15	Added 'tests' Cabal flag.	John MacFarlane	1	-220/+0
	+ This ensures that test-pandoc gets built. + 'cabal test' now runs this. + The old tests/RunTests.hs has been removed, and src/test-pandoc.hs added.
2010-12-15	Use top-level header at end as bibliography title for natbib and biblatex ↵	Nathan Gass	2	-3/+2
	output.
2010-12-15	Remove punctuation at start of suffix for natbib and biblatex output.	Nathan Gass	3	-9/+9
	This is necessary as the latex citation commands include there own punctuation, which resulted in doubled commas for markdown documents where citeproc output works correctly.
2010-12-14	Added normalize funcion to latex citation tests.	Nathan Gass	1	-9/+21
	This is necessary because converting from markdown to latex correctly changes hyphens to en-dashes and some spaces to non-breaking spaces. Converting back to markdown does not undo this changes, and so the tests have to undo them.
2010-12-14	Added citation tests.	Nathan Gass	7	-22/+245
	Added tests for latex citation writer and reader, markdown citation writer and additional markup in citations.
2010-12-13	Added support for latex cite commands in latex reader.	Nathan Gass	1	-4/+4

2010-12-13	Disabled colored boxes around cites in latex template.	Nathan Gass	3	-3/+3

2010-12-13	Markdown reader: Fixed regression in reference key parser.	John MacFarlane	1	-1/+0
	* The recent change allowing spaces and newlines in the URL caused problems when reference keys are stacked up without blank lines between. This is now fixed. * Added test.
2010-12-10	Markdown reader: Allow linebreaks in URLs (treat as spaces).	John MacFarlane	2	-2/+3
	Also, a string of consecutive spaces or tabs is now parsed as a single space. If you have multiple spaces in your URL, use %20%20.
2010-12-09	textile redcloth definition lists	paul.rivier	2	-0/+24

2010-12-09	Textile reader: better treatment of acronyms.	John MacFarlane	1	-1/+1
	We now parse PBS(Public Broadcasting System) as if it were "PBS (Public Broadcasting System)".
2010-12-08	RST reader: Added footnote suppport.	John MacFarlane	2	-1/+35
	Resolves issue #258. Note that there are some differences in how docutils and pandoc treat footnotes. Currently pandoc ignores the numeral or symbol used in the note; footnotes are put in an auto-numbered ordered list.
2010-12-08	Textile reader: Implemented footnotes.	John MacFarlane	2	-2/+11

2010-12-07	Made --smart work with RST reader.	John MacFarlane	2	-48/+48

2010-12-07	Smart punctuation: don't alllow ellipses containing spaces.	John MacFarlane	1	-1/+1
	Previously we allowed '. . .', ' . . . ', etc. This caused too many complications, and removed author's flexibility in combining ellipses with spaces and periods.
2010-12-07	Moved smartPunctuation from Markdown to Parsing.	John MacFarlane	1	-8/+8
	+ Parameterized smartPunctuation on an inline parser. + Handle smartPunctuation in Textile reader.
2010-12-07	Textile reader: implemented acronyms, (tm), (r), (c).	John MacFarlane	2	-1/+19

2010-12-07	Fixed bugs in ieee.csl (Andrea Rossato).	John MacFarlane	1	-2/+2

2010-12-07	Updated ieee citation test for punctuation-in-quote.	John MacFarlane	1	-2/+2

2010-12-06	Markdown reader: handle curly quotes better.	John MacFarlane	3	-2/+11
	Previously, curly quotes were just parsed literally, leading to problems in some output formats. Now they are parsed as Quoted inlines, if --smart is specified. Resolves Issue #270.
2010-12-05	Fix regression: markdown references should be case-insensitive.	John MacFarlane	2	-1/+15
	This broke when we added the Key type. We had assumed that the custom case-insensitive Ord instance would ensure case-insensitive matching, but that is not how Data.Map works. * Added a test case for case-insensitivity in markdown-reader-more * Removed old refsMatch from Text.Pandoc.Parsing module; * hid the 'Key' constructor; * dropped the custom Ord and Eq instances, deriving instead; * added fromKey and toKey to convert between Keys and Inline lists; * toKey ensures that keys are case-insensitive, since this is the only way the API provides to construct a Key. Resolves Issue #272.
2010-12-04	Added tests.	Puneeth Chaganti	3	-0/+938
	+ Added tables.org and writer.org to tests. + Added org.template to templates. + Changed RunTests.hs as required. + Minor changes to Org writer.
2010-12-03	Merge branch 'citeproc' into master.	John MacFarlane	9	-2/+1068
	Conflicts: src/Text/Pandoc/Definition.hs
2010-12-03	Textile reader: added hrule parser.	John MacFarlane	1	-1/+1

2010-12-03	Textile reader: drop leading, trailing newline in pre block.	John MacFarlane	1	-2/+2
	This is consistent with how the other readers work.
2010-12-03	Textile reader: updated test suite to include raw HTML.	John MacFarlane	1	-6/+6

2010-12-03	Textile reader: parse raw by default.	John MacFarlane	1	-1/+1
	It's part of the textile spec to allow raw HTML, just as with markdown. -R is no longer needed in test suite.
2010-12-03	punctuation handling, and more html-specific handling	paul.rivier	2	-48/+48

2010-12-03	html inlines and html blocks handling in textile reader	Paul Rivier	2	-1/+30

2010-12-03	textile reader now ignores html/css attributes	Paul Rivier	2	-1/+26

2010-12-03	fix autolink by promoting it in the parser list, fix table parabreak	Paul Rivier	2	-26/+4

2010-12-03	more support for Textile reader (explicit links, images), tests and cabal ↵	Paul Rivier	3	-0/+279
	entries
2010-11-28	Revamped tests, using markdown output instead of HTML.	John MacFarlane	8	-257/+136
	This is easier to inspect.
2010-11-28	Citation tests: removed spurious double-spaces.	John MacFarlane	3	-9/+9

2010-11-28	Updated citation tests to use en-dash between ranges.	John MacFarlane	3	-12/+12