aboutsummaryrefslogtreecommitdiff
path: root/src/Text
AgeCommit message (Collapse)AuthorFilesLines
2012-09-26UTF8: Better error message for invalid UTF8.John MacFarlane1-4/+6
Read bytestring and use Text's decodeUtf8 instead of using System.IO's hGetContents. This way you get a message saying "invalid UTF-8 stream" instead of "invalid byte sequence." You are also told which byte caused the problem.
2012-09-25Make --id-prefix affect footnote IDs in markdown writer.John MacFarlane2-4/+5
Closes #614.
2012-09-25Removed need for utf8-string package.John MacFarlane6-31/+55
* Depend on text. * Expose Text.Pandoc.UTF8. * Text.Pandoc.UTF8 now exports toString, fromString, toStringLazy, fromStringLazy. * These are used instead of the old utf8-string functions.
2012-09-25UTF8: use universalNewlineMode in reading.John MacFarlane1-1/+2
This treats both '\r\n' and '\n' as '\n' on input, no matter what platform we're running on.
2012-09-24Texinfo writer: Fixed internal cross-references.John MacFarlane1-7/+26
Now we insert anchors after each header, and use @ref instead of @uref for links. Commas are now escaped as @comma{} only when needed; previously all commas were escaped. (This change is needed, in part, because @ref commands must be followed by a real comma or period.) Also insert a blank line in from of @verbatim environments.
2012-09-23Revert "More intelligent handling of text encodings."John MacFarlane4-22/+12
This reverts commit 7272735b3d413a644fd9ab01eeae8ae9cd5a925b.
2012-09-23More intelligent handling of text encodings.John MacFarlane4-12/+22
Previously, UTF-8 was enforced for both input and output. The new system: * For input, UTF-8 is tried first; if an error is raised, the locale encoding is tried. * For output, the locale encoding is always used.
2012-09-23Removed unneeded CPP conditional.John MacFarlane1-44/+0
Removed code that was conditional on base < 4.2, since now we require base >= 4.2.
2012-09-23Revert "Use local encoding for input/output rather than forcing UTF8."John MacFarlane1-1/+2
This reverts commit c69837adb648a479167be5e2d37279a02be8060c.
2012-09-23Use local encoding for input/output rather than forcing UTF8.John MacFarlane1-2/+1
Note that system templates are stored as UTF8 and will still be read as such, even if the local encoding is different. Text downloaded from URLs will also be treated as UTF-8.
2012-09-23UTF8: Export decodeArg.John MacFarlane1-1/+5
2012-09-23Export encodePath/decodePath from UTF8.John MacFarlane1-0/+1
Removed duplicate code in src/pandoc.hs.
2012-09-22Markdown reader: Fixed link parser to avoid exponential slowdowns.John MacFarlane1-8/+12
Previously the parser would hang on input like this: [[[[[[[[[[[[[[[[[[hi We fixed this by making the link parser parser characters between balanced brackets (skipping brackets in inline code spans), then parsing the result as an inline list. One change is that [hi *there]* bud](/url) is now no longer parsed as a link. But in this respect pandoc behaved differently from most other implementations anyway, so that seems okay. All current tests pass. Added test for this case. Closes #620.
2012-09-22LaTeX reader: Added a 'try' in rawLaTeXBlock.John MacFarlane1-1/+1
This allows the markdown reader to treat '\begin' (not followed by an argument) as a raw string rather than erroring out when it doesn't find a '{'. Closes #622.
2012-09-16LaTeX writer: Don't rely on the enumerate package.John MacFarlane1-17/+27
Instead, use standard LaTeX commands to change numbering style.
2012-09-16LaTeX writer: Properly escape strings inside \url{}.John MacFarlane1-1/+2
Closes #576.
2012-09-16RST writer: Fixed hyperlinked images.John MacFarlane1-17/+28
* Use :target: field when you have a simple linked image. * Don't wrap the reference. * Cleaned up code. * Closes #611.
2012-09-16RST reader: Small tweaks to raw field lists.John MacFarlane1-3/+2
* Don't allow line breaks in field names. * Strip off initial newline from 'raw' when field body begins on next line.
2012-09-16Merge pull request #596 from dlax/rst-fieldlistsJohn MacFarlane1-2/+2
improvements to rst reader for field lists
2012-09-15MediaWiki reader: Allow table in table cell.John MacFarlane1-1/+3
2012-09-15LaTeX writer: Don't escape _ in hyperref identifiers.John MacFarlane1-1/+1
2012-09-15LaTeX writer: don't escape _ in url.John MacFarlane1-1/+1
2012-09-15LaTeX writer: Don't use \endhead with longtable.John MacFarlane1-1/+1
It causes the heading to be a different size form the body, sometimes.
2012-09-15Markdown writer: Made sensitive to Ext_hard_line_breaks.John MacFarlane1-4/+8
2012-09-15LaTeX writer: Use longtable instead of ctable.John MacFarlane1-19/+17
2012-09-15MediaWiki reader: Added smart doublequotes with -S option.John MacFarlane1-2/+10
Also disallow ' in the "trail" of an internal link; allowing it causes bugs with `'''[[Link]]'''`.
2012-09-15MediaWiki parser: Improved endline to stop bad breaks.John MacFarlane1-2/+0
2012-09-15MediaWiki reader: Allow def list item with just a term.John MacFarlane1-1/+4
2012-09-15MediaWiki reader: Properly handle templates in list items.John MacFarlane1-4/+15
2012-09-15HTML reader: Modified htmlTag for fewer false positives.John MacFarlane1-1/+1
A tag must start with `<` followed by `!`,`?`, `/`, or a letter. This makes it more useful in the wikimedia and markdown parsers.
2012-09-15MediaWiki reader: Handle ref tags as footnotes.John MacFarlane1-0/+1
2012-09-15MediaWiki reader: Better parsing of templates.John MacFarlane1-9/+10
2012-09-15MediaWiki reader: Treat "references" tag as block.John MacFarlane1-5/+2
2012-09-15MediaWiki reader: Interpret width attribute on table columns.John MacFarlane1-14/+33
2012-09-15MediaWiki reader: Got alignments working.John MacFarlane1-6/+12
They only work on headers, because pandoc forces same alignment for all cells in a column.
2012-09-15MediaWiki reader: Misc fixes, put category links at end.John MacFarlane1-22/+41
2012-09-15MediaWiki reader: Skip inline templates, handle <source>.John MacFarlane1-5/+10
2012-09-15MediaWiki reader: Parse styles on table cells.John MacFarlane1-0/+6
2012-09-15MediaWiki reader: Parse styles after '{|' table start.John MacFarlane1-1/+18
2012-09-14MediaWiki reader: || between table cells.John MacFarlane1-9/+10
2012-09-14MediaWiki reader: Improved table parsing.John MacFarlane1-10/+11
2012-09-14MediaWiki reader: Table improvements, added simple table tests.John MacFarlane1-4/+9
2012-09-14MediaWiki reader: Support for basic mediawiki tables.John MacFarlane1-5/+52
2012-09-13MediaWiki reader: Images.John MacFarlane1-3/+25
2012-09-13MediaWiki reader: internal links.John MacFarlane1-4/+4
2012-09-13MediaWiki reader: Implemented basic internal links.John MacFarlane1-41/+62
Including word-ending links and the "pipe trick."
2012-09-13MediaWiki reader: Implemented numbered external links.John MacFarlane1-8/+12
2012-09-13MediaWiki reader: Use MWState instead of ParserState.John MacFarlane2-6/+25
2012-09-13MediaWiki reader: Ignore `<gallery>` tags.John MacFarlane1-5/+6
2012-09-13MediaWiki reader: Support HTML lists.John MacFarlane1-12/+29