aboutsummaryrefslogtreecommitdiff
path: root/src/Text
AgeCommit message (Collapse)AuthorFilesLines
2012-09-28HTML writer: Use toHtml instead of pre-escaping.John MacFarlane1-4/+7
We work around the problem that blaze-html unnecessarily escapes `'` by pre-escaping just the `'` characters, instead of the whole string. If blaze-html later stops escaping `'` characters, we can simplify strToHtml to toHtml. Note that this change yields a significant speed boost (111ms to 94ms on one benchmark). Closes #629.
2012-09-28RST reader: Made para parser slightly more efficient.John MacFarlane1-12/+19
2012-09-27RST reader: Use Text.Pandoc.Builder.John MacFarlane1-174/+180
This will give us more flexibility in the future. It also gives built-in normalization. Performance slightly better.
2012-09-27Shared: Export compactify', formerly in Markdown reader.John MacFarlane2-21/+25
2012-09-27Parsing: Changed type of stateSubstitutions to use Inlines.John MacFarlane1-2/+2
2012-09-27Removed nullBlock.John MacFarlane2-7/+1
Don't use nullBlock in Textile reader. Better to know about parsing problems than to skip stuff when we get stuck.
2012-09-27RST reader: Support :target: on image substitutions.John MacFarlane1-26/+26
2012-09-27Added stateSubstitutions to ParserState, use for RST substitutions.John MacFarlane2-12/+23
2012-09-27Removed Ext_monospace_autolinks.John MacFarlane2-8/+2
2012-09-27Added Ext_autolink_urls.John MacFarlane2-0/+10
2012-09-27Renamed Ext_autolink_code_spans to Ext_monospace_autolinks.John MacFarlane2-3/+3
2012-09-26RST reader: Support :target: on .. image:: blocks.John MacFarlane1-4/+8
Still not supported on substitution definitions.
2012-09-26Biblio: Remove workaround for toCapital.John MacFarlane1-10/+2
Now citeproc-hs is fixed upstream, so this is no longer needed. Closes #531.
2012-09-26Shared: Count \r as space in removeLeading/TrailingSpace.John MacFarlane1-1/+1
2012-09-26UTF8: Better error message for invalid UTF8.John MacFarlane1-4/+6
Read bytestring and use Text's decodeUtf8 instead of using System.IO's hGetContents. This way you get a message saying "invalid UTF-8 stream" instead of "invalid byte sequence." You are also told which byte caused the problem.
2012-09-25Make --id-prefix affect footnote IDs in markdown writer.John MacFarlane2-4/+5
Closes #614.
2012-09-25Removed need for utf8-string package.John MacFarlane6-31/+55
* Depend on text. * Expose Text.Pandoc.UTF8. * Text.Pandoc.UTF8 now exports toString, fromString, toStringLazy, fromStringLazy. * These are used instead of the old utf8-string functions.
2012-09-25UTF8: use universalNewlineMode in reading.John MacFarlane1-1/+2
This treats both '\r\n' and '\n' as '\n' on input, no matter what platform we're running on.
2012-09-24Texinfo writer: Fixed internal cross-references.John MacFarlane1-7/+26
Now we insert anchors after each header, and use @ref instead of @uref for links. Commas are now escaped as @comma{} only when needed; previously all commas were escaped. (This change is needed, in part, because @ref commands must be followed by a real comma or period.) Also insert a blank line in from of @verbatim environments.
2012-09-23Revert "More intelligent handling of text encodings."John MacFarlane4-22/+12
This reverts commit 7272735b3d413a644fd9ab01eeae8ae9cd5a925b.
2012-09-23More intelligent handling of text encodings.John MacFarlane4-12/+22
Previously, UTF-8 was enforced for both input and output. The new system: * For input, UTF-8 is tried first; if an error is raised, the locale encoding is tried. * For output, the locale encoding is always used.
2012-09-23Removed unneeded CPP conditional.John MacFarlane1-44/+0
Removed code that was conditional on base < 4.2, since now we require base >= 4.2.
2012-09-23Revert "Use local encoding for input/output rather than forcing UTF8."John MacFarlane1-1/+2
This reverts commit c69837adb648a479167be5e2d37279a02be8060c.
2012-09-23Use local encoding for input/output rather than forcing UTF8.John MacFarlane1-2/+1
Note that system templates are stored as UTF8 and will still be read as such, even if the local encoding is different. Text downloaded from URLs will also be treated as UTF-8.
2012-09-23UTF8: Export decodeArg.John MacFarlane1-1/+5
2012-09-23Export encodePath/decodePath from UTF8.John MacFarlane1-0/+1
Removed duplicate code in src/pandoc.hs.
2012-09-22Markdown reader: Fixed link parser to avoid exponential slowdowns.John MacFarlane1-8/+12
Previously the parser would hang on input like this: [[[[[[[[[[[[[[[[[[hi We fixed this by making the link parser parser characters between balanced brackets (skipping brackets in inline code spans), then parsing the result as an inline list. One change is that [hi *there]* bud](/url) is now no longer parsed as a link. But in this respect pandoc behaved differently from most other implementations anyway, so that seems okay. All current tests pass. Added test for this case. Closes #620.
2012-09-22LaTeX reader: Added a 'try' in rawLaTeXBlock.John MacFarlane1-1/+1
This allows the markdown reader to treat '\begin' (not followed by an argument) as a raw string rather than erroring out when it doesn't find a '{'. Closes #622.
2012-09-16LaTeX writer: Don't rely on the enumerate package.John MacFarlane1-17/+27
Instead, use standard LaTeX commands to change numbering style.
2012-09-16LaTeX writer: Properly escape strings inside \url{}.John MacFarlane1-1/+2
Closes #576.
2012-09-16RST writer: Fixed hyperlinked images.John MacFarlane1-17/+28
* Use :target: field when you have a simple linked image. * Don't wrap the reference. * Cleaned up code. * Closes #611.
2012-09-16RST reader: Small tweaks to raw field lists.John MacFarlane1-3/+2
* Don't allow line breaks in field names. * Strip off initial newline from 'raw' when field body begins on next line.
2012-09-16Merge pull request #596 from dlax/rst-fieldlistsJohn MacFarlane1-2/+2
improvements to rst reader for field lists
2012-09-15MediaWiki reader: Allow table in table cell.John MacFarlane1-1/+3
2012-09-15LaTeX writer: Don't escape _ in hyperref identifiers.John MacFarlane1-1/+1
2012-09-15LaTeX writer: don't escape _ in url.John MacFarlane1-1/+1
2012-09-15LaTeX writer: Don't use \endhead with longtable.John MacFarlane1-1/+1
It causes the heading to be a different size form the body, sometimes.
2012-09-15Markdown writer: Made sensitive to Ext_hard_line_breaks.John MacFarlane1-4/+8
2012-09-15LaTeX writer: Use longtable instead of ctable.John MacFarlane1-19/+17
2012-09-15MediaWiki reader: Added smart doublequotes with -S option.John MacFarlane1-2/+10
Also disallow ' in the "trail" of an internal link; allowing it causes bugs with `'''[[Link]]'''`.
2012-09-15MediaWiki parser: Improved endline to stop bad breaks.John MacFarlane1-2/+0
2012-09-15MediaWiki reader: Allow def list item with just a term.John MacFarlane1-1/+4
2012-09-15MediaWiki reader: Properly handle templates in list items.John MacFarlane1-4/+15
2012-09-15HTML reader: Modified htmlTag for fewer false positives.John MacFarlane1-1/+1
A tag must start with `<` followed by `!`,`?`, `/`, or a letter. This makes it more useful in the wikimedia and markdown parsers.
2012-09-15MediaWiki reader: Handle ref tags as footnotes.John MacFarlane1-0/+1
2012-09-15MediaWiki reader: Better parsing of templates.John MacFarlane1-9/+10
2012-09-15MediaWiki reader: Treat "references" tag as block.John MacFarlane1-5/+2
2012-09-15MediaWiki reader: Interpret width attribute on table columns.John MacFarlane1-14/+33
2012-09-15MediaWiki reader: Got alignments working.John MacFarlane1-6/+12
They only work on headers, because pandoc forces same alignment for all cells in a column.
2012-09-15MediaWiki reader: Misc fixes, put category links at end.John MacFarlane1-22/+41