aboutsummaryrefslogtreecommitdiff
path: root/src/Text/Pandoc
AgeCommit message (Collapse)AuthorFilesLines
2013-01-17Revert "Markdown reader: Be less restrictive about class attributes."John MacFarlane1-1/+1
This reverts commit b158b3fb2182b5c8a6f03759277f4f73cdc24797.
2013-01-16Markdown reader: Be less restrictive about class attributes.John MacFarlane1-1/+1
This allows attributes like `C++`. Any nonspace and non-}.
2013-01-16Fixed problem with duplicate ids in epub.John MacFarlane1-12/+4
2013-01-16Implemented Ext_link_attributes in markdown reader.John MacFarlane2-31/+28
Also simplified source URL and link title parsers.
2013-01-16Implemented Ext_mmd_header_identifiers in markdown writer.John MacFarlane1-0/+2
2013-01-16Implemented Ext_mmd_header_identifiers in markdown reader.John MacFarlane2-4/+23
2013-01-16Options: Added multimarkdownExtensions.John MacFarlane1-2/+18
Also added 'markdown_mmd' as input/output option.
2013-01-15LaTeX writer: Prevent paragraphs containing only linebreaks or spaces.John MacFarlane1-4/+9
2013-01-15Added `writerHtmlQTags` and `--html-q-tags` option.John MacFarlane2-1/+3
The previous default was to use `<q>` tags in HTML5. But `<q>` tags are also valid HTML4, and they are not very robust in HTML5. Some user agents don't support them, and some CSS resets prevent pandoc's quotes CSS from working properly (e.g. bootstrap). It seems a better default just to insert quote characters, but the option is provided for those who have gotten used to using `<q>` tags.
2013-01-15Changed Ext_autolink_urls -> Ext_autolink_bare_uris.John MacFarlane3-5/+8
Added tests.
2013-01-15Case-insensitive parsing of URI schemes.John MacFarlane1-1/+1
2013-01-15Parsing: Improve oneOfStrings, export oneOfStringsCI.John MacFarlane1-7/+20
oneOfStrings will now take the longest match it can in a list of strings, so if 'foo' and 'foobar' are both included, 'foobar' will match even if 'foo' is first in the list.
2013-01-15Revised URI parser.John MacFarlane1-27/+50
* It no longer uses Network.URIs URI parser, which is too restrictive (not allowing unicode URIs unless encoded). * It allows many more schemes. * It better handles punctuation so as to avoid capturing trailing punctuation in bare URLs.
2013-01-15Use 'fig:' instead of '\SOH' in title to indicate figure.John MacFarlane14-27/+27
Revises 1a4b47e93368bfbd31daccdfedbd9527ee740201
2013-01-14Implemented Ext_implicit_figures.John MacFarlane14-18/+45
* In markdown reader, add a '\1' character to the beginning of the title of an image that is alone in its paragraph, if implicit_figures extension is selected. * In writers, check for Para [Image alt (src,'\1':tit)] and treat it as a figure if possible. * Updated tests. This is a bit of a hack, but it allows us to make implicit_figures an extension of the markdown reader, rather than the writers.
2013-01-14Removed some cruft.John MacFarlane1-10/+0
2013-01-14Parsing: Fixed uri -- escape unicode URLs.John MacFarlane1-2/+2
Otherwise Network.URI.parseURI fails on e.g. Chinese URLs. Changed an incorrect test in markdown-reader-more.
2013-01-14Added Ext_implicit_figures.John MacFarlane1-2/+2
No code to hook this in yet.
2013-01-14Parsing: Simplified and improved singleQuoteStart.John MacFarlane1-8/+2
This makes 's', 'l', etc. parse properly. Formerly we had some English-centric heuristics, but they are no longer needed now that we keep track of the last 'Str' position in state. Closes #698.
2013-01-13RST writer: Fixes bug with links with duplicate text.John MacFarlane1-6/+10
We now (a) use anonymous links for links with inline URLs, and (b) use an inline link instead of a reference link if the reference link would require a label that has already been used for a different link. Closes #511.
2013-01-13RST writer: Properly handle images with no alt text.John MacFarlane1-6/+7
Closes #678.
2013-01-13Markdown reader: Support RST-style line blocks.John MacFarlane2-1/+15
This depends on the new Ext_line_blocks extension.
2013-01-13Moved lineBlockLines to Parsing.John MacFarlane2-12/+19
This will be used by both RST and markdown readers.
2013-01-13RST reader: Refactored line block parser.John MacFarlane1-14/+10
2013-01-13RST writer: Allow soft breaks w continuations in line blocks.John MacFarlane1-1/+1
2013-01-13RST reader: Line block improvements.John MacFarlane1-3/+8
* Use nonbreaking spaces for initial indent (otherwise lost in HTML and LaTeX). * Allow multiple paragraphs in a single line block.
2013-01-13Use line block in RST writer when a paragraph contains linebreaks.John MacFarlane1-4/+8
Previously linebreaks weren't supported in RST, since RST has no native linebreak construct.
2013-01-12Markdown writer: Support Ext_auto_identifiers and Ext_header_attributes.John MacFarlane1-13/+23
2013-01-12HTML reader: Added html5 tags to list of block-level tags.John MacFarlane1-5/+8
2013-01-12Escape `|` as `\vert` in LaTeX math.John MacFarlane1-2/+11
This avoids a clash with highlighting-kate's macros, which redefine | as a short verbatim delimiter. Thanks to Björn Peemöller for raising this issue.
2013-01-11Changed hspace in last commit to 0.333em.John MacFarlane1-1/+1
This seems to match the spacing of ~ exactly.
2013-01-11LaTeX writer: Use `\hspace*` for nonbreaking space after line break.John MacFarlane1-1/+13
Since `~` spaces after a line break are just ignored. Closes #687.
2013-01-11Revised EPUB writer given changes in header attributes.John MacFarlane1-7/+18
We need to ensure that all headers have attributes, since this is no longer guaranteed by hierarchicalize. Explicitly given headers remain the same.
2013-01-11EPUB writer: Download images at URLs.John MacFarlane1-3/+3
2013-01-11Refactoring:John MacFarlane5-33/+41
* Shared now exports fetchItem (instead of getItem) and openURL * fetchItem has different parameters than getItem and includes some logic formerly in the ODT and Docx writers * getItem still used in SelfContained
2013-01-11ODT, Docx writers: Properly handle URL refs for images.John MacFarlane2-20/+38
These images are now downloaded instead of being ignored (as used to happen in the docx reader) or causing an error (as used to happen in the odt reader).
2013-01-11ImageSize: Remove no longer exported readImageSize.John MacFarlane1-4/+1
2013-01-11Improvements to docx writer.John MacFarlane1-65/+63
Avoid reading image files again when we've already processed them.
2013-01-11Docx writer: Preliminary improvements.John MacFarlane1-18/+23
* Use getItem to fetch images, so we can get them over the net if they have absolute URLs. * Added TODO notes for cleaning up the logic.
2013-01-11ImageSize: Use strict, not lazy bytestrings.John MacFarlane1-2/+2
2013-01-11Move getItem from SelfContained to Share; export getItem.John MacFarlane2-30/+31
2013-01-11Options: Added githubMarkdownExtensions.John MacFarlane1-0/+15
Added github_markdown as input/output option.
2013-01-10Options: Added phpMarkdownExtraExtensions.John MacFarlane1-0/+14
And added markdown_phpextra input/output format.
2013-01-10Implemented Ext_header_attributes.John MacFarlane2-12/+22
This allows explicit attributes to be put on headers, using a syntax like that for code blocks: {#id .class .class k=v k=v}
2013-01-10Ext_header_identifiers -> Ext_auto_identifiers.John MacFarlane2-3/+3
2013-01-09More improvements in emailAddress parser.John MacFarlane1-23/+17
2013-01-09Made email parser more correct.John MacFarlane1-12/+14
Now it's based on RFC 822, though it still doesn't implement quoted strings in email addresses.
2013-01-09Fixed regression in example lists.John MacFarlane1-6/+7
This was associated with e1be934dc2ff061e5200017d6503280686983e1f.
2013-01-09Added Attr field to Header.John MacFarlane24-86/+108
Previously header ids were autogenerated by the writers. Now they are generated (unless supplied explicitly) in the markdown parser, if the `header_identifiers` extension is selected. In addition, the textile reader now supports id attributes on headers.
2013-01-08Biblio: Make sure mvPunc and deNote run on metadata too.John MacFarlane1-4/+3
This fixed a bug with notes on titles using footnote styles.