aboutsummaryrefslogtreecommitdiff
path: root/src/Text
AgeCommit message (Collapse)AuthorFilesLines
2013-01-15Case-insensitive parsing of URI schemes.John MacFarlane1-1/+1
2013-01-15Parsing: Improve oneOfStrings, export oneOfStringsCI.John MacFarlane1-7/+20
oneOfStrings will now take the longest match it can in a list of strings, so if 'foo' and 'foobar' are both included, 'foobar' will match even if 'foo' is first in the list.
2013-01-15Revised URI parser.John MacFarlane1-27/+50
* It no longer uses Network.URIs URI parser, which is too restrictive (not allowing unicode URIs unless encoded). * It allows many more schemes. * It better handles punctuation so as to avoid capturing trailing punctuation in bare URLs.
2013-01-15Use 'fig:' instead of '\SOH' in title to indicate figure.John MacFarlane14-27/+27
Revises 1a4b47e93368bfbd31daccdfedbd9527ee740201
2013-01-14Implemented Ext_implicit_figures.John MacFarlane14-18/+45
* In markdown reader, add a '\1' character to the beginning of the title of an image that is alone in its paragraph, if implicit_figures extension is selected. * In writers, check for Para [Image alt (src,'\1':tit)] and treat it as a figure if possible. * Updated tests. This is a bit of a hack, but it allows us to make implicit_figures an extension of the markdown reader, rather than the writers.
2013-01-14Removed some cruft.John MacFarlane1-10/+0
2013-01-14Parsing: Fixed uri -- escape unicode URLs.John MacFarlane1-2/+2
Otherwise Network.URI.parseURI fails on e.g. Chinese URLs. Changed an incorrect test in markdown-reader-more.
2013-01-14Added Ext_implicit_figures.John MacFarlane1-2/+2
No code to hook this in yet.
2013-01-14Parsing: Simplified and improved singleQuoteStart.John MacFarlane1-8/+2
This makes 's', 'l', etc. parse properly. Formerly we had some English-centric heuristics, but they are no longer needed now that we keep track of the last 'Str' position in state. Closes #698.
2013-01-13RST writer: Fixes bug with links with duplicate text.John MacFarlane1-6/+10
We now (a) use anonymous links for links with inline URLs, and (b) use an inline link instead of a reference link if the reference link would require a label that has already been used for a different link. Closes #511.
2013-01-13RST writer: Properly handle images with no alt text.John MacFarlane1-6/+7
Closes #678.
2013-01-13Markdown reader: Support RST-style line blocks.John MacFarlane2-1/+15
This depends on the new Ext_line_blocks extension.
2013-01-13Moved lineBlockLines to Parsing.John MacFarlane2-12/+19
This will be used by both RST and markdown readers.
2013-01-13RST reader: Refactored line block parser.John MacFarlane1-14/+10
2013-01-13RST writer: Allow soft breaks w continuations in line blocks.John MacFarlane1-1/+1
2013-01-13RST reader: Line block improvements.John MacFarlane1-3/+8
* Use nonbreaking spaces for initial indent (otherwise lost in HTML and LaTeX). * Allow multiple paragraphs in a single line block.
2013-01-13Use line block in RST writer when a paragraph contains linebreaks.John MacFarlane1-4/+8
Previously linebreaks weren't supported in RST, since RST has no native linebreak construct.
2013-01-12Markdown writer: Support Ext_auto_identifiers and Ext_header_attributes.John MacFarlane1-13/+23
2013-01-12HTML reader: Added html5 tags to list of block-level tags.John MacFarlane1-5/+8
2013-01-12Escape `|` as `\vert` in LaTeX math.John MacFarlane1-2/+11
This avoids a clash with highlighting-kate's macros, which redefine | as a short verbatim delimiter. Thanks to Björn Peemöller for raising this issue.
2013-01-11Changed hspace in last commit to 0.333em.John MacFarlane1-1/+1
This seems to match the spacing of ~ exactly.
2013-01-11LaTeX writer: Use `\hspace*` for nonbreaking space after line break.John MacFarlane1-1/+13
Since `~` spaces after a line break are just ignored. Closes #687.
2013-01-11Revised EPUB writer given changes in header attributes.John MacFarlane1-7/+18
We need to ensure that all headers have attributes, since this is no longer guaranteed by hierarchicalize. Explicitly given headers remain the same.
2013-01-11EPUB writer: Download images at URLs.John MacFarlane1-3/+3
2013-01-11Refactoring:John MacFarlane5-33/+41
* Shared now exports fetchItem (instead of getItem) and openURL * fetchItem has different parameters than getItem and includes some logic formerly in the ODT and Docx writers * getItem still used in SelfContained
2013-01-11ODT, Docx writers: Properly handle URL refs for images.John MacFarlane2-20/+38
These images are now downloaded instead of being ignored (as used to happen in the docx reader) or causing an error (as used to happen in the odt reader).
2013-01-11ImageSize: Remove no longer exported readImageSize.John MacFarlane1-4/+1
2013-01-11Improvements to docx writer.John MacFarlane1-65/+63
Avoid reading image files again when we've already processed them.
2013-01-11Docx writer: Preliminary improvements.John MacFarlane1-18/+23
* Use getItem to fetch images, so we can get them over the net if they have absolute URLs. * Added TODO notes for cleaning up the logic.
2013-01-11ImageSize: Use strict, not lazy bytestrings.John MacFarlane1-2/+2
2013-01-11Move getItem from SelfContained to Share; export getItem.John MacFarlane2-30/+31
2013-01-11Options: Added githubMarkdownExtensions.John MacFarlane2-0/+17
Added github_markdown as input/output option.
2013-01-10Added markdown_phpextra as writer option.John MacFarlane1-0/+1
2013-01-10Options: Added phpMarkdownExtraExtensions.John MacFarlane2-9/+25
And added markdown_phpextra input/output format.
2013-01-10Implemented Ext_header_attributes.John MacFarlane2-12/+22
This allows explicit attributes to be put on headers, using a syntax like that for code blocks: {#id .class .class k=v k=v}
2013-01-10Ext_header_identifiers -> Ext_auto_identifiers.John MacFarlane2-3/+3
2013-01-09More improvements in emailAddress parser.John MacFarlane1-23/+17
2013-01-09Made email parser more correct.John MacFarlane1-12/+14
Now it's based on RFC 822, though it still doesn't implement quoted strings in email addresses.
2013-01-09Fixed regression in example lists.John MacFarlane1-6/+7
This was associated with e1be934dc2ff061e5200017d6503280686983e1f.
2013-01-09Added Attr field to Header.John MacFarlane24-86/+108
Previously header ids were autogenerated by the writers. Now they are generated (unless supplied explicitly) in the markdown parser, if the `header_identifiers` extension is selected. In addition, the textile reader now supports id attributes on headers.
2013-01-08Biblio: Make sure mvPunc and deNote run on metadata too.John MacFarlane1-4/+3
This fixed a bug with notes on titles using footnote styles.
2013-01-08Added / to the list of letters that can occur in a citation key.John MacFarlane1-1/+1
2013-01-07Implemented tight lists in context, latex writers.John MacFarlane2-13/+27
2013-01-07Shared: export isTightList.John MacFarlane1-0/+7
2013-01-06Don't put the text of an autolink in Code font.John MacFarlane14-22/+32
2013-01-06Mediawiki reader: interpret HTML in verbatim contexts as text.John MacFarlane1-2/+1
Thanks to Jeff Runningen for reporting that mediawiki works this way.
2013-01-06Docx writer: Use rIdNN identifiers for r:embed in images.John MacFarlane1-1/+1
2013-01-06UTF8 module: Remove `\r` when reading.John MacFarlane1-4/+7
This should prevent problems with extra CRs on windows.
2013-01-06Docx writer: Use separate footnotes.xml for notes.John MacFarlane1-21/+32
This seems to help LibreOffice convert the file, even though it was valid docx before. Note that the references in notes must be in word/_rels/footnotes.xml.rel. We handle this now by simply making that file contain all the references in word/_rels/document.xml.rel. Something better could be done eventually, but this works. Closes #637.
2013-01-05LaTeX tables: Use minipage rather than parbox for block containers.John MacFarlane1-2/+2
This allows verbatim code to be included in grid tables. Closes #663.