aboutsummaryrefslogtreecommitdiff
path: root/src/Text
AgeCommit message (Collapse)AuthorFilesLines
2014-10-25Docx reader: fix for Issue #1692 (i18n styles)Nikolay Yakimov2-56/+130
This patch builds paragraph styles tree, then checks if paragraph has style.styleId or style/name.val matching predetermined patterns. Works with "Heading#" (name.val="heading #") for headings and "Quote"|"BlockQuote"|"BlockQuotation" (name.val="Quote"|"Block Text") for block quotes.
2014-10-24Docx Writer: Partial fix for #1607Nikolay Yakimov1-7/+35
International heading styles are inferred based on `<w:name val="heading #">` fallback, if there are no en-US "Heading#" styles
2014-10-23HTML writer: Make header attributes work outside top level.John MacFarlane1-4/+14
Previously they only appeared on top level header elements. Now they work e.g. in blockquotes. Closes #1711.
2014-10-21DOCX writer: Look in user data dir for archive reference.docx.John MacFarlane1-1/+1
2014-10-21Merge pull request #1706 from tarleb/org-symbol-entitiesJohn MacFarlane1-1/+8
Org reader: parse LaTeX-style MathML entities
2014-10-21Merge pull request #1668 from gbataille/widthFromRef2John MacFarlane1-17/+36
Getting the page width from the reference file
2014-10-20Pretty: Make CR + BLANKLINE = BLANKLINE.John MacFarlane1-0/+3
This fixes an extra blank line we were getting at the end of markdown fragments (as well as rst, org, etc.) Closes #1705.
2014-10-20Org reader: parse LaTeX-style MathML entitiesAlbert Krewinkel1-1/+8
Org supports special symbols which can be included using LaTeX syntax, but are actually MathML entities. Examples for this are `\nbsp` (non-breaking space), `\Aacute` (the letter A with accent acute) or `\copy` (the copyright sign ©). This fixes #1657.
2014-10-19Parsing: fixed `inlineMath` so it handles `\text{..}` containing `$`.John MacFarlane1-1/+23
For example: `$x = \text{the $n$th root of $y$}`. Closes #1677.
2014-10-18Markdown reader: allow `startnum` to work without `fancy_lists`.John MacFarlane1-2/+2
Formerly `pandoc -f markdown-fancy_lists+startnum` did not work properly.
2014-10-18Merge pull request #1680 from shelf/masterJohn MacFarlane1-7/+26
Respect indent when parsing Org bullet lists
2014-10-18Merge pull request #1700 from tarleb/org-emphasis-fixJohn MacFarlane1-5/+5
Org reader: fix rules for emphasis recognition
2014-10-18Org reader: Drop COMMENT document treesAlbert Krewinkel1-1/+26
Document trees under a header starting with the word `COMMENT` are comment trees and should not be exported. Those trees are dropped silently. This closes #1678.
2014-10-18Org reader: fix rules for emphasis recognitionAlbert Krewinkel1-5/+5
Things like `/hello,/` or `/hi'/` were falsy recognized as emphasised strings. This is wrong, as `,` and `'` are forbidden border chars and may not occur on the inner border of emphasized text. This patch enables the reader to matches the reference implementation in that it reads the above strings as plain text.
2014-10-17Fix indent issue for definition listsTimothy Humphries1-14/+25
Tidy up fix for #1650, #1698 as per comments in #1680. Fix same issue for definition lists with the same method.
2014-10-15Escape spaces. Fixes jgm/pandoc#1694.Bjorn Buckwalter1-1/+3
2014-10-12Respect indent when parsing Org bullet listsTimothy Humphries1-2/+10
Fixes issue with top-level bullet list parsing. Previously we would use `many1 spaceChars` rather than respecting the list's indent level. We also permitted `*` bullets on unindented lists, which should unambiguously parse as `header 1`. Combined, this meant headers at a different indent level were being unwittingly slurped into preceding bullet lists, as per Issue #1650.
2014-10-08Merge pull request #1674 from freiric/masterJohn MacFarlane1-6/+4
fix inDirectory to reset to the original directory in case an exception ...
2014-10-08EPUB reader: Further URI handling improvements.John MacFarlane1-25/+12
Now we outsource most of the work to `fetchItem'`. Also, do not include queries in file extensions. Improves fix to #1671. It is possible that this will have some unexpected effects, so further testing would be good.
2014-10-08EPUB writer: correctly resolve relative URIs. (Closes #1671.)John MacFarlane1-11/+14
2014-10-08Fixed absolute URI detection in EPUB writer. Closes #1672.John MacFarlane1-2/+2
2014-10-08fix inDirectory to reset to the original directory in case an exception occursFreiric Barral1-6/+4
2014-10-08EPUB writer: Don't add sourceURL to absolute URIs!John MacFarlane1-5/+10
Closes #1669. If there are further issues, please open a new, targeted issue on the tracker. Some notes on the further issues you gestured at: Data URIs are indeed dereferenced, but why is this a problem? (The function being used to fetch from URLs is used for many different formats. Preserving data URIs would make sense in EPUBs, but not for e.g. PDF output. And by dereferencing we can get a smaller, more efficient EPUB, with the data stored as bytes in a file rather than encoded in textual representation.) "absolute uris are not recognized" -- I assume that is the problem just fixed. If not, please open a new issue. "relative uris are resolved (wrongly) like file paths" -- can you give an example? `<base>` tag is ignored. Yes. I didn't know about the base tag. Could you open a new issue just for this?
2014-10-05Getting the page width from the reference fileGrégory Bataille1-17/+36
Uses it to scale images that are too large. When there is no reference files, default to a US letter portrait size to scale the images
2014-10-04add mime type for WebVTTJason Ronallo1-0/+1
2014-10-04Added `track` to list of tags treated by `--self-contained`.John MacFarlane1-1/+2
Closes #1664.
2014-10-03cref, sepWikiwide1-0/+2
Adding inlineCommands
2014-09-30RST writer: Wrap line blocks with spaces before continuations.John MacFarlane1-1/+1
Improves on fix to #1656.
2014-09-29Don't wrap lines in rST line blocks.John MacFarlane1-1/+1
Closes #1656. Fixing pandoc to wrap the lines but insert spaces would be much more complicated. This at least makes the output semantically correct.
2014-09-27Merge pull request #1601 from jkr/windowsfixJohn MacFarlane1-1/+1
Fix path-slashes inside archive for windows
2014-09-27Merge pull request #1589 from mszep/masterJohn MacFarlane1-7/+17
Add function to sanitize ConTeXt labels
2014-09-27Org Reader: Parse multi-inline terms correctly in definition listMatthew Pickering1-1/+1
Closes #1649
2014-09-26Fix 'Ext_lists_without_preceding_blankline' bug.Artyom1-2/+9
* Fixes #1636. * Adds a test.
2014-09-25HTML Reader: Recognise <br> tags inside <pre> blocksmpickering1-1/+6
Closes #1620
2014-09-25HTML Writer: Don't double render when email-obfuscation=nonempickering1-5/+5
Closes #1625
2014-09-25Add support for KaTeX HTML mathmpickering2-0/+20
Closes #1626
2014-09-25HTML Writer: MathML now outputted with tex annotation.mpickering1-1/+15
Closes #1635
2014-09-25Shared: Make collapseFilePath OS-agnosticmpickering1-5/+9
2014-09-25MediaBag: Fixes Windows specific path problemsmpickering1-5/+5
Changes the internal representation to fix the problem. I haven't tested this on windows. Closes #1597
2014-09-18ConTeXt writer: add function toLabelMark Szepieniec1-7/+17
This function can be used to sanitize reference labels so that they do not contain any of the illegal characters \#[]",{}%()|= . Currently only Links have their labels sanitized, because they are the only Elements that use passed labels.
2014-09-11Docx writer: Renumber header and footer relationships to avoid collisions.Jesse Rosenthal1-12/+56
We previously took the old relationship names of the headers and footer in secptr. That led to collisions. We now make a map of availabl names in the relationships file, and then rename in secptr.
2014-09-09LaTeX writer: Protect graphics in headers.Jesse Rosenthal1-6/+15
Graphics in `\section`/`\subsection` etc titles need to be `\protect`ed. This adds a state value and manually turns it on before every invocation of `sectionHeader` and manually turns it off after. Using a writer value and applying `local` would probably be cleaner, but this fits with the current style.
2014-09-06Docx Reader: Remove header class properly in other langsJesse Rosenthal1-4/+4
When we encounter one of the polyglot header styles, we want to remove that from the par styles after we convert to a header. To do that, we have to keep track of the style name, and remove it appropriately.
2014-09-05Docx reader: Use polyglot header list.Jesse Rosenthal1-7/+7
We're just keeping a list of header formats that different languages use as their default styles. At the moment, we have English, German, Danish, and French. We can continue to add to this. This is simpler than parsing the styles file, and perhaps less error-prone, since there seems to be some variations, even within a language, of how a style file will define headers.
2014-09-05Docx Reader: Start list of polyglot section headers.Jesse Rosenthal1-0/+7
2014-09-04Org reader: Added state changing blanklines.Jesse Rosenthal1-1/+8
This allows us to emphasize at the beginning of a new paragraph (or, in general, after blank lines).
2014-09-04Docx reader: Rewrite rewriteLink to work with new headers.Jesse Rosenthal1-9/+4
There could be new top-level headers after making lists, so we have to rewrite links after that.
2014-09-04Docx reader: Single-item headers in ordered lists are headers.Jesse Rosenthal1-4/+6
When users number their headers, Word understands that as a single item enumerated list. We make the assumption that such a list is, in fact, a header.
2014-09-02Docx reader: Fix window path for image lookup.Jesse Rosenthal1-1/+1
Don't use os-sensitive "combine", since we always want the paths in our zip-archive to use forward-slashes.
2014-09-01EPUB writer: Don't include nav node in spine unless --toc was requested.John MacFarlane1-4/+2
Previously we included it in the spine with `linear="no"`, leading to odd results in some readers. Closes #1593.