Age | Commit message (Collapse) | Author | Files | Lines | |
---|---|---|---|---|---|
2014-08-12 | Docx: More robust handling of multiple bookmarks in header. | Jesse Rosenthal | 1 | -6/+8 | |
2014-08-12 | Docx reader: Check for null-id'd anchors too. | Jesse Rosenthal | 1 | -1/+0 | |
Otherwise they get left dangling in the document. | |||||
2014-08-12 | Docx reader: accept explicit "Italic" and "Bold" rStyles. | Jesse Rosenthal | 2 | -18/+31 | |
Note that "Italic" can be on, and, from the last commit, `<w:i>` can be present, but be turned off. In that case, the turned-off tag takes precedence. So, we have to distinguish between something being off and something not being there. Hence, isItalic, isBold, isStrike, and isSmallCaps have become Maybes. | |||||
2014-08-12 | Docx reader: Add "BlockQuotation" to divs list. | Jesse Rosenthal | 1 | -1/+1 | |
2014-08-12 | Docx Reader: Fix font style parsing. | Jesse Rosenthal | 1 | -12/+27 | |
Before we just checked for the existence of a tag. Now, we make sure to check for its on/off value. | |||||
2014-08-12 | Merge pull request #1528 from mpickering/epubtitlepage | John MacFarlane | 1 | -4/+10 | |
EPUB Reader: Ignores titlepage attribute | |||||
2014-08-12 | EPUB Reader: Ignore title pages | Matthew Pickering | 1 | -4/+10 | |
2014-08-12 | DocBook: Support equations with mathml. | John MacFarlane | 1 | -4/+16 | |
equation, informalequation, inlineequation and mml:math elements. | |||||
2014-08-12 | Merge pull request #1524 from jkr/dropCap3 | John MacFarlane | 2 | -3/+11 | |
Docx reader: move dropcap combining logic to Reducible | |||||
2014-08-12 | Markdown reader: Improved parsing of indented code in list items. | John MacFarlane | 1 | -25/+42 | |
Indented code at the beginning of a list item must be indented eight spaces from the margin (or from the edge of the container), or four spaces past the list marker, whichever is farther. Some examples in `tests/markdown-reader-more.txt`. | |||||
2014-08-12 | Docx reader: move combining logic to Reducible | Jesse Rosenthal | 2 | -3/+11 | |
Introduces a new function in Reducibles, concatR. The idea is that if we have two list of Reducibles (blocks or inlines), we can combine them and just perform the reduction on the joining parts (the last element of the first list, the first element of the second list). This is useful in cases where the two lists are already reduced, and we're only worried about the joining elements. This actually improves the efficiency a bit further, because concatR can be smart about empty lists. | |||||
2014-08-12 | Docx reader: Make dropcap combining more efficient. | Jesse Rosenthal | 1 | -1/+3 | |
Before, we had to run reduceList on the whole combined paragraph, which was redundant, and could take some time for long paragraphs. We only need to combine the drop cap with the first inline of the next paragraph. | |||||
2014-08-11 | Docx reader: combine inlines properly in dropcaps. | Jesse Rosenthal | 1 | -1/+1 | |
Make sure that adjacent inlines are combined properly in dropcaps. This updates the test results as well. | |||||
2014-08-11 | Docx reader: Use dropcap state. | Jesse Rosenthal | 1 | -9/+17 | |
If we get to a dropcap, we keep hold the inlines until the next paragraph, and combine it there. | |||||
2014-08-11 | Add dropCap to paragraph style. | Jesse Rosenthal | 1 | -2/+12 | |
2014-08-11 | EPUB reader: use walk instead of bottomUp. | John MacFarlane | 1 | -2/+1 | |
This should be more efficient. | |||||
2014-08-11 | Merge pull request #1521 from jkr/emptyEmph | John MacFarlane | 1 | -5/+6 | |
Discard empty formatters | |||||
2014-08-11 | Merge pull request #1519 from mpickering/more | John MacFarlane | 1 | -1/+1 | |
EPUB Normalisation and anchors for div blocks in tex | |||||
2014-08-11 | Textile reader: list and HTML block parsing improvements. | John MacFarlane | 1 | -16/+13 | |
Closes #1513. Lists can now start without an intervening blank line. Also, html block-level tags that don't start a line are parsed as RawInline and don't interrupt paragraphs, as in RedCloth. | |||||
2014-08-11 | Docx reader: handle empty reducibles. | Jesse Rosenthal | 1 | -5/+6 | |
2014-08-11 | EPUB Reader: Fixed another normalisation problem.. | Matthew Pickering | 1 | -1/+1 | |
2014-08-11 | Merge pull request #1516 from mpickering/epubmetadata | John MacFarlane | 1 | -6/+7 | |
EPUB improvements | |||||
2014-08-11 | Docx Parse: Improved font recognition when specified in rFonts element | Matthew Pickering | 1 | -8/+27 | |
2014-08-11 | Docx Fonts: Derives Show and Eq | Matthew Pickering | 1 | -0/+1 | |
2014-08-11 | EPUB Reader: Can now parse multiple meta data fields | Matthew Pickering | 1 | -2/+2 | |
2014-08-11 | EPUB reader: Fixed bug where filepaths weren't sufficiently normalised | Matthew Pickering | 1 | -4/+5 | |
2014-08-10 | Merge pull request #1510 from jkr/spacefix | John MacFarlane | 1 | -10/+12 | |
Docx reader: Fix spacing issue. | |||||
2014-08-10 | Removed OMath module, depend on texmath >= 0.8. | John MacFarlane | 2 | -439/+1 | |
2014-08-10 | Change head/tail to pattern guards. | Jesse Rosenthal | 1 | -7/+8 | |
2014-08-09 | Docx reader: Fix spacing issue. | Jesse Rosenthal | 1 | -9/+10 | |
Previously spaces at the beginning of Emph/Strong/etc were kept inside. This makes sure they are moved out. | |||||
2014-08-09 | Docx Parse: Recognises code points in sym elements which are in the private ↵ | Matthew Pickering | 1 | -1/+4 | |
range | |||||
2014-08-09 | Added Text.Pandoc.Readers.Docx.Fonts | Matthew Pickering | 1 | -0/+237 | |
2014-08-09 | Docx Reader: Added recognition of sym element in paragraphs | Matthew Pickering | 1 | -0/+19 | |
2014-08-10 | EPUB: Fixed another mediabag related regression.. | Matthew Pickering | 1 | -3/+5 | |
2014-08-09 | EPUB Reader: Changed image paths to be relative to manifest file | Matthew Pickering | 1 | -6/+6 | |
2014-08-08 | Merge branch 'newbranch' of https://github.com/mpickering/pandoc into ↵ | John MacFarlane | 1 | -28/+19 | |
mpickering-newbranch Conflicts: src/Text/Pandoc/Readers/EPUB.hs | |||||
2014-08-08 | Added `native_divs` and `native_spans` extensions. | John MacFarlane | 3 | -10/+9 | |
This allows users to turn off the default pandoc behavior of parsing contents of div and span tags in markdown and HTML as native pandoc Div blocks and Span inlines. Setting of default epub extensions has been moved from the EPUB reader to Text.Pandoc. | |||||
2014-08-08 | EPUB Reader: Improved robustness of image extraction | Matthew Pickering | 1 | -7/+9 | |
We now maintain the invariant that when fetchImages is called, all images have absolute paths. This patch fixes several bugs relating to this as there are three places where images can be introduced. (1) During the HTML parse (2) As spine elements (3) As a cover image For (1), the paths are corrected by the transformation renameImages For (2) and (3), we need to append the "root" to the path we parse from the spine | |||||
2014-08-08 | EPUB Reader: Fixed regressions in image extraction | Matthew Pickering | 1 | -7/+7 | |
Before the images were relative to the position of the package file. The collapse function changed this so that they were then absolute in the archive but the fetchImages function wasn't updated to recognise this. | |||||
2014-08-08 | EPUB Reader: Use collapseFilePath | Matthew Pickering | 1 | -14/+3 | |
2014-08-08 | EPUB Reader: Removed incorrectly set reader flag | Matthew Pickering | 1 | -5/+9 | |
2014-08-08 | Merge pull request #1503 from jkr/streamlineMath | John MacFarlane | 3 | -12/+24 | |
OMath parser: Change signature of exported function. | |||||
2014-08-08 | OMath parser: Change signature of exported function. | Jesse Rosenthal | 3 | -12/+24 | |
This changes the signature of the exported `readOMML` to `String -> Either String [Exp]`, so it can now, in theory, be slotted into TeXMath. It doesn't have any real error reporting yet, but that might make more sense once I put it in a branch, and understand how it works in the other readers. It also now reads strings that parse to either oMath or oMathPara elements. Note that the distinction is lost in the output. It's up to the caller to remember the display type. | |||||
2014-08-08 | Textile reader: fixed list parsing bug. Closes #1500. | John MacFarlane | 1 | -5/+16 | |
2014-08-08 | Textile reader: don't allow inline formatting to extend over newline. | John MacFarlane | 1 | -1/+2 | |
This matches behavior of RedCarpet, avoids some ugly bugs, and improves performance. | |||||
2014-08-08 | OMath: Finish initial cleanup. | Jesse Rosenthal | 2 | -536/+54 | |
This gets rid of commented-out functions, cleans up whitespace errors, and exports and imports the correct functions. | |||||
2014-08-08 | OMath: Remove Namespaces | Jesse Rosenthal | 2 | -287/+290 | |
We still need to test against prefixes, but this is only going to look at oMath fragments, so we're not going to be worried about looking up the real namespace. | |||||
2014-08-08 | OMath: Start phasing out internal OMath type. | Jesse Rosenthal | 1 | -0/+276 | |
This is the first step in removing the intermediate OMath type, which we no longer need since we're writing straight to TeXMath Exp. | |||||
2014-08-08 | OMath parser: don't group expressions if there's only one. | Jesse Rosenthal | 1 | -3/+3 | |
2014-08-08 | HTML EPUB exts: switch element can now be in either the inline or block position | Matthew Pickering | 1 | -9/+10 | |