aboutsummaryrefslogtreecommitdiff
AgeCommit message (Collapse)AuthorFilesLines
2021-02-28LaTeX reader efficiency improvements.John MacFarlane1-31/+42
In conjunction with other changes this makes the reader almost twice as fast on our benchmark as it was on Feb. 10.
2021-02-28Move setDefaultLanguage to T.P.Readers.LaTeX.Lang.John MacFarlane2-16/+22
2021-02-28LaTeX reader: remove two unnecessary parsers in inline.John MacFarlane1-2/+0
These are handled anyway by regularSymbol.
2021-02-28trypandoc: add 2 second timeout.John MacFarlane2-2/+3
2021-02-28Factor out T.P.Readers.LaTeX.Citation.John MacFarlane4-186/+232
2021-02-27Factor out T.P.Readers.LaTeX.Table.John MacFarlane4-363/+412
2021-02-27Split off T.P.Readers.LaTeX.Accent.John MacFarlane3-60/+87
To help reduce memory demands compiling the main LaTeX reader.
2021-02-27Add xml-conduit to stack.yaml extra-deps.John MacFarlane1-0/+1
2021-02-27Use skylighting 0.10.4.John MacFarlane2-4/+4
This version of skylighting uses xml-conduit rather than hxt. This speeds up parsing of XML syntax definitions fourfold, and removes four packages from pandoc's dependency graph: hxt-charproperties hxt-unicode hxt-regex-xmlschema hxt
2021-02-27Lua: use strict evaluation when retrieving AST value from the stackAlbert Krewinkel1-79/+77
Fixes: #6674
2021-02-26Add tools/parseTimings.pl.John MacFarlane1-0/+33
A script to help pin down which modules take the most time and memory to compile.
2021-02-26CI: hide successes in linux test output.John MacFarlane1-1/+1
2021-02-26Update tests for changes to https URLs.John MacFarlane6-6/+6
2021-02-26Fix/update URLs and use HTTP**S** where possible (#7122)Salim B17-48/+47
2021-02-26Wrap url colours in braces in the default LaTeX template (#7121)Loïc Grobol1-4/+4
2021-02-22CI: Add 'benchmark' to benchmark cache key.John MacFarlane1-2/+2
For benchmark we build with optimizations, so the regular cache is useless.
2021-02-22Use latest skylighting.John MacFarlane2-2/+4
2021-02-22Remove weigh-pandoc.John MacFarlane2-56/+0
It's not really useful any more, now that our regular benchmarks include data on allocation.
2021-02-22benchmark CI improvements.John MacFarlane1-3/+3
- don't build tests, just benchmarks. - get allocation info.
2021-02-22Rename benchmark in CI.John MacFarlane1-1/+1
2021-02-22Add linux benchmark to CI.John MacFarlane1-0/+56
2021-02-22T.P.CSV: fix parsing of unquoted values.John MacFarlane2-2/+16
Previously we didn't allow unescaped quotes in unquoted values, but they are allowed. Closes #7112.
2021-02-22tests: print accurate location if a test failsAlbert Krewinkel1-1/+1
Ensures that tasty-hunit reports the location of the failing test instead of the location of the helper `test` function.
2021-02-22Fall back to latin1 if UTF-8 decoding fails...John MacFarlane1-1/+7
...when handling URL argument served with no charset in the mime type. The assumption is that most pages that don't specify a charset in the mime type are either UTF-8 or latin1. I think that's a good assumption, though I'm not sure.
2021-02-22When downloading content from URL arguments, be sensitive to...John MacFarlane1-1/+9
the character encoding. We can properly handle UTF-8 and latin1 (ISO-8859-1); for others we raise an error. See #5600.
2021-02-22T.P.Error: Add PandocUnsupportedCharsetError constructor...John MacFarlane2-0/+5
...for PandocError. [API change]
2021-02-22Text.Pandoc.MIME: add exported function getCharset.John MacFarlane1-2/+15
[API change]
2021-02-22Text.Pandoc.UTF8: change IO functions to return Text, not String.John MacFarlane11-68/+70
[API change] This affects `readFile`, `getContents`, `writeFileWith`, `writeFile`, `putStrWith`, `putStr`, `putStrLnWith`, `putStrLn`. `hPutStrWith`, `hPutStr`, `hPutStrLnWith`, `hPutStrLn`, `hGetContents`. This avoids the need to uselessly create a linked list of characters when emiting output.
2021-02-21Update changelog.John MacFarlane1-0/+355
2021-02-21Fix changelog-helper.shJohn MacFarlane1-4/+11
2021-02-21LaTeX reader: further optimizations in satisfyTok.John MacFarlane1-5/+5
Benchmarks show 2/3 of the run time and 2/3 of the allocation of the Feb. 10 benchmarks.
2021-02-21LaTeX reader: removed sExpanded in state.John MacFarlane1-7/+2
This isn't actually needed and checking it doesn't change anything. Also remove an unnecessary `doMacros` before `satisfyTok`, which does it anyway.
2021-02-21LaTeX reader: further performance optimization.John MacFarlane1-23/+19
Avoid unnecessary 'doMacros'.
2021-02-20HTML reader: small performance tweak.John MacFarlane1-9/+5
2021-02-20T.P.Shared: remove some obsolete functions [API change].John MacFarlane1-43/+1
Removed: - `splitByIndices` - `splitStringByIndicies` - `substitute` - `underlineSpan` None of these are used elsewhere in the code base.
2021-02-20HTML reader: small efficiency improvements.John MacFarlane1-25/+18
Also, remove exported class NamedTag(..) [API change]. This was just intended to smooth over the transition from String to Text and is no longer needed. The functions isInlineTag and isBlockTag are no longer polymorphic.
2021-02-20LaTeX reader: Another small improvement to macro handling.John MacFarlane1-4/+3
2021-02-20LaTeX reader: avoid macro resolution code if no macros defined.John MacFarlane1-16/+19
2021-02-20T.P.Readers.LaTeX.Parsing: improve braced'.John MacFarlane1-16/+13
Remove the parameter, have it parse the opening brace, and make it more efficient.
2021-02-20revealjs writer: add 'center' option for vertical slide centering.maurerle1-0/+4
Closes #7104.
2021-02-20Benchmark improvements.John MacFarlane1-4/+17
+ Run writer benchmarks for binary formats too. + Alphabetize benchmarks. + Don't run benchmarks for bibliography formats (yet; we need a special input for them).
2021-02-20HTML reader: efficiency improvements.John MacFarlane1-81/+129
Do a lookahead to find the right parser to use. Benchmarks from 34ms to 23ms, with less allocation. Also speeds up the epub reader.
2021-02-19MANUAL: block-level formatting is not allowed in line blocks.John MacFarlane1-0/+3
Closes #7107.
2021-02-19make bench: compare against a baseline, use datestamps for bench results.John MacFarlane1-1/+8
2021-02-19Update default ODT styleLorenzo1-1/+1
As of now, the default style for ODT documents has a "First paragraph" style that inherits from "Standard" style and has no top or bottom margin. All subsequent paragraphs have "Text_20_body" style that inherits from "Standard" and add "0.0598in" margins on top and bottom. This makes the final document a bit ugly since the first paragraph has a small gap ("0.0598in") towards the second one, and all subsequent have double that. The proposed fix makes "First paragraph" inherit from "Text_20_body" instead so that it also has a consistent margin. Another approach would be to inherit "Text_20_body" and add a 0 margin on top.
2021-02-19Clarify `tex_math_dollars` extension.John MacFarlane1-1/+2
Note that no blank lines are allowed between the delimiters in display math.
2021-02-18'make bench': Create csv files for comparison.John MacFarlane3-3/+4
2021-02-18DocBook, JATS, OPML readers: performance optimization.John MacFarlane3-64/+8
With the new XML parser, we can avoid the expensive tree normalization step we used to do. This gives a significant speed boost in docbook and JATS parsing (e.g. 9.7 to 6 ms).
2021-02-18T.P.XML Improve fromEntities.John MacFarlane1-17/+13
2021-02-18T.P.PDF: disable `smart` when building PDF via LaTeX.John MacFarlane1-1/+5
This is to prevent accidental creation of ligatures like `` ?` `` and `` !` `` (especially in languages with quotations like German), and similar ligature issues. See jgm/citeproc#54.