aboutsummaryrefslogtreecommitdiff
path: root/src/Text/Pandoc/Readers/HTML
AgeCommit message (Collapse)AuthorFilesLines
2021-11-24HTML reader: parse attributes on links and images.John MacFarlane1-3/+4
Closes #6970.
2021-09-23HTML reader: handle empty tbody element in table.John MacFarlane1-5/+8
Closes #7589.
2021-08-10HTML reader: treat commments as blank when parsing.John MacFarlane1-5/+7
This modifies pBlank. Previously comments could sometimes flummox the parser. Cloes #7482.
2021-07-06HTML reader: add col, colgroup to 'closes' definitionsJohn MacFarlane1-1/+3
2021-05-30HTML reader: fix column width regression.John MacFarlane1-1/+1
Column widths specified with a style attribute were off by a factor of 100 in 2.14. Closes #7334.
2021-05-22Handle relative lengths (e.g. `2*`) in HTML column widths.John MacFarlane1-14/+33
See <https://www.w3.org/TR/html4/types.html#h-6.6>. "A relative length has the form "i*", where "i" is an integer. When allotting space among elements competing for that space, user agents allot pixel and percentage lengths first, then divide up remaining available space among relative lengths. Each relative length receives a portion of the available space that is proportional to the integer preceding the "*". The value "*" is equivalent to "1*". Thus, if 60 pixels of space are available after the user agent allots pixel and percentage space, and the competing relative lengths are 1*, 2*, and 3*, the 1* will be alloted 10 pixels, the 2* will be alloted 20 pixels, and the 3* will be alloted 30 pixels." Closes #4063.
2021-05-22Revert "HTML reader: simplify col width parsing"John MacFarlane1-9/+13
This reverts commit f76fe2ab56606528d4710cc6c40bceb5788c3906.
2021-05-22HTML reader: simplify col width parsingAlbert Krewinkel1-13/+9
2021-03-19Protect partial uses of maximum with NonEmpty.John MacFarlane1-1/+2
2021-03-15Remove an unneeded importJohn MacFarlane1-1/+0
2021-03-15Use foldl' instead of foldl everywhere.John MacFarlane1-0/+1
2021-01-08Update copyright notices for 2021 (#7012)Albert Krewinkel4-5/+5
2020-12-10HTML reader: retain attribute prefixes and avoid duplicates.John MacFarlane1-7/+13
Previously we stripped attribute prefixes, reading `xml:lang` as `lang` for example. This resulted in two duplicate `lang` attributes when `xml:lang` and `lang` were both used. This commit causes the prefixes to be retained, and also avoids invald duplicate attributes. Closes #6938.
2020-11-27HTML reader tests: improve test coverage of new featuresAlbert Krewinkel1-1/+2
2020-11-27HTML reader: support body headers, row head columnsAlbert Krewinkel1-41/+61
Closes: #6312
2020-11-26HTML reader: improve support for table headers, footer, attributesAlbert Krewinkel2-66/+219
- `<tfoot>` elements are no longer added to the table body but used as table footer. - Separate `<tbody>` elements are no longer combined into one. - Attributes on `<thead>`, `<tbody>`, `<th>`/`<td>`, and `<tfoot>` elements are preserved.
2020-11-26HTML reader: allow finer grained options for tag omissionAlbert Krewinkel2-12/+25
2020-11-24HTML reader: support row or column-spanning table cellsAlbert Krewinkel1-16/+18
2020-11-24HTML reader: support blocks in captionAlbert Krewinkel1-5/+5
2020-11-24HTML reader: extract table parsing into separate moduleAlbert Krewinkel2-0/+137
2020-11-23HTML reader: extract submodulesAlbert Krewinkel3-0/+331
Reducing module size should reduce memory use during compilation. This is preparatory work to tackle support for more table features.