1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
|
Check that the commonmark reader handles the `ascii_identifiers`
extension properly.
```
% pandoc -f commonmark+gfm_auto_identifiers+ascii_identifiers -t native
# non ascii ⚠️ räksmörgås
^D
[ Header
1
( "non-ascii--raksmorgas" , [] , [] )
[ Str "non"
, Space
, Str "ascii"
, Space
, Str "\9888\65039"
, Space
, Str "r\228ksm\246rg\229s"
]
]
```
Note that the emoji here is actually a composite character,
formed from \9888 and \65039. The latter is a combining mark,
so it survives...
```
% pandoc -f commonmark+gfm_auto_identifiers-ascii_identifiers -t native
# non ascii ⚠️ räksmörgås
^D
[ Header
1
( "non-ascii-\65039-r\228ksm\246rg\229s" , [] , [] )
[ Str "non"
, Space
, Str "ascii"
, Space
, Str "\9888\65039"
, Space
, Str "r\228ksm\246rg\229s"
]
]
```
`gfm` should have `ascii_identifiers` disabled by default.
```
% pandoc -f gfm -t native
# non ascii ⚠️ räksmörgås
^D
[ Header
1
( "non-ascii-\65039-r\228ksm\246rg\229s" , [] , [] )
[ Str "non"
, Space
, Str "ascii"
, Space
, Str "\9888\65039"
, Space
, Str "r\228ksm\246rg\229s"
]
]
```
|