aboutsummaryrefslogtreecommitdiff
path: root/test/command/4742.md
blob: a91dffb332f6ab9e5b470b097ab12e7f08593151 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
Check that the commonmark reader handles the `ascii_identifiers`
extension properly.

```
% pandoc -f commonmark+gfm_auto_identifiers+ascii_identifiers -t native
# non ascii ⚠️ räksmörgås
^D
[ Header
    1
    ( "non-ascii--raksmorgas" , [] , [] )
    [ Str "non"
    , Space
    , Str "ascii"
    , Space
    , Str "\9888\65039"
    , Space
    , Str "r\228ksm\246rg\229s"
    ]
]
```

Note that the emoji here is actually a composite character,
formed from \9888 and \65039. The latter is a combining mark,
so it survives...

```
% pandoc -f commonmark+gfm_auto_identifiers-ascii_identifiers -t native
# non ascii ⚠️ räksmörgås
^D
[ Header
    1
    ( "non-ascii-\65039-r\228ksm\246rg\229s" , [] , [] )
    [ Str "non"
    , Space
    , Str "ascii"
    , Space
    , Str "\9888\65039"
    , Space
    , Str "r\228ksm\246rg\229s"
    ]
]
```

`gfm` should have `ascii_identifiers` disabled by default.

```
% pandoc -f gfm -t native
# non ascii ⚠️ räksmörgås
^D
[ Header
    1
    ( "non-ascii-\65039-r\228ksm\246rg\229s" , [] , [] )
    [ Str "non"
    , Space
    , Str "ascii"
    , Space
    , Str "\9888\65039"
    , Space
    , Str "r\228ksm\246rg\229s"
    ]
]
```