aboutsummaryrefslogtreecommitdiff
path: root/test/command/4742.md
blob: 45f42db5a2f24496c8629f2da98134cd59c84d86 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
Check that the commonmark reader handles the `ascii_identifiers`
extension properly.

```
% pandoc -f commonmark+gfm_auto_identifiers+ascii_identifiers -t native
# non ascii ⚠️ räksmörgås
^D
[ Header 1
  ( "non-ascii--raksmorgas", [], [] )
  [ Str "non"
  , Space
  , Str "ascii"
  , Space
  , Str "\9888\65039"
  , Space
  , Str "r\228ksm\246rg\229s"
  ]
]
```

Note that the emoji here is actually a composite character,
formed from \9888 and \65039. The latter is a combining mark,
so it survives...

```
% pandoc -f commonmark+gfm_auto_identifiers-ascii_identifiers -t native
# non ascii ⚠️ räksmörgås
^D
[ Header 1
  ( "non-ascii-\65039-r\228ksm\246rg\229s", [], [] )
  [ Str "non"
  , Space
  , Str "ascii"
  , Space
  , Str "\9888\65039"
  , Space
  , Str "r\228ksm\246rg\229s"
  ]
]
```

`gfm` should have `ascii_identifiers` disabled by default.

```
% pandoc -f gfm -t native
# non ascii ⚠️ räksmörgås
^D
[ Header 1
  ( "non-ascii-\65039-r\228ksm\246rg\229s", [], [] )
  [ Str "non"
  , Space
  , Str "ascii"
  , Space
  , Str "\9888\65039"
  , Space
  , Str "r\228ksm\246rg\229s"
  ]
]
```