Full-width punctuation before the end of emphasis breaks the emphasis #490

ptmkenny · 2020-05-26T10:30:21Z

When I use emphasis (bold or italics) with Japanese full-width punctuation, if the punctuation comes immediately before the end of the emphasis, the parser ignores it.

CommonMark 1.4.3

What should happen: the bold/italics should be processed just like it is for English punctuation.

Example characters that cause this issue:

。！？、

Input:

**テスト。**テスト

**テスト**テスト

。**。テスト**テスト

。**テスト。**テスト

**テスト**。テスト

**テスト**テスト。

**テスト、**テスト

**テスト**テスト

、**、テスト**テスト

、**テスト、**テスト

**テスト**、テスト

**テスト**テスト、

**テスト。**テスト

**テスト**テスト

！**！テスト**テスト

！**テスト！**テスト

**テスト**！テスト

**テスト**テスト！

**テスト？**テスト

**テスト**テスト

？**？テスト**テスト

？**テスト？**テスト

**テスト**？テスト

**テスト**テスト？

Here's a screenshot of the output:

The text was updated successfully, but these errors were encountered:

colinodell · 2020-05-26T14:00:03Z

Do you get the same results with other CommonMark parsers like https://spec.commonmark.org/dingus/?

ptmkenny · 2020-05-26T14:04:05Z

Hmm, yes that URL returns the same incorrect output.

This Markdown is originally from Pandoc which handles it just fine.

colinodell · 2020-05-26T14:22:10Z

The CommonMark spec does have some strict, complex rules around what should be considered valid emphasis, especially when dealing with punctuation: https://spec.commonmark.org/0.29/#emphasis-and-strong-emphasis Because other CommonMark parsers are producing the same result, I would guess the issue is one of two things - either:

This behavior is valid and expected per the spec; or,
The spec does not take Japanese full-width punctuation into an account

Either way, because this parser behaves the same as other CommonMark parsers, I don't think this is an issue with this library. I'd highly recommend taking this issue/question upstream:

They should be able to identify which of those two possibilities it is. And if something does need to change in the spec or reference implementation, we'll be sure to make corresponding changes here too.

ptmkenny · 2020-05-26T15:49:27Z

Thanks, I opened a new issue upstream!

colinodell · 2020-06-13T22:41:56Z

Because our implementation currently matches the spec, even though it doesn't handle your case as you'd expect, I'm going to close this issue. Please know that I will be tracking your upstream issue closely and will implement any resulting changes back here :)

colinodell added question General questions about the project or usage spec compliance Issues or question about compliance with the CommonMark or GFM specs labels May 26, 2020

ptmkenny mentioned this issue May 26, 2020

Emphasis with CJK punctuation commonmark/commonmark-spec#650

Open

colinodell closed this as completed Jun 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Full-width punctuation before the end of emphasis breaks the emphasis #490

Full-width punctuation before the end of emphasis breaks the emphasis #490

ptmkenny commented May 26, 2020

colinodell commented May 26, 2020

Uh oh!

ptmkenny commented May 26, 2020

Uh oh!

colinodell commented May 26, 2020

Uh oh!

ptmkenny commented May 26, 2020

Uh oh!

colinodell commented Jun 13, 2020

Uh oh!

Uh oh!

Full-width punctuation before the end of emphasis breaks the emphasis #490

Full-width punctuation before the end of emphasis breaks the emphasis #490

Comments

ptmkenny commented May 26, 2020

colinodell commented May 26, 2020

Uh oh!

ptmkenny commented May 26, 2020

Uh oh!

colinodell commented May 26, 2020

Uh oh!

ptmkenny commented May 26, 2020

Uh oh!

colinodell commented Jun 13, 2020

Uh oh!