Modify replacement properties of `encodeStringUtf8`/`decodeStringUtf8` #4928

hvr · 2017-12-03T21:55:36Z

This should finally address #4644

Please include the following checklist in your PR:

Patches conform to the coding conventions.
Any changes that could be relevant to users have been recorded in the changelog.
The documentation has been updated, if necessary.

Please also shortly describe how you tested your change. Bonus points for added tests!

This changes `decodeStringUtf8` to not replace U+FFFE and U+FFFF into U+FFFD, while `encodeStringUtf8` now replaces surrogate pairs (i.e. code-points U+D800 through U+DFFF which are invalid in UTF-8) with U+FFFD. Consequently, `decodeStringUtf8 . encodeStringUtf8` can now properly round-trip all scalar code-points (i.e. [U+0000..U+D7FF] ∪ [U+E000..U+10FFFF]). This should finally address haskell#4644

23Skidoo · 2017-12-04T08:31:32Z

Merged, thanks!

hvr added 2 commits December 3, 2017 22:31

Add unit-tests covering fromUTF8BS/toUTF8BS

b67871c

23Skidoo merged commit 6e1871a into haskell:master Dec 4, 2017

hvr deleted the pr/issue-4644 branch December 4, 2017 08:45

hvr mentioned this pull request Dec 4, 2017

Unit test Distribution.Utils.ShortText BinaryId fails #4644

Closed

hvr added this to the 2.2 milestone Dec 4, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Modify replacement properties of `encodeStringUtf8`/`decodeStringUtf8` #4928

Modify replacement properties of `encodeStringUtf8`/`decodeStringUtf8` #4928

Uh oh!

hvr commented Dec 3, 2017 •

edited

Loading

Uh oh!

23Skidoo commented Dec 4, 2017

Uh oh!

Uh oh!

Modify replacement properties of encodeStringUtf8/decodeStringUtf8 #4928

Modify replacement properties of encodeStringUtf8/decodeStringUtf8 #4928

Uh oh!

Conversation

hvr commented Dec 3, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

23Skidoo commented Dec 4, 2017

Uh oh!

Uh oh!

Modify replacement properties of `encodeStringUtf8`/`decodeStringUtf8` #4928

Modify replacement properties of `encodeStringUtf8`/`decodeStringUtf8` #4928

hvr commented Dec 3, 2017 •

edited

Loading