RFR: 8354273: Restore even more pointless unicode characters to ASCII [v2]

Eirik Bjørsnøs eirbjo at openjdk.org
Fri Apr 18 06:38:47 UTC 2025


On Thu, 10 Apr 2025 10:36:31 GMT, Magnus Ihse Bursie <ihse at openjdk.org> wrote:

>> As a follow-up to [JDK-8354213](https://bugs.openjdk.org/browse/JDK-8354213), I found some additional places where unicode characters are unnecessarily used instead of pure ASCII.
>
> Magnus Ihse Bursie has updated the pull request incrementally with one additional commit since the last revision:
> 
>   Remove incorrectly copied "§anchor"

While the changes here look okay, I think the issue/PR title could be improved.

The replacement of Unicode "En Dash" with ASCII hypen-minus and the similar relacement of the Unicode "Horizontal Ellipsis" with three ASCII periods are not really "restoring" much, and these unicode characters are hardly "pointless" as they may carry different semantic meaning, behavior and rendering. 

It's a valid chioce to normalize them into ASCII though, but perhaps a title like "Normalize even more Unicode characters as ASCII" would be more "fair" to these poor Unicode characters :-)

-------------

PR Comment: https://git.openjdk.org/jdk/pull/24567#issuecomment-2814667275


More information about the core-libs-dev mailing list