RFR: 8303018: Unicode Emoji Properties [v3]

Eirik Bjorsnos duke at openjdk.org
Wed Mar 15 20:46:21 UTC 2023


On Wed, 15 Mar 2023 18:21:11 GMT, Naoto Sato <naoto at openjdk.org> wrote:

>> Proposing accessor methods to Emoji properties defined in [Unicode Technical Standard #51](https://unicode.org/reports/tr51/) in `java.lang.Character` class. This is per a request from the client group, as well as refining the currently existing ad-hoc emoji implementation in regex. A CSR has also been drafted, and I would appreciate reviews for it too.
>
> Naoto Sato has updated the pull request incrementally with two additional commits since the last revision:
> 
>  - InternalError message/mask constants cleanup
>  - Indentation/print comment fix

Unrelated side note: Reviewing this PR inspired me to see if one could generate more efficient switch expressions for the CharacterDataLatin1 methods as replacement for the property lookup / masking done today. This seemed to give a small improvements on benchmarks and also collapsed a few methods to simply "return false". Could be something to explore at a later point.

-------------

PR: https://git.openjdk.org/jdk/pull/13006



More information about the build-dev mailing list