RFR: 8291660: Grapheme support in BreakIterator [v3]

Stuart Marks smarks at openjdk.org
Fri Aug 26 21:00:55 UTC 2022


On Fri, 26 Aug 2022 18:12:04 GMT, Naoto Sato <naoto at openjdk.org> wrote:

>> This is to enhance the character break analysis in `java.text.BreakIterator` to conform to the extended grapheme cluster boundaries defined in https://www.unicode.org/reports/tr29/#Grapheme_Cluster_Boundaries. A corresponding CSR has also been drafted, as there will be behavioral changes with this modification.
>
> Naoto Sato has updated the pull request incrementally with one additional commit since the last revision:
> 
>   Reverting the fix to BreakIterator.isBoundary()

src/java.base/share/classes/java/text/BreakIterator.java line 107:

> 105:  * <a href="https://www.unicode.org/reports/tr29/#Grapheme_Cluster_Boundaries">
> 106:  * Grapheme Cluster Boundaries</a> section in the Unicode Standard Annex #29.
> 107:  *

I would say this is an `@implSpec` instead of an `@implNote` because we want applications to be able to rely on this behavior if they're using the default implementation. Also please make corresponding update to the CSR.

-------------

PR: https://git.openjdk.org/jdk/pull/9991


More information about the core-libs-dev mailing list