Unicode script support in Regex and Character class

Ulf Zibis Ulf.Zibis at gmx.de
Mon Apr 26 09:23:20 UTC 2010


Am 26.04.2010 07:28, schrieb Xueming Shen:
>
> Can I assume we are all OK with at least the API part of the latest 
> webrev/blenderrev of
> the script support in j.l.Character and j.u.r.Pattern, including the 
> j.l.Chareacter.getName().

I guess you mean:
     public static enum UnicodeScript {
         COMMON,
         ...;
         public static UnicodeScript of(int codePoint);
         public static final UnicodeScript forName(String scriptName);
     }
     public static String getName(int codePoint);

I'm ok with this api on enum base.

I would like to see the full names redundantly in the aliases map. Needs 
only ~100 * (4 + 4) bytes in HashMap<String, Character.UnicodeScript>.
I think there should be some more words in the javadoc about 
correlation/usecase/advantage of UnicodeScript against against UnicodeBlock.
I would like to have the 3 special cases INHERITED, COMMON and UNKNOWN 
together at the beginning or end of the enum list.


>
> http://cr.openjdk.java.net/~sherman/script/blenderrev.html
> http://cr.openjdk.java.net/~sherman/script/webrev
>
> Okutsu-san, Yuka, can one of you help review the corresponding CCC at
> http://ccc.sfbay.sun.com/6945564?

I get server not found: ccc.sfbay.sun.com

-Ulf





More information about the core-libs-dev mailing list