<i18n dev> RFR: 8306031: Update IANA Language Subtag Registry to Version 2023-04-13 [v2]

Naoto Sato naoto at openjdk.org
Wed Apr 19 22:54:48 UTC 2023


On Wed, 19 Apr 2023 20:07:32 GMT, Justin Lu <jlu at openjdk.org> wrote:

>> Update the registry and accompanying tests with the **IANA 4/13/2022** update.
>> 
>> This update introduces the case where an IANA entry can have a preferred value, but that preferred value has a preferred value as well.
>> 
>> This causes unexpected failures in JDK tests because of how locale equivalencies are created.
>> 
>> eg: `ar-ajp` has a preferred value of `ajp` but `ajp` has a preferred value of `apc`
>> 
>> Normally, when the JDK is built, _LocaleEquivlalentMaps.java_ generates the following
>> 
>> 
>> ...
>> singleEquivMap.put("ar-ajp", "ajp");
>> singleEquivMap.put("ajp", "ar-ajp");
>> ...
>> multiEquivsMap.put("ajp", new String[] {"apc", "ar-apc"});
>> multiEquivsMap.put("apc", new String[] {"ajp", "ar-apc"});
>> multiEquivsMap.put("ar-apc", new String[] {"apc", "ajp"});
>> ...
>> 
>> 
>> When `LocaleMatcher.parse(ACCEPT_LANGUAGE)` is called with `ACCEPT_LANGUAGE` containing `apc` and `ajp` in that order, the following occurs:
>> 
>> `apc` is found, `apc` is added, all of `apc's` equivalencies are added: `ajp` and `ar-apc`
>> 
>> When parse iterates to `ajp`, it finds that it is already added to the list, and does not add it's equivalency `ar-ajp`.
>> 
>> To address this, the build process must be adjusted so that the equivalencies are built as 
>> 
>> 
>> ...
>> multiEquivsMap.put("ajp", new String[] {"apc", "ar-ajp", "ar-apc"});
>> multiEquivsMap.put("apc", new String[] {"ajp", "ar-ajp", "ar-apc"});
>> multiEquivsMap.put("ar-ajp", new String[] {"apc", "ajp", "ar-apc"});
>> multiEquivsMap.put("ar-apc", new String[] {"apc", "ajp", "ar-ajp"});
>> ...
>> 
>> 
>> As, if `ar-ajp` has a preferred value of `ajp`, and `ajp` has a preferred value of `apc`, this technically means that `ar-ajp` is equivalent to `apc` and its equivalencies as well. This way, when `LocaleMatcher.parse(ACCEPT_LANGUAGE)` iterates to `apc`, it will add all of it's equivalencies including `ar-ajp`.
>
> Justin Lu has updated the pull request incrementally with one additional commit since the last revision:
> 
>   Copyright

make/jdk/src/classes/build/tools/generatelsrequivmaps/EquivMapsGenerator.java line 2:

> 1: /*
> 2:  * Copyright (c) 2012, 2023, Oracle and/or its affiliates. All rights reserved.

Cannot comment on unmodified lines, but instead of calculating the initial load itself, `HashMap.newHashMap()` can be used for initializing maps.

make/jdk/src/classes/build/tools/generatelsrequivmaps/EquivMapsGenerator.java line 144:

> 142:                 boolean foundInOther = false;
> 143:                 final String finalPref = ","+preferred;
> 144:                 final String inbtwnPref = ","+preferred+",";

This could utilize regex?

make/jdk/src/classes/build/tools/generatelsrequivmaps/EquivMapsGenerator.java line 146:

> 144:                 final String inbtwnPref = ","+preferred+",";
> 145:                 // Check if current pref exists inside a value for another pref
> 146:                 List<StringBuilder> doublePrefs = initialLanguageMap.entrySet()

`values()` fits here

make/jdk/src/classes/build/tools/generatelsrequivmaps/EquivMapsGenerator.java line 150:

> 148:                                 e.getValue().toString().contains(inbtwnPref)))
> 149:                         .map(Map.Entry::getValue)
> 150:                         .collect(Collectors.toList());

Can replace `collect()` with `toList()`

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/13501#discussion_r1171923706
PR Review Comment: https://git.openjdk.org/jdk/pull/13501#discussion_r1171920639
PR Review Comment: https://git.openjdk.org/jdk/pull/13501#discussion_r1171922562
PR Review Comment: https://git.openjdk.org/jdk/pull/13501#discussion_r1171923033


More information about the i18n-dev mailing list