RFR: 8195686: ISO-8859-8-i charset cannot be decoded, should be mapped to ISO-8859-8

Pratiksha.Sawant duke at openjdk.org
Wed Sep 18 08:01:18 UTC 2024


On Fri, 23 Aug 2024 10:38:38 GMT, Pratiksha.Sawant <duke at openjdk.org> wrote:

> Mapping ISO-8859-8-I charset to ISO-8859-8.
> Below mentioned 2 aliases are added as part of this:-
> **ISO-8859-8-I**
> **ISO8859-8-I**
> 
> The bug report for the same:- https://bugs.openjdk.org/browse/JDK-8195686

Based on our analysis, we've identified that the file “EncodingMap.java” includes an entry where "ISO-8859-8-I" is defined as an alias for "ISO8859_8." This entry is found in the headstream repository, and we believe it makes sense to include this in the charsets file as well. You can reference the relevant section here: [EncodingMap.java](https://github.com/openjdk/jdk/blob/5381f553ad61ddaa44d49c3039a05511cc68bdd0/src/java.xml/share/classes/com/sun/org/apache/xerces/internal/util/EncodingMap.java#L770).

Moreover, the original bug submitter has expressed agreement with our proposed solution, as noted in the discussion [here](https://github.com/openjdk/jdk/pull/20690#issuecomment-2354559575).

If we decide to create a new charset mapping for "ISO-8859-8-I," it would essentially mirror "ISO-8859-8," differing only in the naming convention. This would function similarly to creating an alias in the charsets file.

Therefore, we propose that this approach is valid and appropriate for implementation.

-------------

PR Comment: https://git.openjdk.org/jdk/pull/20690#issuecomment-2357768469


More information about the nio-dev mailing list