<i18n dev> RFR: 8354266: Fix non-UTF-8 text encoding
Eirik Bjørsnøs
eirbjo at openjdk.org
Fri Apr 11 10:27:40 UTC 2025
On Fri, 11 Apr 2025 10:21:32 GMT, Magnus Ihse Bursie <ihse at openjdk.org> wrote:
>> src/demo/share/java2d/J2DBench/resources/textdata/arabic.ut8.txt line 11:
>>
>>> 9: تخصص اÙØ´Ùرة اÙÙ
ÙØدة "ÙÙÙÙÙÙد" رÙÙ
ا ÙØÙدا ÙÙÙ Ù
Øر٠Ù٠جÙ
Ùع اÙÙغات اÙعاÙÙ
ÙØ©Ø ÙØ°Ù٠بغض اÙÙظر ع٠ÙÙع اÙØاسÙب أ٠اÙبراÙ
ج اÙÙ
ستخدÙ
Ø©. ÙÙد تÙÙ
تبÙÙ Ù
ÙاصÙØ© "ÙÙÙÙÙÙد" Ù
ÙÙÙ ÙبÙÙ Ùادة اÙصاÙعÙÙ ÙØ£ÙظÙ
Ø© اÙØÙاسÙب ÙÙ٠اÙعاÙÙ
Ø Ù
ث٠شرÙات Ø¢Ù.بÙ.Ø¥Ù
. (IBM)Ø Ø£Ø¨ÙÙ (APPLE)Ø ÙÙÙÙÙÙÙÙÙت بÙاÙÙرد (Hewlett-Packard) Ø Ù
اÙÙرÙسÙÙت (Microsoft)Ø Ø£ÙراÙÙÙÙ (Oracle) Ø ØµÙ (Sun) ÙغÙرÙا. ÙÙ
ا أ٠اÙÙ
ÙاصÙات ÙاÙÙ
ÙاÙÙس اÙØدÙثة (Ù
Ø«Ù Ùغة اÙبرÙ
جة "جاÙا" "JAVA" ÙÙغة "Ø¥Ùس Ø¥Ù
Ø¥Ù" "XML" اÙت٠تستخدÙ
ÙبرÙ
جة اÙاÙترÙÙت) تتطÙب استخداÙ
"ÙÙÙÙÙÙد". عÙاÙØ© عÙÙ Ø°ÙÙ Ø ÙØ¥Ù "ÙÙÙÙÙÙد" Ù٠اÙØ·ÙرÙÙÙÙØ© اÙرسÙÙ
ÙØ© ÙتطبÙ٠اÙÙ
ÙÙÙاس اÙÙعÙاÙÙÙ
٠إÙز٠١Ù
٦٤٦ (ISO 10646) .
>>> 10:
>>> 11: إ٠بزÙغ Ù
ÙاصÙØ© "ÙÙÙÙÙÙد" ÙتÙÙÙÙر اÙØ£ÙظÙ
Ø© اÙت٠تستخدÙ
Ù ÙتدعÙ
ÙØ Ùعتبر Ù
٠أÙÙ
اÙاختراعات اÙØدÙثة Ù٠عÙÙÙ
Ø© اÙبرÙ
جÙات ÙجÙ
Ùع اÙÙغات Ù٠اÙعاÙÙ
. Ùإ٠استخداÙ
"ÙÙÙÙÙÙد" Ù٠عاÙÙ
اÙاÙترÙÙت سÙؤد٠إÙ٠تÙÙÙر ÙبÙر Ù
ÙارÙØ© Ù
ع استخداÙ
اÙÙ
جÙ
Ùعات اÙتÙÙÙدÙØ© ÙÙÙ
Øار٠اÙÙ
Ø´Ùرة. ÙÙ
ا أ٠استخداÙ
"ÙÙÙÙÙÙد" سÙÙÙ
ÙÙÙ٠اÙÙ
برÙ
ج Ù
Ù Ùتابة اÙبرÙاÙ
ج Ù
رة ÙاØØ¯Ø©Ø ÙاستخداÙ
٠عÙ٠أ٠ÙÙع Ù
٠اÙأجÙزة أ٠اÙØ£ÙظÙ
Ø©Ø ÙÙØ£Ù Ùغة أ٠دÙÙØ© Ù٠اÙعاÙÙ
Ø£ÙÙÙ
ا ÙاÙØªØ Ø¯Ù٠اÙØاجة Ùإعادة اÙبرÙ
جة أ٠إجراء أ٠تعدÙÙ. ÙأخÙØ±Ø§Ø Ùإ٠استخداÙ
"ÙÙÙÙÙÙد" سÙÙ
Ù٠اÙبÙاÙات Ù
٠اÙاÙتÙا٠عبر اÙØ£ÙظÙ
Ø© ÙاÙأجÙزة اÙÙ
ختÙÙØ© دÙÙ Ø
£Ù خطÙرة ÙتØرÙÙÙØ§Ø Ù
ÙÙ
ا تعددت اÙشرÙات اÙصاÙعة ÙÙØ£ÙظÙ
Ø© ÙاÙÙØºØ§ØªØ ÙاÙدÙ٠اÙت٠تÙ
ر Ù
Ù Ø®ÙاÙÙا Ùذ٠اÙبÙاÙات.
>>
>> Looks like most of the changes in java2d/* are related to spaces at the end of the line?
>
> No, that are just incidental changes (see https://github.com/openjdk/jdk/pull/24566#issuecomment-2795201480). The actual change for the java2d files is the removal of the initial UTF-8 BOM. Github has a hard time showing this though, since the BOM is not visible.
I found the side-by-side diff in IntelliJ useful here, as it said "UTF-8 BOM" vs. "UTF-8".
-------------
PR Review Comment: https://git.openjdk.org/jdk/pull/24566#discussion_r2039263227
More information about the i18n-dev
mailing list