RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

older
Re: RFR: JDK-8032012, ,...

Xueming Shen

16 Jan 2014 16 Jan '14

6:08 p.m.

Hi, The proposed changeset is to improve the performance (both speed and memory usage) of String.toLowerCase/toUpperCase, by (1) to separate the "most likely" use scenario (non supplementary character, no special case mapping handling) into simple/quick iteration loop code path (2) to use the package private constructor String(char[], boolean) to avoid unnecessary target char[] copy in fast-path case. (3) some tiny code cleanup. Since it's supposed to be a simple/quick round of code cleanup, I did not go too far to optimize the supplementary/special mapping code. The webrev is http://cr.openjdk.java.net/~sherman/8032012/ My small non-scientific speed measurement test suggests we can get 25% to 40% speed boost. http://cr.openjdk.java.net/~sherman/str_toCase/TestToCase.java Thanks! -Sherman

Show replies by date

Ulf Zibis

16 Jan 16 Jan

7:42 p.m.

New subject: RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

Hi Sherman, nice performance trick :-) Do you remember the discussion about : https://bugs.openjdk.java.net/browse/JDK-6939278 ;-) (this bug was originally filed by me!) -Ulf Am 16.01.2014 19:08, schrieb Xueming Shen:

...

Hi,

The proposed changeset is to improve the performance (both speed and memory usage) of String.toLowerCase/toUpperCase, by

(1) to separate the "most likely" use scenario (non supplementary character, no special case mapping handling) into simple/quick iteration loop code path (2) to use the package private constructor String(char[], boolean) to avoid unnecessary target char[] copy in fast-path case. (3) some tiny code cleanup.

Since it's supposed to be a simple/quick round of code cleanup, I did not go too far to optimize the supplementary/special mapping code.

The webrev is

http://cr.openjdk.java.net/~sherman/8032012/

My small non-scientific speed measurement test suggests we can get 25% to 40% speed boost.

http://cr.openjdk.java.net/~sherman/str_toCase/TestToCase.java

Thanks! -Sherman

Paul Sandoz

20 Jan 20 Jan

5:24 p.m.

On Jan 16, 2014, at 7:08 PM, Xueming Shen <xueming.shen@oracle.com> wrote:

...

Hi,

The proposed changeset is to improve the performance (both speed and memory usage) of String.toLowerCase/toUpperCase, by

(1) to separate the "most likely" use scenario (non supplementary character, no special case mapping handling) into simple/quick iteration loop code path (2) to use the package private constructor String(char[], boolean) to avoid unnecessary target char[] copy in fast-path case. (3) some tiny code cleanup.

Since it's supposed to be a simple/quick round of code cleanup, I did not go too far to optimize the supplementary/special mapping code.

The webrev is

http://cr.openjdk.java.net/~sherman/8032012/

Some quick comments. In String.toLowerCase: - it would be nice to get rid of the pseudo goto using the "scan" labelled block. - there is no need for the "localeDependent" local variable. - you might be able to optimize by doing (could depend on the answer to the next point): int c = (int)value[i]; int lc = Character.toLowerCase(c); if (.....) { result[i] = (char)lc; } else { return toLowerCaseEx(result, i, locale, localeDependent); } - Do you need to check ERROR for the result of toLowerCase? 2586 if (c == Character.ERROR || Paul.

Xueming Shen

21 Jan 21 Jan

10:05 p.m.

New subject: RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

Hi Paul, thanks for reviewing the changeset, comment inlined below. On 01/20/2014 09:24 AM, Paul Sandoz wrote:

...

Some quick comments.

In String.toLowerCase:

- it would be nice to get rid of the pseudo goto using the "scan" labelled block.

webrev has been updated to remove the pseudo goto by checking the "first" against "len" after the loop break.

...

- there is no need for the "localeDependent" local variable.

I just tried to not throw away the result of this "localeDependent", which is still needed in the toXCaseEx() methods.

...

- you might be able to optimize by doing (could depend on the answer to the next point):

int c = (int)value[i]; int lc = Character.toLowerCase(c); if (.....) { result[i] = (char)lc; } else { return toLowerCaseEx(result, i, locale, localeDependent); }

- Do you need to check ERROR for the result of toLowerCase?

2586 if (c == Character.ERROR ||

Yes, Character.toLowerCase() should never return ERROR (while the package private Character.toUpperCaseEx() will). In theory there is no need to check if the return value of Character.toUpperCase(int) > min_supplementary_code_point in our loop, because there is no bmp character returns a supplementary code point as its lower case. But since it's a data driven mapping table, there is no guarantee the unicode data table is not going to change in the "future", so I still keep the check. The webrev has been updated to use one single "if" inside the loop. I have a "single if" implementation for the toUpperCase() as well, though don't sure which one is better/faster :-) http://cr.openjdk.java.net/~sherman/8032012/ -Sherman

Paul Sandoz

22 Jan 22 Jan

3:20 p.m.

On Jan 21, 2014, at 11:05 PM, Xueming Shen <xueming.shen@oracle.com> wrote:

...

Hi Paul, thanks for reviewing the changeset, comment inlined below.

On 01/20/2014 09:24 AM, Paul Sandoz wrote:

...
Some quick comments.

In String.toLowerCase:

- it would be nice to get rid of the pseudo goto using the "scan" labelled block.

webrev has been updated to remove the pseudo goto by checking the "first" against "len" after the loop break.

Much for readable :-)

...

...
- there is no need for the "localeDependent" local variable.

I just tried to not throw away the result of this "localeDependent", which is still needed in the toXCaseEx() methods.

if (lang == "tr" || lang == "az" || lang == "lt") { // local dependent return toLowerCaseEx(result, firstUpper, locale, true); } // otherwise false is passed to subsequent calls to toLowerCaseEx ?

...

...
- you might be able to optimize by doing (could depend on the answer to the next point):

int c = (int)value[i]; int lc = Character.toLowerCase(c); if (.....) { result[i] = (char)lc; } else { return toLowerCaseEx(result, i, locale, localeDependent); }

- Do you need to check ERROR for the result of toLowerCase?

2586 if (c == Character.ERROR ||

Yes, Character.toLowerCase() should never return ERROR (while the package private Character.toUpperCaseEx() will). In theory there is no need to check if the return value of Character.toUpperCase(int) > min_supplementary_code_point in our loop, because there is no bmp character returns a supplementary code point as its lower case. But since it's a data driven mapping table, there is no guarantee the unicode data table is not going to change in the "future", so I still keep the check. The webrev has been updated to use one single "if" inside the loop.

I have a "single if" implementation for the toUpperCase() as well, though don't sure which one is better/faster :-)

OK, i suppose one could measure :-) Not sure how much it is worth obsessing over but... int c = (int)value[i]; if (c < '\u03A3') { result[i] = (char)Character.toLowerCase(c); // Is that safe? } else if (c < Character.MIN_HIGH_SURROGATE && c != 'u03A3' && (c = Character.toLowerCase(c)) < Character.MIN_SUPPLEMENTARY_CODE_POINT)) { result[i] = (char)c; } else { return toLowerCaseEx(result, i, locale, localeDependent); } or: int c = (int)value[i]; int lc = Character.toLowerCase(c); // is that safe? if (c < '\u03A3') { result[i] = (char)lc; } else if (c < Character.MIN_HIGH_SURROGATE && c != 'u03A3' && lc < Character.MIN_SUPPLEMENTARY_CODE_POINT)) { result[i] = (char)lc; } else { return toLowerCaseEx(result, i, locale, localeDependent); } or: int c = (int)value[i]; int lc = Character.toLowerCase(c); // is that safe? if (c < '\u03A3' || (c < Character.MIN_HIGH_SURROGATE && c != 'u03A3' && lc < Character.MIN_SUPPLEMENTARY_CODE_POINT))) { result[i] = (char)lc; } else { return toLowerCaseEx(result, i, locale, localeDependent); } FWIW i personally find those solutions easier to read, if they are safe w.r.t. Character.toLowerCase and that annoying greek character. Paul.

...

http://cr.openjdk.java.net/~sherman/8032012/

-Sherman

Ulf Zibis

10:10 p.m.

New subject: RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

Am 22.01.2014 16:20, schrieb Paul Sandoz:

...

On Jan 21, 2014, at 11:05 PM, Xueming Shen <xueming.shen@oracle.com> wrote:

...
On 01/20/2014 09:24 AM, Paul Sandoz wrote:

...
- it would be nice to get rid of the pseudo goto using the "scan" labelled block. webrev has been updated to remove the pseudo goto by checking the "first" against "len" after the loop break. Much for readable :-)

I think, you should compare the performance of both versions on modern + 32-bit CPUs.

...

...
...
- you might be able to optimize by doing (could depend on the answer to the next point):

int c = (int)value[i]; int lc = Character.toLowerCase(c); if (.....) { result[i] = (char)lc; } else { return toLowerCaseEx(result, i, locale, localeDependent); }

- Do you need to check ERROR for the result of toLowerCase?

2586 if (c == Character.ERROR ||

Yes, Character.toLowerCase() should never return ERROR (while the package private Character.toUpperCaseEx() will). In theory there is no need to check if the return value of Character.toUpperCase(int) > min_supplementary_code_point in our loop, because there is no bmp character returns a supplementary code point as its lower case. But since it's a data driven mapping table, there is no guarantee the unicode data table is not going to change in the "future", so I still keep the check.

In my opinion this check should be subject of JDK's build test, but not of runtime code.

...

or:

int c = (int)value[i]; int lc = Character.toLowerCase(c); // is that safe? if (c < '\u03A3' || (c < Character.MIN_HIGH_SURROGATE && c != 'u03A3' && lc < Character.MIN_SUPPLEMENTARY_CODE_POINT))) { result[i] = (char)lc; } else { return toLowerCaseEx(result, i, locale, localeDependent); }

FWIW i personally find those solutions easier to read, if they are safe w.r.t. Character.toLowerCase and that annoying greek character.

I would like the 3rd version. -Ulf

Xueming Shen

5 Feb 5 Feb

5:58 p.m.

New subject: RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

Hi, Let's try to wrap it up, otherwise I may drop the ball somewhere :-) On 01/22/2014 07:20 AM, Paul Sandoz wrote:

...

...
if (lang == "tr" || lang == "az" || lang == "lt") { // local dependent return toLowerCaseEx(result, firstUpper, locale, true); } // otherwise false is passed to subsequent calls to toLowerCaseEx

?

toLowerCaseEx will also be invoked later (in your another suggestion next), which needs a "localeDependent".

...

...
...
- you might be able to optimize by doing (could depend on the answer to the next point):

int c = (int)value[i]; int lc = Character.toLowerCase(c); if (.....) { result[i] = (char)lc; } else { return toLowerCaseEx(result, i, locale, localeDependent); }

- Do you need to check ERROR for the result of toLowerCase?

2586 if (c == Character.ERROR ||

Yes, Character.toLowerCase() should never return ERROR (while the package private Character.toUpperCaseEx() will). In theory there is no need to check if the return value of Character.toUpperCase(int)> min_supplementary_code_point in our loop, because there is no bmp character returns a supplementary code point as its lower case. But since it's a data driven mapping table, there is no guarantee the unicode data table is not going to change in the "future", so I still keep the check. The webrev has been updated to use one single "if" inside the loop.

I have a "single if" implementation for the toUpperCase() as well, though don't sure which one is better/faster :-)

OK, i suppose one could measure :-)

Not sure how much it is worth obsessing over but...

int c = (int)value[i]; if (c< '\u03A3') { result[i] = (char)Character.toLowerCase(c); // Is that safe? } else if (c< Character.MIN_HIGH_SURROGATE&& c != 'u03A3'&& (c = Character.toLowerCase(c))< Character.MIN_SUPPLEMENTARY_CODE_POINT)) { result[i] = (char)c; } else { return toLowerCaseEx(result, i, locale, localeDependent); }

or:

int c = (int)value[i]; int lc = Character.toLowerCase(c); // is that safe? if (c< '\u03A3') { result[i] = (char)lc; } else if (c< Character.MIN_HIGH_SURROGATE&& c != 'u03A3'&& lc< Character.MIN_SUPPLEMENTARY_CODE_POINT)) { result[i] = (char)lc; } else { return toLowerCaseEx(result, i, locale, localeDependent); }

or:

int c = (int)value[i]; int lc = Character.toLowerCase(c); // is that safe? if (c< '\u03A3' || (c< Character.MIN_HIGH_SURROGATE&& c != 'u03A3'&& lc< Character.MIN_SUPPLEMENTARY_CODE_POINT))) { result[i] = (char)lc; } else { return toLowerCaseEx(result, i, locale, localeDependent); }

FWIW i personally find those solutions easier to read, if they are safe w.r.t. Character.toLowerCase and that annoying greek character.

The existing code has the clear and explicit logic that (1) j.l.C.toLowerCase() can NOT handle surrogate(s) (2) j.l.C.toLowerCase() can NOT handle \u03A3 (3) if for some reasons j.l.C.toLowerCase() returns a supplementary character for a non-bmp character, the current "one-char" based result can NOT handle it, So in above cases, go to "ex" version. The suggested approach is kinda of hacky and the logic is not clear. For example it tries to optimize with c < \u03a3 cases, it works with the assumption that all lower cases of < \u03a3 characters are never a supplementary characters, which is true for now and probably will be true forever, but it actually changes the design and implementation logic (be driven by the backend unicode case mapping data, not an "assumption"). The (c< Character.MIN_HIGH_SURROGATE&& c != 'u03A3'&& lc< Character.MIN_SUPPLEMENTARY_CODE_POINT) part appears to be fine, but its logic is not as explicit as the existing one. Given the measurement indicates it does not bring in measurable improvement, personally I would prefer to keep the existing one, if you don't have a very strong opinion on this. http://cr.openjdk.java.net/~sherman/8032012/ Thanks! -Sherman

Paul Sandoz

7:09 p.m.

On Feb 5, 2014, at 6:58 PM, Xueming Shen <xueming.shen@oracle.com> wrote:

...

Hi,

Let's try to wrap it up, otherwise I may drop the ball somewhere :-)

On 01/22/2014 07:20 AM, Paul Sandoz wrote:

...
...
if (lang == "tr" || lang == "az" || lang == "lt") { // local dependent return toLowerCaseEx(result, firstUpper, locale, true); } // otherwise false is passed to subsequent calls to toLowerCaseEx

?

toLowerCaseEx will also be invoked later (in your another suggestion next), which needs a "localeDependent".

But is not the second call to toLowerCaseEx always invoked with a value of false? 2574 String lang = locale.getLanguage(); 2575 final boolean localeDependent = (lang == "tr" || lang == "az" || lang == "lt"); 2576 if (localeDependent) { 2577 return toLowerCaseEx(result, firstUpper, locale, localeDependent); // <-- localeDependent is true 2578 } 2579 for (int i = firstUpper; i < len; i++) { 2580 int c = (int)value[i]; 2581 if (c >= Character.MIN_HIGH_SURROGATE && c <= Character.MAX_HIGH_SURROGATE || 2582 c == '\u03A3' || // GREEK CAPITAL LETTER SIGMA 2583 (c = Character.toLowerCase(c)) >= Character.MIN_SUPPLEMENTARY_CODE_POINT) { 2584 return toLowerCaseEx(result, i, locale, localeDependent); // <-- localDependent is false 2585 } 2586 result[i] = (char)c; 2587 } 2588 return new String(result, true); <snip>

...

Given the measurement indicates it does not bring in measurable improvement, personally I would prefer to keep the existing one, if you don't have a very strong opinion on this.

No i don't have a strong opinion. Paul.

...

http://cr.openjdk.java.net/~sherman/8032012/

Thanks! -Sherman

Xueming Shen

6:43 p.m.

New subject: RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

On 02/05/2014 11:09 AM, Paul Sandoz wrote:

...

On Feb 5, 2014, at 6:58 PM, Xueming Shen<xueming.shen@oracle.com> wrote:

...
Hi,

Let's try to wrap it up, otherwise I may drop the ball somewhere :-)

On 01/22/2014 07:20 AM, Paul Sandoz wrote:

...
if (lang == "tr" || lang == "az" || lang == "lt") { // local dependent return toLowerCaseEx(result, firstUpper, locale, true); } // otherwise false is passed to subsequent calls to toLowerCaseEx

?

toLowerCaseEx will also be invoked later (in your another suggestion next), which needs a "localeDependent".

But is not the second call to toLowerCaseEx always invoked with a value of false?

2574 String lang = locale.getLanguage(); 2575 final boolean localeDependent = (lang == "tr" || lang == "az" || lang == "lt"); 2576 if (localeDependent) { 2577 return toLowerCaseEx(result, firstUpper, locale, localeDependent); //<-- localeDependent is true 2578 } 2579 for (int i = firstUpper; i< len; i++) { 2580 int c = (int)value[i]; 2581 if (c>= Character.MIN_HIGH_SURROGATE&& c<= Character.MAX_HIGH_SURROGATE || 2582 c == '\u03A3' || // GREEK CAPITAL LETTER SIGMA 2583 (c = Character.toLowerCase(c))>= Character.MIN_SUPPLEMENTARY_CODE_POINT) { 2584 return toLowerCaseEx(result, i, locale, localeDependent); //<-- localDependent is false 2585 } 2586 result[i] = (char)c; 2587 } 2588 return new String(result, true);

You are absolutely right :-) I will update as suggested. -sherman

Xueming Shen

8:06 p.m.

New subject: RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

Hi Paul, http://cr.openjdk.java.net/~sherman/8032012/webrev webrev has been updated accordingly. Thanks! -Sherman On 02/05/2014 10:43 AM, Xueming Shen wrote:

...

On 02/05/2014 11:09 AM, Paul Sandoz wrote:

...
On Feb 5, 2014, at 6:58 PM, Xueming Shen<xueming.shen@oracle.com> wrote:

...
Hi,

Let's try to wrap it up, otherwise I may drop the ball somewhere :-)

On 01/22/2014 07:20 AM, Paul Sandoz wrote:

...
if (lang == "tr" || lang == "az" || lang == "lt") { // local dependent return toLowerCaseEx(result, firstUpper, locale, true); } // otherwise false is passed to subsequent calls to toLowerCaseEx

?

toLowerCaseEx will also be invoked later (in your another suggestion next), which needs a "localeDependent".

But is not the second call to toLowerCaseEx always invoked with a value of false?

2574 String lang = locale.getLanguage(); 2575 final boolean localeDependent = (lang == "tr" || lang == "az" || lang == "lt"); 2576 if (localeDependent) { 2577 return toLowerCaseEx(result, firstUpper, locale, localeDependent); //<-- localeDependent is true 2578 } 2579 for (int i = firstUpper; i< len; i++) { 2580 int c = (int)value[i]; 2581 if (c>= Character.MIN_HIGH_SURROGATE&& c<= Character.MAX_HIGH_SURROGATE || 2582 c == '\u03A3' || // GREEK CAPITAL LETTER SIGMA 2583 (c = Character.toLowerCase(c))>= Character.MIN_SUPPLEMENTARY_CODE_POINT) { 2584 return toLowerCaseEx(result, i, locale, localeDependent); //<-- localDependent is false 2585 } 2586 result[i] = (char)c; 2587 } 2588 return new String(result, true);

You are absolutely right :-) I will update as suggested.

-sherman

Ulf Zibis

6 Feb 6 Feb

5:59 p.m.

New subject: RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

Am 05.02.2014 21:06, schrieb Xueming Shen:

...

http://cr.openjdk.java.net/~sherman/8032012/webrev

webrev has been updated accordingly.

I still more like the break-to-label approach. It looks more logical and saves one comparison. This might count on very very short strings, but I would rename it to something like find1stUpperCase: or scanForUpperCase. As an alternative you could write "if (first >= len)" as counterpart to the last "first < len" operation in the for loop, which therefore should be merged by JIT. (Or use "first != len" in the for loop.) -Ulf

Ulf Zibis

8:26 p.m.

New subject: RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

To cover this problem, I filed a new language suggestion: https://bugs.openjdk.java.net/browse/JDK-8033813 :-) -Ulf Am 06.02.2014 18:59, schrieb Ulf Zibis:

...

I still more like the break-to-label approach. It looks more logical and saves one comparison. This might count on very very short strings, but I would rename it to something like find1stUpperCase: or scanForUpperCase.

As an alternative you could write "if (first >= len)" as counterpart to the last "first < len" operation in the for loop, which therefore should be merged by JIT. (Or use "first != len" in the for loop.)

-Ulf

Remi Forax

5 Feb 5 Feb

9 p.m.

New subject: RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

On 02/05/2014 07:43 PM, Xueming Shen wrote:

...

On 02/05/2014 11:09 AM, Paul Sandoz wrote:

...
On Feb 5, 2014, at 6:58 PM, Xueming Shen<xueming.shen@oracle.com> wrote:

...
Hi,

Let's try to wrap it up, otherwise I may drop the ball somewhere :-)

On 01/22/2014 07:20 AM, Paul Sandoz wrote:

...
if (lang == "tr" || lang == "az" || lang == "lt") { // local dependent return toLowerCaseEx(result, firstUpper, locale, true); } // otherwise false is passed to subsequent calls to toLowerCaseEx

?

toLowerCaseEx will also be invoked later (in your another suggestion next), which needs a "localeDependent".

But is not the second call to toLowerCaseEx always invoked with a value of false?

2574 String lang = locale.getLanguage(); 2575 final boolean localeDependent = (lang == "tr" || lang == "az" || lang == "lt"); 2576 if (localeDependent) { 2577 return toLowerCaseEx(result, firstUpper, locale, localeDependent); //<-- localeDependent is true 2578 } 2579 for (int i = firstUpper; i< len; i++) { 2580 int c = (int)value[i]; 2581 if (c>= Character.MIN_HIGH_SURROGATE&& c<= Character.MAX_HIGH_SURROGATE || 2582 c == '\u03A3' || // GREEK CAPITAL LETTER SIGMA 2583 (c = Character.toLowerCase(c))>= Character.MIN_SUPPLEMENTARY_CODE_POINT) { 2584 return toLowerCaseEx(result, i, locale, localeDependent); //<-- localDependent is false 2585 } 2586 result[i] = (char)c; 2587 } 2588 return new String(result, true);

You are absolutely right :-) I will update as suggested.

-sherman

Hi Sherman, the code can be faster if the first loop call toLowerCaseEx in case of a surrogate instead of modifying srcCount because in that case the JIT will see that the increment is constant. Note that with the current code, the performance of toLowerCase if it was never called with a string that contains a surrogate are great but if toLowerCase is called once with a string that contains a surrogate, even if toLowercase is called after with strings that never contain a surrogate performance will be slower than it was. Calling toLowerCaseEx will make a toLowerCase with a surrogate slower but make toLowerCase faster in the common case. cheers, Rémi

Xueming Shen

9:30 p.m.

New subject: RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

Hi Remi, Good suggestion. Now the "common case" path is much simple and faster :-) I'm seeing a 5%-10% boost for the normal-non-surrogates case. And it appears the bmp+surr mixed is getting faster as well. Though I would assume the it would get slower in case of "no-case-folding" + surrogates. But the common case wins here. http://cr.openjdk.java.net/~sherman/8032012/webrev/ Thanks! -Sherman On 02/05/2014 01:00 PM, Remi Forax wrote:

...

Hi Sherman, the code can be faster if the first loop call toLowerCaseEx in case of a surrogate instead of modifying srcCount because in that case the JIT will see that the increment is constant.

Note that with the current code, the performance of toLowerCase if it was never called with a string that contains a surrogate are great but if toLowerCase is called once with a string that contains a surrogate, even if toLowercase is called after with strings that never contain a surrogate performance will be slower than it was. Calling toLowerCaseEx will make a toLowerCase with a surrogate slower but make toLowerCase faster in the common case.

cheers, Rémi

Remi Forax

10:46 p.m.

New subject: RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

On 02/05/2014 10:30 PM, Xueming Shen wrote:

...

Hi Remi,

Good suggestion. Now the "common case" path is much simple and faster :-) I'm seeing a 5%-10% boost for the normal-non-surrogates case. And it appears the bmp+surr mixed is getting faster as well. Though I would assume the it would get slower in case of "no-case-folding" + surrogates. But the common case wins here.

http://cr.openjdk.java.net/~sherman/8032012/webrev/

Thanks! -Sherman

nice, cool ! Rémi

...

On 02/05/2014 01:00 PM, Remi Forax wrote:

...
Hi Sherman, the code can be faster if the first loop call toLowerCaseEx in case of a surrogate instead of modifying srcCount because in that case the JIT will see that the increment is constant.

Note that with the current code, the performance of toLowerCase if it was never called with a string that contains a surrogate are great but if toLowerCase is called once with a string that contains a surrogate, even if toLowercase is called after with strings that never contain a surrogate performance will be slower than it was. Calling toLowerCaseEx will make a toLowerCase with a surrogate slower but make toLowerCase faster in the common case.

cheers, Rémi

Ulf Zibis

11:28 p.m.

New subject: RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

Additionally you could use Character.isSurrogate() and Character.isSupplementaryCode<point() at appropriate places. Both are better optimized for JIT. -Ulf Am 05.02.2014 22:30, schrieb Xueming Shen:

...

Hi Remi,

Good suggestion. Now the "common case" path is much simple and faster :-) I'm seeing a 5%-10% boost for the normal-non-surrogates case. And it appears the bmp+surr mixed is getting faster as well. Though I would assume the it would get slower in case of "no-case-folding" + surrogates. But the common case wins here.

http://cr.openjdk.java.net/~sherman/8032012/webrev/

Xueming Shen

11:57 p.m.

New subject: RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

On 02/05/2014 03:28 PM, Ulf Zibis wrote:

...

Additionally you could use Character.isSurrogate() and Character.isSupplementaryCode<point() at appropriate places. Both are better optimized for JIT.

j.l.C.isSupplementaryCodePoint() checks up boundary of supp, we probably don't need it here, as the returning code point is either a ERROR or a valid unicode code point. I'm not sure about the j.l.C.isSurrogate(), which takes a char and we have an int here. I would expect the javac will inline the constants for me, but I don't know whether jit can inline and then optimize away the explicit casting i2c. Not a big deal though. -sherman

...

-Ulf

Am 05.02.2014 22:30, schrieb Xueming Shen:

...
Hi Remi,

Good suggestion. Now the "common case" path is much simple and faster :-) I'm seeing a 5%-10% boost for the normal-non-surrogates case. And it appears the bmp+surr mixed is getting faster as well. Though I would assume the it would get slower in case of "no-case-folding" + surrogates. But the common case wins here.

http://cr.openjdk.java.net/~sherman/8032012/webrev/

Ulf Zibis

6 Feb 6 Feb

4:42 p.m.

New subject: RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

Hi, Am 06.02.2014 00:57, schrieb Xueming Shen:

...

On 02/05/2014 03:28 PM, Ulf Zibis wrote:

...
Additionally you could use Character.isSurrogate() and Character.isSupplementaryCode<point() at appropriate places. Both are better optimized for JIT.

j.l.C.isSupplementaryCodePoint() checks up boundary of supp, we probably don't need it here, as the returning code point is either a ERROR or a valid unicode code point.

Sorry, I was in error. I meant using !isBmpCodePoint() which is codePoint >>> 16 != 0 which is should be slightly faster than codePoint >= Character.MIN_SUPPLEMENTARY_CODE_POINT as the latter needs a 32-bit value to be loaded.

...

I'm not sure about the j.l.C.isSurrogate(), which takes a char and we have an int here. I would expect the javac will inline the constants for me, but I don't know whether jit can inline and then optimize away the explicit casting i2c. Not a big deal though.

Why you use (in) here, you could do the cast later? Yes, IIRC from my HSdis inspection, i2c is a Noop and yes, isSurrogate() is better to read. Additionally I remember, we had discussion if isSurrogate() would be faster with (byte)(ch >>> 11) == (byte)(0xD8 >>> 3) So if isSurrogate() will be optimised in future or even intrinsified, using isSurrogate() should be better than ch >= MIN_HIGH_SURROGATE&& c <= MAX_HIGH_SURROGATE Please note: ch >= MIN_SURROGATE && ch < (MAX_SURROGATE + 1) generally is better than: ch >= MIN_HIGH_SURROGATE&& c <= MAX_HIGH_SURROGATE See: http://bugs.java.com/bugdatabase/view_bug.do?bug_id=6984886 -Ulf

4414

Age (days ago)

4435

Last active (days ago)

List overview

Download

17 comments

4 participants

participants (4)

Paul Sandoz
Remi Forax
Ulf Zibis
Xueming Shen

RFR: JDK-8032012, , String.toLowerCase/toUpperCase performance improvement

tags

participants (4)