Remove indify string concatenation on Inet::getInet4Address #93

franz1981 · 2024-01-12T14:20:24Z

In Quarkus's startup we noticed some allocations due to https://openjdk.org/jeps/280, see

which point to the string concatenation in Inet::getInet4Address.
The method could be optimized by:

using a presized StringBuilder
using an ad hoc method

I've chosen the latter, to avoid the StringBuilder allocation, given that we have knowledge about the parameters ranges ie [0, 255].

What could be done, better, is to save the intermediate byte[] hostBytes allocation to happen as a whole, by leveraging the default EliminateAllocationArraySizeLimit in case OpenJDK escape analysis will do its, job ie creating a fixed size buffer of < 64 bytes (the default value - eg 15 bytes is the max size we could expect for 255.255.255.255).
Obviously this is a dangerous bet, because rely on specific OpenJDK optimizations and when/if escape analysis kicks-in.

dmlloyd · 2024-01-12T16:48:21Z

src/main/java/org/wildfly/common/net/Inet.java

+        encodeUnsignedByte(s3, hostBytes, digitsForS1 + digitsForS2 + 2, digitsForS3);
+        hostBytes[digitsForS1 + digitsForS2 + digitsForS3 + 2] = '.';
+        encodeUnsignedByte(s4, hostBytes, digitsForS1 + digitsForS2 + digitsForS3 + 3, digitsForS4);
+        String hostName = new String(hostBytes, 0);


You can use new String(byte[],...,StandardCharsets.ISO_8859_1) and it should be similar since it uses the same hot path with the charset is that one.

explained at #93 (comment)

dmlloyd · 2024-01-12T16:52:49Z

src/main/java/org/wildfly/common/net/Inet.java

        bytes[0] = (byte) s1;
        bytes[1] = (byte) s2;
        bytes[2] = (byte) s3;
        bytes[3] = (byte) s4;
+        // pre-compute the digits required


You could avoid this (and some redundant branching) with something like:

byte[] hostBytes = new byte[15]; int cnt = encodeUnsignedByte(s1, hostBytes, 0); hostBytes[cnt++] = '.'; cnt += encodeUnsignedByte(s2, hostBytes, cnt); ... String hostName = new String(hostBytes, 0, cnt, StandardCharsets.ISO_8859_1);

and then:

private static int encodeUnsignedByte(int value, byte[] bytes, int offset) { if (value >= 100) { bytes[offset++] = '0' + value / 100; value %= 100; } if (value >= 10) { ... } return offset; }

The redundant branching are there to avoid data dependency among encodeUnsignedByte calls, which could be indipendent from each others thanks to the upfront estimation. But is a minor: I'm opened to change it if you prefer it.

for

String hostName = new String(hostBytes, 0, cnt, StandardCharsets.ISO_8859_1);

there's an opened JDK bug (I've provided the reproducer some time ago) which show that it perform very bad, still, see https://bugs.openjdk.org/browse/JDK-8295496

quoting it

I also included String(byte[], int, int, ISO_8859_1) which disappointingly lags slightly behind the deprecated String(byte[], int, int, int).

That's why I didn't used it.

Additionally, I have sized correctly the byte array and not using any cnt variable to simplify the life of the JDK stub generator and the bound checks generation, there, which proved to be the reason behind the bad performance of https://bugs.openjdk.org/browse/JDK-8295496

dmlloyd · 2024-01-12T17:44:37Z

Also I would suggest instead making these changes to SmallRye Common Net - these two projects should be identical for this class, and it is my intention to deprecate this one and replace it with redirections to SmallRye Common.

franz1981 · 2024-01-15T09:36:26Z

Also I would suggest instead making these changes to SmallRye Common Net - these two projects should be identical for this class, and it is my intention to deprecate this one and replace it with redirections to SmallRye Common.

Perfect, sending the PR there as well!

franz1981 · 2024-01-15T09:38:49Z

src/main/java/org/wildfly/common/net/Inet.java

+    private static void encodeUnsignedByte(int value, byte[] bytes, int offset, int digits) {
+        assert value >= 0 && value <= 255 && digits >= 1 && digits <= 3;
+        if (digits == 3) {
+            bytes[offset + 2] = (byte) ('0' + (value % 10));


@dmlloyd FYI I didn't used the manual "strength reduction" to bit-shifts because:

while the code is interpreted it seems there's no need, is already pretty slow for others reasons

while the code is hot (C2, mostly), the JIT perform the same exact optimization

franz1981 · 2024-01-15T09:39:24Z

@radcortez let me know if this help your startup allocation analysis

franz1981 · 2024-01-15T12:47:22Z

I can close it, to favour one to https://github.com/franz1981/wildfly-common instead

franz1981 · 2024-03-18T15:11:55Z

@radcortez that's why: this has never been merged and relying on smallrye/smallrye-common#275, but it seems 3.8.2 still use this, suggestions @radcortez @dmlloyd ?

radcortez · 2024-03-18T16:29:03Z

The fix is in SR Commons: smallrye/smallrye-common#279

But the delegating code from WF Common to SR Common has not been released:
#100

We can replace the WF API with the SR API in the Converter so we don't have to wait for the release (and update in Quarkus)

franz1981 force-pushed the no_indy branch from 3c39568 to 075869d Compare January 12, 2024 15:30

Remove indify string concatenation on Inet::getInet4Address

e0a6bd3

franz1981 force-pushed the no_indy branch from 075869d to e0a6bd3 Compare January 12, 2024 16:14

dmlloyd requested changes Jan 12, 2024

View reviewed changes

franz1981 commented Jan 15, 2024

View reviewed changes

radcortez approved these changes Jan 15, 2024

View reviewed changes

franz1981 closed this Jan 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove indify string concatenation on Inet::getInet4Address #93

Remove indify string concatenation on Inet::getInet4Address #93

franz1981 commented Jan 12, 2024

dmlloyd Jan 12, 2024

franz1981 Jan 15, 2024

dmlloyd Jan 12, 2024

franz1981 Jan 15, 2024 •

edited

Loading

franz1981 Jan 15, 2024 •

edited

Loading

dmlloyd commented Jan 12, 2024

franz1981 commented Jan 15, 2024

franz1981 Jan 15, 2024

franz1981 commented Jan 15, 2024

franz1981 commented Jan 15, 2024

franz1981 commented Mar 18, 2024

radcortez commented Mar 18, 2024

Remove indify string concatenation on Inet::getInet4Address #93

Remove indify string concatenation on Inet::getInet4Address #93

Conversation

franz1981 commented Jan 12, 2024

dmlloyd Jan 12, 2024

Choose a reason for hiding this comment

franz1981 Jan 15, 2024

Choose a reason for hiding this comment

dmlloyd Jan 12, 2024

Choose a reason for hiding this comment

franz1981 Jan 15, 2024 • edited Loading

Choose a reason for hiding this comment

franz1981 Jan 15, 2024 • edited Loading

Choose a reason for hiding this comment

dmlloyd commented Jan 12, 2024

franz1981 commented Jan 15, 2024

franz1981 Jan 15, 2024

Choose a reason for hiding this comment

franz1981 commented Jan 15, 2024

franz1981 commented Jan 15, 2024

franz1981 commented Mar 18, 2024

radcortez commented Mar 18, 2024

franz1981 Jan 15, 2024 •

edited

Loading

franz1981 Jan 15, 2024 •

edited

Loading