stub: optimize ThreadlessExecutor used for blocking calls #5516

njhill · 2019-03-28T22:07:42Z

The ThreadlessExecutor currently used for blocking calls uses LinkedBlockingQueue which is relatively heavy both in terms of allocations and synchronization overhead (e.g. when compared to ConcurrentLinkedQueue). It accounts for ~10% of allocations and ~5% of allocated bytes per-call in the TransportBenchmark when using in-process transport with stats and tracing disabled.

Changing to use a ConcurrentLinkedQueue results in a ~5% speedup of that benchmark.

Replace LinkedBlockingQueue with ConcurrentLinkedQueue and explicit blocking.

stub/src/main/java/io/grpc/stub/ClientCalls.java

dapengzhang0 · 2019-03-28T23:29:14Z

stub/src/main/java/io/grpc/stub/ClientCalls.java

@@ -639,20 +642,33 @@ public void onClose(Status status, Metadata trailers) {
     * Waits until there is a Runnable, then executes it and all queued Runnables after it.
     */
    public void waitAndDrain() throws InterruptedException {
-      Runnable runnable = queue.take();
-      while (runnable != null) {
+      Runnable runnable = poll();


Maybe mark waitAndDrain() with @NotThreadSafe because there must not be two concurrent callers of it.

Sure, the class is only for internal use in an SPSC context.

One possible thing you could try is to make this SPSC. I have a POC (originally for SerializingExecutor) here: https://github.com/grpc/grpc-java/pull/3778/files

@carl-mastrangelo sure, I remember seeing that before, could be worth a try here too. I thought (possibly mistakenly) this would be a simpler change just to circumvent LinkedBlockingQueue which was the main goal.

stub/src/main/java/io/grpc/stub/ClientCalls.java

including interruption handling fix

njhill · 2019-03-29T00:20:58Z

Thanks @dapengzhang0 @carl-mastrangelo, have addressed the comments, PTAL

dapengzhang0 · 2019-03-29T00:33:52Z

LGTM

carl-mastrangelo · 2019-03-29T15:32:26Z

@njhill Can you include your before and after JMH numbers for the commit? We typically include them when making performance optimizations.

njhill · 2019-03-29T17:30:52Z

@carl-mastrangelo I thought I had observed a bigger difference in the non-direct case in other runs, when the system was noisier. I know it's not a huge delta but there's a couple more similar changes I have in mind which cumulatively add up to maybe ~15% (to be confirmed!)

Before:

Benchmark                         (direct)  (transport)  Mode  Cnt      Score     Error  Units
TransportBenchmark.unaryCall1024      true    INPROCESS  avgt   60   1877.339 ±  46.309  ns/op
TransportBenchmark.unaryCall1024     false    INPROCESS  avgt   60  12680.525 ± 208.684  ns/op

After:

Benchmark                         (direct)  (transport)  Mode  Cnt      Score     Error  Units
TransportBenchmark.unaryCall1024      true    INPROCESS  avgt   60   1779.188 ±  36.769  ns/op
TransportBenchmark.unaryCall1024     false    INPROCESS  avgt   60  12532.470 ± 238.271  ns/op

This is with the following changes to default config:

Set tracingEnabled and statsEnabled to false in channel and server builders
Bumped forks 1 -> 2, iterations 10 -> 30 and changed mode to AverageTime

carl-mastrangelo

LGTM

carl-mastrangelo · 2019-03-29T17:45:35Z

@njhill merged, thanks!

dapengzhang0 · 2019-03-29T17:45:52Z

Thanks a lot for your PR @njhill

stub: optimize ThreadlessExecutor used for blocking calls

c4149a2

Replace LinkedBlockingQueue with ConcurrentLinkedQueue and explicit blocking.

dapengzhang0 reviewed Mar 28, 2019

View reviewed changes

stub/src/main/java/io/grpc/stub/ClientCalls.java Show resolved Hide resolved

dapengzhang0 reviewed Mar 28, 2019

View reviewed changes

stub/src/main/java/io/grpc/stub/ClientCalls.java Show resolved Hide resolved

address comments from @dapengzhang0 and @carl-mastrangelo

ca5a333

including interruption handling fix

njhill force-pushed the threadless branch from d552136 to ca5a333 Compare March 29, 2019 00:20

dapengzhang0 added the kokoro:run Add this label to a PR to tell Kokoro the code is safe and tests can be run label Mar 29, 2019

grpc-kokoro removed the kokoro:run Add this label to a PR to tell Kokoro the code is safe and tests can be run label Mar 29, 2019

carl-mastrangelo approved these changes Mar 29, 2019

View reviewed changes

carl-mastrangelo merged commit 5f88bc4 into grpc:master Mar 29, 2019

njhill deleted the threadless branch March 29, 2019 18:09

lock bot locked as resolved and limited conversation to collaborators Jun 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

stub: optimize ThreadlessExecutor used for blocking calls #5516

stub: optimize ThreadlessExecutor used for blocking calls #5516

Uh oh!

njhill commented Mar 28, 2019

Uh oh!

Uh oh!

dapengzhang0 Mar 28, 2019

Uh oh!

njhill Mar 28, 2019

Uh oh!

carl-mastrangelo Mar 28, 2019

Uh oh!

njhill Mar 28, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

njhill commented Mar 29, 2019

Uh oh!

dapengzhang0 commented Mar 29, 2019

Uh oh!

carl-mastrangelo commented Mar 29, 2019

Uh oh!

njhill commented Mar 29, 2019

Uh oh!

carl-mastrangelo left a comment

Uh oh!

carl-mastrangelo commented Mar 29, 2019

Uh oh!

dapengzhang0 commented Mar 29, 2019

Uh oh!

Uh oh!

stub: optimize ThreadlessExecutor used for blocking calls #5516

stub: optimize ThreadlessExecutor used for blocking calls #5516

Uh oh!

Conversation

njhill commented Mar 28, 2019

Uh oh!

Uh oh!

dapengzhang0 Mar 28, 2019

Choose a reason for hiding this comment

Uh oh!

njhill Mar 28, 2019

Choose a reason for hiding this comment

Uh oh!

carl-mastrangelo Mar 28, 2019

Choose a reason for hiding this comment

Uh oh!

njhill Mar 28, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

njhill commented Mar 29, 2019

Uh oh!

dapengzhang0 commented Mar 29, 2019

Uh oh!

carl-mastrangelo commented Mar 29, 2019

Uh oh!

njhill commented Mar 29, 2019

Uh oh!

carl-mastrangelo left a comment

Choose a reason for hiding this comment

Uh oh!

carl-mastrangelo commented Mar 29, 2019

Uh oh!

dapengzhang0 commented Mar 29, 2019

Uh oh!

Uh oh!