Add a test to demonstrate connection logical deadlock under high concurrency #852

sandersaares · 2025-07-22T10:36:54Z

Test to accompany bug report #853.

…equests than max_concurrent_streams

seanmonstar · 2025-07-22T14:10:07Z

Thanks for providing a test case! What you describe in the issue sounds plausible, but I notice in this that flow control isn't touched, so neither side will give any more connection window space at all, so it will by design definitely not be able to make further progress.

seanmonstar · 2025-07-22T14:12:16Z

tests/h2-tests/tests/deadlock.rs

+                );
+            }
+
+            thread::sleep(CHECK_FOR_PROGRESS_INTERVAL);


You'd also want to use something like tokio::time::sleep(CHECK_FOR_PROGRESS_INTERVAL).await to not block any of the other tasks on this thread.

Good catch - updated.

sandersaares · 2025-07-22T15:54:49Z

What you describe in the issue sounds plausible, but I notice in this that flow control isn't touched, so neither side will give any more connection window space at all, so it will by design definitely not be able to make further progress.

It is not clear what you mean by this. Why would flow control need to be touched? The connection has a non-zero window (before it gets allocated to pending_open requests that could not possibly use it, based on the theory above), so all the requests should be transmitted through that window. Am I misunderstanding something?

seanmonstar · 2025-07-28T21:17:38Z

In HTTP/2, there's a window on each stream, and on the connection as a whole. And the default is fairly small, just 64kb. Flow control management with h2 is manual. Without the other "side" give more window on the connection, no stream will be allowed to send more data.

hyper will manage it for you automatically... And, perhaps you do in your larger app. But it is a reason why this specific test will hang. With 10kb bodies, it should stop around the 7th request (-ish, since prioritization/ordering of DATA frames is not specified).

sandersaares · 2025-07-29T04:14:05Z

If this test were failing because the connection window is not being extended, it would be failing also with CONCURRENCY = 1, wouldn't it? As configured in the PR, each increment of CONCURRENCY translates to 100 requests of 10 KB, so 1 MB of data, which would exceed a 64KB window.

Yet, with lower values of CONCURRENCY the test appears to pass just fine. Even 10 000 requests go through just fine with CONCURRENCY=1.

This suggests to me that the problem is not the lack of connection window extension.

seanmonstar · 2025-07-29T14:04:27Z

OK, I've debugged a bit more, and was reminded that h2 will automatically clear connection window that a stream has used upon dropping all references to that particular stream, which is why there are WINDOW_UPDATE frames of stream-id=0 being sent. That clears that issue up.

Fiddling with the numbers in your test case, and adding a few more printlns, it seems there is a correlation between streams being refused and eventually the connection window not having capacity. I'll need to investigate that a little more.

benjaminp · 2025-08-25T03:34:16Z

The lack of calls to request_capacity() calls in this testcase means it completely relies on the streaming-dropping auto-release to keep the connection window open. From the client's perspective, when the CONCURRENCY is sufficiently large, it's possible that all open streams have partially transmitted request bodies and all local-half-closed streams have partially transmitted response bodies. Neither side can make progress because the window only changes when a stream completes. I think this state could explain the deadlock?

sandersaares · 2025-08-27T04:14:29Z

Sounds plausible. Forgive my ignorance but where in the Hyper stack does this "make sure you reserve enough connection window before you try sending something" go? In our real scenario where we found this behavior, we were just using Hyper as the higher layer above h2 and I posted this as an h2 bug simply in order to try minimize the repro.

Perhaps that was a mistake and I removed important machinery from the picture in my minimization attempt. If we take the full picture of a Hyper-based HTTP2 client, where would such logic go? I was under the assumption that Hyper would itself manage the different requests on the same connection. Is that assumption false? Or should we be looking at this as a Hyper bug, whereby it is failing to do that?

seanmonstar · 2025-08-27T15:24:16Z

hyper does manage capacity for you automatically, yes. When I investigated some last month, I do believe there's a bug in h2 that is missing either a window update or release or something.

But my current priorities have me looking into other things at the moment.

sandersaares · 2025-09-04T05:19:10Z

Slightly beyond my depth in h2 internals but I am staring at this block here:

h2/src/proto/streams/prioritize.rs

Lines 811 to 817 in b9d5397

    
           let eos = frame.is_end_stream(); 
        
           let len = len as usize; 
        
           if frame.payload().remaining() > len { 
        
               frame.set_end_stream(false); 
        
           } 
        
           (eos, len)

That if is entered if we only have enough connection capacity to send part of the payload, not the entire payload. Two questions immediately pop into my head:

We buf.take(len) below but what happens to the rest of the payload? The original frame appears to just be dropped after we .map() to only the head of the payload.
We clear the "end stream" flag because we are only sending a part of the queued frame. However, we still appear to return the partial frame with the previous eos flag, indicating "end of stream == true". Meanwhile, I do not see any logic that "preserves" the "end stream" flag for the remainder that we are not sending.

It is possible that I am only seeing a partial view - perhaps this logic collaborates with some other logic that takes care of these concerns - but this code looks suspicious to me. Could it be that these "partial frames" are getting their second half lost, causing some kind of stall whereby these streams never complete?

benjaminp · 2025-09-04T05:22:47Z

I was wondering if #860 had an effect on your production version of this issue.

sandersaares · 2025-09-04T10:31:50Z

@benjaminp I gave it a try but was not able to see any difference in outcome. I can see the rationale behind the change - why should things get allocated capacity if they just sit in a queue - but in practice, even if it is a piece of the puzzle, it seems not to be the whole puzzle.

benjaminp · 2025-09-04T21:46:25Z

An update to #860 now fixes another case where connection windows could be starved.

sandersaares · 2025-09-05T06:50:52Z

Tried latest state in that branch, still no difference in our real workload - the deadlocks occur in an equivalent manner.

bvinc · 2025-09-10T17:02:08Z

I spent a day looking at this program since I believed that it was probably related to the logical deadlock that I was experiencing, fixed by #860.

The deadlocks, every time I saw them, were happening because the server side, sending to the client, ran out of connection-level capacity.

Why did the server run out of connection-level capacity? Because the client wasn't sending window_update's for stream id 0.

Why wasn't the client sending window_update's for stream 0? Because the client's recv_flow was had window_size=0 and available=0. In that case, the method unclaimed_capacity will always return None. For the client to send a window_update, the recv_flow.available needs to be higher. It gets higher when capacity is released.

Why isn't capacity being released? Because no one is calling the flow_control's release_capacity method in the deadlock example.

So why isn't it deterministic? Because from the client's perspective, it's recv_flow capacity will automatically be released when a request finishes. If requests are mostly being completed one or 2 at a time, it won't deadlock. It deadlocks once the server puts all of it's connection-level capacity in sending partially-completed responses. From the client's perspective, this is all "claimed capacity", and a window_update will never be sent back to the server. Here's an example of some of the logs that I produced from this deadlock program.

2025-09-10T00:08:05.220886Z TRACE client:Connection{peer=Client}:poll: h2::proto::streams::recv: recv_data; size=10240; connection=30720; stream=65535
2025-09-10T00:08:05.221035Z TRACE client:Connection{peer=Client}:poll: h2::proto::streams::recv: recv_data; size=10240; connection=20480; stream=65535
2025-09-10T00:08:05.221348Z TRACE client:Connection{peer=Client}:poll: h2::proto::streams::recv: recv_data; size=6145; connection=10240; stream=61440
2025-09-10T00:08:05.221656Z TRACE client:Connection{peer=Client}:poll: h2::proto::streams::recv: recv_data; size=4095; connection=4095; stream=65535
2025-09-10T00:08:05.221953Z TRACE client:Connection{peer=Client}:poll:send_connection_window_update: h2::proto::streams::flow_control: checking if should send connection_window_update, NO! window_size: Window(0) available: Window(0)

So in conclusion, I think this is simply a bug in the deadlock program, as it's not calling release_capacity like it should. This has nothing to do with #860. As a matter of fact, this deadlock program never once attempted to assign capacity to a stream that was in a pending_open state.

sandersaares · 2025-09-15T03:50:40Z

@bvinc thank you for investigating! Your findings make sense and seem to explain the logical deadlock here, indeed. Unfortunately, this means I failed to correctly minimally reproduce our real deadlock (which uses Hyper above and so does the necessary capacity releasing already, which I verified). I will attempt to achieve a better repro of the problem we are hitting.

sandersaares added 5 commits July 22, 2025 09:04

Deadlock start

1b7e27f

Demonstrate logical deadlock of HTTP2 connection when handling more r…

5e195dd

…equests than max_concurrent_streams

Remove misleading comment

20759ba

Delete misleading comment

13edf15

Tidy

1e08581

sandersaares mentioned this pull request Jul 22, 2025

Logical connection deadlock occurs under high concurrency - outgoing data transmission stops #853

Open

seanmonstar reviewed Jul 22, 2025

View reviewed changes

tokio::time::sleep

66abdc0

bvinc mentioned this pull request Sep 12, 2025

Add a test to demonstrate connection logical deadlock due assigning capacity to pending open streams #863

Open

Uh oh!

Add a test to demonstrate connection logical deadlock under high concurrency #852

Are you sure you want to change the base?

Add a test to demonstrate connection logical deadlock under high concurrency #852

Uh oh!

Conversation

sandersaares commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seanmonstar commented Jul 22, 2025

Uh oh!

seanmonstar Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

sandersaares Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

sandersaares commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seanmonstar commented Jul 28, 2025

Uh oh!

sandersaares commented Jul 29, 2025

Uh oh!

seanmonstar commented Jul 29, 2025

Uh oh!

benjaminp commented Aug 25, 2025

Uh oh!

sandersaares commented Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seanmonstar commented Aug 27, 2025

Uh oh!

sandersaares commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benjaminp commented Sep 4, 2025

Uh oh!

sandersaares commented Sep 4, 2025

Uh oh!

benjaminp commented Sep 4, 2025

Uh oh!

sandersaares commented Sep 5, 2025

Uh oh!

bvinc commented Sep 10, 2025

Uh oh!

sandersaares commented Sep 15, 2025

Uh oh!

Uh oh!

sandersaares commented Jul 22, 2025 •

edited

Loading

sandersaares commented Jul 22, 2025 •

edited

Loading

sandersaares commented Aug 27, 2025 •

edited

Loading

sandersaares commented Sep 4, 2025 •

edited

Loading