add 20th percentile for server startup timing #133

shaneknapp · 2024-05-16T17:03:21Z

we've come to find that adding the 20th percentile to server startup times shows us a more realistic view of a general single user server startup duration.

this is highly non-critical, but very useful. :)

shaneknapp · 2024-05-16T17:05:05Z

hmm, not sure why the tests are failing...

consideRatio · 2025-11-09T09:44:38Z

I thought about this quite a bit now, with some insights summarized below.

Whenever a user starts a server, we can only tell that the spawn duration lies between two bucket sizes, defined by JupyterHub's metrics, which are listed below. Note that le stands for less than or equal to, and each entry is a counter that is only being incremented.

jupyterhub_server_spawn_duration_seconds_bucket{le="0.5",status="success"} 0.0
jupyterhub_server_spawn_duration_seconds_bucket{le="1.0",status="success"} 0.0
jupyterhub_server_spawn_duration_seconds_bucket{le="2.5",status="success"} 0.0
jupyterhub_server_spawn_duration_seconds_bucket{le="5.0",status="success"} 0.0
jupyterhub_server_spawn_duration_seconds_bucket{le="10.0",status="success"} 2.0
jupyterhub_server_spawn_duration_seconds_bucket{le="15.0",status="success"} 2.0
jupyterhub_server_spawn_duration_seconds_bucket{le="30.0",status="success"} 2.0
jupyterhub_server_spawn_duration_seconds_bucket{le="60.0",status="success"} 2.0
jupyterhub_server_spawn_duration_seconds_bucket{le="120.0",status="success"} 2.0
jupyterhub_server_spawn_duration_seconds_bucket{le="180.0",status="success"} 2.0
jupyterhub_server_spawn_duration_seconds_bucket{le="300.0",status="success"} 2.0
jupyterhub_server_spawn_duration_seconds_bucket{le="600.0",status="success"} 2.0
jupyterhub_server_spawn_duration_seconds_bucket{le="+Inf",status="success"} 2.0

Often what is presented will only relate to a single server startup, and with these percentiles you end up with multiple values in between the range. For example below, I used five percentiles - 0, 25, 50, 75, and 100.
Sometimes we have multiple server startup times recorded during a single timestep, and then it can look like this:

My current opinion

I think overall, the 99th or 100th percentile represents the worst case, while the 50th percentile represents the best guess of average spawn time. Both of these seem reasonable to me.

I think it could also make sense to see the best case alongside the worst case.

Beyond that, I think it's reasonable to add more points to help see the skew, such as 25 and 75, but we should maintain even spacing between all points, so 0, 25, 50, 75, 100, rather than 0, 20, 50, 100.

shaneknapp · 2025-11-12T18:46:53Z

i'll go ahead and close this in lieu of #161

add 20th percentile for server startup timing

a8f45b9

minrk closed this Jul 22, 2025

minrk reopened this Jul 22, 2025

yuvipanda added this to PR triage (experimental) Aug 14, 2025

yuvipanda moved this to Backlog in PR triage (experimental) Aug 14, 2025

jupyterhub-pr-triage-board-bot bot removed this from PR triage (experimental) Nov 5, 2025

jupyterhub-pr-triage-board-bot bot added this to PR triage (experimental) Nov 5, 2025

consideRatio closed this Nov 9, 2025

github-project-automation bot moved this to Done in PR triage (experimental) Nov 9, 2025

consideRatio reopened this Nov 9, 2025

consideRatio mentioned this pull request Nov 9, 2025

jupyterhub dashboard: transition two panels to heatmaps, and tweak existing heatmaps #161

Merged

shaneknapp closed this Nov 12, 2025

jupyterhub-pr-triage-board-bot bot removed this from PR triage (experimental) Nov 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add 20th percentile for server startup timing #133

add 20th percentile for server startup timing #133

Uh oh!

shaneknapp commented May 16, 2024

Uh oh!

shaneknapp commented May 16, 2024

Uh oh!

consideRatio commented Nov 9, 2025 •

edited

Loading

Uh oh!

shaneknapp commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add 20th percentile for server startup timing #133

add 20th percentile for server startup timing #133

Uh oh!

Conversation

shaneknapp commented May 16, 2024

Uh oh!

shaneknapp commented May 16, 2024

Uh oh!

consideRatio commented Nov 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

My current opinion

Uh oh!

shaneknapp commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

consideRatio commented Nov 9, 2025 •

edited

Loading