-
Notifications
You must be signed in to change notification settings - Fork 51
Delay setup_model_metrics() until /metrics is called #327
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
WalkthroughThe Changes
Sequence Diagram(s)sequenceDiagram
participant Client
participant MetricsEndpoint
participant SetupModelMetrics
participant PrometheusExporter
Client->>MetricsEndpoint: HTTP GET /metrics
MetricsEndpoint->>SetupModelMetrics: await setup_model_metrics()
SetupModelMetrics-->>MetricsEndpoint: (runs once, subsequent calls await same task)
MetricsEndpoint->>PrometheusExporter: generate_latest(metrics_registry)
PrometheusExporter-->>MetricsEndpoint: metrics data
MetricsEndpoint->>Client: PlainTextResponse(metrics)
Estimated code review effort🎯 2 (Simple) | ⏱️ ~8 minutes Possibly related PRs
Suggested reviewers
Poem
Note ⚡️ Unit Test Generation is now available in beta!Learn more here, or try it out under "Finishing Touches" below. 📜 Recent review detailsConfiguration used: CodeRabbit UI 📒 Files selected for processing (1)
✅ Files skipped from review due to trivial changes (1)
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)
✨ Finishing Touches🧪 Generate unit tests
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. 🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
SupportNeed help? Create a ticket on our support page for assistance with any issues or questions. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
CodeRabbit Configuration File (
|
|
The E2E test is not using the proposed changes. This PR deletes the |
|
/retest |
|
@tisnik the E2E test is not using the code from PR, we need to fix that first: |
|
Rebased it on top of main, let's see |
8d190e4 to
576341d
Compare
This patch moves the setup_model_metrics() from the service startup to the first time the /metrics endpoint is called. This speed up the lightspeed-stack service initialization and also make lightspeed-stack more resilient regarding service initialization order because it no longer requires llama-stack to be started first (as setup_model_metrics() tries to connect to llama-stack and fetch the list of models from it). Signed-off-by: Lucas Alvares Gomes <[email protected]> build actual image instead of using the actual one fix tests
c8a8cb7 to
1acf3da
Compare
tisnik
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Description
This patch moves the setup_model_metrics() from the service startup to the first time the /metrics endpoint is called. This speed up the lightspeed-stack service initialization and also make lightspeed-stack more resilient regarding service initialization order because it no longer requires llama-stack to be started first (as setup_model_metrics() tries to connect to llama-stack and fetch the list of models from it).
Type of change
Related Tickets & Documents
Checklist before requesting a review
Testing
Summary by CodeRabbit
New Features
Bug Fixes
/metricsendpoint to handle setup asynchronously, preventing redundant or repeated setup calls.Tests