Skip to content

Conversation

ksandowi
Copy link

@ksandowi ksandowi commented Feb 18, 2025

Description

This PR is an extension to #799 with Intel TDX support enabled for all docsum microservices.

Subcharts enabled to be run with Intel TDX:

  • docsum
  • whisper

Issues

n/a

Type of change

  • New feature (non-breaking change which adds new functionality)

Dependencies

n/a

Tests

Tested the deployment of docsum example on the running cluster without and with Intel TDX for:

  • docsum
  • docsum-ui
  • llm-uservice
  • tgi
  • whisper

@ksandowi ksandowi force-pushed the docsum-with-tdx branch 2 times, most recently from 1d5cdaa to e7f14db Compare February 18, 2025 19:42
Copy link
Contributor

@JakubLedworowski JakubLedworowski left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The implementation lacks changes for docsum-ui. Also tgi and llm-uservice are already enabled.
Please fix the following:

  • fix description accordingly to what new component is enabled,
  • correct the formatting of the PR description to match the suggested default format,
  • fix the sign-off to have only one correct entry with valid email address.

# helm install docsum docsum --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN} --values docsum/gaudi-tgi-values.yaml ...
# To use Gaudi device with vLLM
# helm install docsum docsum --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN} --values docsum/gaudi-vllm-values.yaml ..
# To run chatqna with Intel TDX feature
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

docsum

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed

# To use Gaudi device with vLLM
# helm install docsum docsum --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN} --values docsum/gaudi-vllm-values.yaml ..
# To run chatqna with Intel TDX feature
# helm install docsum docsum --set global.HUGGINGFACEHUB_API_TOKEN=${HFTOKEN} --set tgi.LLM_MODEL_ID=${MODELNAME} --set docsum.tdxEnabled=true --set docsum.resources.limits.memory=1Gi --set docsum-ui.tdxEnabled=true --set docsum-ui.resources.limits.memory=2Gi --set llm-uservice.tdxEnabled=true --set llm-uservice.resources.limits.memory=4Gi --set tgi.tdxEnabled=true --set tgi.resources.limits.memory=80Gi --set whisper.tdxEnabled=true --set whisper.resources.limits.memory=13Gi
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

docsum-ui not enabled

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It is with this line:
--set docsum-ui.tdxEnabled=true --set docsum-ui.resources.limits.memory=2Gi

krzysztof.sandowicz and others added 3 commits February 19, 2025 13:10
…n docsum README.md

Signed-off-by: Krzysztof Sandowicz <[email protected]>
for more information, see https://pre-commit.ci

Signed-off-by: Krzysztof Sandowicz <[email protected]>
Signed-off-by: Krzysztof Sandowicz <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants