Skip to content

Latest commit

 

History

History
105 lines (87 loc) · 3.49 KB

File metadata and controls

105 lines (87 loc) · 3.49 KB

Check Quota Availability Before Deployment

Before deploying the accelerator, ensure sufficient quota availability for the required model.

We recommend increasing the capacity to 100k tokens for optimal performance.

Login if you have not done so already

azd auth login

Login if you have not done so already

azd auth login

📌 Default Models & Capacities:

gpt-4o:30, gpt-4o-mini:30, gpt-4:30, text-embedding-ada-002:80

📌 Default Regions:

eastus, uksouth, eastus2, northcentralus, swedencentral, westus, westus2, southcentralus, canadacentral

Usage Scenarios:

  • No parameters passed → Default models and capacities will be checked in default regions.
  • Only model(s) provided → The script will check for those models in the default regions.
  • Only region(s) provided → The script will check default models in the specified regions.
  • Both models and regions provided → The script will check those models in the specified regions.
  • --verbose passed → Enables detailed logging output for debugging and traceability.

Input Formats

Use the --models, --regions, and --verbose options for parameter handling:

✔️ Run without parameters to check default models & regions without verbose logging:

./quota_check_params.sh

✔️ Enable verbose logging:

./quota_check_params.sh --verbose

✔️ Check specific model(s) in default regions:

./quota_check_params.sh --models gpt-4o:30,text-embedding-ada-002:80

✔️ Check default models in specific region(s):

./quota_check_params.sh --regions eastus,westus

✔️ Passing Both models and regions:

./quota_check_params.sh --models gpt-4o:30 --regions eastus,westus2

✔️ All parameters combined:

./quota_check_params.sh --models gpt-4:30,text-embedding-ada-002:80 --regions eastus,westus --verbose

Sample Output

The final table lists regions with available quota. You can select any of these regions for deployment.

quota-check-ouput


If using Azure Portal and Cloud Shell

  1. Navigate to the Azure Portal.

  2. Click on Azure Cloud Shell in the top right navigation menu.

  3. Run the appropriate command based on your requirement:

    To check quota for the deployment

    curl -L -o quota_check_params.sh "https://raw.githubusercontent.com/microsoft/Conversation-Knowledge-Mining-Solution-Accelerator/main/infra/scripts/quota_check_params.sh"
    chmod +x quota_check_params.sh
    ./quota_check_params.sh

If using VS Code or Codespaces

  1. Open the terminal in VS Code or Codespaces.

  2. If you're using VS Code, click the dropdown on the right side of the terminal window, and select Git Bash. git_bash

  3. Navigate to the scripts folder where the script files are located and make the script as executable:

     cd infra/scripts
     chmod +x quota_check_params.sh
  4. Run the appropriate script based on your requirement:

    To check quota for the deployment

    ./quota_check_params.sh
  5. If you see the error _bash: az: command not found_, install Azure CLI:

    curl -sL https://aka.ms/InstallAzureCLIDeb | sudo bash
    az login
  6. Rerun the script after installing Azure CLI.