Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 601 100

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 388 62

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.4k 1.5k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.7k 228

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 3.9k 446

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.7k 928

Repositories

Showing 10 of 639 repositories
  • KAI-Scheduler Public

    KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale

    NVIDIA/KAI-Scheduler’s past year of commit activity
    Go 956 Apache-2.0 114 23 37 Updated Dec 7, 2025
  • nvidia-resiliency-ext Public

    NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the effective training time by minimizing the downtime due to failures and interruptions.

    NVIDIA/nvidia-resiliency-ext’s past year of commit activity
    Python 239 37 1 15 Updated Dec 7, 2025
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 14,440 3,350 329 241 Updated Dec 7, 2025
  • accelerated-computing-hub Public

    NVIDIA curated collection of educational resources related to general purpose GPU programming.

    NVIDIA/accelerated-computing-hub’s past year of commit activity
    Jupyter Notebook 925 167 13 4 Updated Dec 7, 2025
  • Fuser Public

    A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

    NVIDIA/Fuser’s past year of commit activity
    C++ 364 69 205 (15 issues need help) 212 Updated Dec 7, 2025
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 12,323 1,917 618 453 Updated Dec 7, 2025
  • cuda-python Public

    CUDA Python: Performance meets Productivity

    NVIDIA/cuda-python’s past year of commit activity
    Python 3,069 227 204 15 Updated Dec 7, 2025
  • k8s-dra-driver-gpu Public

    NVIDIA DRA Driver for GPUs

    NVIDIA/k8s-dra-driver-gpu’s past year of commit activity
    Go 504 Apache-2.0 101 94 27 Updated Dec 7, 2025
  • TensorRT-Incubator Public

    Experimental projects related to TensorRT

    NVIDIA/TensorRT-Incubator’s past year of commit activity
    MLIR 116 19 37 (1 issue needs help) 12 Updated Dec 7, 2025
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 1,613 Apache-2.0 206 69 44 Updated Dec 7, 2025