Description
Your current environment
No module named 'numpy'
× Building editable for vllm (pyproject.toml) did not run successfully.
│ exit code: 1
╰─> [541 lines of output]
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/torch/_subclasses/functional_tensor.py:276: UserWarning: Failed to initialize NumPy: No module named 'numpy' (Triggered internally at /pytorch/torch/csrc/utils/tensor_numpy.cpp:81.)
cpu = _conversion_method_template(device=torch.device("cpu"))
running editable_wheel
creating /tmp/pip-wheel-a7zxip6g/.tmp-t3dsx5by/vllm.egg-info
writing /tmp/pip-wheel-a7zxip6g/.tmp-t3dsx5by/vllm.egg-info/PKG-INFO
writing dependency_links to /tmp/pip-wheel-a7zxip6g/.tmp-t3dsx5by/vllm.egg-info/dependency_links.txt
writing entry points to /tmp/pip-wheel-a7zxip6g/.tmp-t3dsx5by/vllm.egg-info/entry_points.txt
writing requirements to /tmp/pip-wheel-a7zxip6g/.tmp-t3dsx5by/vllm.egg-info/requires.txt
writing top-level names to /tmp/pip-wheel-a7zxip6g/.tmp-t3dsx5by/vllm.egg-info/top_level.txt
writing manifest file '/tmp/pip-wheel-a7zxip6g/.tmp-t3dsx5by/vllm.egg-info/SOURCES.txt'
reading manifest template 'MANIFEST.in'
adding license file 'LICENSE'
writing manifest file '/tmp/pip-wheel-a7zxip6g/.tmp-t3dsx5by/vllm.egg-info/SOURCES.txt'
creating '/tmp/pip-wheel-a7zxip6g/.tmp-t3dsx5by/vllm-0.9.1.dev5+ga3896c7f0.cu122.dist-info'
creating /tmp/pip-wheel-a7zxip6g/.tmp-t3dsx5by/vllm-0.9.1.dev5+ga3896c7f0.cu122.dist-info/WHEEL
running build_py
running build_ext
-- The CXX compiler identification is GNU 11.4.0
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Check for working CXX compiler: /usr/bin/c++ - skipped
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Build type: RelWithDebInfo
-- Target device: cuda
-- Found Python: /home/house365ai/anaconda3/envs/vllm6/bin/python3.10 (found version "3.10.16") found components: Interpreter Development.Module Development.SABIModule
-- Found python matching: /home/house365ai/anaconda3/envs/vllm6/bin/python3.10.
-- Found CUDA: /usr/local/cuda-12.2 (found version "12.2")
-- The CUDA compiler identification is NVIDIA 12.2.91 with host compiler GNU 12.3.0
-- Detecting CUDA compiler ABI info
-- Detecting CUDA compiler ABI info - done
-- Check for working CUDA compiler: /usr/local/cuda-12.2/bin/nvcc - skipped
-- Detecting CUDA compile features
-- Detecting CUDA compile features - done
-- Found CUDAToolkit: /usr/local/cuda-12.2/include (found version "12.2.91")
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success
-- Found Threads: TRUE
-- PyTorch: CUDA detected: 12.2
-- PyTorch: CUDA nvcc is: /usr/local/cuda-12.2/bin/nvcc
-- PyTorch: CUDA toolkit directory: /usr/local/cuda-12.2
-- PyTorch: Header version is: 12.2
-- Found Python: /home/house365ai/anaconda3/envs/vllm6/bin/python3.10 (found version "3.10.16") found components: Interpreter
CMake Warning at /tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message):
Failed to compute shorthash for libnvrtc.so
Call Stack (most recent call first):
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include)
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package)
CMakeLists.txt:80 (find_package)
CMake Warning (dev) at /tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/share/cmake-4.0/Modules/FindPackageHandleStandardArgs.cmake:430 (message):
The package name passed to find_package_handle_standard_args() (nvtx3) does
not match the name of the calling package (Caffe2). This can lead to
problems in calling code that expects find_package() result variables
(e.g., `_FOUND`) to follow a certain pattern.
Call Stack (most recent call first):
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:178 (find_package_handle_standard_args)
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include)
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package)
CMakeLists.txt:80 (find_package)
This warning is for project developers. Use -Wno-dev to suppress it.
-- Could NOT find nvtx3 (missing: nvtx3_dir)
CMake Warning at /tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:184 (message):
Cannot find NVTX3, find old NVTX instead
Call Stack (most recent call first):
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include)
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package)
CMakeLists.txt:80 (find_package)
-- USE_CUDNN is set to 0. Compiling without cuDNN support
-- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support
-- USE_CUDSS is set to 0. Compiling without cuDSS support
-- USE_CUFILE is set to 0. Compiling without cuFile support
-- Autodetected CUDA architecture(s): 8.0 8.0 8.0 8.0 8.0 8.0 8.0 8.0
-- Added CUDA NVCC flags for: -gencode;arch=compute_80,code=sm_80
CMake Warning at /tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message):
static library kineto_LIBRARY-NOTFOUND not found.
Call Stack (most recent call first):
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:125 (append_torchlib_if_found)
CMakeLists.txt:80 (find_package)
-- Found Torch: /tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/torch/lib/libtorch.so
-- CUDA target architectures: 8.0
-- CUDA supported target architectures: 8.0
-- FetchContent base directory: /home/house365ai/xxm/vllm6/.deps
-- Enabling cumem allocator extension.
-- CMake Version: 4.0.2
-- CUTLASS 3.9.2
-- Found CUDAToolkit: /usr/local/cuda-12.2/targets/x86_64-linux/include (found version "12.2.91")
-- CUDART: /usr/local/cuda-12.2/lib64/libcudart.so
-- CUDA Driver: /usr/local/cuda-12.2/lib64/libcuda.so
-- NVRTC: /usr/local/cuda-12.2/lib64/libnvrtc.so
-- Default Install Location: install
-- Found Python3: /home/house365ai/anaconda3/envs/vllm6/bin/python3.10 (found suitable version "3.10.16", minimum required is "3.5") found components: Interpreter
-- CUDA Compilation Architectures: 70;72;75;80;86;87;89;90;90a
-- Enable caching of reference results in conv unit tests
-- Enable rigorous conv problem sizes in conv unit tests
-- Grid Dependency Control (GDC) is enabled for SM100 kernels (required for programmatic dependent launches).
-- Using the following NVCC flags:
--expt-relaxed-constexpr
-ftemplate-backtrace-limit=0
-DCUTLASS_TEST_LEVEL=0
-DCUTLASS_TEST_ENABLE_CACHED_RESULTS=1
-DCUTLASS_CONV_UNIT_TEST_RIGOROUS_SIZE_ENABLED=1
-DCUTLASS_DEBUG_TRACE_LEVEL=0
-Xcompiler=-Wconversion
-Xcompiler=-fno-strict-aliasing
-lineinfo
-- Configuring cublas ...
-- cuBLAS Disabled.
-- Configuring cuBLAS ... done.
-- Marlin generation script hash: 0fcae1cd7de8a1a79f7344bb65cf9436
-- Last run Marlin generate script hash:
-- Marlin generation completed successfully.
-- Building Marlin kernels for archs: 8.0
-- Building AllSpark kernels for archs: 8.0
-- Not building scaled_mm_c3x_sm90 as no compatible archs found in CUDA target architectures
-- Not building scaled_mm_c3x_100 as no compatible archs found in CUDA target architectures
-- Building scaled_mm_c2x for archs: 8.0
-- Not building sparse_scaled_mm_c3x as no compatible archs found in CUDA target architectures
-- Not building NVFP4 as no compatible archs were found.
-- Not building CUTLASS MLA as no compatible archs were found.
-- Not building grouped_mm_c3x as no compatible archs found in CUDA target architectures
-- Not building Machete kernels as no compatible archs found in CUDA target architectures
-- Enabling C extension.
-- Marlin MOE generation script hash: 1bcbe1b319bde437f422958bd8f0308c
-- Last run Marlin MOE generate script hash:
-- Marlin MOE generation completed successfully.
-- Building Marlin MOE kernels for archs: 8.0
-- Enabling moe extension.
CMake Warning (dev) at /tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/share/cmake-4.0/Modules/FetchContent.cmake:1564 (cmake_parse_arguments):
The BUILD_COMMAND keyword was followed by an empty string or no value at
all. Policy CMP0174 is not set, so cmake_parse_arguments() will unset the
ARG_BUILD_COMMAND variable rather than setting it to an empty string.
Call Stack (most recent call first):
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/share/cmake-4.0/Modules/FetchContent.cmake:2145:EVAL:2 (__FetchContent_doPopulation)
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/share/cmake-4.0/Modules/FetchContent.cmake:2145 (cmake_language)
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/share/cmake-4.0/Modules/FetchContent.cmake:2384 (__FetchContent_Populate)
cmake/external_projects/flashmla.cmake:30 (FetchContent_MakeAvailable)
CMakeLists.txt:790 (include)
This warning is for project developers. Use -Wno-dev to suppress it.
CMake Warning (dev) at /tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/share/cmake-4.0/Modules/FetchContent.cmake:1564 (cmake_parse_arguments):
The CONFIGURE_COMMAND keyword was followed by an empty string or no value
at all. Policy CMP0174 is not set, so cmake_parse_arguments() will unset
the ARG_CONFIGURE_COMMAND variable rather than setting it to an empty
string.
Call Stack (most recent call first):
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/share/cmake-4.0/Modules/FetchContent.cmake:2145:EVAL:2 (__FetchContent_doPopulation)
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/share/cmake-4.0/Modules/FetchContent.cmake:2145 (cmake_language)
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/share/cmake-4.0/Modules/FetchContent.cmake:2384 (__FetchContent_Populate)
cmake/external_projects/flashmla.cmake:30 (FetchContent_MakeAvailable)
CMakeLists.txt:790 (include)
This warning is for project developers. Use -Wno-dev to suppress it.
-- FlashMLA is available at /home/house365ai/xxm/vllm6/.deps/flashmla-src
[1/9] Creating directories for 'vllm-flash-attn-populate'
[1/9] Performing download step (git clone) for 'vllm-flash-attn-populate'
Cloning into 'vllm-flash-attn-src'...
remote: Enumerating objects: 9073, done.
remote: Counting objects: 14% (1/7)
remote: Counting objects: 28% (2/7)
remote: Counting objects: 42% (3/7)
remote: Counting objects: 57% (4/7)
remote: Counting objects: 71% (5/7)
remote: Counting objects: 85% (6/7)
remote: Counting objects: 100% (7/7)
remote: Counting objects: 100% (7/7), done.
remote: Compressing objects: 25% (1/4)
remote: Compressing objects: 50% (2/4)
remote: Compressing objects: 75% (3/4)
remote: Compressing objects: 100% (4/4)
remote: Compressing objects: 100% (4/4), done.
Receiving objects: 0% (1/9073)
Receiving objects: 1% (91/9073)
Receiving objects: 2% (182/9073)
Receiving objects: 3% (273/9073)
Receiving objects: 4% (363/9073), 124.00 KiB | 213.00 KiB/s
Receiving objects: 5% (454/9073), 124.00 KiB | 213.00 KiB/s
Receiving objects: 6% (545/9073), 124.00 KiB | 213.00 KiB/s
Receiving objects: 7% (636/9073), 124.00 KiB | 213.00 KiB/s
Receiving objects: 8% (726/9073), 124.00 KiB | 213.00 KiB/s
Receiving objects: 8% (807/9073), 252.00 KiB | 215.00 KiB/s
Receiving objects: 9% (817/9073), 252.00 KiB | 215.00 KiB/s
Receiving objects: 9% (870/9073), 524.00 KiB | 237.00 KiB/s
Receiving objects: 9% (872/9073), 740.00 KiB | 270.00 KiB/s
Receiving objects: 9% (873/9073), 1.28 MiB | 304.00 KiB/s
Receiving objects: 9% (873/9073), 1.75 MiB | 348.00 KiB/s
Receiving objects: 9% (873/9073), 2.24 MiB | 396.00 KiB/s
Receiving objects: 9% (873/9073), 2.79 MiB | 457.00 KiB/s
Receiving objects: 9% (873/9073), 3.30 MiB | 490.00 KiB/s
Receiving objects: 9% (874/9073), 3.54 MiB | 500.00 KiB/s
Receiving objects: 9% (875/9073), 4.14 MiB | 471.00 KiB/s
Receiving objects: 9% (877/9073), 4.14 MiB | 471.00 KiB/s
Receiving objects: 9% (883/9073), 4.77 MiB | 423.00 KiB/s
Receiving objects: 9% (886/9073), 5.27 MiB | 419.00 KiB/s
Receiving objects: 10% (908/9073), 5.27 MiB | 419.00 KiB/s
Receiving objects: 11% (999/9073), 5.27 MiB | 419.00 KiB/s
Receiving objects: 12% (1089/9073), 5.27 MiB | 419.00 KiB/s
Receiving objects: 13% (1180/9073), 5.49 MiB | 416.00 KiB/s
Receiving objects: 13% (1233/9073), 5.49 MiB | 416.00 KiB/s
Receiving objects: 14% (1271/9073), 5.63 MiB | 392.00 KiB/s
Receiving objects: 15% (1361/9073), 5.63 MiB | 392.00 KiB/s
Receiving objects: 16% (1452/9073), 5.63 MiB | 392.00 KiB/s
Receiving objects: 17% (1543/9073), 5.63 MiB | 392.00 KiB/s
Receiving objects: 18% (1634/9073), 5.63 MiB | 392.00 KiB/s
Receiving objects: 19% (1724/9073), 5.63 MiB | 392.00 KiB/s
Receiving objects: 20% (1815/9073), 5.63 MiB | 392.00 KiB/s
Receiving objects: 21% (1906/9073), 5.88 MiB | 403.00 KiB/s
Receiving objects: 21% (1925/9073), 5.88 MiB | 403.00 KiB/s
Receiving objects: 22% (1997/9073), 5.88 MiB | 403.00 KiB/s
Receiving objects: 23% (2087/9073), 5.88 MiB | 403.00 KiB/s
Receiving objects: 24% (2178/9073), 5.88 MiB | 403.00 KiB/s
Receiving objects: 25% (2269/9073), 6.07 MiB | 404.00 KiB/s
Receiving objects: 26% (2359/9073), 6.07 MiB | 404.00 KiB/s
Receiving objects: 27% (2450/9073), 6.07 MiB | 404.00 KiB/s
Receiving objects: 27% (2510/9073), 6.25 MiB | 408.00 KiB/s
Receiving objects: 28% (2541/9073), 6.25 MiB | 408.00 KiB/s
Receiving objects: 29% (2632/9073), 6.25 MiB | 408.00 KiB/s
Receiving objects: 30% (2722/9073), 6.25 MiB | 408.00 KiB/s
Receiving objects: 31% (2813/9073), 6.43 MiB | 402.00 KiB/s
Receiving objects: 32% (2904/9073), 6.43 MiB | 402.00 KiB/s
Receiving objects: 33% (2995/9073), 6.43 MiB | 402.00 KiB/s
Receiving objects: 34% (3085/9073), 6.61 MiB | 391.00 KiB/s
Receiving objects: 35% (3176/9073), 6.61 MiB | 391.00 KiB/s
Receiving objects: 35% (3183/9073), 6.61 MiB | 391.00 KiB/s
Receiving objects: 36% (3267/9073), 6.61 MiB | 391.00 KiB/s
Receiving objects: 37% (3358/9073), 6.61 MiB | 391.00 KiB/s
Receiving objects: 38% (3448/9073), 6.82 MiB | 387.00 KiB/s
Receiving objects: 39% (3539/9073), 6.82 MiB | 387.00 KiB/s
Receiving objects: 40% (3630/9073), 6.82 MiB | 387.00 KiB/s
Receiving objects: 41% (3720/9073), 6.82 MiB | 387.00 KiB/s
Receiving objects: 42% (3811/9073), 6.82 MiB | 387.00 KiB/s
Receiving objects: 42% (3831/9073), 7.02 MiB | 374.00 KiB/s
Receiving objects: 43% (3902/9073), 7.02 MiB | 374.00 KiB/s
Receiving objects: 44% (3993/9073), 7.02 MiB | 374.00 KiB/s
Receiving objects: 45% (4083/9073), 7.02 MiB | 374.00 KiB/s
Receiving objects: 46% (4174/9073), 7.02 MiB | 374.00 KiB/s
Receiving objects: 47% (4265/9073), 7.02 MiB | 374.00 KiB/s
Receiving objects: 48% (4356/9073), 7.02 MiB | 374.00 KiB/s
Receiving objects: 49% (4446/9073), 7.02 MiB | 374.00 KiB/s
Receiving objects: 50% (4537/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 51% (4628/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 52% (4718/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 53% (4809/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 54% (4900/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 55% (4991/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 56% (5081/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 57% (5172/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 58% (5263/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 59% (5354/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 60% (5444/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 61% (5535/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 62% (5626/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 63% (5716/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 63% (5769/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 64% (5807/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 65% (5898/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 66% (5989/9073), 7.23 MiB | 359.00 KiB/s
Receiving objects: 67% (6079/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 68% (6170/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 69% (6261/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 70% (6352/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 71% (6442/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 72% (6533/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 73% (6624/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 74% (6715/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 75% (6805/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 76% (6896/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 77% (6987/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 78% (7077/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 79% (7168/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 80% (7259/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 81% (7350/9073), 7.52 MiB | 397.00 KiB/s
Receiving objects: 82% (7440/9073), 7.75 MiB | 394.00 KiB/s
Receiving objects: 83% (7531/9073), 7.75 MiB | 394.00 KiB/s
Receiving objects: 84% (7622/9073), 7.75 MiB | 394.00 KiB/s
Receiving objects: 85% (7713/9073), 7.75 MiB | 394.00 KiB/s
Receiving objects: 86% (7803/9073), 7.75 MiB | 394.00 KiB/s
Receiving objects: 87% (7894/9073), 7.75 MiB | 394.00 KiB/s
Receiving objects: 88% (7985/9073), 7.75 MiB | 394.00 KiB/s
Receiving objects: 89% (8075/9073), 7.75 MiB | 394.00 KiB/s
Receiving objects: 89% (8140/9073), 7.75 MiB | 394.00 KiB/s
Receiving objects: 90% (8166/9073), 7.75 MiB | 394.00 KiB/s
Receiving objects: 90% (8206/9073), 8.21 MiB | 406.00 KiB/s
Receiving objects: 91% (8257/9073), 8.45 MiB | 412.00 KiB/s
Receiving objects: 92% (8348/9073), 8.45 MiB | 412.00 KiB/s
Receiving objects: 93% (8438/9073), 8.45 MiB | 412.00 KiB/s
Receiving objects: 93% (8489/9073), 8.65 MiB | 416.00 KiB/s
Receiving objects: 94% (8529/9073), 8.65 MiB | 416.00 KiB/s
Receiving objects: 95% (8620/9073), 8.65 MiB | 416.00 KiB/s
Receiving objects: 96% (8711/9073), 8.65 MiB | 416.00 KiB/s
Receiving objects: 97% (8801/9073), 8.65 MiB | 416.00 KiB/s
Receiving objects: 98% (8892/9073), 8.83 MiB | 405.00 KiB/s
Receiving objects: 99% (8983/9073), 8.83 MiB | 405.00 KiB/s
remote: Total 9073 (delta 3), reused 3 (delta 3), pack-reused 9066 (from 2)
Receiving objects: 100% (9073/9073), 8.83 MiB | 405.00 KiB/s
Receiving objects: 100% (9073/9073), 8.91 MiB | 401.00 KiB/s, done.
Resolving deltas: 0% (0/6785)
Resolving deltas: 1% (68/6785)
Resolving deltas: 2% (136/6785)
Resolving deltas: 3% (204/6785)
Resolving deltas: 4% (273/6785)
Resolving deltas: 5% (340/6785)
Resolving deltas: 6% (408/6785)
Resolving deltas: 7% (475/6785)
Resolving deltas: 8% (543/6785)
Resolving deltas: 9% (611/6785)
Resolving deltas: 10% (680/6785)
Resolving deltas: 11% (749/6785)
Resolving deltas: 12% (815/6785)
Resolving deltas: 13% (884/6785)
Resolving deltas: 14% (950/6785)
Resolving deltas: 15% (1019/6785)
Resolving deltas: 16% (1086/6785)
Resolving deltas: 17% (1157/6785)
Resolving deltas: 18% (1222/6785)
Resolving deltas: 19% (1291/6785)
Resolving deltas: 20% (1357/6785)
Resolving deltas: 21% (1425/6785)
Resolving deltas: 22% (1493/6785)
Resolving deltas: 23% (1561/6785)
Resolving deltas: 24% (1629/6785)
Resolving deltas: 25% (1697/6785)
Resolving deltas: 26% (1765/6785)
Resolving deltas: 27% (1832/6785)
Resolving deltas: 28% (1900/6785)
Resolving deltas: 29% (1968/6785)
Resolving deltas: 30% (2038/6785)
Resolving deltas: 31% (2105/6785)
Resolving deltas: 32% (2173/6785)
Resolving deltas: 33% (2240/6785)
Resolving deltas: 34% (2308/6785)
Resolving deltas: 35% (2375/6785)
Resolving deltas: 36% (2443/6785)
Resolving deltas: 37% (2512/6785)
Resolving deltas: 38% (2579/6785)
Resolving deltas: 39% (2647/6785)
Resolving deltas: 40% (2717/6785)
Resolving deltas: 41% (2782/6785)
Resolving deltas: 42% (2850/6785)
Resolving deltas: 43% (2918/6785)
Resolving deltas: 44% (2986/6785)
Resolving deltas: 45% (3054/6785)
Resolving deltas: 46% (3123/6785)
Resolving deltas: 47% (3189/6785)
Resolving deltas: 48% (3257/6785)
Resolving deltas: 49% (3325/6785)
Resolving deltas: 50% (3393/6785)
Resolving deltas: 51% (3461/6785)
Resolving deltas: 52% (3529/6785)
Resolving deltas: 53% (3597/6785)
Resolving deltas: 54% (3664/6785)
Resolving deltas: 55% (3732/6785)
Resolving deltas: 56% (3800/6785)
Resolving deltas: 57% (3868/6785)
Resolving deltas: 58% (3936/6785)
Resolving deltas: 59% (4004/6785)
Resolving deltas: 60% (4071/6785)
Resolving deltas: 61% (4139/6785)
Resolving deltas: 62% (4207/6785)
Resolving deltas: 63% (4279/6785)
Resolving deltas: 64% (4343/6785)
Resolving deltas: 65% (4411/6785)
Resolving deltas: 66% (4479/6785)
Resolving deltas: 67% (4546/6785)
Resolving deltas: 68% (4615/6785)
Resolving deltas: 69% (4682/6785)
Resolving deltas: 70% (4753/6785)
Resolving deltas: 71% (4819/6785)
Resolving deltas: 72% (4887/6785)
Resolving deltas: 73% (4954/6785)
Resolving deltas: 74% (5021/6785)
Resolving deltas: 75% (5089/6785)
Resolving deltas: 76% (5157/6785)
Resolving deltas: 77% (5225/6785)
Resolving deltas: 78% (5293/6785)
Resolving deltas: 79% (5361/6785)
Resolving deltas: 80% (5428/6785)
Resolving deltas: 81% (5496/6785)
Resolving deltas: 82% (5564/6785)
Resolving deltas: 83% (5632/6785)
Resolving deltas: 84% (5700/6785)
Resolving deltas: 85% (5768/6785)
Resolving deltas: 86% (5836/6785)
Resolving deltas: 87% (5903/6785)
Resolving deltas: 88% (5971/6785)
Resolving deltas: 89% (6039/6785)
Resolving deltas: 90% (6107/6785)
Resolving deltas: 91% (6175/6785)
Resolving deltas: 92% (6243/6785)
Resolving deltas: 93% (6311/6785)
Resolving deltas: 94% (6378/6785)
Resolving deltas: 95% (6446/6785)
Resolving deltas: 96% (6514/6785)
Resolving deltas: 97% (6582/6785)
Resolving deltas: 98% (6650/6785)
Resolving deltas: 99% (6718/6785)
Resolving deltas: 100% (6785/6785)
Resolving deltas: 100% (6785/6785), done.
HEAD is now at 8798f27 Add rotary triton operator to vllm_flash_attn (#64)
Submodule 'csrc/composable_kernel' (https://github.com/ROCm/composable_kernel.git) registered for path 'csrc/composable_kernel'
Submodule 'csrc/cutlass' (https://github.com/NVIDIA/cutlass.git) registered for path 'csrc/cutlass'
Cloning into '/home/house365ai/xxm/vllm6/.deps/vllm-flash-attn-src/csrc/composable_kernel'...
Cloning into '/home/house365ai/xxm/vllm6/.deps/vllm-flash-attn-src/csrc/cutlass'...
fatal: unable to access 'https://github.com/NVIDIA/cutlass.git/': GnuTLS recv error (-110): The TLS connection was non-properly terminated.
fatal: clone of 'https://github.com/NVIDIA/cutlass.git' into submodule path '/home/house365ai/xxm/vllm6/.deps/vllm-flash-attn-src/csrc/cutlass' failed
Failed to clone 'csrc/cutlass'. Retry scheduled
Cloning into '/home/house365ai/xxm/vllm6/.deps/vllm-flash-attn-src/csrc/cutlass'...
fatal: unable to access 'https://github.com/NVIDIA/cutlass.git/': Failed to connect to github.com port 443 after 133821 ms: Connection timed out
fatal: clone of 'https://github.com/NVIDIA/cutlass.git' into submodule path '/home/house365ai/xxm/vllm6/.deps/vllm-flash-attn-src/csrc/cutlass' failed
Failed to clone 'csrc/cutlass' a second time, aborting
CMake Error at vllm-flash-attn-subbuild/vllm-flash-attn-populate-prefix/tmp/vllm-flash-attn-populate-gitclone.cmake:75 (message):
Failed to update submodules in:
'/home/house365ai/xxm/vllm6/.deps/vllm-flash-attn-src'
FAILED: vllm-flash-attn-populate-prefix/src/vllm-flash-attn-populate-stamp/vllm-flash-attn-populate-download /home/house365ai/xxm/vllm6/.deps/vllm-flash-attn-subbuild/vllm-flash-attn-populate-prefix/src/vllm-flash-attn-populate-stamp/vllm-flash-attn-populate-download
cd /home/house365ai/xxm/vllm6/.deps && /tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/bin/cmake -DCMAKE_MESSAGE_LOG_LEVEL=VERBOSE -P /home/house365ai/xxm/vllm6/.deps/vllm-flash-attn-subbuild/vllm-flash-attn-populate-prefix/tmp/vllm-flash-attn-populate-gitclone.cmake && /tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/bin/cmake -E touch /home/house365ai/xxm/vllm6/.deps/vllm-flash-attn-subbuild/vllm-flash-attn-populate-prefix/src/vllm-flash-attn-populate-stamp/vllm-flash-attn-populate-download
ninja: build stopped: subcommand failed.
CMake Error at /tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/share/cmake-4.0/Modules/FetchContent.cmake:1918 (message):
Build step for vllm-flash-attn failed: 1
Call Stack (most recent call first):
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/share/cmake-4.0/Modules/FetchContent.cmake:1609 (__FetchContent_populateSubbuild)
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/share/cmake-4.0/Modules/FetchContent.cmake:2145:EVAL:2 (__FetchContent_doPopulation)
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/share/cmake-4.0/Modules/FetchContent.cmake:2145 (cmake_language)
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/cmake/data/share/cmake-4.0/Modules/FetchContent.cmake:2384 (__FetchContent_Populate)
cmake/external_projects/vllm_flash_attn.cmake:62 (FetchContent_MakeAvailable)
CMakeLists.txt:793 (include)
-- Configuring incomplete, errors occurred!
Traceback (most recent call last):
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/command/editable_wheel.py", line 139, in run
self._create_wheel_file(bdist_wheel)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/command/editable_wheel.py", line 340, in _create_wheel_file
files, mapping = self._run_build_commands(dist_name, unpacked, lib, tmp)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/command/editable_wheel.py", line 263, in _run_build_commands
self._run_build_subcommands()
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/command/editable_wheel.py", line 290, in _run_build_subcommands
self.run_command(name)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/_distutils/cmd.py", line 357, in run_command
self.distribution.run_command(command)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/dist.py", line 1104, in run_command
super().run_command(command)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
cmd_obj.run()
File "<string>", line 267, in run
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/command/build_ext.py", line 99, in run
_build_ext.run(self)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/_distutils/command/build_ext.py", line 368, in run
self.build_extensions()
File "<string>", line 229, in build_extensions
File "<string>", line 207, in configure
File "/home/house365ai/anaconda3/envs/vllm6/lib/python3.10/subprocess.py", line 369, in check_call
raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command '['cmake', '/home/house365ai/xxm/vllm6', '-G', 'Ninja', '-DCMAKE_BUILD_TYPE=RelWithDebInfo', '-DVLLM_TARGET_DEVICE=cuda', '-DVLLM_PYTHON_EXECUTABLE=/home/house365ai/anaconda3/envs/vllm6/bin/python3.10', '-DVLLM_PYTHON_PATH=/tmp/pip-build-env-_pl3h8ke/site:/home/house365ai/anaconda3/envs/vllm6/lib/python310.zip:/home/house365ai/anaconda3/envs/vllm6/lib/python3.10:/home/house365ai/anaconda3/envs/vllm6/lib/python3.10/lib-dynload:/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages:/tmp/pip-build-env-_pl3h8ke/normal/lib/python3.10/site-packages:/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/_vendor', '-DFETCHCONTENT_BASE_DIR=/home/house365ai/xxm/vllm6/.deps', '-DNVCC_THREADS=1', '-DCMAKE_JOB_POOL_COMPILE:STRING=compile', '-DCMAKE_JOB_POOLS:STRING=compile=384', '-DCMAKE_CUDA_COMPILER=/usr/local/cuda-12.2/bin/nvcc']' returned non-zero exit status 1.
/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/_distutils/dist.py:1021: _DebuggingTips: Problem in editable installation.
!!
********************************************************************************
An error happened while installing `vllm` in editable mode.
The following steps are recommended to help debug this problem:
- Try to install the project normally, without using the editable mode.
Does the error still persist?
(If it does, try fixing the problem before attempting the editable mode).
- If you are using binary extensions, make sure you have all OS-level
dependencies installed (e.g. compilers, toolchains, binary libraries, ...).
- Try the latest version of setuptools (maybe the error was already fixed).
- If you (or your project dependencies) are using any setuptools extension
or customization, make sure they support the editable mode.
After following the steps above, if the problem still persists and
you think this is related to how setuptools handles editable installations,
please submit a reproducible example
(see https://stackoverflow.com/help/minimal-reproducible-example) to:
https://github.com/pypa/setuptools/issues
See https://setuptools.pypa.io/en/latest/userguide/development_mode.html for details.
********************************************************************************
!!
cmd_obj.run()
Traceback (most recent call last):
File "/home/house365ai/anaconda3/envs/vllm6/lib/python3.10/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 389, in <module>
main()
File "/home/house365ai/anaconda3/envs/vllm6/lib/python3.10/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 373, in main
json_out["return_val"] = hook(**hook_input["kwargs"])
File "/home/house365ai/anaconda3/envs/vllm6/lib/python3.10/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 303, in build_editable
return hook(wheel_directory, config_settings, metadata_directory)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/build_meta.py", line 468, in build_editable
return self._build_with_temp_dir(
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/build_meta.py", line 404, in _build_with_temp_dir
self.run_setup()
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/build_meta.py", line 317, in run_setup
exec(code, locals())
File "<string>", line 684, in <module>
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/__init__.py", line 117, in setup
return distutils.core.setup(**attrs)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/_distutils/core.py", line 186, in setup
return run_commands(dist)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/_distutils/core.py", line 202, in run_commands
dist.run_commands()
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/_distutils/dist.py", line 1002, in run_commands
self.run_command(cmd)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/dist.py", line 1104, in run_command
super().run_command(command)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
cmd_obj.run()
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/command/editable_wheel.py", line 139, in run
self._create_wheel_file(bdist_wheel)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/command/editable_wheel.py", line 340, in _create_wheel_file
files, mapping = self._run_build_commands(dist_name, unpacked, lib, tmp)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/command/editable_wheel.py", line 263, in _run_build_commands
self._run_build_subcommands()
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/command/editable_wheel.py", line 290, in _run_build_subcommands
self.run_command(name)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/_distutils/cmd.py", line 357, in run_command
self.distribution.run_command(command)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/dist.py", line 1104, in run_command
super().run_command(command)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/_distutils/dist.py", line 1021, in run_command
cmd_obj.run()
File "<string>", line 267, in run
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/command/build_ext.py", line 99, in run
_build_ext.run(self)
File "/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/_distutils/command/build_ext.py", line 368, in run
self.build_extensions()
File "<string>", line 229, in build_extensions
File "<string>", line 207, in configure
File "/home/house365ai/anaconda3/envs/vllm6/lib/python3.10/subprocess.py", line 369, in check_call
raise CalledProcessError(retcode, cmd)
subprocess.CalledProcessError: Command '['cmake', '/home/house365ai/xxm/vllm6', '-G', 'Ninja', '-DCMAKE_BUILD_TYPE=RelWithDebInfo', '-DVLLM_TARGET_DEVICE=cuda', '-DVLLM_PYTHON_EXECUTABLE=/home/house365ai/anaconda3/envs/vllm6/bin/python3.10', '-DVLLM_PYTHON_PATH=/tmp/pip-build-env-_pl3h8ke/site:/home/house365ai/anaconda3/envs/vllm6/lib/python310.zip:/home/house365ai/anaconda3/envs/vllm6/lib/python3.10:/home/house365ai/anaconda3/envs/vllm6/lib/python3.10/lib-dynload:/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages:/tmp/pip-build-env-_pl3h8ke/normal/lib/python3.10/site-packages:/tmp/pip-build-env-_pl3h8ke/overlay/lib/python3.10/site-packages/setuptools/_vendor', '-DFETCHCONTENT_BASE_DIR=/home/house365ai/xxm/vllm6/.deps', '-DNVCC_THREADS=1', '-DCMAKE_JOB_POOL_COMPILE:STRING=compile', '-DCMAKE_JOB_POOLS:STRING=compile=384', '-DCMAKE_CUDA_COMPILER=/usr/local/cuda-12.2/bin/nvcc']' returned non-zero exit status 1.
[end of output]
note: This error originates from a subprocess, and is likely not a problem with pip.
ERROR: Failed building editable for vllm
Failed to build vllm
ERROR: Failed to build installable wheels for some pyproject.toml based projects (vllm)
🐛 Describe the bug
pip install -e .
Before submitting a new issue...
- Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.