vLLM for DGX Spark (GB10). After pip install, run: sudo dgx-spark-vllm-install
Project description
DGX Spark vLLM
Pre-built vLLM environment for NVIDIA DGX Spark (GB10, sm_121).
About
This package contains the vLLM environment extracted from NVIDIA's official
nvcr.io/nvidia/vllm:25.09-py3 Docker image, configured for local installation
on DGX Spark without Docker.
Why? The official vLLM doesn't support sm_121 (GB10) yet. NVIDIA's Docker image has patches for Blackwell support, but requires running everything in a container. This package lets you run vLLM natively on DGX Spark.
Requirements
- NVIDIA DGX Spark (GB10 chip, sm_121)
- Ubuntu with CUDA 13.0 installed
- Python 3.12
- aarch64 (ARM64) architecture
Installation
pip install dgx-spark-vllm
sudo dgx-spark-vllm-install
Then restart your shell or run:
source /etc/profile.d/nvidia-venv.sh
Included Versions
| Package | Version |
|---|---|
| PyTorch | 2.9.0a0+50eac811a6.nv25.09 |
| vLLM | 0.10.1.1+381074ae.nv25.09 |
| CUDA | 13.0 |
| Triton | 3.4.0 |
| FlashAttention | 2.7.4 |
| cuDNN | 9.13.0 |
| NCCL | 2.27.7 |
Usage
import torch
print(torch.__version__) # 2.9.0a0+50eac811a6.nv25.09
print(torch.cuda.get_device_name(0)) # NVIDIA GB10
import vllm
print(vllm.__version__) # 0.10.1.1+381074ae.nv25.09
Run vLLM server:
python3 -m vllm.entrypoints.openai.api_server --model <model_name>
What's Installed
/home/srpost/shared/nvidia-venv/- Python packages/opt/hpcx/- HPC-X libraries (UCX, UCC, OpenMPI)/usr/lib/aarch64-linux-gnu/libcudnn*- cuDNN/usr/lib/aarch64-linux-gnu/libnccl*- NCCL/usr/local/lib/libnvpl*- NVPL (NVIDIA Performance Libraries)/etc/ld.so.conf.d/00-hpcx.conf- Library path config/etc/profile.d/nvidia-venv.sh- PYTHONPATH config
License
The installer is Apache 2.0. The installed packages retain their original licenses (PyTorch BSD, vLLM Apache 2.0, NVIDIA components under NVIDIA license).
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dgx_spark_vllm-25.9.3.tar.gz.
File metadata
- Download URL: dgx_spark_vllm-25.9.3.tar.gz
- Upload date:
- Size: 4.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0b2922d9d8ac1e1d137111e41d2ac0e18d774908c254fb0bc2df89af8ca4ec74
|
|
| MD5 |
6a94a0071a5e8bb25475ea8f6e9b7025
|
|
| BLAKE2b-256 |
355521ecdbf58caba8dd1b9e5c6022bb8b4d2591b8012967301913a40b5a7302
|
File details
Details for the file dgx_spark_vllm-25.9.3-py3-none-any.whl.
File metadata
- Download URL: dgx_spark_vllm-25.9.3-py3-none-any.whl
- Upload date:
- Size: 4.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ced810431d614cb5d7c2b400d5c828738921d12e97e68bf71ff054bdb1055409
|
|
| MD5 |
40ade964d0936c2a406e9597fe56e363
|
|
| BLAKE2b-256 |
793022280f944b896d7fc95fc0915f436019229a6e07a2bcd8242d3dabe3a429
|