A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
We publish nightly wheel packages on https://pypi.nvidia.com, while build against stable torch version on https://pypi.org. Wheels are published for Python version: 3.10, 3.12.
built-env | cuda 11.8 | cuda 12.1 | cuda 12.4 |
---|---|---|---|
torch 2.5 (pypi.org) | nvfuser-cu118-torch25 | nvfuser-cu121-torch25 | nvfuser-cu124-torch25 |
torch nightly (pypi.nvidia.com) | nvfuser-cu118 | nvfuser-cu121 | nvfuser-cu124 |
Note that nvfuser built against stable torch version isn't compatible with nightly pytorch wheel, so ensure you pick the right version suiting your environment.
You can instll a nightly nvfuser pip package built against torch nightly code base with pip install --pre nvfuser-cu121 --extra-index-url https://pypi.nvidia.com
As we build against nightly torch wheel and there's no compatibility promised on nightly wheels, we have explicitly marked the nightly torch wheel as an optinoal dependency. You can choose to install the torch wheel along with nvfuser package. e.g.
pip install --pre "nvfuser-cu121[torch]" --extra-index-url https://pypi.nvidia.com
.
Note that this may uninstall your local pytorch installation and install the compatible nightly pytorch.
nvfuser pip wheel built against stable torch releases is published on pypi.org. Pick the right cuda toolkit version to match your torch installation. e.g. pip install nvfuser-cu121-torch24
PyPI: https://pypi.org/project/nvfuser/
Docs: https://github.com/NVIDIA/Fuser/wiki
Supported compilers:
GCC:
We support all "supported releases" of gcc as specified in the official site. As of 6/20/2024, they are:
- gcc 11.4
- gcc 12.4
- gcc 13.2
- gcc 14.1
Clang:
- clang 14+
Supported C++ standard:
- C++17
- C++20
We are actively considering dropping C++17 support