NOTE: This directory contains examples for PyCUTLASS, a Python library providing low-level building blocks for emitting CUTLASS C++ kernels. For examples using CUTLASS's Pythonic interface, see the examples/python directory.
Two types of examples are provided:
- Basic examples: minimal examples that illustrate how to set up GEMMs, convolutions, and grouped GEMM operations
- Customizable examples: examples that allow one to specify a variety of template parameters for the given kernel
Please follow the instructions here to set up the PyCUTLASS.
Each of the basic examples can be run as follows:
# Run the GEMM example
python gemm.py
# Run the Conv2d example
python conv2d.py
# Run the grouped GEMM example
python gemm_grouped.py
To run the customizable examples, refer to the README in the customizable directory.