Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Update on "Fix CI after torchmetrics update"
It now takes an argument: https://torchmetrics.readthedocs.io/en/stable/classification/accuracy.html Change in pytorch lightning: Lightning-AI/torchmetrics@20eab43 Somehow this is failing with a SEGFAULT on my A100 (in a triton kernel): ``` #0 0x00007fffc0f62e10 in ?? () from /lib/x86_64-linux-gnu/libcuda.so #1 0x00007fffc0f9303c in ?? () from /lib/x86_64-linux-gnu/libcuda.so #2 0x00007fffc0f2ea13 in ?? () from /lib/x86_64-linux-gnu/libcuda.so #3 0x00007fffc0f94603 in ?? () from /lib/x86_64-linux-gnu/libcuda.so #4 0x00007fffc119e4a0 in ?? () from /lib/x86_64-linux-gnu/libcuda.so #5 0x00007fffc0f3728f in ?? () from /lib/x86_64-linux-gnu/libcuda.so #6 0x00007fffc0f3999f in ?? () from /lib/x86_64-linux-gnu/libcuda.so #7 0x00007fffc0fdb1c2 in ?? () from /lib/x86_64-linux-gnu/libcuda.so #8 0x00007fff502234c0 in _launch () from /data/home/XXXXX/.triton/cache/704a3e6949e60326bc68d18a620bee50/layer_norm_fw.so #9 0x00007fff3c0eea25 in launch () from /data/home/XXXXX/.triton/cache/2cebb5590a024a2e06fe9de08c6b7079/k_dropout_bw.so #10 0x0000555555698422 in cfunction_call (func=0x7fff3c6e5760, args=<optimized out>, kwargs=<optimized out>) at /usr/local/src/conda/python-3.10.6/Objects/methodobject.c:552 ``` [ghstack-poisoned]
- Loading branch information