Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bug: tests failing at nvidia-driver-530 #304

Open
1 of 2 tasks
christallire opened this issue Mar 2, 2023 · 0 comments
Open
1 of 2 tasks

bug: tests failing at nvidia-driver-530 #304

christallire opened this issue Mar 2, 2023 · 0 comments

Comments

@christallire
Copy link

Description

I was setting up a fresh pytorch environment and the latest cuda package installs nvidia-driver-530 and it makes kernl tests fail
and prints RuntimeError: CUDA error: misaligned address

also optimize_model at inference time prints the same error message too so I had to comment it out.

Steps to reproduce

Prepare fresh ubuntu environment

  1. sudo apt install nvidia-drivers-520
  2. sudo apt install cuda=11.7.1-1
  3. pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu117
  4. cd ~/kernl/ && pytest

Expected Behavior

tests pass

Actual Behavior

=================================================================================== test session starts ===================================================================================
platform linux -- Python 3.10.9, pytest-7.2.1, pluggy-1.0.0
rootdir: /home/chris/workspace/kernl
collected 2883 items                                                                                                                                                                      

test/test_attention.py ..FEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE [  5%]
EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE

Your environment

  • Ubuntu 22 LTS
  • Linux d42gx-1 5.19.0-35-generic Feat/layernorm #36-Ubuntu SMP PREEMPT_DYNAMIC Fri Feb 3 18:36:56 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
  • Python 3.10.9
  • pip 23.0.1 from /home/chris/mambaforge/envs/singu/lib/python3.10/site-packages/pip (python 3.10)

Self-service

  • I would be willing to help fix this bug myself.

Code of Conduct

  • I agree to follow this project's Code of Conduct
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant