-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] H800 run UT failed. #6
Comments
Hi, unfortunately, I don't have access to any H800s (or any Hopper GPUs for that matter), so it is a bit hard to test. Which of the matrix shapes are failing and by how much? Can you perhaps print the result of this line for all test cases, i.e., what is the relative average error? |
Yes, if the thread_shape = [64, 256], I get the right thing: |
@Ageliss Which cuda version was the failed test ran on? Can you retest on latest Cuda 12.4 and/or pytorch 2.2.2? |
The text was updated successfully, but these errors were encountered: