Comments (2)
Not trying to sound flippant, but if it works in rocm 6.1 please use that instead of the older rocm 6.0. Also, gfx900 is unsupported. I'm glad to hear it was working for you on rocm 6.1.
from pytorch.
@thenightterorx I am going to close since it is fixed for you on rocm6.1.
from pytorch.
Related Issues (20)
- torchrun nccl Multi machine and multi card training error: ss1.ss_family == ss2.ss_family. 2 vs 10 HOT 4
- UserWarning: Grad strides do not match bucket view strides. This may indicate grad was not created according to the gradient layout contract, or that the param's strides changed since DDP was constructed. This is not an error, but may impair performance. grad.sizes() = [1, 64, 64, 64], strides() = [1, 4096, 64, 1] bucket_view.sizes() = [1, 64, 64, 64], strides() = [262144, 4096, 64, 1] HOT 1
- binary_cross_entropy_with_logits outputs NaN on -inf, which is inconsistent with documentation HOT 4
- MacOS runner fails at `Complete runner step` HOT 2
- test_autograd_cpp_node flaky in TestCompiledAutograd
- DISABLED test_inplace_custom_op_two_mutated_inputs (__main__.InplacingTests) HOT 2
- DISABLED test_autocast_methods_fp32 (__main__.TestCudaAutocast) HOT 2
- Stop tracking FunctionalTensor bases in Python
- Models that have non-tensor elements in state_dict, are not ONNX-exportable (and not JIT-traceable) HOT 1
- [Inductor][SDPA] `test_sdpa_rewriter_12` broken on A2/A16 GPU
- DataLoader + IterableDataset held up by slowest worker!? HOT 1
- [Dynamo] Eager fallback casued by graph breaks in module hooks HOT 2
- [ONNX] Verify handling of zero output ops HOT 6
- Bug with "make latexpdf"
- No batching rule for aten::repeat_interleave.Tensor
- torch.nn.functional.normalize producing nan values with a large p value and tensor of complex numbers HOT 3
- Compiler is 2x Faster for Input Size 1 Compared to Sizes 2 and Above, Where Forward Pass Times Remain Consistent HOT 9
- [rocm] Unusable torch.ops.aten._scaled_dot_product_flash_attention_backward at 9.6TFLOPs HOT 3
- torch.compile cannot handle torch.Tensor correctly HOT 1
- bitwise_xor does not work for uint32 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pytorch.