[original author: mrnikwaws] fixing incorrect stride information on xla tensors #5468

aws-kingrj · 2023-08-18T21:31:08Z

fixing incorrect xla tensor stride information
breaks log softmax lowering
tested on AWS Neuron internal testing
original author mrnikwaws

JackCaoG · 2023-08-19T01:48:30Z

Ok I think I know what's going on. The pr from the fork can not use remote cache so build just timeout. Let me see if I can bump the timeout.

JackCaoG · 2023-08-19T01:54:06Z

pr in #5470. Let me also grant you write access so you can create branch on pytorch/xla directly. This way all of the CI will use the bazel remote cache and runs much faster.

JackCaoG · 2023-08-21T17:10:58Z

@aws-kingrj if you rebase the build should start run pass timeout.

JackCaoG · 2023-08-21T17:12:03Z

torch_xla/csrc/tensor_impl.cpp

@@ -161,6 +161,11 @@ at::IntArrayRef XLATensorImpl::strides_custom() const {
  return strides_default();
 }



You mentioned this breaks log softmax lowering. Can you add a test that will fail without this change? The change in general make sense but without a test it is hard to make sure this is correct nor can we prevent this from regressing again.

* fix dtype inference * fix linter

JackCaoG · 2023-08-23T17:26:48Z

test/cpp/test_aten_xla_tensor_1.cpp

@@ -3700,6 +3700,19 @@ TEST_F(AtenXlaTensorTest, TestPowIntExponent) {
  ExpectCounterNotChanged("aten::.*", cpp_test::GetIgnoredCounters());
 }

+TEST_F(AtenXlaTensorTest, TestPowFloatScalarBaseIntExponent) {


I think the commit is a bit messed up, this seems to be from other pr

fixing incorrect stride information on xla tensors

c748abb

aws-kingrj mentioned this pull request Aug 18, 2023

AWS Neuron PyTorch XLA Changes #5465

Open

aws-kingrj added 2 commits August 18, 2023 15:05

fixing linter issues

bbc97e1

fixing spacing

6a2f3c6

JackCaoG reviewed Aug 21, 2023

View reviewed changes

lsy323 and others added 3 commits August 21, 2023 10:52

Fix data type in Pow with Scalar base and Tensor exponent (pytorch#5467)

128b395

* fix dtype inference * fix linter

bump the timeout for CI (pytorch#5470)

1617706

Fix the input sharding for dynamo (pytorch#5469)

9e88a70

JackCaoG reviewed Aug 23, 2023

View reviewed changes

aws-kingrj closed this Aug 23, 2023

aws-kingrj deleted the xla_tensor_stride branch August 23, 2023 17:32

aws-kingrj mentioned this pull request Aug 23, 2023

[original author: mrnikwaws] fixing incorrect stride information on xla tensors #5486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[original author: mrnikwaws] fixing incorrect stride information on xla tensors #5468

[original author: mrnikwaws] fixing incorrect stride information on xla tensors #5468

aws-kingrj commented Aug 18, 2023

JackCaoG commented Aug 19, 2023

JackCaoG commented Aug 19, 2023

JackCaoG commented Aug 21, 2023

JackCaoG Aug 21, 2023

JackCaoG Aug 23, 2023

		@@ -161,6 +161,11 @@ at::IntArrayRef XLATensorImpl::strides_custom() const {
		return strides_default();
		}

[original author: mrnikwaws] fixing incorrect stride information on xla tensors #5468

[original author: mrnikwaws] fixing incorrect stride information on xla tensors #5468

Conversation

aws-kingrj commented Aug 18, 2023

JackCaoG commented Aug 19, 2023

JackCaoG commented Aug 19, 2023

JackCaoG commented Aug 21, 2023

JackCaoG Aug 21, 2023

Choose a reason for hiding this comment

JackCaoG Aug 23, 2023

Choose a reason for hiding this comment