[original author: mrnikwaws] fixing incorrect stride information on xla tensors #5486

aws-kingrj · 2023-08-23T17:35:00Z

fixing incorrect xla tensor stride information
breaks log softmax lowering
tested on AWS Neuron internal testing
original author mrnikwaws
old PR [original author: mrnikwaws] fixing incorrect stride information on xla tensors #5468

aws-kingrj · 2023-08-23T17:35:33Z

Sorry, had to create a new PR because of the rebase conflicts

JackCaoG · 2023-08-23T17:37:03Z

torch_xla/csrc/tensor_impl.cpp

@@ -161,6 +161,11 @@ at::IntArrayRef XLATensorImpl::strides_custom() const {
  return strides_default();
 }

+c10::SymIntArrayRef XLATensorImpl::sym_strides_custom() const {


can we add a test? It is a bit hard for me to tell what this pr is fixing and how it affect user.

Let me check in with Ryan and create a test.

This issue occurs when pytorch checks the strides of a tensor have the same rank as the shape of a tensor. By default XLA returns strides of one. This will cause log_softmax to fail lowering (based on the input XLA tensor failing an assertion in pytorch code prior to lowering), so using this lowering should form a simple test.

(test_env) ubuntu@ip-172-31-63-138:~/waldronn/asr$ python Python 3.8.10 (default, May 26 2023, 14:05:08) [GCC 9.4.0] on linux Type "help", "copyright", "credits" or "license" for more information. >>> import torch >>> import torch_neuronx >>> example_input = torch.rand(1, 80, 643, dtype=torch.float) >>> print(example_input.stride()) (51440, 643, 1) >>> example_input = example_input.to('xla') >>> print(example_input.stride()) (1,)

With the change the strides will match

great, can we make this a test case? You can add it somewhere in https://github.com/pytorch/xla/blob/master/test/test_operations.py#L907 test_operation and create a new test for it.

aws-kingrj · 2023-11-20T19:10:26Z

This has already been fixed in 2.1, after doing testing with log softmax lowering and getting the correct stride information

aws-kingrj and others added 4 commits August 18, 2023 14:29

fixing incorrect stride information on xla tensors

c748abb

fixing linter issues

bbc97e1

fixing spacing

6a2f3c6

Merge branch 'pytorch:master' into xla_tensor_stride

e80b9b2

aws-kingrj requested a review from JackCaoG August 23, 2023 17:35

JackCaoG reviewed Aug 23, 2023

View reviewed changes

aws-kingrj mentioned this pull request Aug 24, 2023

AWS Neuron PyTorch XLA Changes #5465

Open

aws-kingrj closed this Nov 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[original author: mrnikwaws] fixing incorrect stride information on xla tensors #5486

[original author: mrnikwaws] fixing incorrect stride information on xla tensors #5486

aws-kingrj commented Aug 23, 2023

aws-kingrj commented Aug 23, 2023

JackCaoG Aug 23, 2023

mrnikwaws Aug 31, 2023 •

edited

Loading

mrnikwaws Aug 31, 2023

JackCaoG Sep 5, 2023 •

edited

Loading

aws-kingrj commented Nov 20, 2023

[original author: mrnikwaws] fixing incorrect stride information on xla tensors #5486

[original author: mrnikwaws] fixing incorrect stride information on xla tensors #5486

Conversation

aws-kingrj commented Aug 23, 2023

aws-kingrj commented Aug 23, 2023

JackCaoG Aug 23, 2023

Choose a reason for hiding this comment

mrnikwaws Aug 31, 2023 • edited Loading

Choose a reason for hiding this comment

mrnikwaws Aug 31, 2023

Choose a reason for hiding this comment

JackCaoG Sep 5, 2023 • edited Loading

Choose a reason for hiding this comment

aws-kingrj commented Nov 20, 2023

mrnikwaws Aug 31, 2023 •

edited

Loading

JackCaoG Sep 5, 2023 •

edited

Loading