Remove cast of y_true to y_pred data type in sparse categorical cross entropy loss #15015

old-school-kid · 2021-07-28T09:54:56Z

Casting y_true to y_pred data type gives erroneous loss. Like in case of Mean Squared error if y_true is > 2**32 and y_pred data type is int32 then resulting loss will be wrong.
A detailed notebook on the error which also happens in mean absolute loss, categorical cross entropy loss and sparse categorical cross entropy loss and also contains the proposed changes.

The issue was raised here

qlzh727

I think the cast is mainly trying to address the dtype mismatch, eg int and float for label and prediction, which is quite common in normal case. The 2*32 seems to be a very extreme case, which we barely hit. Can we change the PR to only cast the dtype if the label and prediction are in different category?

qlzh727

I think we should only remove the cast for sparse_categorical_crossentropy(), since the label value could be large based on the dimension of the prediction. The rest of them like binary_crossentropy or categorical_crossentropy, the label value is either one_hot or just 0 and 1, which won't be affected when casting.

Also since backend.sparse_categorical_crossentropy will cast the y_true to int64 anyway, removing the y_true cast here is correct.

keras/keras/backend.py

Line 4952 in 00518dc

target = cast(target, 'int64')

qlzh727 · 2021-07-28T18:50:40Z

keras/losses.py

@@ -1536,7 +1536,7 @@ def categorical_hinge(y_true, y_pred):
    Categorical hinge loss values.
  """
  y_pred = tf.convert_to_tensor(y_pred)
-  y_true = tf.cast(y_true, y_pred.dtype)


I think you should keep this cast here, since the y_true are expect to be either {-1, +1} or {0, 1} (i.e. a one-hot-encoded tensor). See the docstring.

We probably want to fix the example in the docstring since the y_true is given in the range of 0-3.

Yeah made the changes as requested.

qlzh727

Thanks for the fix.

Remove all the casts of y_true to y_pred data type

6bc2571

google-cla bot added the cla: yes label Jul 28, 2021

old-school-kid mentioned this pull request Jul 28, 2021

y_true gets casted to y_pred.dtype in every losses #15014

Closed

rmothukuru mentioned this pull request Jul 28, 2021

SparseCategoricalCrossentropy and Mixed Precision Training #15012

Closed

gbaned self-assigned this Jul 28, 2021

gbaned requested a review from qlzh727 July 28, 2021 14:03

qlzh727 requested changes Jul 28, 2021

View reviewed changes

Removed cast only in sparse categorical cross entropy loss

ca3da0c

old-school-kid changed the title ~~Remove all the casts of y_true to y_pred data type~~ Remove cast of y_true to y_pred data type in sparse categorical cross entropy loss Jul 28, 2021

qlzh727 reviewed Jul 28, 2021

View reviewed changes

Removed cast only in sparse categorical cross entropy loss

f175038

qlzh727 approved these changes Jul 28, 2021

View reviewed changes

qlzh727 added the kokoro:force-run label Jul 28, 2021

kokoro-team removed the kokoro:force-run label Jul 28, 2021

qlzh727 added the ready to pull Ready to be merged into the codebase label Jul 28, 2021

copybara-service bot merged commit a9b4be7 into keras-team:master Jul 29, 2021

old-school-kid mentioned this pull request Jul 30, 2021

remove cast of y_true to y_pred data type in sparse categorical cross… tensorflow/tensorflow#51053

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove cast of y_true to y_pred data type in sparse categorical cross entropy loss #15015

Remove cast of y_true to y_pred data type in sparse categorical cross entropy loss #15015

old-school-kid commented Jul 28, 2021

qlzh727 left a comment

qlzh727 left a comment

qlzh727 Jul 28, 2021

old-school-kid Jul 28, 2021

qlzh727 left a comment

Remove cast of y_true to y_pred data type in sparse categorical cross entropy loss #15015

Remove cast of y_true to y_pred data type in sparse categorical cross entropy loss #15015

Conversation

old-school-kid commented Jul 28, 2021

qlzh727 left a comment

Choose a reason for hiding this comment

qlzh727 left a comment

Choose a reason for hiding this comment

qlzh727 Jul 28, 2021

Choose a reason for hiding this comment

old-school-kid Jul 28, 2021

Choose a reason for hiding this comment

qlzh727 left a comment

Choose a reason for hiding this comment