New design of the transformer API to support causal and masked pre-training approach #1008

sararb · 2023-02-28T23:27:07Z

This is a placeholder to support the Transformer-API for the GTC tutorial 2023. This branch is rebased with release-23.02.

For the latest work intended to be merged with the main branch, please refer to #1022

gabrielspmoreira

Sara. Does our new Transformer API already support dense tensors for inputs and targets instead of RaggedTensor?
The dataloader provides dense tensors for sequential features in some cases (as summarized in this ADR):

In the current dataloader API, if value_count.max is not None and is_ragged == False
In the future dataloader API, if is_ragged == False

merlin/models/tf/utils/tf_utils.py

gabrielspmoreira · 2023-03-01T15:15:16Z

merlin/models/tf/models/base.py

+        # losses does not support RaggedVariantTensor on GPU:
+        prediction = prediction.flat_values
+        if isinstance(target, tf.RaggedTensor):
+            target = target.flat_values


As you are flattening the values here to 1D, is there a way to reshape the losses output back to be a RaggedTensor? Otherwise the 1D loss will not match the sample weights, that can be either 1D or 2D (ragged).

merlin/models/tf/models/base.py

review-notebook-app · 2023-03-01T16:59:27Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

sararb · 2023-03-01T23:36:04Z

Sara. Does our new Transformer API already support dense tensors for inputs and targets instead of RaggedTensor? The dataloader provides dense tensors for sequential features in some cases (as summarized in this ADR):

In the current dataloader API, if value_count.max is not None and is_ragged == False

In the future dataloader API, if is_ragged == False

Thank you for checking out the PR! This PR only addresses how to support the different masking approaches in the TransformerBlock, but we still need to work on extending the SequenceTransforms to support dense tensors as inputs (as mentioned in this ADR).

…02 branch

sararb · 2023-03-23T13:10:14Z

closing as this was a placeholder for the tutorial image.

sararb added bug Something isn't working enhancement New feature or request area/api size/L P0 breaking Breaking change area/session-based labels Feb 28, 2023

sararb added this to the Merlin 23.02 milestone Feb 28, 2023

sararb self-assigned this Feb 28, 2023

sararb requested review from gabrielspmoreira and marcromeyn February 28, 2023 23:28

gabrielspmoreira reviewed Mar 1, 2023

View reviewed changes

rnyak modified the milestones: Merlin 23.02, Merlin 23.03 Mar 1, 2023

sararb force-pushed the tf/transformer-api branch from a02d8d1 to 20a40d7 Compare March 1, 2023 22:17

sararb added 6 commits March 7, 2023 22:59

implement new design of the Transformer API on top of the release-23.…

f641cb9

…02 branch

add support of ragged tensor to weight tying

a2c692c

update example notebook with the new API

7adede5

include PR comments

a296753

fix masking of sequence-predict-next transform

19e5c05

adjust sample_weights to targets shape

a92bdc2

sararb force-pushed the tf/transformer-api branch from 5b5ef80 to a92bdc2 Compare March 7, 2023 23:00

add masking support to SequencePredictRandom transform

a86201e

sararb closed this Mar 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New design of the transformer API to support causal and masked pre-training approach #1008

New design of the transformer API to support causal and masked pre-training approach #1008

sararb commented Feb 28, 2023 •

edited

Loading

gabrielspmoreira left a comment

gabrielspmoreira Mar 1, 2023

review-notebook-app bot commented Mar 1, 2023

sararb commented Mar 1, 2023

sararb commented Mar 23, 2023

New design of the transformer API to support causal and masked pre-training approach #1008

New design of the transformer API to support causal and masked pre-training approach #1008

Conversation

sararb commented Feb 28, 2023 • edited Loading

gabrielspmoreira left a comment

Choose a reason for hiding this comment

gabrielspmoreira Mar 1, 2023

Choose a reason for hiding this comment

review-notebook-app bot commented Mar 1, 2023

sararb commented Mar 1, 2023

sararb commented Mar 23, 2023

sararb commented Feb 28, 2023 •

edited

Loading