Fix BroadcastToSequence to enable context features in sequential models #991

gabrielspmoreira · 2023-02-17T00:56:38Z

Fixes #989

Goals ⚽

This PR fixes BroadcastToSequence, which was not working properly in some cases

Implementation Details 🚧

BroadcastToSequence was not working in graph mode because the contextual features were being expanded to match the shape of sequential features using a logic like the following snippet, which was making the first dim (batch size) of the resulting tensor fixed, while the other sequential tensors had the first dim as None (bcs the batch size might vary).

sequence_length = inputs["a_sequential_feature"].row_lengths()
broadcasted_contextual_feature = tf.RaggedTensor.from_row_lengths(
    tf.repeat(inputs["a_contextual_feature"], sequence_length, axis=0), 
    sequence_length 
)

The solution I found to keep the first dim of the expanded context feature tensor matching the other sequential features was using tf.ones_like() to create a (ragged) tensor matching the sequential feature shape and then multiply by the context feature (equivalent to repeat it), as the following example

first_seq_feature_name = list(seq_features_shapes.keys())[0]
non_seq_target[fname] = tf.ones_like(
                            inputs[first_seq_feature_name][..., :1]
                         ) * tf.expand_dims(inputs["a_contextual_feature"], -1)

This PR also includes other fixes:

Continuous block now forwards masking (supports_masking=True)
BroadcastToSequence was refactored to separate the logic that checks if sequential and context features in the input match the schema, otherwise exceptions are raised now.
In BroadcastToSequence.compute_mask() I don't call self._broadcast() and the logic there was made simpler: the expanded contextual feature mask should match the sequential feature mask.
The SequenceTargetAsInput now returns a tuple instead of a Prediction, so that Keras can better align the inputs and targets output from call() and compute_mask() of child classes.
In ProcessList, reshaping scalar features (is_list=False,is_ragged=False) to be 2D (batch size, 1), as the last dim was None in graph mode and was causing issues when concatenating

Testing Details 🔍

Have added new tests for testing BroadcastToSequence usage as a post of InputBlockV2, with both categorical and continuous context (non-sequential) features and also to test it in a Transformer model trained with masked language modeling.

…fixed size first dim in graph mode and not being compatible with the ragged sequential features

… with last dim undefined (which happens in graph mode)

github-actions · 2023-02-17T01:03:14Z

Documentation preview

https://nvidia-merlin.github.io/models/review/pr-991

oliverholworthy · 2023-02-17T14:07:04Z

merlin/models/tf/transforms/tensor.py

@@ -101,6 +101,13 @@ def call(self, inputs: TabularData, **kwargs) -> TabularData:
            elif isinstance(val, tf.RaggedTensor):
                ragged = val
            else:
+                # Expanding / setting last dim of non-list features to be 1D


is this relevant to ProcessList. intuitively ProcessList sounds like it might only be transforming list features

That is a good point @oliverholworthy . The ProcessList is currently used at core places to ensure the features are in good shape for models. When the change making dataloader outputs scalars as 1D happens, we will also need this fix that makes scalars 2D (batch size, 1) for models.
What if we rename ProcessList to PrepareFeatures and have it as a generic block that works as a translation layer between dataloader and models (used in the same places ProcessList is currently used)?

The rename sounds like a reasonable thing to do and better matches it's purpose. Can be in another PR if preferred

@oliverholworthy I have created another separate PR just for renaming ProcessList to PrepareFeatures: PR #992

…ls (#991) * Fixed error that was causing the broadcasted context feature to have fixed size first dim in graph mode and not being compatible with the ragged sequential features * Enforcing non-list (scalar) features to be 2D (batch size,1) if 1D or with last dim undefined (which happens in graph mode) * Making Continuous support_masking=True (to cascade mask) * Changing BroadcastToSequence to fix some issues and simplify the masking * Fixed tests * Fixed test

gabrielspmoreira added 4 commits February 15, 2023 19:30

Fixed error that was causing the broadcasted context feature to have …

6360b6f

…fixed size first dim in graph mode and not being compatible with the ragged sequential features

Enforcing non-list (scalar) features to be 2D (batch size,1) if 1D or…

c706a0b

… with last dim undefined (which happens in graph mode)

Making Continuous support_masking=True (to cascade mask)

f591ebe

Changing BroadcastToSequence to fix some issues and simplify the masking

6c260cf

gabrielspmoreira self-assigned this Feb 17, 2023

gabrielspmoreira added the bug Something isn't working label Feb 17, 2023

gabrielspmoreira changed the title ~~Fix BroadcastToSequence~~ Fix BroadcastToSequence to enable context features in sequential models Feb 17, 2023

gabrielspmoreira requested a review from oliverholworthy February 17, 2023 01:11

Fixed tests

0512f6a

oliverholworthy reviewed Feb 17, 2023

View reviewed changes

Fixed test

c5e49b0

rnyak requested a review from marcromeyn February 17, 2023 15:22

rnyak added this to the Merlin 23.02 milestone Feb 17, 2023

marcromeyn approved these changes Feb 17, 2023

View reviewed changes

oliverholworthy approved these changes Feb 17, 2023

View reviewed changes

gabrielspmoreira merged commit 6af5b83 into main Feb 17, 2023

gabrielspmoreira mentioned this pull request Feb 21, 2023

[RMP] Tensorflow support for session based recommendations integration in Merlin NVIDIA-Merlin/Merlin#433

Closed

37 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix BroadcastToSequence to enable context features in sequential models #991

Fix BroadcastToSequence to enable context features in sequential models #991

gabrielspmoreira commented Feb 17, 2023 •

edited

Loading

github-actions bot commented Feb 17, 2023

oliverholworthy Feb 17, 2023

gabrielspmoreira Feb 17, 2023

oliverholworthy Feb 17, 2023

gabrielspmoreira Feb 17, 2023

Fix BroadcastToSequence to enable context features in sequential models #991

Fix BroadcastToSequence to enable context features in sequential models #991

Conversation

gabrielspmoreira commented Feb 17, 2023 • edited Loading

Goals ⚽

Implementation Details 🚧

Testing Details 🔍

github-actions bot commented Feb 17, 2023

Documentation preview

oliverholworthy Feb 17, 2023

Choose a reason for hiding this comment

gabrielspmoreira Feb 17, 2023

Choose a reason for hiding this comment

oliverholworthy Feb 17, 2023

Choose a reason for hiding this comment

gabrielspmoreira Feb 17, 2023

Choose a reason for hiding this comment

gabrielspmoreira commented Feb 17, 2023 •

edited

Loading