assert torch.all(all_layers[-1] == last_layer_features) #2044

HAWLYQ · 2020-04-22T10:40:46Z

❓ Questions and Help

Before asking:

search the issues.
search the docs.

What is your question?

Hi, @myleott , I meet an AssertionError when I run the demo code to extract features from RoBERTa. It shows the last layer features is not the same as 'all_layers[-1]'.

Code

import torch
roberta = torch.hub.load('pytorch/fairseq', 'roberta.large')
tokens = roberta.encode('Hello world!')
last_layer_features = roberta.extract_features(tokens)
all_layers = roberta.extract_features(tokens, return_all_hiddens=True)
assert torch.all(all_layers[-1] == last_layer_features)

What have you tried?

There is nothing wrong with the size of last_layer_features and the length of all_layers.
The outputs of last_layers_features and all_layers[-1] are shown below:

What's your environment?

fairseq Version (e.g., 1.0 or master):0.9.0
PyTorch Version (e.g., 1.0) 1.4.0
OS (e.g., Linux):linux
How you installed fairseq (pip, source):pip
Build command you used (if compiling from source):
Python version:3.6
CUDA/cuDNN version:10.0
GPU models and configuration:-
Any other relevant information:

The text was updated successfully, but these errors were encountered:

myleott · 2020-05-13T14:53:17Z

You need to call roberta.eval() to disable dropout first, otherwise the calls will return slightly different results.

This works for me:

import torch
roberta = torch.hub.load('pytorch/fairseq', 'roberta.large')
roberta.eval()  # <-- this disables dropout
tokens = roberta.encode('Hello world!')
last_layer_features = roberta.extract_features(tokens)
all_layers = roberta.extract_features(tokens, return_all_hiddens=True)
assert torch.all(all_layers[-1] == last_layer_features)

…ne Translation" (#2044) Summary: # Before submitting - [ ] Was this discussed/approved via a Github issue? (no need for typos, doc improvements) - [ ] Did you read the [contributor guideline](https://github.com/pytorch/fairseq/blob/master/CONTRIBUTING.md)? - [ ] Did you make sure to update the docs? - [ ] Did you write any new necessary tests? ## What does this PR do? Release the code for the paper "Discriminative Reranking for Neural Machine Translation" ## PR review Anyone in the community is free to review the PR once the tests have passed. If we didn't discuss your PR in Github issues there's a high chance it will not be merged. ## Did you have fun? Make sure you had fun coding � Pull Request resolved: fairinternal/fairseq-py#2044 Reviewed By: michaelauli Differential Revision: D29628590 Pulled By: an918tw fbshipit-source-id: 7a52602d495b736573187cc721829aa545d24770

HAWLYQ added needs triage question labels Apr 22, 2020

myleott added bug and removed needs triage question labels May 1, 2020

myleott self-assigned this May 1, 2020

myleott closed this as completed May 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assert torch.all(all_layers[-1] == last_layer_features) #2044

assert torch.all(all_layers[-1] == last_layer_features) #2044

HAWLYQ commented Apr 22, 2020

myleott commented May 13, 2020

assert torch.all(all_layers[-1] == last_layer_features) #2044

assert torch.all(all_layers[-1] == last_layer_features) #2044

Comments

HAWLYQ commented Apr 22, 2020

❓ Questions and Help

Before asking:

What is your question?

Code

What have you tried?

What's your environment?

myleott commented May 13, 2020