TF BERT not FP16 compatible? #3320

volker42maru · 2020-03-18T03:20:45Z

🐛 Bug

Information

Model I am using (Bert, XLNet ...): TFBertForQuestionAnswering

Language I am using the model on (English, Chinese ...): English

The problem arises when using:

my own modified scripts:

The tasks I am working on is:

an official GLUE/SQUaD task: SQUaD

To reproduce

Simple example to reproduce error:

import tensorflow as tf
from transformers import TFBertForQuestionAnswering

# turn on mp (fp16 operations)
tf.keras.mixed_precision.experimental.set_policy('mixed_float16')

model = TFBertForQuestionAnswering.from_pretrained('bert-base-uncased')

The error occurs here:
transformers/modeling_tf_bert.py", line 174, in _embedding
embeddings = inputs_embeds + position_embeddings + token_type_embeddings

And this is the error:
tensorflow.python.framework.errors_impl.InvalidArgumentError: cannot compute AddV2 as input #1(zero-based) was expected to be a half tensor but is a float tensor [Op:AddV2] name: tf_bert_for_question_answering/bert/embeddings/add/

Expected behavior

I want to use TF BERT with mixed precision (for faster inference on tensor core GPUs). I know that full fp16 is not working out-of-the-box, because the model weights need to be in fp16 as well. Mixed precision, however, should work because only operations are performed in fp16.

I get some dtype issue. Seems the mode is not fp16 compatible yet? Will this be fixed in the future?

Environment info

transformers version: 2.5.0
Platform: ubuntu 16.04
Python version: 3.6.9
PyTorch version (GPU?): 1.4.0 (GPU)
Tensorflow version (GPU?): 2.1.0 (GPU)
Using GPU in script?: sort of
Using distributed or parallel set-up in script?: nope

The text was updated successfully, but these errors were encountered:

bamps53 · 2020-04-16T14:08:47Z

I've aced same issue. Maybe it's hard coded the data type somewhere? Have you found solution?

rzepinskip · 2020-04-24T15:33:01Z

Tried this on Colab TPU, same error.

ben74 · 2020-05-06T06:12:03Z

Same here, would be convenient as hell :)

patrickvonplaten · 2020-06-18T09:37:58Z

Having the same error also for transformers version 2.11.0.
Here some code to easily reproduce the error:

#!/usr/bin/env python3
from transformers import TFBertModel, BertTokenizer
from tensorflow.keras.mixed_precision import experimental as mixed_precision

policy = mixed_precision.Policy('mixed_float16')
mixed_precision.set_policy(policy)

tok = BertTokenizer.from_pretrained("bert-base-uncased")
model = TFBertModel.from_pretrained("bert-base-uncased")
input_ids = tok("The dog is cute", return_tensors="tf").input_ids
model(input_ids)  # throws error on GPU

chrisabbott · 2020-06-20T18:49:46Z

Encountering the same issue here:

import tensorflow as tf
from transformers.modeling_tf_distilbert import TFDistilBertModel

tf.keras.mixed_precision.experimental.set_policy('mixed_float16')
model = TFDistilBertModel.from_pretrained('distilbert-base-uncased')

patrickvonplaten · 2020-06-22T08:24:03Z

Put this issue on my TF ToDo-List :-)

Hazarapet · 2020-07-16T23:54:57Z

+1

QixinLi · 2020-08-07T10:32:43Z

Hi @patrickvonplaten, is this problem fixed？
I got the same error recently with version 3.0.2

patrickvonplaten · 2020-08-08T16:54:33Z

This is still an open problem...I didn't find the time yet to take a look! Will link this issue to the TF projects.

…face#3320)

xuxingya · 2020-11-19T01:54:36Z

This is already solved in new version.
position_embeddings = tf.cast(self.position_embeddings(position_ids), inputs_embeds.dtype) token_type_embeddings = tf.cast(self.token_type_embeddings(token_type_ids), inputs_embeds.dtype) embeddings = inputs_embeds + position_embeddings + token_type_embeddings

amaiya mentioned this issue Apr 20, 2020

Is there a way to train/fine-tune with fp-16 flag? amaiya/ktrain#126

Closed

patrickvonplaten mentioned this issue Jun 18, 2020

Benchmarks #4912

Merged

7 tasks

schmidek added a commit to schmidek/transformers that referenced this issue Aug 21, 2020

Remove hard-coded uses of float32 to fix mixed precision use (hugging…

6ac59c3

…face#3320)

schmidek mentioned this issue Aug 21, 2020

Remove hard-coded uses of float32 to fix mixed precision use #6648

Merged

JetRunner closed this as completed in #6648 Aug 25, 2020

xuxingya mentioned this issue Aug 28, 2020

F16 support for DistilBert #6792

Closed

patrickvonplaten reopened this Nov 18, 2020

patrickvonplaten closed this as completed Nov 19, 2020

hankcs mentioned this issue Dec 23, 2020

unable to use mixed precision hankcs/HanLP#1592

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TF BERT not FP16 compatible? #3320

TF BERT not FP16 compatible? #3320

volker42maru commented Mar 18, 2020

bamps53 commented Apr 16, 2020 •

edited

Loading

rzepinskip commented Apr 24, 2020

ben74 commented May 6, 2020

patrickvonplaten commented Jun 18, 2020

chrisabbott commented Jun 20, 2020 •

edited

Loading

patrickvonplaten commented Jun 22, 2020

Hazarapet commented Jul 16, 2020

QixinLi commented Aug 7, 2020

patrickvonplaten commented Aug 8, 2020

xuxingya commented Nov 19, 2020

TF BERT not FP16 compatible? #3320

TF BERT not FP16 compatible? #3320

Comments

volker42maru commented Mar 18, 2020

🐛 Bug

Information

To reproduce

Expected behavior

Environment info

bamps53 commented Apr 16, 2020 • edited Loading

rzepinskip commented Apr 24, 2020

ben74 commented May 6, 2020

patrickvonplaten commented Jun 18, 2020

chrisabbott commented Jun 20, 2020 • edited Loading

patrickvonplaten commented Jun 22, 2020

Hazarapet commented Jul 16, 2020

QixinLi commented Aug 7, 2020

patrickvonplaten commented Aug 8, 2020

xuxingya commented Nov 19, 2020

bamps53 commented Apr 16, 2020 •

edited

Loading

chrisabbott commented Jun 20, 2020 •

edited

Loading