Deion: While using keras.layers.E

Enhancement Request: XLA Compatibility for `keras.layers.Embedding about keras HOT 3 OPEN

jacob-talroo commented on September 28, 2024

Enhancement Request: XLA Compatibility for `keras.layers.Embedding

from keras.

Comments (3)

fchollet commented on September 28, 2024

What's the performance like with JAX? Please try it.

from keras.

jacob-talroo commented on September 28, 2024

Thanks for the suggestion. Here is our situation. Previously, we were running a BERT pretrain with TensorFlow 2.11 and TensorFlow Models (TFM) with XLA on mulitple GPUs. This took about an hour.

We are migrating to Keras 3 and testing:

With Keras 3.3.3, JAX, Keras NLP, XLA and Data Parallel, we are seeing about a 50% increase in train time.
With Keras 3.3.3, TF, Keras NLP, and TF Distribute, and no XLA we are seeing a 200% increase in train time. (We also need to reduce batch size when we do not enable XLA)

The train is modified:

Before, we used add_loss. Now, we specify the loss in the compile().
Keras 3 only has one loss as a metric, so we duplicate these losses as metrics to see the separate MLM and NSP losses.
Keras 3 does not support add_metric() so we moved these also to compile().

from keras.

jacob-talroo commented on September 28, 2024

FYI - we removed the extra projections from the heads of the NLP trunks and now JAX is about the same performance of TF 2.

from keras.

Enhancement Request: XLA Compatibility for `keras.layers.Embedding about keras HOT 3 OPEN

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent