Since the DeepLearningExamples uses fairseq to build the Transformer model, <p dir

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

How to costumize my own Transformer model? about deeplearningexamples HOT 3 CLOSED

nvidia commented on May 10, 2024

How to costumize my own Transformer model?

from deeplearningexamples.

Comments (3)

jbaczek commented on May 10, 2024

Transformer model inherits from FairseqModel. It is implemented in fairseq/models/transformer.py. Your custom model has to inherit from FairseqModel (which is defined in fairseq/models/fairseq_model.py. You have to implement build_model function which is called in taksk.build_model function. Next you have to register model with decorator @register_model. For any for predefined configurations of your custom model use decorator @register_model_architecture. I believe that after reading these two files everything will be clear.

from deeplearningexamples.

yaoyiran commented on May 10, 2024

@jbaczek Thanks for your answer! A follow-up question: if I wrote my own model e.g. fairseq/models/mymodel.py or I directly modify and overwrite fairseq/models/transformer.py, do I need to run “pip install -e . ” or "python setup.py install" again? Does fairseq/models/mymodel.py rely on setup.py?

from deeplearningexamples.

jbaczek commented on May 10, 2024

No, setup.py builds only C++ and CUDA extensions. Your code should work without rebuilding the fairseq.

from deeplearningexamples.

How to costumize my own Transformer model? about deeplearningexamples HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent