We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Is it possible to implement weight share of input and output embedding? It will save a lot of params for a small model!
Activity