VIDEO DOI: https://doi.org/10.48448/9cea-q259

technical paper

NAACL 2021

June 07, 2021

Live on Underline

On the Transformer Growth for Progressive BERT Training

Please log in to leave a comment

Downloads

PaperTranscript English (automatic)

Next from NAACL 2021

Attention Head Masking for Inference Time Content Selection in Abstractive Summarization
technical paper

Attention Head Masking for Inference Time Content Selection in Abstractive Summarization

NAACL 2021

Shuyang Cao
Shuyang Cao and 1 other author

07 June 2021

Similar lecture

Neural Machine Translation without Embeddings
technical paper

Neural Machine Translation without Embeddings

NAACL 2021

Omer LevyUri Shaham
Uri Shaham and 1 other author

07 June 2021