Training Tips for the Transformer Model
The Prague Bulletin of Mathematical Linguistics
doi 10.2478/pralin-2018-0002
Full Text
Open PDFAbstract
Available in full text
Date
April 1, 2018
Authors
Publisher
Walter de Gruyter GmbH
Available in full text
April 1, 2018
Walter de Gruyter GmbH