buruzaemon
commited on
Commit
•
c19c52c
1
Parent(s):
9f85ee2
Update README.md
Browse files
README.md
CHANGED
@@ -30,6 +30,8 @@ The training and evaluation data come straight from the `train` and `validation`
|
|
30 |
|
31 |
## Training procedure
|
32 |
|
|
|
|
|
33 |
### Training hyperparameters
|
34 |
|
35 |
The following hyperparameters were used during training:
|
|
|
30 |
|
31 |
## Training procedure
|
32 |
|
33 |
+
Please see page 224 in Chapter 8: Making Transformers Efficient in Production, Natural Language Processing with Transformers, May 2022.
|
34 |
+
|
35 |
### Training hyperparameters
|
36 |
|
37 |
The following hyperparameters were used during training:
|