saattrupdan
commited on
Commit
•
54c4dd4
1
Parent(s):
b6b9edd
update model card README.md
Browse files
README.md
CHANGED
@@ -1,5 +1,5 @@
|
|
1 |
---
|
2 |
-
license:
|
3 |
tags:
|
4 |
- generated_from_trainer
|
5 |
model-index:
|
@@ -12,10 +12,10 @@ should probably proofread and complete it, then remove this comment. -->
|
|
12 |
|
13 |
# contract-ner-model-da
|
14 |
|
15 |
-
This model is a fine-tuned version of [
|
16 |
It achieves the following results on the evaluation set:
|
17 |
-
- Loss: 0.
|
18 |
-
- Micro F1: 0.
|
19 |
|
20 |
## Model description
|
21 |
|
@@ -42,61 +42,32 @@ The following hyperparameters were used during training:
|
|
42 |
- total_train_batch_size: 32
|
43 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
44 |
- lr_scheduler_type: linear
|
45 |
-
- lr_scheduler_warmup_steps:
|
46 |
- num_epochs: 500
|
47 |
|
48 |
### Training results
|
49 |
|
50 |
-
| Training Loss | Epoch
|
51 |
-
|
52 |
-
| 0.
|
53 |
-
| 0.
|
54 |
-
| 0.
|
55 |
-
| 0.
|
56 |
-
| 0.
|
57 |
-
| 0.
|
58 |
-
| 0.
|
59 |
-
| 0.
|
60 |
-
| 0.
|
61 |
-
| 0.
|
62 |
-
| 0.
|
63 |
-
| 0.
|
64 |
-
| 0.
|
65 |
-
| 0.
|
66 |
-
| 0.0004 | 66.66 | 3000 | 0.0218 | 0.8937 |
|
67 |
-
| 0.0003 | 71.11 | 3200 | 0.0214 | 0.9039 |
|
68 |
-
| 0.0003 | 75.55 | 3400 | 0.0251 | 0.8958 |
|
69 |
-
| 0.0003 | 79.99 | 3600 | 0.0210 | 0.8943 |
|
70 |
-
| 0.0004 | 84.44 | 3800 | 0.0227 | 0.8995 |
|
71 |
-
| 0.0002 | 88.88 | 4000 | 0.0226 | 0.9116 |
|
72 |
-
| 0.0002 | 93.33 | 4200 | 0.0209 | 0.9121 |
|
73 |
-
| 0.0001 | 97.77 | 4400 | 0.0225 | 0.9105 |
|
74 |
-
| 0.0003 | 102.22 | 4600 | 0.0278 | 0.9039 |
|
75 |
-
| 0.0003 | 106.66 | 4800 | 0.0223 | 0.9009 |
|
76 |
-
| 0.0002 | 111.11 | 5000 | 0.0274 | 0.9123 |
|
77 |
-
| 0.0003 | 115.55 | 5200 | 0.0236 | 0.9123 |
|
78 |
-
| 0.0001 | 119.99 | 5400 | 0.0235 | 0.9059 |
|
79 |
-
| 0.0001 | 124.44 | 5600 | 0.0220 | 0.9165 |
|
80 |
-
| 0.0002 | 128.88 | 5800 | 0.0265 | 0.9021 |
|
81 |
-
| 0.0001 | 133.33 | 6000 | 0.0225 | 0.9029 |
|
82 |
-
| 0.0002 | 137.77 | 6200 | 0.0249 | 0.9166 |
|
83 |
-
| 0.0004 | 142.22 | 6400 | 0.0192 | 0.9128 |
|
84 |
-
| 0.0002 | 146.66 | 6600 | 0.0181 | 0.9313 |
|
85 |
-
| 0.0001 | 151.11 | 6800 | 0.0191 | 0.9269 |
|
86 |
-
| 0.0001 | 155.55 | 7000 | 0.0224 | 0.9154 |
|
87 |
-
| 0.0002 | 159.99 | 7200 | 0.0192 | 0.9171 |
|
88 |
-
| 0.0002 | 164.44 | 7400 | 0.0202 | 0.9217 |
|
89 |
-
| 0.0001 | 168.88 | 7600 | 0.0185 | 0.9171 |
|
90 |
-
| 0.0001 | 173.33 | 7800 | 0.0207 | 0.9201 |
|
91 |
-
| 0.0001 | 177.77 | 8000 | 0.0226 | 0.9121 |
|
92 |
-
| 0.0001 | 182.22 | 8200 | 0.0208 | 0.9191 |
|
93 |
-
| 0.0002 | 186.66 | 8400 | 0.0248 | 0.9184 |
|
94 |
-
| 0.0002 | 191.11 | 8600 | 0.0213 | 0.9217 |
|
95 |
|
96 |
|
97 |
### Framework versions
|
98 |
|
99 |
- Transformers 4.11.3
|
100 |
-
- Pytorch 1.
|
101 |
-
- Datasets 1.
|
102 |
- Tokenizers 0.10.3
|
|
|
1 |
---
|
2 |
+
license: mit
|
3 |
tags:
|
4 |
- generated_from_trainer
|
5 |
model-index:
|
|
|
12 |
|
13 |
# contract-ner-model-da
|
14 |
|
15 |
+
This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on the None dataset.
|
16 |
It achieves the following results on the evaluation set:
|
17 |
+
- Loss: 0.0026
|
18 |
+
- Micro F1: 0.9297
|
19 |
|
20 |
## Model description
|
21 |
|
|
|
42 |
- total_train_batch_size: 32
|
43 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
44 |
- lr_scheduler_type: linear
|
45 |
+
- lr_scheduler_warmup_steps: 919
|
46 |
- num_epochs: 500
|
47 |
|
48 |
### Training results
|
49 |
|
50 |
+
| Training Loss | Epoch | Step | Validation Loss | Micro F1 |
|
51 |
+
|:-------------:|:-----:|:----:|:---------------:|:--------:|
|
52 |
+
| 0.8971 | 0.24 | 200 | 0.0205 | 0.0 |
|
53 |
+
| 0.0173 | 0.48 | 400 | 0.0100 | 0.2921 |
|
54 |
+
| 0.0092 | 0.73 | 600 | 0.0065 | 0.7147 |
|
55 |
+
| 0.0063 | 0.97 | 800 | 0.0046 | 0.8332 |
|
56 |
+
| 0.0047 | 1.21 | 1000 | 0.0047 | 0.8459 |
|
57 |
+
| 0.0042 | 1.45 | 1200 | 0.0039 | 0.8694 |
|
58 |
+
| 0.0037 | 1.69 | 1400 | 0.0035 | 0.8888 |
|
59 |
+
| 0.0032 | 1.93 | 1600 | 0.0035 | 0.8840 |
|
60 |
+
| 0.0025 | 2.18 | 1800 | 0.0029 | 0.8943 |
|
61 |
+
| 0.0023 | 2.42 | 2000 | 0.0024 | 0.9104 |
|
62 |
+
| 0.0023 | 2.66 | 2200 | 0.0032 | 0.8808 |
|
63 |
+
| 0.0021 | 2.9 | 2400 | 0.0022 | 0.9338 |
|
64 |
+
| 0.0018 | 3.14 | 2600 | 0.0020 | 0.9315 |
|
65 |
+
| 0.0015 | 3.39 | 2800 | 0.0026 | 0.9297 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
66 |
|
67 |
|
68 |
### Framework versions
|
69 |
|
70 |
- Transformers 4.11.3
|
71 |
+
- Pytorch 1.8.1+cu101
|
72 |
+
- Datasets 1.12.1
|
73 |
- Tokenizers 0.10.3
|