PereLluis13 commited on
Commit
2c13e4b
1 Parent(s): 732e354

Training in progress, step 23000

Browse files
.ipynb_checkpoints/README-checkpoint.md ADDED
@@ -0,0 +1,158 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - ca
4
+ license: apache-2.0
5
+ tags:
6
+ - automatic-speech-recognition
7
+ - mozilla-foundation/common_voice_8_0
8
+ - collectivat/tv3_parla
9
+ - projecte-aina/parlament_parla
10
+ - generated_from_trainer
11
+ - robust-speech-event
12
+ datasets:
13
+ - mozilla-foundation/common_voice_8_0
14
+ - collectivat/tv3_parla
15
+ - projecte-aina/parlament_parla
16
+ model-index:
17
+ - name: wav2vec2-xls-r-300m-ca
18
+ results:
19
+ - task:
20
+ name: Speech Recognition
21
+ type: automatic-speech-recognition
22
+ dataset:
23
+ name: mozilla-foundation/common_voice_8_0 ca
24
+ type: mozilla-foundation/common_voice_8_0
25
+ args: ca
26
+ metrics:
27
+ - name: Test WER
28
+ type: wer
29
+ value: 0.15636874077301
30
+ - name: Test CER
31
+ type: cer
32
+ value: 0.04086725403909639
33
+ - task:
34
+ name: Speech Recognition
35
+ type: automatic-speech-recognition
36
+ dataset:
37
+ name: projecte-aina/parlament_parla ca
38
+ type: projecte-aina/parlament_parla
39
+ args: clean
40
+ metrics:
41
+ - name: Test WER
42
+ type: wer
43
+ value: 0.09940385143350199
44
+ - name: Test CER
45
+ type: cer
46
+ value: 0.026906712890009454
47
+ - task:
48
+ name: Speech Recognition
49
+ type: automatic-speech-recognition
50
+ dataset:
51
+ name: collectivat/tv3_parla ca
52
+ type: collectivat/tv3_parla
53
+ args: ca
54
+ metrics:
55
+ - name: Test WER
56
+ type: wer
57
+ value: 0.27349193517342263
58
+ - name: Test CER
59
+ type: cer
60
+ value: 0.11571091827304163
61
+ ---
62
+
63
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
64
+ should probably proofread and complete it, then remove this comment. -->
65
+
66
+ # wav2vec2-xls-r-300m-ca
67
+
68
+ This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - CA dataset.
69
+ It achieves the following results on the evaluation set:
70
+ - Loss: 0.2549
71
+ - Wer: 0.1573
72
+
73
+ ## Model description
74
+
75
+ More information needed
76
+
77
+ ## Intended uses & limitations
78
+
79
+ More information needed
80
+
81
+ ## Training and evaluation data
82
+
83
+ More information needed
84
+
85
+ ## Training procedure
86
+
87
+ ### Training hyperparameters
88
+
89
+ The following hyperparameters were used during training:
90
+ - learning_rate: 7.5e-05
91
+ - train_batch_size: 32
92
+ - eval_batch_size: 32
93
+ - seed: 42
94
+ - gradient_accumulation_steps: 4
95
+ - total_train_batch_size: 128
96
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
97
+ - lr_scheduler_type: linear
98
+ - lr_scheduler_warmup_steps: 2000
99
+ - num_epochs: 12.0
100
+ - mixed_precision_training: Native AMP
101
+
102
+ ### Training results
103
+
104
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
105
+ |:-------------:|:-----:|:-----:|:---------------:|:------:|
106
+ | 6.2099 | 0.09 | 500 | 3.4125 | 1.0 |
107
+ | 2.9961 | 0.18 | 1000 | 2.9224 | 1.0 |
108
+ | 2.2147 | 0.26 | 1500 | 0.6521 | 0.5568 |
109
+ | 1.3017 | 0.35 | 2000 | 0.3153 | 0.2761 |
110
+ | 1.1196 | 0.44 | 2500 | 0.2444 | 0.2367 |
111
+ | 1.0712 | 0.53 | 3000 | 0.2324 | 0.2132 |
112
+ | 1.052 | 0.62 | 3500 | 0.2173 | 0.2032 |
113
+ | 1.2813 | 2.13 | 4000 | 0.3326 | 0.2099 |
114
+ | 1.2365 | 2.4 | 4500 | 0.3224 | 0.2003 |
115
+ | 1.2193 | 2.66 | 5000 | 0.3198 | 0.1957 |
116
+ | 1.2072 | 2.93 | 5500 | 0.3063 | 0.1933 |
117
+ | 1.213 | 3.2 | 6000 | 0.3051 | 0.1980 |
118
+ | 1.2074 | 3.46 | 6500 | 0.3012 | 0.1879 |
119
+ | 1.1918 | 3.73 | 7000 | 0.2947 | 0.1829 |
120
+ | 1.1893 | 4.0 | 7500 | 0.2895 | 0.1807 |
121
+ | 1.1751 | 4.26 | 8000 | 0.2878 | 0.1776 |
122
+ | 1.1628 | 4.53 | 8500 | 0.2835 | 0.1731 |
123
+ | 1.1577 | 4.79 | 9000 | 0.2816 | 0.1761 |
124
+ | 1.1448 | 5.06 | 9500 | 0.2757 | 0.1740 |
125
+ | 1.1407 | 5.33 | 10000 | 0.2768 | 0.1798 |
126
+ | 1.1401 | 5.59 | 10500 | 0.2780 | 0.1816 |
127
+ | 1.1333 | 5.86 | 11000 | 0.2748 | 0.1750 |
128
+ | 1.1571 | 6.13 | 11500 | 0.2808 | 0.1708 |
129
+ | 1.1505 | 6.39 | 12000 | 0.2726 | 0.1692 |
130
+ | 1.1519 | 6.66 | 12500 | 0.2749 | 0.1654 |
131
+ | 1.136 | 6.93 | 13000 | 0.2765 | 0.1643 |
132
+ | 1.1326 | 7.19 | 13500 | 0.2706 | 0.1668 |
133
+ | 1.1342 | 7.46 | 14000 | 0.2665 | 0.1638 |
134
+ | 1.1286 | 7.72 | 14500 | 0.2669 | 0.1636 |
135
+ | 1.1243 | 7.99 | 15000 | 0.2619 | 0.1623 |
136
+ | 1.1173 | 8.26 | 15500 | 0.2652 | 0.1604 |
137
+ | 1.1129 | 8.52 | 16000 | 0.2610 | 0.1598 |
138
+ | 1.1091 | 8.79 | 16500 | 0.2608 | 0.1584 |
139
+ | 1.1053 | 9.06 | 17000 | 0.2633 | 0.1664 |
140
+ | 1.1004 | 9.32 | 17500 | 0.2594 | 0.1662 |
141
+ | 1.0995 | 9.59 | 18000 | 0.2623 | 0.1569 |
142
+ | 1.0964 | 9.86 | 18500 | 0.2624 | 0.1597 |
143
+ | 1.09 | 10.12 | 19000 | 0.2577 | 0.1578 |
144
+ | 1.089 | 10.39 | 19500 | 0.2574 | 0.1531 |
145
+ | 1.0864 | 10.66 | 20000 | 0.2556 | 0.1546 |
146
+ | 1.0806 | 10.92 | 20500 | 0.2548 | 0.1583 |
147
+ | 1.0842 | 11.19 | 21000 | 0.2550 | 0.1542 |
148
+ | 1.0805 | 11.45 | 21500 | 0.2561 | 0.1524 |
149
+ | 1.0722 | 11.72 | 22000 | 0.2540 | 0.1566 |
150
+ | 1.0763 | 11.99 | 22500 | 0.2549 | 0.1572 |
151
+
152
+
153
+ ### Framework versions
154
+
155
+ - Transformers 4.16.0.dev0
156
+ - Pytorch 1.10.1+cu102
157
+ - Datasets 1.18.1
158
+ - Tokenizers 0.11.0
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:38f9952471847b9dbd693d34fa642974ebb6a016e7677a8ccfb3e3458f45e32a
3
  size 1262112241
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:63649cccccbceaad36a1cd242db8bf5c80d0c57ac644ae13e78ead1877a3bbb6
3
  size 1262112241
runs/Feb01_18-08-21_job-336a688f-553a-4e6e-83b3-ad5d10274b51/1643741534.116655/events.out.tfevents.1643741534.job-336a688f-553a-4e6e-83b3-ad5d10274b51.3348585.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:24cb9f2a1f8cd9f07b463f6996a54a600e48f99b5d21f1cabc83dc60826e1698
3
+ size 4814
runs/Feb01_18-08-21_job-336a688f-553a-4e6e-83b3-ad5d10274b51/events.out.tfevents.1643741534.job-336a688f-553a-4e6e-83b3-ad5d10274b51.3348585.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d3db160cb23b387164fb7a23090b94320918cb2d6876ad129b005da6435fdf85
3
+ size 5230
special_tokens_map.json CHANGED
@@ -1 +1 @@
1
- {"bos_token": "<s>", "eos_token": "</s>", "unk_token": "[UNK]", "pad_token": "[PAD]", "additional_special_tokens": [{"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}]}
 
1
+ {"bos_token": "<s>", "eos_token": "</s>", "unk_token": "[UNK]", "pad_token": "[PAD]", "additional_special_tokens": [{"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "<s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}, {"content": "</s>", "single_word": false, "lstrip": false, "rstrip": false, "normalized": true}]}
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:17c7ac8ef3fa5cd56e5588b26ef042771a921380343f4f907f550dba5411e925
3
  size 3055
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:39bcfb99d35cbd0bb67a8e20a452004a5f564b287b8dc35791fc7479970c3713
3
  size 3055