End of training

Browse files

Files changed (3) hide show

README.md +10 -19
model.safetensors +1 -1
runs/May30_15-00-33_tzuchichen/events.out.tfevents.1717052478.tzuchichen.47.2 +2 -2

README.md CHANGED Viewed

@@ -15,8 +15,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/git-base](https://huggingface.co/microsoft/git-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0629
-- Wer Score: 6.3548
 ## Model description
@@ -36,33 +36,24 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 2
-- eval_batch_size: 2
 - seed: 42
 - gradient_accumulation_steps: 2
-- total_train_batch_size: 4
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 50
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch   | Step | Validation Loss | Wer Score |
 |:-------------:|:-------:|:----:|:---------------:|:---------:|
-| 7.2993        | 3.7037  | 50   | 4.4559          | 11.1452   |
-| 2.2534        | 7.4074  | 100  | 0.4051          | 0.5       |
-| 0.1148        | 11.1111 | 150  | 0.0457          | 0.4032    |
-| 0.0153        | 14.8148 | 200  | 0.0451          | 0.4032    |
-| 0.0108        | 18.5185 | 250  | 0.0486          | 0.5161    |
-| 0.009         | 22.2222 | 300  | 0.0488          | 0.4516    |
-| 0.0066        | 25.9259 | 350  | 0.0522          | 0.7419    |
-| 0.0054        | 29.6296 | 400  | 0.0538          | 0.4032    |
-| 0.0033        | 33.3333 | 450  | 0.0575          | 5.8548    |
-| 0.0018        | 37.0370 | 500  | 0.0604          | 4.9839    |
-| 0.0009        | 40.7407 | 550  | 0.0625          | 6.2097    |
-| 0.0008        | 44.4444 | 600  | 0.0629          | 6.3226    |
-| 0.0007        | 48.1481 | 650  | 0.0629          | 6.3548    |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/git-base](https://huggingface.co/microsoft/git-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0481
+- Wer Score: 3.1327
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 2
+- total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 15
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch   | Step | Validation Loss | Wer Score |
 |:-------------:|:-------:|:----:|:---------------:|:---------:|
+| 0.0029        | 3.5714  | 50   | 0.0447          | 3.9381    |
+| 0.0007        | 7.1429  | 100  | 0.0473          | 3.3097    |
+| 0.0002        | 10.7143 | 150  | 0.0476          | 4.3274    |
+| 0.0001        | 14.2857 | 200  | 0.0481          | 3.1327    |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:eef469242a42fe52a0e6c6965bcf6857503a4cf3838a429ec9514e738bbe39fd
 size 706516040

 version https://git-lfs.github.com/spec/v1
+oid sha256:cbb81a8b8ea7afa83b07b15ab843e553d9dd2cf7a3058b51872981e71f884052
 size 706516040

runs/May30_15-00-33_tzuchichen/events.out.tfevents.1717052478.tzuchichen.47.2 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:365490875edf9e8be1fd3460186b356583dbd756eec033b24922707c1e33ab1b
-size 6473

 version https://git-lfs.github.com/spec/v1
+oid sha256:9c521da382e296003f0ceca3e16f7993c6ed13e3b43211a226447749d2545f09
+size 7362