buruzaemon
commited on
Commit
•
6257ac8
1
Parent(s):
25f5a4a
Fine-tuned student model training completed
Browse files- README.md +1 -28
- config.json +0 -1
- pytorch_model.bin +1 -1
- runs/Jan29_11-46-24_t4-us-west4-b-n1-standard-16/1706537121.2245193/events.out.tfevents.1706537121.t4-us-west4-b-n1-standard-16.53568.31 +3 -0
- runs/Jan29_11-46-24_t4-us-west4-b-n1-standard-16/1706537185.5430374/events.out.tfevents.1706537185.t4-us-west4-b-n1-standard-16.53568.33 +3 -0
- runs/Jan29_11-46-24_t4-us-west4-b-n1-standard-16/events.out.tfevents.1706536423.t4-us-west4-b-n1-standard-16.53568.27 +2 -2
- runs/Jan29_11-46-24_t4-us-west4-b-n1-standard-16/events.out.tfevents.1706537185.t4-us-west4-b-n1-standard-16.53568.32 +3 -0
- training_args.bin +1 -1
README.md
CHANGED
@@ -4,22 +4,9 @@ tags:
|
|
4 |
- generated_from_trainer
|
5 |
datasets:
|
6 |
- clinc_oos
|
7 |
-
metrics:
|
8 |
-
- accuracy
|
9 |
model-index:
|
10 |
- name: distilbert-base-uncased-finetuned-clinc
|
11 |
-
results:
|
12 |
-
- task:
|
13 |
-
name: Text Classification
|
14 |
-
type: text-classification
|
15 |
-
dataset:
|
16 |
-
name: clinc_oos
|
17 |
-
type: clinc_oos
|
18 |
-
args: plus
|
19 |
-
metrics:
|
20 |
-
- name: Accuracy
|
21 |
-
type: accuracy
|
22 |
-
value: 0.9180645161290323
|
23 |
---
|
24 |
|
25 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -28,9 +15,6 @@ should probably proofread and complete it, then remove this comment. -->
|
|
28 |
# distilbert-base-uncased-finetuned-clinc
|
29 |
|
30 |
This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the clinc_oos dataset.
|
31 |
-
It achieves the following results on the evaluation set:
|
32 |
-
- Loss: 0.7719
|
33 |
-
- Accuracy: 0.9181
|
34 |
|
35 |
## Model description
|
36 |
|
@@ -57,17 +41,6 @@ The following hyperparameters were used during training:
|
|
57 |
- lr_scheduler_type: linear
|
58 |
- num_epochs: 5
|
59 |
|
60 |
-
### Training results
|
61 |
-
|
62 |
-
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|
63 |
-
|:-------------:|:-----:|:----:|:---------------:|:--------:|
|
64 |
-
| No log | 1.0 | 318 | 3.2882 | 0.7426 |
|
65 |
-
| 3.7861 | 2.0 | 636 | 1.8744 | 0.8381 |
|
66 |
-
| 3.7861 | 3.0 | 954 | 1.1567 | 0.8958 |
|
67 |
-
| 1.6922 | 4.0 | 1272 | 0.8569 | 0.9132 |
|
68 |
-
| 0.9055 | 5.0 | 1590 | 0.7719 | 0.9181 |
|
69 |
-
|
70 |
-
|
71 |
### Framework versions
|
72 |
|
73 |
- Transformers 4.16.2
|
|
|
4 |
- generated_from_trainer
|
5 |
datasets:
|
6 |
- clinc_oos
|
|
|
|
|
7 |
model-index:
|
8 |
- name: distilbert-base-uncased-finetuned-clinc
|
9 |
+
results: []
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
10 |
---
|
11 |
|
12 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
15 |
# distilbert-base-uncased-finetuned-clinc
|
16 |
|
17 |
This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the clinc_oos dataset.
|
|
|
|
|
|
|
18 |
|
19 |
## Model description
|
20 |
|
|
|
41 |
- lr_scheduler_type: linear
|
42 |
- num_epochs: 5
|
43 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
44 |
### Framework versions
|
45 |
|
46 |
- Transformers 4.16.2
|
config.json
CHANGED
@@ -320,7 +320,6 @@
|
|
320 |
"n_heads": 12,
|
321 |
"n_layers": 6,
|
322 |
"pad_token_id": 0,
|
323 |
-
"problem_type": "single_label_classification",
|
324 |
"qa_dropout": 0.1,
|
325 |
"seq_classif_dropout": 0.2,
|
326 |
"sinusoidal_pos_embds": false,
|
|
|
320 |
"n_heads": 12,
|
321 |
"n_layers": 6,
|
322 |
"pad_token_id": 0,
|
|
|
323 |
"qa_dropout": 0.1,
|
324 |
"seq_classif_dropout": 0.2,
|
325 |
"sinusoidal_pos_embds": false,
|
pytorch_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 268320042
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1aa1b972d6e736a33a8be818b4bac211d7755c977000d6e45f3cebcc3a420496
|
3 |
size 268320042
|
runs/Jan29_11-46-24_t4-us-west4-b-n1-standard-16/1706537121.2245193/events.out.tfevents.1706537121.t4-us-west4-b-n1-standard-16.53568.31
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bcdf80bb399ebcec046e461016fcff1075386ab06c6f003517f24e2be773a2f3
|
3 |
+
size 4893
|
runs/Jan29_11-46-24_t4-us-west4-b-n1-standard-16/1706537185.5430374/events.out.tfevents.1706537185.t4-us-west4-b-n1-standard-16.53568.33
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:70230837289e56834f0b7955ead7027330216a381ae5407f74898cbe5601ce0f
|
3 |
+
size 4893
|
runs/Jan29_11-46-24_t4-us-west4-b-n1-standard-16/events.out.tfevents.1706536423.t4-us-west4-b-n1-standard-16.53568.27
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4228cb3cdf2a1c13dfbe8b91254496e6857b6ab026aad06a6a479e398c964f32
|
3 |
+
size 48059
|
runs/Jan29_11-46-24_t4-us-west4-b-n1-standard-16/events.out.tfevents.1706537185.t4-us-west4-b-n1-standard-16.53568.32
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0d3fd3611d60f664cc86d1f151687f4dedd2e5c75c1b9e3a0ab3a5ab520ee275
|
3 |
+
size 15523
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 3576
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:7c92aa77a1096d0c2d447caf69ebe89d82e04d2b8afdf17dfd61061ff88c8abd
|
3 |
size 3576
|