autoevaluator HF staff commited on
Commit
b1c0256
1 Parent(s): 4a8bb46

Add evaluation results on the samsum config and test split of samsum

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the samsum config and test split of the [samsum](https://huggingface.co/datasets/samsum) dataset by

@pszemraj

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-samsum-samsum-29813b-2390574811).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=samsum).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=samsum).

Files changed (1) hide show
  1. README.md +34 -1
README.md CHANGED
@@ -9,7 +9,40 @@ datasets:
9
  - stacked-summaries/stacked-samsum-1024
10
  model-index:
11
  - name: flan-t5-large-stacked-samsum1024-WIP3
12
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  ---
14
 
15
 
 
9
  - stacked-summaries/stacked-samsum-1024
10
  model-index:
11
  - name: flan-t5-large-stacked-samsum1024-WIP3
12
+ results:
13
+ - task:
14
+ type: summarization
15
+ name: Summarization
16
+ dataset:
17
+ name: samsum
18
+ type: samsum
19
+ config: samsum
20
+ split: test
21
+ metrics:
22
+ - name: ROUGE-1
23
+ type: rouge
24
+ value: 47.6682
25
+ verified: true
26
+ - name: ROUGE-2
27
+ type: rouge
28
+ value: 23.3053
29
+ verified: true
30
+ - name: ROUGE-L
31
+ type: rouge
32
+ value: 39.7678
33
+ verified: true
34
+ - name: ROUGE-LSUM
35
+ type: rouge
36
+ value: 43.259
37
+ verified: true
38
+ - name: loss
39
+ type: loss
40
+ value: 2.372586965560913
41
+ verified: true
42
+ - name: gen_len
43
+ type: gen_len
44
+ value: 17.4237
45
+ verified: true
46
  ---
47
 
48