klasocki commited on
Commit
c3fa42a
1 Parent(s): 72fb4d7

Add the evaluation notebook

Browse files
Files changed (3) hide show
  1. README.md +7 -1
  2. notebooks/evaluation.ipynb +0 -0
  3. setup.py +1 -0
README.md CHANGED
@@ -18,7 +18,7 @@ app_port: 8000
18
 
19
  `git push hub` to deploy to huggingface hub, after adding a remote
20
 
21
- Multi-stage build brings down the size from 9GB+ to around 7GB.
22
  Less not possible most likely, due to the size of torch and models.
23
 
24
  Reported token classification F1 scores on commas for different languages, on a political speeches' dataset:
@@ -27,3 +27,9 @@ Reported token classification F1 scores on commas for different languages, on a
27
  |---------|--------|--------|---------|
28
  | 0.819 | 0.945 | 0.831 | 0.798 |
29
 
 
 
 
 
 
 
 
18
 
19
  `git push hub` to deploy to huggingface hub, after adding a remote
20
 
21
+ Multi-stage build brings down the size from 9GB+ to around 7GB.
22
  Less not possible most likely, due to the size of torch and models.
23
 
24
  Reported token classification F1 scores on commas for different languages, on a political speeches' dataset:
 
27
  |---------|--------|--------|---------|
28
  | 0.819 | 0.945 | 0.831 | 0.798 |
29
 
30
+ Evaluation of the baseline model on the wikitext-103-raw-v1 validation dataset:
31
+
32
+ | precision | recall | F1 | support |
33
+ |-----------|--------|------|---------|
34
+ | 0.79 | 0.71 | 0.75 | 10079 |
35
+
notebooks/evaluation.ipynb ADDED
The diff for this file is too large to render. See raw diff
 
setup.py CHANGED
@@ -21,6 +21,7 @@ setup(
21
  extras_require={
22
  'training': [
23
  'datasets==2.14.4',
 
24
  'notebook'
25
  ],
26
  'test': [
 
21
  extras_require={
22
  'training': [
23
  'datasets==2.14.4',
24
+ 'seqeval'
25
  'notebook'
26
  ],
27
  'test': [