NBoukachab commited on
Commit
909eea7
β€’
1 Parent(s): 6a90b01

Add model files

Browse files
Files changed (7) hide show
  1. .gitattributes +5 -0
  2. README.md +29 -0
  3. cfg +3 -0
  4. model +3 -0
  5. moves +3 -0
  6. tokenizer +3 -0
  7. vectors +3 -0
.gitattributes CHANGED
@@ -32,3 +32,8 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
32
  *.zip filter=lfs diff=lfs merge=lfs -text
33
  *.zst filter=lfs diff=lfs merge=lfs -text
34
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
32
  *.zip filter=lfs diff=lfs merge=lfs -text
33
  *.zst filter=lfs diff=lfs merge=lfs -text
34
  *tfevents* filter=lfs diff=lfs merge=lfs -text
35
+ model filter=lfs diff=lfs merge=lfs -text
36
+ cfg filter=lfs diff=lfs merge=lfs -text
37
+ moves filter=lfs diff=lfs merge=lfs -text
38
+ tokenizer filter=lfs diff=lfs merge=lfs -text
39
+ vectors filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,3 +1,32 @@
1
  ---
 
2
  license: mit
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ library_name: Spacy
3
  license: mit
4
+ tags:
5
+ - Spacy
6
+ language:
7
+ - 'lat'
8
  ---
9
+ ​
10
+ ## Model description
11
+ ​
12
+ The model has been trained using the Spacy library on the [NorHand](https://zenodo.org/record/5600884) document annotations.
13
+ ​
14
+ # Cite us!
15
+ ​
16
+ ```bibtex
17
+ @inproceedings{10.1007/978-3-031-06555-2_29,
18
+ author = {Monroc, Claire Bizon and Miret, Blanche and Bonhomme, Marie-Laurence and Kermorvant, Christopher},
19
+ title = {A Comprehensive Study Of Open-Source Libraries For Named Entity Recognition On Handwritten Historical Documents},
20
+ year = {2022},
21
+ isbn = {978-3-031-06554-5},
22
+ publisher = {Springer-Verlag},
23
+ address = {Berlin, Heidelberg},
24
+ url = {https://doi.org/10.1007/978-3-031-06555-2_29},
25
+ doi = {10.1007/978-3-031-06555-2_29},
26
+ abstract = {In this paper, we propose an evaluation of several state-of-the-art open-source natural language processing (NLP) libraries for named entity recognition (NER) on handwritten historical documents: spaCy, Stanza and Flair. The comparison is carried out on three low-resource multilingual datasets of handwritten historical documents: HOME (a multilingual corpus of medieval charters), Balsac (a corpus of parish records from Quebec), and Esposalles (a corpus of marriage records in Catalan). We study the impact of the document recognition processes (text line detection and handwriting recognition) on the performance of the NER. We show that current off-the-shelf NER libraries yield state-of-the-art results, even on low-resource languages or multilingual documents using multilingual models. We show, in an end-to-end evaluation, that text line detection errors have a greater impact than handwriting recognition errors. Finally, we also report state-of-the-art results on the public Esposalles dataset.},
27
+ booktitle = {Document Analysis Systems: 15th IAPR International Workshop, DAS 2022, La Rochelle, France, May 22–25, 2022, Proceedings},
28
+ pages = {429–444},
29
+ numpages = {16},
30
+ keywords = {Text line detection, Named entity recognition, Handwritten historical documents},
31
+ location = {La Rochelle, France}
32
+ }
cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b9cf14ffd96ed50ffbfbfcc7b98a7417a087ef2d4ea86a67099bbb4ad9c72ae5
3
+ size 354
model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5bcc3aace622daba2e44ff658f25612b0941bbcaab0cb5ed68652dce12dbc7ad
3
+ size 4003764
moves ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bf6d131d83b3572cf12b7223e19ec79dd0410c13f458215c0d506a5aaed2b67f
3
+ size 157
tokenizer ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7549ddabd3937f9c40d12a9d45493174821cd1fe10156dae3299ad226982835e
3
+ size 19566
vectors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:14772b683e726436d5948ad3fff2b43d036ef2ebbe3458aafed6004e05a40706
3
+ size 128