File size: 1,751 Bytes
418534d
 
 
 
 
 
4a70227
 
 
 
 
418534d
 
 
 
 
e39226a
 
4a70227
418534d
40c44e5
4a70227
40c44e5
418534d
40c44e5
418534d
40c44e5
418534d
6c8ff29
418534d
40c44e5
4e8a8df
270d9e1
6c8ff29
 
 
 
 
40c44e5
418534d
 
 
 
4e8a8df
270d9e1
4e8a8df
 
418534d
4cb9f00
418534d
4e8a8df
859137b
4cb9f00
859137b
 
4e8a8df
 
 
 
 
 
 
 
 
859137b
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
---
library_name: PyLaia
license: mit
tags:
- PyLaia
- PyTorch
- atr
- htr
- ocr
- modern
- handwritten
metrics:
- CER
- WER
language:
- fr
datasets:
- Teklia/rimes-2011-lines
pipeline_tag: image-to-text
---

# PyLaia - RIMES

This model performs Handwritten Text Recognition in French.

## Model description

The model has been trained using the PyLaia library on the [RIMES](https://teklia.com/research/rimes-database/) dataset.

Training images were resized with a fixed height of 128 pixels, keeping the original aspect ratio.

| set   | lines   | 
|:----- | ------: | 
| train | 10,188  |
| val   |  1,138  |
| test  |    778  |

An external 6-gram character language model can be used to improve recognition. The language model is trained on the text from the RIMES training set.

## Evaluation results

The model achieves the following results:

| set   | Language model | CER (%)    | WER (%) | lines   |
|:------|:---------------| ----------:| -------:|--------:|
| test  | no             | 4.53       | 15.06   |    778  |
| test  | yes            | 3.47       | 10.20   |    778  |

## How to use?

Please refer to the [PyLaia documentation](https://atr.pages.teklia.com/pylaia/usage/prediction/) to use this model.

## Cite us!

```bibtex
@inproceedings{pylaia2024,
    author = {Tarride, Solène and Schneider, Yoann and Generali-Lince, Marie and Boillet, Mélodie and Abadie, Bastien and Kermorvant, Christopher},
    title = {{Improving Automatic Text Recognition with Language Models in the PyLaia Open-Source Library}},
    booktitle = {Document Analysis and Recognition - ICDAR 2024},
    year = {2024},
    publisher = {Springer Nature Switzerland},
    address = {Cham},
    pages = {387--404},
    isbn = {978-3-031-70549-6}
}
```