File size: 1,751 Bytes
418534d 4a70227 418534d e39226a 4a70227 418534d 40c44e5 4a70227 40c44e5 418534d 40c44e5 418534d 40c44e5 418534d 6c8ff29 418534d 40c44e5 4e8a8df 6c8ff29 40c44e5 418534d 4e8a8df e82f6d0 4e8a8df 418534d 4cb9f00 418534d 4e8a8df 859137b 4cb9f00 859137b 4e8a8df 859137b |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 |
---
library_name: PyLaia
license: mit
tags:
- PyLaia
- PyTorch
- atr
- htr
- ocr
- modern
- handwritten
metrics:
- CER
- WER
language:
- fr
datasets:
- Teklia/rimes-2011-lines
pipeline_tag: image-to-text
---
# PyLaia - RIMES
This model performs Handwritten Text Recognition in French.
## Model description
The model has been trained using the PyLaia library on the [RIMES](https://teklia.com/research/rimes-database/) dataset.
Training images were resized with a fixed height of 128 pixels, keeping the original aspect ratio.
| set | lines |
| ----- | ------: |
| train | 10,188 |
| val | 1,138 |
| test | 778 |
An external 6-gram character language model can be used to improve recognition. The language model is trained on the text from the RIMES training set.
## Evaluation results
The model achieves the following results:
| set | Language model | CER (%) | WER (%) | lines |
| ------|:---------------| ----------:| -------:|--------:|
| test | no | 4.53 | 15.06 | 778 |
| test | yes | 3.47 | 10.20 | 778 |
## How to use?
Please refer to the [PyLaia documentation](https://atr.pages.teklia.com/pylaia/usage/prediction/) to use this model.
## Cite us!
```bibtex
@inproceedings{pylaia2024,
author = {Tarride, Solène and Schneider, Yoann and Generali-Lince, Marie and Boillet, Mélodie and Abadie, Bastien and Kermorvant, Christopher},
title = {{Improving Automatic Text Recognition with Language Models in the PyLaia Open-Source Library}},
booktitle = {Document Analysis and Recognition - ICDAR 2024},
year = {2024},
publisher = {Springer Nature Switzerland},
address = {Cham},
pages = {387--404},
isbn = {978-3-031-70549-6}
}
``` |