File size: 661 Bytes
40c44e5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
# PyLaia Rimes

## Datasets

Trained on text-lines from the [Rimes 2011 dataset](https://teklia.com/research/rimes-database/).

| split  | N lines |
|--------|--------:|
| train  | 10,188  |
| val    |  1,138  |
| test   |    778  |

## Results

* Fixed line height: 128 pixels
* Language model: 6-gram character model trained on the training set with KenLM

| Model             			  | val CER | test CER | val WER | test WER |
|:--------------------------------|--------:|---------:|--------:|---------:|
| Model without LM 	              | 4.55  	| 4.53 	   | 14.39   | 15.06    |
| Model with LM (`weight = 1.5`)  | 3.68    | 3.47 	   | 10.01 	 | 10.20    |