Update README.md
Browse files
README.md
CHANGED
@@ -1,25 +1,25 @@
|
|
1 |
---
|
2 |
license: mit
|
3 |
---
|
4 |
-
**
|
5 |
|
6 |
-
+
|
7 |
```bash
|
8 |
pip install ctranslate2~=3.20.0
|
9 |
```
|
10 |
```bash
|
11 |
pip install subword-nmt
|
12 |
```
|
13 |
-
+
|
14 |
```bash
|
15 |
subword-nmt apply-bpe -c gl-detok10k.code < input_file.txt > input_file_bpe.txt
|
16 |
```
|
17 |
|
18 |
-
+
|
19 |
```bash
|
20 |
python3 trans_ct2.py ct2_detok-gl-zh input_file_bpe.txt >output_file_bpe.txt
|
21 |
```
|
22 |
-
+
|
23 |
|
24 |
```bash
|
25 |
cat out_test_bpe.txt | sed "s/@@ //g" > output_file.txt
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
---
|
4 |
+
**How to translate with this model**
|
5 |
|
6 |
+
+ Install [Python 3.9](https://www.python.org/downloads/release/python-390/) + ctranslate 2 + subword-nmt
|
7 |
```bash
|
8 |
pip install ctranslate2~=3.20.0
|
9 |
```
|
10 |
```bash
|
11 |
pip install subword-nmt
|
12 |
```
|
13 |
+
+ tokenization with BPE:
|
14 |
```bash
|
15 |
subword-nmt apply-bpe -c gl-detok10k.code < input_file.txt > input_file_bpe.txt
|
16 |
```
|
17 |
|
18 |
+
+ Translating an input_text using ct2_detok-gl-zh:
|
19 |
```bash
|
20 |
python3 trans_ct2.py ct2_detok-gl-zh input_file_bpe.txt >output_file_bpe.txt
|
21 |
```
|
22 |
+
+ DeBPEar output txt:
|
23 |
|
24 |
```bash
|
25 |
cat out_test_bpe.txt | sed "s/@@ //g" > output_file.txt
|