boda commited on
Commit
ed4d191
1 Parent(s): 0e55bc2

add results to readme

Browse files
Files changed (1) hide show
  1. README.md +43 -27
README.md CHANGED
@@ -1,31 +1,33 @@
1
  ---
2
  language:
3
- - ar
4
-
5
- thumbnail: "url to a thumbnail used in social sharing"
6
  tags:
7
- - ner
8
- - token-classification
9
- - Arabic-NER
10
-
11
  metrics:
12
- - accuracy
13
- - f1
14
- - precision
15
- - recall
16
-
17
  widget:
18
- - text: "النجم محمد صلاح لاعب المنتخب المصري يعيش في مصر بالتحديد من نجريج, الشرقية"
19
- example_title: "Mohamed Salah"
20
- - text: "انا ساكن في حدايق الزتون و بدرس في جامعه عين شمس"
21
- example_title: "Egyptian Dialect"
22
- - text: "يقع نهر الأمازون في قارة أمريكا الجنوبية"
23
- example_title: "Standard Arabic"
24
-
25
  datasets:
26
- - Fine-grained-Arabic-Named-Entity-Corpora
 
27
  ---
28
 
 
 
 
 
29
  # Arabic Named Entity Recognition
30
 
31
  This project is made to enrich the Arabic Named Entity Recognition(ANER). Arabic is a tough language to deal with and has alot of difficulties.
@@ -37,9 +39,26 @@ Here's the paper that contains all the details for our model, our approach, and
37
 
38
  - [ANER Paper](https://drive.google.com/file/d/1jJn3iWqOeLzaNvO-6aKfgidzJlWOtvti/view?usp=sharing)
39
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
40
  # Usage
41
 
42
- The model is available in HuggingFace model page under the name: [boda/ANER](https://huggingface.co/boda/ANER). Checkpoints are available only in PyTorch at the time.
43
 
44
  ### Use in python:
45
 
@@ -51,15 +70,12 @@ tokenizer = AutoTokenizer.from_pretrained("boda/ANER")
51
  model = AutoModelForTokenClassification.from_pretrained("boda/ANER")
52
  ```
53
 
54
- # Dataset
55
-
56
- - [Fine-grained Arabic Named Entity Corpora](https://fsalotaibi.kau.edu.sa/Pages-Arabic-NE-Corpora.aspx)
57
 
58
  # Acknowledgments
59
 
60
- Thanks for [Arabert](https://github.com/aub-mind/arabert) for providing the Arabic Bert model, which we used as a base model for our work.
61
 
62
- We also would like to thank [Prof. Fahd Saleh S Alotaibi](https://fsalotaibi.kau.edu.sa/Pages-Arabic-NE-Corpora.aspx) at Faculty of Computing and Information Technology King Abdulaziz University, for providing the dataset which we used to train our model with.
63
 
64
  # Contacts
65
 
@@ -67,4 +83,4 @@ We also would like to thank [Prof. Fahd Saleh S Alotaibi](https://fsalotaibi.kau
67
 
68
  - [LinkedIn](linkedin.com/in/boda-sadalla)
69
  - [Github](https://github.com/BodaSadalla98)
70
- - <boda998@yahoo.com>
 
1
  ---
2
  language:
3
+ - ar
4
+ - fr
5
+ thumbnail: url to a thumbnail used in social sharing
6
  tags:
7
+ - ner
8
+ - token-classification
9
+ - Arabic-NER
 
10
  metrics:
11
+ - accuracy
12
+ - f1
13
+ - precision
14
+ - recall
 
15
  widget:
16
+ - text: النجم محمد صلاح لاعب المنتخب المصري يعيش في مصر بالتحديد من نجريج, الشرقية
17
+ example_title: Mohamed Salah
18
+ - text: انا ساكن في حدايق الزتون و بدرس في جامعه عين شمس
19
+ example_title: Egyptian Dialect
20
+ - text: يقع نهر الأمازون في قارة أمريكا الجنوبية
21
+ example_title: Standard Arabic
 
22
  datasets:
23
+ - Fine-grained-Arabic-Named-Entity-Corpora
24
+ pipeline_tag: token-classification
25
  ---
26
 
27
+
28
+
29
+
30
+
31
  # Arabic Named Entity Recognition
32
 
33
  This project is made to enrich the Arabic Named Entity Recognition(ANER). Arabic is a tough language to deal with and has alot of difficulties.
 
39
 
40
  - [ANER Paper](https://drive.google.com/file/d/1jJn3iWqOeLzaNvO-6aKfgidzJlWOtvti/view?usp=sharing)
41
 
42
+
43
+
44
+ # Dataset
45
+
46
+ - [Fine-grained Arabic Named Entity Corpora](https://fsalotaibi.kau.edu.sa/Pages-Arabic-NE-Corpora.aspx)
47
+
48
+
49
+ # Evaluation results
50
+
51
+ The model achieves the following results:
52
+
53
+ | Dataset | WikiFANE Gold | WikiFANE Gold | WikiFANE Gold | NewsFANE Gold | NewsFANE Gold | NewsFANE Gold
54
+ |:--------:|:-------:|:-------:|:------:|:------:|:---------:|:------:|
55
+ | (metric) | (Recall) | (Precision) | (F1) | (Recall) | (Precision) | (F1)
56
+ | | 87.0 | 90.5 | 88.7 | 78.1 | 77.4 | 77.7
57
+
58
+
59
  # Usage
60
 
61
+ The model is available on the HuggingFace model page under the name: [boda/ANER](https://huggingface.co/boda/ANER). Checkpoints are available only in PyTorch at the time.
62
 
63
  ### Use in python:
64
 
 
70
  model = AutoModelForTokenClassification.from_pretrained("boda/ANER")
71
  ```
72
 
 
 
 
73
 
74
  # Acknowledgments
75
 
76
+ Thanks to [Arabert](https://github.com/aub-mind/arabert) for providing the Arabic Bert model, which we used as a base model for our work.
77
 
78
+ We also would like to thank [Prof. Fahd Saleh S Alotaibi](https://fsalotaibi.kau.edu.sa/Pages-Arabic-NE-Corpora.aspx) at the Faculty of Computing and Information Technology King Abdulaziz University, for providing the dataset which we used to train our model with.
79
 
80
  # Contacts
81
 
 
83
 
84
  - [LinkedIn](linkedin.com/in/boda-sadalla)
85
  - [Github](https://github.com/BodaSadalla98)
86
+ - <bodasadallah@yahoo.com>