ajitrajasekharan
commited on
Commit
•
c22c86d
1
Parent(s):
fbfb0f8
Update README.md
Browse files
README.md
CHANGED
@@ -20,11 +20,16 @@ tags:
|
|
20 |
---
|
21 |
|
22 |
|
23 |
-
This model was pretrained from scratch
|
|
|
|
|
|
|
24 |
|
25 |
This pretrained model was used to do NER as is, **with no fine-tuning** as described [in this post](https://ajitrajasekharan.github.io/2021/01/02/my-first-post.html). [Towards Data Science review](https://twitter.com/TDataScience/status/1486300137366466560?s=20)
|
26 |
|
27 |
-
[Github link](https://github.com/ajitrajasekharan/unsupervised_NER) to perform NER using this model in an ensemble with bert-base cased
|
|
|
|
|
28 |
|
29 |
<img src="https://ajitrajasekharan.github.io/images/1.png" width="600">
|
30 |
|
|
|
20 |
---
|
21 |
|
22 |
|
23 |
+
This model was pretrained from scratch using a custom vocabulary on the following corpora
|
24 |
+
- Pubmed
|
25 |
+
- Clinical trials corpus
|
26 |
+
- and a small subset of Bookcorpus
|
27 |
|
28 |
This pretrained model was used to do NER as is, **with no fine-tuning** as described [in this post](https://ajitrajasekharan.github.io/2021/01/02/my-first-post.html). [Towards Data Science review](https://twitter.com/TDataScience/status/1486300137366466560?s=20)
|
29 |
|
30 |
+
[Github link](https://github.com/ajitrajasekharan/unsupervised_NER) to perform NER using this model in an ensemble with bert-base cased.
|
31 |
+
|
32 |
+
The ensemble detects 69 entity subtypes (17 broad entity groups)
|
33 |
|
34 |
<img src="https://ajitrajasekharan.github.io/images/1.png" width="600">
|
35 |
|