pioNER

Introduced by Ghukasyan et al. in pioNER: Datasets and Baselines for Armenian Named Entity Recognition

The pioNER corpus provides gold-standard and automatically generated named-entity datasets for the Armenian language. The automatically generated corpus is generated from Wikipedia. The gold-standard set is a collection of over 250 news articles from iLur.am with manual named-entity annotation. It includes sentences from political, sports, local and world news, and is comparable in size with the test sets of other languages.

Source: https://github.com/ispras-texterra/pioner

Homepage