Named Entity Recognition (NER) |
Top Previous Next |
Named entity recognition (NER) is the process of automatically classifying different unique identifiers known as named entities (NE), according to a predefined set of types. In the second part of the NCHLT Text Phase II project, automatic NER systems for each of the languages were developed to form, in most cases, baseline systems that could be used for other development projects, or as starting points from which to improve NER systems for the South African languages. Although several different techniques have been shown to be accurate approaches to NER classification, it was decided to use linear-chain conditional random fields (CRFs) with L2 regularisation, since this has been shown to be an effective and scalable technique to solve sequence labelling problems in the NER domain.
See Annotation Tag Sets for tag details.
Evaluation results for NER: (As reported in Eiselen, R., 2016. Government Domain Named Entity Recognition for South African Languages. In LREC.)
|