Towards Improving Neural Named Entity Recognition with Gazetteers

This paper trains a tagger (sub-tagger) to predict the BILOU tags generated by gazetteers and concatenates the feature representation of that tagger to a NER tagger. They are able to score 92.75 on CoNLL03.


  • Is the sub-tagger trained separately or jointly with the main tagger?
  • The gazetteers have noting to do with semi Markov conditional random fields. You can concatenate the gazetteer embeddings to any model.
  • The appendix is missing in PDF. It cannot compile in \LaTeX.
  • Gazetteer type mapping table of CoNLL03 is copied from OntoNotes. Not sure why this careless mistake is presented in an ACL paper.
  • Overall quality is very low.
