我运行的是readme里面的example:
import hanlp
text = "我的希望是希望和平"
tokenizer = hanlp.load('LARGE_ALBERT_BASE')
tagger = hanlp.load(hanlp.pretrained.pos.CTB9_POS_ALBERT_BASE)
ner_recog = hanlp.load(hanlp.pretrained.ner.MSRA_NER_BERT_BASE_ZH)
tokens = tokenizer(text)
tag = tagger(tokens)
ner = ner_recog(list(text))
tok_tag = [(a, b) for a, b in zip(tokens, tag)]
print(tok_tag)
输出的结果是:
[('我', 'PN'), ('的', 'DEG'), ('希望', 'NN'), ('是', 'NN'), ('希望', 'VC'), ('和平', 'NN')]
貌似输入是token的list,但是解析出来却是按照单个字进行标注并且解析的?