opened 03:05PM - 22 Aug 20 UTC
closed 03:20PM - 22 Aug 20 UTC
Describe the bug
运行测试 Demo https://github.com/hankcs/HanLP/blob/1.x/src/test/java/com/hankcs/demo/DemoURLRecognition.java ,发现不能识别出带中文顶级域名的网址。
Code to reproduce the issue
String text =
"HanLP的项目地址是https://github.com/hankcs/HanLP," +
"发布地址是https://github.com/hankcs/HanLP/releases," +
"我有时候会在www.hankcs.com上面发布一些消息," +
"我的微博是http://weibo.com/hankcs/,会同步推送hankcs.com的新闻。" +
...
bug
wontfix