v1.5.3新年快乐

hankcs released this 31 Dec 04:09

· 1033 commits to master since this release

7d11036

分词器多线程数默认系统CPU核心数
索引模式可选分词结果最小颗粒度：#670
识别带千位分隔符的数字，修复BaseNode中的toString()
微调人名识别模型、ngram；修订现代汉语补充词库、简繁词库
使word2vec命令行参数解析与原版兼容：#699
改正CRF的Tag方法：#703
修复word2vec缓存问题：#718
新词发现过滤使用LinkedList：#724
模型加载失败时统一throw new IllegalArgumentException，参考：#477 #116
数据包依然兼容data-for-1.5.3.zip：国内网盘或海外連結 md5=cadc96db94c3df070855706bb0f8429e
获取最新版的数据包，请fork一份并git clone https://github.com/YourName/HanLP.git。

Portable版同步升级到v1.5.3

        <dependency>
            <groupId>com.hankcs</groupId>
            <artifactId>hanlp</artifactId>
            <version>portable-1.5.3</version>
        </dependency>

🎉感谢所有contributors、所有在issue中提出宝贵建议的用户！

Assets 3