100+ Chinese Word Vectors 上百种预训练中文词向量
-
Updated
Oct 30, 2023 - Python
100+ Chinese Word Vectors 上百种预训练中文词向量
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类
中文分词
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型
g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
A PyTorch implementation of a BiLSTM \ BERT \ Roberta (+ BiLSTM + CRF) model for Chinese Word Segmentation (中文分词) .
基于深度学习的自然语言处理库
一个轻量且功能全面的中文分词器,帮助学生了解分词器的工作原理。MicroTokenizer: A lightweight Chinese tokenizer designed for educational and research purposes. Provides a practical, hands-on approach to understanding NLP concepts, featuring multiple tokenization algorithms and customizable models. Ideal for students, researchers, and NLP enthusiasts..
Some experiments about Machine Learning
Source code for an ACL2017 paper on Chinese word segmentation
Source codes for paper "Neural Networks Incorporating Dictionaries for Chinese Word Segmentation", AAAI 2018
利用深度学习实现中文分词
A convenient Chinese word segmentation tool 简便中文分词器
Open Source State-of-the-art Chinese Word Segmentation System with BiLSTM and ELMo. https://arxiv.org/abs/1901.05816
基于深度学习的自然语言处理库
Sub-Character Representation Learning
Berserker - BERt chineSE woRd toKenizER
Multiple Character Embeddings for Chinese Word Segmentation, ACL 2019
Add a description, image, and links to the chinese-word-segmentation topic page so that developers can more easily learn about it.
To associate your repository with the chinese-word-segmentation topic, visit your repo's landing page and select "manage topics."