A generative speech model for daily dialogue.
-
Updated
Jan 7, 2025 - Python
A generative speech model for daily dialogue.
📙 中华新华字典数据库。包括歇后语,成语,词语,汉字。
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
A linting tool for Chinese language.
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。
Rime Cantonese input schema | 粵語拼音輸入方案
A framework for cleaning Chinese dialog data
Learn, read, write and practice Mandarin by drawing strokes in Anki Desktop, AnkiDroid and AnkiMobile with audio of HSK 2.0 (HSK1-6) and HSK 3.0 (HSK 1-9) characters.
收集非普通話漢語和古漢語的中州韻輸入法拼音方案 Collection of phonetic spelling schemas for Sinitic languages and dialects
Discovering magic squares in Tang Dynasty poems
中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.
Python scraper for Language Pods such as Japanesepod101.com 👹 🗾 🍣 Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨
CJK computer science terms comparison / 中日韓電腦科學術語對照 / 日中韓のコンピュータ科学の用語対照 / 한·중·일 전산학 용어 대조
Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark
solidity-by-example 教程中文翻译|@Web3-Club
Free Human Language Learning Resources
Từ điển tiếng Việt dành cho máy đọc sách Kindle, Kobo, Pocketbook v.v.
简繁转换 簡繁轉換 Python implementation of StarCC, the next generation of Simplified-Traditional Chinese conversion framework
文本去重
Add a description, image, and links to the chinese-language topic page so that developers can more easily learn about it.
To associate your repository with the chinese-language topic, visit your repo's landing page and select "manage topics."