language-resources

Star

Here are 14 public repositories matching this topic...

telegram-zhCN / telegram-language-resources

Star

Source strings and zh-CN translate resources of Telegram

translation telegram language-resources

Updated Dec 13, 2020
Python

motazsaad / tweets-collector

Star

Collect tweets (tweets corpus) using Twitter API. Collection can be based on hashtags, keywords, geographical location

nlp json tweets twitter-api corpus language-resources twitter-corpus collect-tweets tweet-collector

Updated Nov 4, 2019
Python

singletongue / japanese-bert

Star

BERT models with tokenization for Japanese texts.

nlp japanese-language mecab bert language-resources

Updated Nov 15, 2019
Python

CoEDL / hermes

Star

💬 Cross-platform application for the creation of language resources from ELAN linguistic analysis files, or from scratch.

python research university pyqt5 python3 linguistics hermes computational-linguistics elan eaf language-resources linguistics-field

Updated May 1, 2019
Python

ufal / universal-segmentations

Star

Build scripts for the UniSegments collection of morphologically segmented lexicons for many languages

morphology language-resources morpheme-segmentation

Updated Aug 6, 2023
Python

Dugong-Chinese / chinese-resource-app

Star

This is a web application that will serve to be the community-driven go-to site for finding Chinese resources and learning Mandarin.

language-learning chinese language-resources mandarin mandarin-chinese

Updated Oct 4, 2020
Python

lukyjanek / universal-derivations

Star

The scripts for compiling the Universal Derivations collections of harmonised word-formation resources for multiple langugaes.

morphology nlp-resources language-resources word-formation universal-derivations uder-collection

Updated Nov 16, 2021
Python

CoolCat467 / Localization-Translation-Utility

Star

Script for simplifying the process of translating MineOS Language (.lang) files

language json translation opencomputers python3 translate language-resources mineos coolcat467

Updated Jan 7, 2025
Python

beviah / ezglot

Star

Selected data processing scripts including language agnostic multilingual wiktionary parser

multilingual dictionary extractor templates pronunciation levenshtein-distance wikitext ipa similarity-measures language-resources wiktionary thesaurus-data wiktionary-parser wiktionary-data wiktionary-tool wiktionary-dataset word-distance

Updated Mar 31, 2024
Python

udaycruise2903 / kaDienshonhia-digitalisation

Star

This repo contaings PDF, text and manually edited files of ka Dienshonhia dictionary digitalisation work

pdf dictionary scripts utf-8 text-processing language-resources digitalisation khasi text-to-excel

Updated Aug 2, 2022
Python

Stavre / Dict.cc-parsing

Star

Simple parser for Dict.cc dictionary

dictionaries dictionary language-resources

Updated Aug 31, 2022
Python

Español: cree un conjunto de datos de tarjetas flash a partir de un archivo .txt. Palabra, significado, etimología, ejemplos, clase. English: create Flash Cards dataset from a .txt file. Word, meaning, etymology, examples, class.

python json-data flashcards data-structures dataset webscraping language-resources csv-data

Updated Sep 15, 2022
Python

mosesab / Language-Text-Extraction-

Star

Gets text and extracts sentences in a language from text using that language's lexicon.

nlp natural-language-processing corpus python3 python-programming english languages text-processing language-resources language-processing python-standard-library corpus-processing corpus-search

Updated Sep 26, 2021
Python

tlu-dt-nlp / EstGEC-L2-Corpus

Star

Estonian Grammatical Error Correction (GEC) test and development corpus that contains L2 learner texts error-annotated in the M2 format.

annotation corpus error-corpora estonian-language language-resources benchmark-datasets gold-standard grammatical-error-correction

Updated Dec 5, 2024
Python

Improve this page

Add a description, image, and links to the language-resources topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the language-resources topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

language-resources

Here are 14 public repositories matching this topic...

telegram-zhCN / telegram-language-resources

motazsaad / tweets-collector

singletongue / japanese-bert

CoEDL / hermes

ufal / universal-segmentations

Dugong-Chinese / chinese-resource-app

lukyjanek / universal-derivations

CoolCat467 / Localization-Translation-Utility

beviah / ezglot

udaycruise2903 / kaDienshonhia-digitalisation

Stavre / Dict.cc-parsing

Nicolamunozi / FC_SV_txt

mosesab / Language-Text-Extraction-

tlu-dt-nlp / EstGEC-L2-Corpus

Improve this page

Add this topic to your repo