veld_data__eltec_conllu_stats

Statistics on conllu data inferenced with udpipe on eltec corpora.

This repo and its data is the output of this chain veld repo: https://github.com/veldhub/veld_chain__eltec_udpipe_inference

statistics

count_token

Simply counting the token for each file (token definition: https://universaldependencies.org/format.html)

count_lemma_total

Simply counting the unique lemmas (lemma definition: https://universaldependencies.org/format.html)

count_lemma_normalized_by_token

Taking count_lemma_total and dividing it by count_token so that this lemma count is respective to the overall token count.

count_pos

For each part-of-speech tag, count its occurrence (pos definition: https://universaldependencies.org/u/pos/index.html)

count_feat

For each feature tag, count its occurence (feature definition: https://universaldependencies.org/u/feat/index.html)

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
LICENSE		LICENSE
README.md		README.md
veld.yaml		veld.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

veld_data__eltec_conllu_stats

statistics

count_token

count_lemma_total

count_lemma_normalized_by_token

count_pos

count_feat

About

Releases

Packages

License

veldhub/veld_data__eltec_conllu_stats

Folders and files

Latest commit

History

Repository files navigation

veld_data__eltec_conllu_stats

statistics

count_token

count_lemma_total

count_lemma_normalized_by_token

count_pos

count_feat

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages