Skip to content
/ wag Public

WaG - install your own word profile generator out of diverse data resources

License

Notifications You must be signed in to change notification settings

czcorpus/wag

Folders and files

NameName
Last commit message
Last commit date
Feb 28, 2025
Mar 11, 2025
Jun 9, 2020
Feb 19, 2025
Feb 9, 2021
Jan 4, 2022
Feb 26, 2025
Mar 25, 2025
Feb 26, 2025
May 26, 2021
Jul 30, 2021
Jul 30, 2021
Apr 27, 2022
Apr 8, 2021
Oct 19, 2018
Feb 4, 2025
Feb 26, 2025
Oct 20, 2021
Jul 30, 2021
Feb 26, 2025
Apr 8, 2021
Mar 13, 2025
Mar 13, 2025
Mar 3, 2025
Mar 3, 2025
Mar 3, 2025
Mar 3, 2025
Mar 3, 2025
Mar 3, 2025

Repository files navigation

WaG - Word at a Glance

WaG screenshot

  1. Use existing corpus/data search and retrieval software as backend,
  2. Obtain and compile information about:
    1. single word,
    2. two or more words compared with each other,
    3. word translation.
  3. explore text metadata statistics, time-based trends, word cloud-based data and many more,
  4. combine statistics from different corpora,
  5. Use results of a resource as an input for other resource.

Currently supported resources

WaG KonText MQuery NoSkE Treq Clarin FCS Datamuse ElasticSearch LCC
collocations 🚧
concFilter
concordance
freqBar
freqComparison
freqPie
geoAreas
multiWordGeoAreas
html
matchingDocuments
mergeCorpFreq
speeches
syntacticColls
timeDistrib
multiWordtimeDistrib
translations
treqSubsets
wordForms 🚧
wordFreq 🚧
wordSim 🚧

Requirements

WaG is able to run either as a self-hosted application or within a compatible web page. For the self-hosted variant the following is needed:

  • Node.JS + NPM package manager
  • HTTP proxy server (Nginx, HAProxy, Apache)
  • a core word frequency database:

Please refer for more information to the INSTALL.md.

How to cite WaG

Tomáš Machálek (2020): Word at a Glance: Modular Word Profile Aggregator. In: Proceedings of LREC 2020, s. 7011–7016.

@InProceedings{machalek2020lrec,
 author = {Tomáš Machálek},
 title = "{Word at a Glance: Modular Word Profile Aggregator.}",
 booktitle = {Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020)},
 year = {2020},
 publisher = {European Language Resources Association (ELRA)},
 language = {english}
}