Skip to content

Commit

Permalink
Feat: Updated README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
SebastianCB-dev committed Oct 18, 2022
1 parent 0e4d876 commit c72e6c8
Showing 1 changed file with 34 additions and 1 deletion.
35 changes: 34 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,40 @@ Lo que se hace:
* punctuaction sign removal
* Hashtags and mentions removal

#### Data
# Módulos
* emoji
* nltk
* sklearn
* spacy
* stanza
* wheel
* hunspell
* numpy
* gensim
* pyemd
```
pip install emoji nltk sklearn spacy stanza wheel hunspell numpy gensim pyemd
```
o
```
pip3 install emoji nltk sklearn spacy stanza wheel hunspell numpy gensim pyemd
```

# Errores:
1. Error al instalar hunspell:
- Se solventa instalando libhunspell-dev (Ubuntu)

```
sudo apt-get install libhunspell-dev
```
2. Error con modelo 'es_core_news_md' de Spacy:
- Se solventa descargandolo:

```
python -m spacy download es_core_news_md
pytho3 -m spacy download es_core_news_md
```
## Data
Para entrenar mas corpus al modelo
```
model.build_vocab(data)
Expand Down

0 comments on commit c72e6c8

Please sign in to comment.