Resolve annotators and pipelines consuming lots of driver RAM #69

saif-ellafi · 2017-12-18T08:07:15Z

Description

Annotators, specifically the Vivekn Sentiment Analysis, is consuming lots of driver RAM due to standard scala collections containing model information. This becomes a storage both inside pipelines and when reading the models back. We need to let this information flow from disk instead

Expected Behavior

Current Behavior

Possible Solution

Steps to Reproduce

Context

Your Environment

Version used:
Browser Name and version:
Operating System and version (desktop or mobile):
Link to your project:

aleksei-ai · 2017-12-26T11:51:12Z

@saifjsl Please try my fix with Kryo Serialization. Let me know is it works for you https://github.com/JohnSnowLabs/spark-nlp/tree/vivekn_sentiment_model_kryo_serialization

saif-ellafi · 2018-01-08T21:17:56Z

#78

saif-ellafi added the enhancement label Dec 19, 2017

aleksei-ai self-assigned this Dec 25, 2017

aleksei-ai closed this as completed Dec 26, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resolve annotators and pipelines consuming lots of driver RAM #69

Resolve annotators and pipelines consuming lots of driver RAM #69

saif-ellafi commented Dec 18, 2017

aleksei-ai commented Dec 26, 2017

saif-ellafi commented Jan 8, 2018

Resolve annotators and pipelines consuming lots of driver RAM #69

Resolve annotators and pipelines consuming lots of driver RAM #69

Comments

saif-ellafi commented Dec 18, 2017

Description

Expected Behavior

Current Behavior

Possible Solution

Steps to Reproduce

Context

Your Environment

aleksei-ai commented Dec 26, 2017

saif-ellafi commented Jan 8, 2018