From 0c3f43c0d3e210f3940f7266fe84426900a6294e Mon Sep 17 00:00:00 2001 From: riya Date: Wed, 22 Nov 2017 13:11:18 +0530 Subject: [PATCH] added new notebook --- docs/notebooks.html | 32 +++++++++++++++++++++++++++++--- 1 file changed, 29 insertions(+), 3 deletions(-) diff --git a/docs/notebooks.html b/docs/notebooks.html index 00d897bf3f053a..0685d4a39ebc22 100644 --- a/docs/notebooks.html +++ b/docs/notebooks.html @@ -79,8 +79,16 @@

-
-

Notebook

+
+

Notebook

+
+

Sentiment Analysis using PySpark JSL NLP

+

+ The following code will produce an entire machine learning pipeline aiming to retrieve sentiment analysis from a target distributed dataframe (or dataset). + This will utilize several annotators required for such analysis and even an optional one (the spell checker, which is of type Token anyways) which is injected in between the sentiment analysis and the tokens it requires. + ViveknSentimentAnalysis is the annotator utilized and it is training with two corpus, one containing positive sentiment examples, and another one containing negative examples. + Some additional parameters have been used to fine tune the pipeline, and the Finisher transformer makes sure our final result is exactly the sentiment result of every row. +

#Imports
 import sys
@@ -149,16 +157,34 @@ 

Notebook

sentiment_data.show()
+
+
+
+

Spark Summit Europe 2017

+