Skip to content

Spark NLP 5.0.1: Patch release

Compare
Choose a tag to compare
@maziyarpanahi maziyarpanahi released this 18 Jul 20:18
· 361 commits to master since this release
2b2f93c

πŸ“’ Overview

Spark NLP 5.0.1 πŸš€ is a patch release with bug fixes and other improvements. We want to thank our community for their valuable feedback, feature requests, and contributions. Our Models Hub now contains over 18,000+ free and truly open-source models & pipelines. πŸŽ‰


πŸ› Bug Fixes & Enhancements

  • Fix multiLabel param issue in XXXForSequenceClassitication and XXXForZeroShotClassification annotators
  • Add the missing threshold param to all XXXForSequenceClassitication in Python
  • Fix issue with passing spark.driver.cores config as a param into start() function in Python and Scala
  • Fix 600+ models' cards on Models Hub with duplicated code snippets
  • Add new notebooks to export BERT, DistilBERT, RoBERTa, and DeBERTa models to ONNX format

πŸ““ New Notebooks

Spark NLP Notebooks Colab
BertEmbeddings HuggingFace in Spark NLP - BERT BERT
DistilBertEmbeddings HuggingFace in Spark NLP - DistilBERT DistilBERT
RoBertaEmbeddings HuggingFace in Spark NLP - RoBERTa RoBERTa
DeBertaEmbeddings HuggingFace in Spark NLP - DeBERTa DeBERTa

πŸ“– Documentation


❀️ Community support

  • Slack For live discussion with the Spark NLP community and the team
  • GitHub Bug reports, feature requests, and contributions
  • Discussions Engage with other community members, share ideas,
    and show off how you use Spark NLP!
  • Medium Spark NLP articles
  • JohnSnowLabs official Medium
  • YouTube Spark NLP video tutorials

Installation

Python

#PyPI

pip install spark-nlp==5.0.1

Spark Packages

spark-nlp on Apache Spark 3.0.x, 3.1.x, 3.2.x, 3.3.x, and 3.4.x (Scala 2.12):

spark-shell --packages com.johnsnowlabs.nlp:spark-nlp_2.12:5.0.1

pyspark --packages com.johnsnowlabs.nlp:spark-nlp_2.12:5.0.1

GPU

spark-shell --packages com.johnsnowlabs.nlp:spark-nlp-gpu_2.12:5.0.1

pyspark --packages com.johnsnowlabs.nlp:spark-nlp-gpu_2.12:5.0.1

Apple Silicon (M1 & M2)

spark-shell --packages com.johnsnowlabs.nlp:spark-nlp-silicon_2.12:5.0.1

pyspark --packages com.johnsnowlabs.nlp:spark-nlp-silicon_2.12:5.0.1

AArch64

spark-shell --packages com.johnsnowlabs.nlp:spark-nlp-aarch64_2.12:5.0.1

pyspark --packages com.johnsnowlabs.nlp:spark-nlp-aarch64_2.12:5.0.1

Maven

spark-nlp on Apache Spark 3.0.x, 3.1.x, 3.2.x, 3.3.x, and 3.4.x:

<dependency>
    <groupId>com.johnsnowlabs.nlp</groupId>
    <artifactId>spark-nlp_2.12</artifactId>
    <version>5.0.1</version>
</dependency>

spark-nlp-gpu:

<dependency>
    <groupId>com.johnsnowlabs.nlp</groupId>
    <artifactId>spark-nlp-gpu_2.12</artifactId>
    <version>5.0.1</version>
</dependency>

spark-nlp-silicon:

<dependency>
    <groupId>com.johnsnowlabs.nlp</groupId>
    <artifactId>spark-nlp-silicon_2.12</artifactId>
    <version>5.0.1</version>
</dependency>

spark-nlp-aarch64:

<dependency>
    <groupId>com.johnsnowlabs.nlp</groupId>
    <artifactId>spark-nlp-aarch64_2.12</artifactId>
    <version>5.0.1</version>
</dependency>

FAT JARs

What's Changed

Full Changelog: 5.0.0...5.0.1