Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Release/430 release candidate #13346

Merged
merged 84 commits into from
Feb 9, 2023
Merged

Conversation

maziyarpanahi and others added 30 commits December 25, 2022 10:11
- delimiter id is actually correct and doesn't need any offset
- anything that can be used by other DL engines
- there are some features that are exclusively designed by using TensorFlow so they can stay in tensorflow package
- sometimes the import optimization doesn't follow scalafmt rules
…notator' into SPARKNLP-695-refactor-ml-module
- actually sentencepiece does use TensorFlow to load the SP model so it must stay in tensorflow package
- io is also mostly used in loading TensorFlow models
- Spark 3.3.1 is now a default package for our APIs
- GCP storage is updated to 2.16.0 from 2.15.0
- It needs to asset with `"The deserializer is not supported: need a(n) \"ARRAY\" field but got \"STRING\"."`
Removed the duplicated definition of method `setWeightedDistPath` from `ContextSpellCheckerApproach`.
C-K-Loan and others added 7 commits January 28, 2023 16:34
Co-authored-by: Maziyar Panahi <maziyar.panahi@iscpif.fr>
* SPARKNLP-728 Verifies if model already exist in S3 before unzipping when cache_pretrained is defined as S3 bucket

* SPARKNLP-728 Verifies if model already exist in GCP before unzipping when cache_pretrained is defined as GCP bucket
* [skip ci] SPARKNLP-709: Add Documentation
- SwinForImageClassification, HubertForCTC
- Added additional example for ViTForImageClassification

* Fix some PyDocs and add ignore pattern to sphinx

* Resolved some warnings for Sphinx

* Improve WordSegmenter Docs

* Resolve sphinx Warnings

* Remove typo

* SPARKNLP-709: Add documentation for ZeroShotNer
* SPARKNLP-734 Enable params argument in spark_nlp.start()

* SPARKNLP-734 Revert cache_folder, log_folder and cluster_tmp_dir for backward compatibility
maziyarpanahi and others added 20 commits February 1, 2023 14:43
* Implement DateMatcher annotator

* Add Date2Chunk annotator and unit test to Python

- Move Token2Chunk to the base module
- Add Date2Chunk to the Python APIs
- Add Date2Chunk unit tests
* SPARKNLP-733: Fix loadSavedModel for private buckets
- also resolves warnings

* SPARKNLP-733: Check region for S3 loadSavedModel
* [skip ci] SPARK-NLP-721: New example notebooks

* SPARK-NLP-721: Changed Notebook Links
* Doc Id column implementation

* Including tests

* Access tuple and not cleared variable for the doc and the sentence
* Update code style [skip test]

* Refactor m1 to silicon

- README/doc
- dependencies (spark-nlp-silicon)
- func (apple_silicon=True/true)
- no matter what, Intellij cannot ignore formatting *.ipynb notebooks! Exclude formatter even ignore file types are no help
- the www.johnsnowlabs.com/slack-redirect DNS redirect is forcing people to have @johnsnowlabs.com email in order to register
- using the raw invitation URL fixes this issue
@maziyarpanahi maziyarpanahi merged commit 281c0af into master Feb 9, 2023
@KshitizGIT KshitizGIT deleted the release/430-release-candidate branch March 2, 2023 11:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug-fix dependencies Pull requests that update a dependency file documentation DON'T MERGE Do not merge this PR enhancement models_hub pretrained models and pipelines new model new-feature Introducing a new feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants