Skip to content

Commit

Permalink
2024-09-08-distilbert_base_uncased_finetuned_imdb_chrischang80_en (#1…
Browse files Browse the repository at this point in the history
…4396)

* Add model 2024-09-09-roberta_finetuned_subjqa_movies_2_malizade_pipeline_en

* Add model 2024-09-09-deberta_v3_large__sst2__train_8_9_en

* Add model 2024-09-08-helsinki_english_spanish_fine_tune_opus100_pipeline_en

* Add model 2024-09-09-exalt_baseline_en

* Add model 2024-09-09-roberta_finetuned_squadcovid_en

* Add model 2024-09-09-kaz_roberta_base_ft_qa_english_maltese_tonga_tonga_islands_kaz_kk

* Add model 2024-09-09-roberta_large_few_shot_k_1024_finetuned_squad_seed_2_pipeline_en

* Add model 2024-09-09-roberta_base_squad2_squad_k5_e3_full_finetune_pipeline_en

* Add model 2024-09-09-babyberta_french_wikipedia_french_masking_finetuned_qasrl_lielbin_en

* Add model 2024-09-09-babyberta_french_wikipedia_french_masking_finetuned_qasrl_lielbin_pipeline_en

* Add model 2024-09-09-roberta_finetuned_subjqa_chennaiqa_expanded_pipeline_en

* Add model 2024-09-09-roberta_base_mod_quoref_pipeline_en

* Add model 2024-09-09-robbert_dutch_base_squad_dutch_pipeline_en

* Add model 2024-09-09-roberta_base_finetuned_squadv2_vubacktracking_pipeline_en

* Add model 2024-09-09-nepal_bhasa_model_en

* Add model 2024-09-09-ellis_qa_en

* Add model 2024-09-04-distilbert_base_uncased_finetuned_emotion_wzy1924561588_en

* Add model 2024-09-09-roberta_vmw_mrqa_s_en

* Add model 2024-09-07-opus_maltese_italian_english_finetuned_5000_italian_tonga_tonga_islands_english_en

* Add model 2024-09-09-dzoqamodel_en

* Add model 2024-09-06-mnli_microsoft_deberta_v3_base_seed_2_pipeline_en

* Add model 2024-09-04-akai_ner_pipeline_en

* Add model 2024-09-09-distilbert_base_uncased_finetuned_squad_sachinsingh31_en

* Add model 2024-09-09-burmese_awesome_qa_model_selbl_en

* Add model 2024-09-09-burmese_awesome_qa_model_prinslayy_16_en

* Add model 2024-09-09-question_answering_model_vishnun0027_pipeline_en

* Add model 2024-09-09-distilbert_base_uncased_finetuned_clickbait_detection_en

* Add model 2024-09-09-spanish_catalan_pipeline_en

* Add model 2024-09-09-burmese_awesome_qa_model_0uma_en

* Add model 2024-09-09-distilbert_base_uncased_finetuned_squad_azyren_pipeline_en

* Add model 2024-09-09-distilbert_base_japanese_cased_jaquad_pipeline_en

* Add model 2024-09-06-bert_checkpoint_980000_en

* Add model 2024-09-09-distilbert_base_uncased_finetuned_imdb_accelerate_strawhatdrag0n_pipeline_en

* Add model 2024-09-09-finetuning_sentiment_model_3000_samples_navazpv_pipeline_en

* Add model 2024-09-08-recipes_trainer_n_sentences_per_recipe_3_sep_true_en

* Add model 2024-09-07-sent_patana_chilean_spanish_bert_pipeline_es

* Add model 2024-09-08-dummy_model_subhasree_pipeline_en

* Add model 2024-09-06-dummy_model_sapphirejade_en

* Add model 2024-09-07-xlm_roberta_base_finetuned_panx_english_bessho_en

* Add model 2024-09-09-msmarco_distilbert_word2vec256k_mlm_400k_pipeline_en

* Add model 2024-09-09-distilbert_base_uncased_finetuned_imdb_sechmo_en

* Add model 2024-09-09-distillbert_base_spanish_uncased_finetuned_imdb_en

* Add model 2024-09-09-distilbert_base_cased_fine_tuned_blbooksgenre_en

* Add model 2024-09-09-distilbert_base_uncased_finetuned_imdb_accelerate_rezakakooee_pipeline_en

* Add model 2024-09-09-distilbert_base_uncased_finetuned_imdb_accelerate_rezakakooee_en

* Add model 2024-09-09-distilbert_base_uncased_finetuned_imdb_guydebruyn_en

* Add model 2024-09-09-discord_distilbert_en

* Add model 2024-09-09-distilbert_base_uncased_finetuned_imdb_accelerate_achakr37_en

* Add model 2024-09-06-whisper_small_spanish_nemo_unified_2024_07_02_15_19_06_pipeline_en

* Add model 2024-09-09-distilbert_qa_flat_N_max_en

* Add model 2024-09-09-robertalex_bsc_lt_pipeline_es

* Add model 2024-09-04-roberta_base_squad2_finetuned_roberta_pipeline_en

* Add model 2024-09-09-results_alexhv_en

* Add model 2024-09-08-brand_tone_of_voice_en

* Add model 2024-09-09-q2d_re_5_en

* Add model 2024-09-07-xlm_roberta_base_finetuned_panx_all_cataluna84_pipeline_en

* Add model 2024-09-09-qa_model_bytesizedllm_en

* Add model 2024-09-09-mdebertav3_subjectivity_dutch_nl

* Add model 2024-09-08-best_model_yelp_polarity_32_21_en

* Add model 2024-09-09-mdebertav3_subjectivity_dutch_pipeline_nl

* Add model 2024-09-09-deberta_v3_large_survey_fluency_rater_all_gpt4_en

* Add model 2024-09-09-deberta_v3_large_survey_fluency_rater_all_gpt4_pipeline_en

* Add model 2024-09-09-mdeberta_v3_base_rte_10_pipeline_en

* Add model 2024-09-09-qqp_microsoft_deberta_v3_large_seed_1_en

* Add model 2024-09-04-sent_jmedroberta_base_sentencepiece_pipeline_ja

* Add model 2024-09-09-qqp_microsoft_deberta_v3_large_seed_1_pipeline_en

* Add model 2024-09-09-007_microsoft_deberta_v3_base_finetuned_yahoo_80_20k_pipeline_en

* Add model 2024-09-09-yelp_polarity_microsoft_deberta_v3_large_seed_1_pipeline_en

* Add model 2024-09-09-deberta_v3_xsmall_emotion_en

* Add model 2024-09-09-argureviews_aspect_deberta_v1_en

* Add model 2024-09-09-deberta_v3_xsmall_emotion_pipeline_en

* Add model 2024-09-07-burmese_awesome_qa_model_krayray_pipeline_en

* Add model 2024-09-09-roberta_crypto_profiling_task1_deberta_en

* Add model 2024-09-09-deberta_docnli_sentencelevel_nofeatures_en

* Add model 2024-09-09-deberta_v3_base_hate_speech_offensive_en

* Add model 2024-09-07-helsinki_opus_german_english_fine_tuned_wmt16_finetuned_src_tonga_tonga_islands_trg_pipeline_en

* Add model 2024-09-09-fine_tuned_deberta_category_by_notes_synthetic_en

* Add model 2024-09-09-deberta_v3_small_finetuned_rte_pipeline_en

* Add model 2024-09-09-fine_tuned_deberta_category_by_notes_synthetic_pipeline_en

* Add model 2024-09-07-distilbert_qasports_basketball_pipeline_en

* Add model 2024-09-08-sent_bioformer_8l_en

* Add model 2024-09-09-qqp_microsoft_deberta_v3_large_seed_2_pipeline_en

* Add model 2024-09-09-deprem_mdeberta_binary_tr

* Add model 2024-09-09-job_listing_relevance_model_en

* Add model 2024-09-09-roberta_qa_base_squad2_tamil_qna_3e_en

* Add model 2024-09-09-roberta_finetuned_squad_shortcut_token_before_answer_start_en

* Add model 2024-09-08-opus_maltese_english_russian_finetuned_english_tonga_tonga_islands_russian_slimamel_pipeline_en

* Add model 2024-09-09-mpnet_pd_books_en

* Add model 2024-09-08-burmese_awesome_setfit_model_leerobert_pipeline_en

* Add model 2024-09-09-roberta_qa_base_squad2_tamil_qna_3e_pipeline_en

* Add model 2024-09-08-chungli_ao_bert_model_pipeline_en

* Add model 2024-09-08-deberta_amazon_reviews_v1_bweb771_pipeline_en

* Add model 2024-09-09-mpnetforclassification_pipeline_en

* Add model 2024-09-07-burmese_qa_model_rosa_alvarez_pipeline_en

* Add model 2024-09-09-translate_model_error_v0_4_mitrashatru_en

* Add model 2024-09-09-mpnet_base_apple_iphone_northern_sami_reviews_en

* Add model 2024-09-06-finetuned_opusmt_english_hindi_gujarati_en

* Add model 2024-09-08-dummy_model_edward47_pipeline_en

* Add model 2024-09-08-dummy_model_edward47_en

* Add model 2024-09-09-deberta_v3_base_finetuned_uf_ner_6x_0type_v1_pipeline_en

* Add model 2024-09-07-sent_bert_1890_1900_pipeline_en

* Add model 2024-09-09-tmp_trainer_juncodh_en

* Add model 2024-09-03-sent_xlm_roberta_base_xlmberttest_en

* Add model 2024-09-09-bert_embeddings_gbert_large_de

* Add model 2024-09-09-distilbert_base_uncased_finetuned_squad_sachinsingh31_pipeline_en

* Add model 2024-09-09-petbert_en

* Add model 2024-09-07-qa_model_study_1_pipeline_en

* Add model 2024-09-09-helsinki_nlp_finetuned_russian_tonga_tonga_islands_english_pipeline_en

* Add model 2024-09-09-bert_embeddings_gbert_large_pipeline_de

* Add model 2024-09-09-distiltesttodelete_en

* Add model 2024-09-09-electra_embeddings_delectra_generator_ko

* Add model 2024-09-07-random_initialization_kde4_english_tonga_tonga_islands_french_en

* Add model 2024-09-09-nlp4web_group80_pipeline_en

* Add model 2024-09-07-sent_bangla_bert_base_pipeline_bn

* Add model 2024-09-05-distilbert_base_uncased_emotion_ft_0703_en

* Add model 2024-09-09-burmese_awesome_qa_model_bloomlonely_en

* Add model 2024-09-07-dummy_model_rudytzhan_pipeline_en

* Add model 2024-09-07-r_t_sms_lm_en

* Add model 2024-09-08-deberta_v3_small_chaiblend_pipeline_en

* Add model 2024-09-08-multilingual_distilbert_intent_classification_pipeline_xx

* Add model 2024-09-09-danish_mrm8488_distilroberta_finetuned_financial_news_sentiment_analysis_nlp_feup_en

* Add model 2024-09-09-distilbert_base_cased_fine_tuned_blbooksgenre_pipeline_en

* Add model 2024-09-09-burmese_awesome_qa_model_tealeafs_pipeline_en

* Add model 2024-09-08-opus_maltese_romance_english_finetuned_npomo_english_5_epochs_en

* Add model 2024-09-09-roberta_pubmed_en

* Add model 2024-09-09-dummy_model_ericchu000_pipeline_en

* Add model 2024-09-09-burmese_awesome_qa_model_tealeafs_en

* Add model 2024-09-09-ruroberta_large_pipeline_ru

* Add model 2024-09-09-araroberta_sanskrit_saskta_pipeline_ar

* Add model 2024-09-07-bert_b09_en

* Add model 2024-09-07-distilbert_kazakh_ner_2_en

* Add model 2024-09-07-ner_legal_german_de

* Add model 2024-09-09-mdeberta_v3_base_rte_10_en

* Add model 2024-09-07-dummy_model_arunm7_pipeline_en

* Add model 2024-09-06-qa_redaction_nov1_18_pipeline_en

* Add model 2024-09-08-xlm_roberta_base_finetuned_panx_german_nitin1690_pipeline_en

* Add model 2024-09-09-facets_gpt_1234_en

* Add model 2024-09-09-sroberta_f_pipeline_hr

* Add model 2024-09-09-xlm_roberta_base_final_vietnam_aug_insert_bert_1_pipeline_en

* Add model 2024-09-09-yelp_polarity_microsoft_deberta_v3_large_seed_1_en

* Add model 2024-09-04-deberta_v2_large_conll2003_inca_v1_latin_fe_v1_pipeline_en

* Add model 2024-09-08-stego_classifier_checkpoint_epoch_10_2024_07_26_14_26_52_pipeline_en

* Add model 2024-09-07-burmese_awesome_qa_model_ayushij074_en

* Add model 2024-09-09-deberta_v3_base_fever_en

* Add model 2024-09-09-rulebert_v0_1_k4_pipeline_it

* Add model 2024-09-04-distilbert_base_uncased_finetuned_ner_layath_en

* Add model 2024-09-08-mlm_distilbert_base_uncased_finetuned_game_titles_accelerate_en

* Add model 2024-09-09-finetuned_random6_3epochs_pipeline_en

* Add model 2024-09-06-deberta_v3_base_rocstories_test_en

* Add model 2024-09-09-maltese_align_finetuned_sum3_thai_tonga_tonga_islands_english_pipeline_en

* Add model 2024-09-08-lenu_polish_en

* Add model 2024-09-07-burmese_ner_model_veronica1608_en

* Add model 2024-09-08-trainedsentiment_en

* Add model 2024-09-08-bert_finetuned_sentiment_classification_yelp_pipeline_en

* Add model 2024-09-09-babyberta_aochildes_2_5m_aochildes_french_without_masking_seed3_finetuned_squad_pipeline_en

* Add model 2024-09-08-dummy_model_8mly_pipeline_en

* Add model 2024-09-09-kaz_roberta_base_ft_qa_english_maltese_tonga_tonga_islands_kaz_pipeline_kk

* Add model 2024-09-06-useless_model_try_1_pipeline_en

* Add model 2024-09-04-ner_model_techme_pipeline_en

* Add model 2024-09-04-roberta_human_label_en

* Add model 2024-09-09-medicalquestionanswering_en

* Add model 2024-09-09-babyberta_wikipedia_french_run3_with_masking_finetuned_french_squad_pipeline_en

* Add model 2024-09-07-burmese_awesome_qa_model_jafs1986_pipeline_en

* Add model 2024-09-07-burmese_awesome_qa_model_khadidja22_en

* Add model 2024-09-07-distilbert_base_uncased_finetuned_imdb_cervino44_en

* Add model 2024-09-07-translation_vietnamese_english_official_en

* Add model 2024-09-07-embedding_finetuned_model_v2_en

* Add model 2024-09-09-roberta_base_dofla_pipeline_en

* Add model 2024-09-09-xlm_roberta_base_final_mixed_aug_replace_bert_2_en

* Add model 2024-09-08-dummy_model_ccyr119_pipeline_en

* Add model 2024-09-04-delivery_balanced_distilbert_base_uncased_v2_pipeline_en

* Add model 2024-09-08-all_mpnet_base_128_20_mnr_en

* Add model 2024-09-09-mpnet_base_apple_iphone_northern_sami_reviews_pipeline_en

* Add model 2024-09-06-albert_bbc_news_pipeline_en

* Add model 2024-09-08-roberta_base_emolit_en

* Add model 2024-09-09-electra_qa_base_discriminator_finetuned_squad_en

* Add model 2024-09-08-distilbert_finetuned_squadv2_dungquarkquark_pipeline_en

* Add model 2024-09-07-opus_maltese_romance_english_finetuned_npomo_english_15_epochs_en

* Add model 2024-09-09-deneme_model_eng_en

* Add model 2024-09-09-iwslt17_marian_small_ctx4_cwd2_english_french_pipeline_en

* Add model 2024-09-08-sent_afro_xlmr_base_finetuned_kintweetsb_pipeline_en

* Add model 2024-09-09-deberta_attr_score_140_pipeline_en

* Add model 2024-09-09-distilbert_beekeeping_qanda_model_en

* Add model 2024-09-09-squad_qa_model_horyekhunley_en

* Add model 2024-09-09-burmese_awesome_qa_model_selbl_pipeline_en

* Add model 2024-09-09-opus_english_polish_finetuning_opus_pipeline_en

* Add model 2024-09-09-opus_english_polish_finetuning_opus_en

* Add model 2024-09-09-mnli_microsoft_deberta_v3_large_seed_3_pipeline_en

* Add model 2024-09-09-biobert_huner_gene_v1_pipeline_en

* Add model 2024-09-09-bert_base_multilingual_cased_ner_xx

* Add model 2024-09-09-qa_callback_pipeline_en

* Add model 2024-09-09-burmese_awesome_qa_model_0uma_pipeline_en

* Add model 2024-09-09-distilbert_static_malware_detection_pipeline_en

* Add model 2024-09-09-nuzhny_english_tonga_tonga_islands_ru2_en

* Add model 2024-09-07-opus_maltese_german_english_finetuned_german_tonga_tonga_islands_english_second_felipetanios_pipeline_en

* Add model 2024-09-09-biomedbert_finetuned_pico_adishingote_pipeline_en

* Add model 2024-09-09-arabicwojood_flatner_ar

* Add model 2024-09-08-dbert_model_02_pipeline_en

* Add model 2024-09-01-roberta_classifier_emotion_english_distil_base_en

* Add model 2024-09-06-burmese_fine_tuning_opus_maltese_english_vietnamese_model_pipeline_en

* Add model 2024-09-07-distilbert_ner_andrembcosta_en

* Add model 2024-09-09-hesroberta_pipeline_en

* Add model 2024-09-09-ruroberta_large_ru

* Add model 2024-09-08-brand_tone_of_voice_pipeline_en

* Add model 2024-09-06-burmese_awesome_qa_model_lizhealey_pipeline_en

* Add model 2024-09-08-xlm_roberta_base_finetuned_panx_german_mlagrand_en

* Add model 2024-09-06-bert_fda_nutrition_ner_pipeline_en

* Add model 2024-09-06-dummy_model_jp1773hsu_en

* Add model 2024-09-07-sent_indo_aryan_xlm_r_base_gu

* Add model 2024-09-08-esg_bert_cnn_pipeline_en

* Add model 2024-09-08-xlm_roberta_base_finetuned_marc_english_giannipinelli_pipeline_en

* Add model 2024-09-07-xlm_roberta_base_finetuned_panx_german_cicimen_en

* Add model 2024-09-09-mdeberta_v3_base_claim_detection_pipeline_en

* Add model 2024-09-09-marian_finetuned_kde4_english_tonga_tonga_islands_french_hypurci_en

* Add model 2024-09-09-mdeberta_v3_base_faquad_nli_pipeline_pt

* Add model 2024-09-07-southern_sotho_all_mpnet_finetuned_french_1000_en

* Add model 2024-09-09-furina_seed42_eng_amh_hau_basic_0_0001_pipeline_en

* Add model 2024-09-09-all_mpnet_base_v2_margin_1_epoch_1_pipeline_en

* Add model 2024-09-07-pubchem10m_smiles_bpe_390k_pipeline_en

* Add model 2024-09-04-albert_japanese_v2_pipeline_en

* Add model 2024-09-07-burmese_distilbert_model_prasadavidi_en

* Add model 2024-09-08-distilbert_cord_ner_en

* Add model 2024-09-09-albert_irony_en

* Add model 2024-09-07-ner_model_ep2_en

* Add model 2024-09-07-roberta_spanish_clinical_trials_misc_ents_ner_pipeline_en

* Add model 2024-09-06-deberta_v3_large_survey_nepal_bhasa_fact_related_passage_rater_gpt4_pipeline_en

* Add model 2024-09-09-roberta_finetuned_subjqa_movies_2_malizade_en

* Add model 2024-09-08-sent_xlm_r_with_transliteration_max_en

* Add model 2024-09-09-deberta_v3_small_finetuned_rte_en

* Add model 2024-09-06-working_pipeline_en

* Add model 2024-09-08-sent_hing_mbert_mixed_pipeline_hi

* Add model 2024-09-06-topic_topic_random0_seed2_bernice_en

* Add model 2024-09-09-xlm_roberta_base_balance_vietnam_aug_replace_bert_pipeline_en

* Add model 2024-09-06-distilbert_base_uncased_question_answering_en

* Add model 2024-09-08-distilbert_base_uncased_finetuned_emotion_malay_5_0_pipeline_en

* Add model 2024-09-09-xlm_roberta_base_final_mixed_aug_replace_bert_pipeline_en

* Add model 2024-09-07-sent_rxbert_v1_pipeline_en

* Add model 2024-09-07-sent_zabantu_sot_ven_170m_ve

* Add model 2024-09-05-predicting_misdirection_pipeline_en

* Add model 2024-09-09-gefs_language_detector_en

* Add model 2024-09-09-burmese_awesome_wnut_model_catbult_en

* Add model 2024-09-09-roberta_squad_finetuned_en

* Add model 2024-09-09-babyberta_wikipedia1_1_25m_wikipedia_french1_25m_without_masking_seed3_finetuned_squad_pipeline_en

* Add model 2024-09-07-bsc_bio_ehr_spanish_cantemist_ner_en

* Add model 2024-09-09-marian_finetuned_kde4_english_tonga_tonga_islands_french_lyk0013_pipeline_en

* Add model 2024-09-09-roberta_finetuned_subjqa_movies_2_vishwasbhushanb_en

* Add model 2024-09-07-burmese_awesome_wnut_all_place_pipeline_en

* Add model 2024-09-08-bert_base_yelp_reviews_en

* Add model 2024-09-05-reward_model_deberta_v3_unit_test_en

* Add model 2024-09-09-marian_finetuned_kde4_english_tonga_tonga_islands_french_lyk0013_en

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>
  • Loading branch information
jsl-models and ahmedlone127 authored Sep 9, 2024
1 parent 3892863 commit 2df9cfd
Show file tree
Hide file tree
Showing 1,783 changed files with 144,465 additions and 0 deletions.
Original file line number Diff line number Diff line change
@@ -0,0 +1,69 @@
---
layout: model
title: English finetuned_bge_embeddings_v3_base_v1_5_pipeline pipeline BGEEmbeddings from austinpatrickm
author: John Snow Labs
name: finetuned_bge_embeddings_v3_base_v1_5_pipeline
date: 2024-09-01
tags: [en, open_source, pipeline, onnx]
task: Embeddings
language: en
edition: Spark NLP 5.5.0
spark_version: 3.0
supported: true
annotator: PipelineModel
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained BGEEmbeddings, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`finetuned_bge_embeddings_v3_base_v1_5_pipeline` is a English model originally trained by austinpatrickm.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/finetuned_bge_embeddings_v3_base_v1_5_pipeline_en_5.5.0_3.0_1725229508197.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/finetuned_bge_embeddings_v3_base_v1_5_pipeline_en_5.5.0_3.0_1725229508197.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python

pipeline = PretrainedPipeline("finetuned_bge_embeddings_v3_base_v1_5_pipeline", lang = "en")
annotations = pipeline.transform(df)

```
```scala

val pipeline = new PretrainedPipeline("finetuned_bge_embeddings_v3_base_v1_5_pipeline", lang = "en")
val annotations = pipeline.transform(df)

```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|finetuned_bge_embeddings_v3_base_v1_5_pipeline|
|Type:|pipeline|
|Compatibility:|Spark NLP 5.5.0+|
|License:|Open Source|
|Edition:|Official|
|Language:|en|
|Size:|387.1 MB|

## References

https://huggingface.co/austinpatrickm/finetuned_bge_embeddings_v3_base_v1.5

## Included Models

- DocumentAssembler
- BGEEmbeddings
Original file line number Diff line number Diff line change
@@ -0,0 +1,94 @@
---
layout: model
title: English roberta_classifier_emotion_english_distil_base RoBertaForSequenceClassification from j-hartmann
author: John Snow Labs
name: roberta_classifier_emotion_english_distil_base
date: 2024-09-01
tags: [en, open_source, onnx, sequence_classification, roberta]
task: Text Classification
language: en
edition: Spark NLP 5.4.2
spark_version: 3.0
supported: true
engine: onnx
annotator: RoBertaForSequenceClassification
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained RoBertaForSequenceClassification model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`roberta_classifier_emotion_english_distil_base` is a English model originally trained by j-hartmann.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/roberta_classifier_emotion_english_distil_base_en_5.4.2_3.0_1725167903299.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/roberta_classifier_emotion_english_distil_base_en_5.4.2_3.0_1725167903299.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python

documentAssembler = DocumentAssembler() \
.setInputCol('text') \
.setOutputCol('document')

tokenizer = Tokenizer() \
.setInputCols(['document']) \
.setOutputCol('token')

sequenceClassifier = RoBertaForSequenceClassification.pretrained("roberta_classifier_emotion_english_distil_base","en") \
.setInputCols(["documents","token"]) \
.setOutputCol("class")

pipeline = Pipeline().setStages([documentAssembler, tokenizer, sequenceClassifier])
data = spark.createDataFrame([["I love spark-nlp"]]).toDF("text")
pipelineModel = pipeline.fit(data)
pipelineDF = pipelineModel.transform(data)

```
```scala

val documentAssembler = new DocumentAssembler()
.setInputCols("text")
.setOutputCols("document")

val tokenizer = new Tokenizer()
.setInputCols(Array("document"))
.setOutputCol("token")

val sequenceClassifier = RoBertaForSequenceClassification.pretrained("roberta_classifier_emotion_english_distil_base", "en")
.setInputCols(Array("documents","token"))
.setOutputCol("class")

val pipeline = new Pipeline().setStages(Array(documentAssembler, tokenizer, sequenceClassifier))
val data = Seq("I love spark-nlp").toDS.toDF("text")
val pipelineModel = pipeline.fit(data)
val pipelineDF = pipelineModel.transform(data)

```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|roberta_classifier_emotion_english_distil_base|
|Compatibility:|Spark NLP 5.4.2+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[document, token]|
|Output Labels:|[class]|
|Language:|en|
|Size:|308.8 MB|

## References

https://huggingface.co/j-hartmann/emotion-english-distilroberta-base
94 changes: 94 additions & 0 deletions docs/_posts/ahmedlone127/2024-09-02-sent_muril_base_cased_en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,94 @@
---
layout: model
title: English sent_muril_base_cased BertSentenceEmbeddings from google
author: John Snow Labs
name: sent_muril_base_cased
date: 2024-09-02
tags: [en, open_source, onnx, sentence_embeddings, bert]
task: Embeddings
language: en
edition: Spark NLP 5.5.0
spark_version: 3.0
supported: true
engine: onnx
annotator: BertSentenceEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained BertSentenceEmbeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`sent_muril_base_cased` is a English model originally trained by google.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/sent_muril_base_cased_en_5.5.0_3.0_1725274394892.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/sent_muril_base_cased_en_5.5.0_3.0_1725274394892.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python

documentAssembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("document")

sentenceDL = SentenceDetectorDLModel.pretrained("sentence_detector_dl", "xx") \
.setInputCols(["document"]) \
.setOutputCol("sentence")

embeddings = BertSentenceEmbeddings.pretrained("sent_muril_base_cased","en") \
.setInputCols(["sentence"]) \
.setOutputCol("embeddings")

pipeline = Pipeline().setStages([documentAssembler, sentenceDL, embeddings])
data = spark.createDataFrame([["I love spark-nlp"]]).toDF("text")
pipelineModel = pipeline.fit(data)
pipelineDF = pipelineModel.transform(data)

```
```scala

val documentAssembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("document")

val sentenceDL = SentenceDetectorDLModel.pretrained("sentence_detector_dl", "xx")
.setInputCols(Array("document"))
.setOutputCol("sentence")

val embeddings = BertSentenceEmbeddings.pretrained("sent_muril_base_cased","en")
.setInputCols(Array("sentence"))
.setOutputCol("embeddings")

val pipeline = new Pipeline().setStages(Array(documentAssembler, sentenceDL, embeddings))
val data = Seq("I love spark-nlp").toDF("text")
val pipelineModel = pipeline.fit(data)
val pipelineDF = pipelineModel.transform(data)

```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|sent_muril_base_cased|
|Compatibility:|Spark NLP 5.5.0+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[sentence]|
|Output Labels:|[embeddings]|
|Language:|en|
|Size:|890.4 MB|

## References

https://huggingface.co/google/muril-base-cased
Original file line number Diff line number Diff line change
@@ -0,0 +1,69 @@
---
layout: model
title: English bislama_all_bs320_vanilla_finetuned_webnlg2020_relevance_pipeline pipeline MPNetEmbeddings from teven
author: John Snow Labs
name: bislama_all_bs320_vanilla_finetuned_webnlg2020_relevance_pipeline
date: 2024-09-03
tags: [en, open_source, pipeline, onnx]
task: Embeddings
language: en
edition: Spark NLP 5.5.0
spark_version: 3.0
supported: true
annotator: PipelineModel
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained MPNetEmbeddings, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`bislama_all_bs320_vanilla_finetuned_webnlg2020_relevance_pipeline` is a English model originally trained by teven.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/bislama_all_bs320_vanilla_finetuned_webnlg2020_relevance_pipeline_en_5.5.0_3.0_1725350188221.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/bislama_all_bs320_vanilla_finetuned_webnlg2020_relevance_pipeline_en_5.5.0_3.0_1725350188221.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python

pipeline = PretrainedPipeline("bislama_all_bs320_vanilla_finetuned_webnlg2020_relevance_pipeline", lang = "en")
annotations = pipeline.transform(df)

```
```scala

val pipeline = new PretrainedPipeline("bislama_all_bs320_vanilla_finetuned_webnlg2020_relevance_pipeline", lang = "en")
val annotations = pipeline.transform(df)

```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|bislama_all_bs320_vanilla_finetuned_webnlg2020_relevance_pipeline|
|Type:|pipeline|
|Compatibility:|Spark NLP 5.5.0+|
|License:|Open Source|
|Edition:|Official|
|Language:|en|
|Size:|407.3 MB|

## References

https://huggingface.co/teven/bi_all_bs320_vanilla_finetuned_WebNLG2020_relevance

## Included Models

- DocumentAssembler
- MPNetEmbeddings
Loading

0 comments on commit 2df9cfd

Please sign in to comment.