Skip to content

Commit

Permalink
2024-09-15-roberta_base_epoch_30_pipeline_en (#14402)
Browse files Browse the repository at this point in the history
* Add model 2024-09-11-film_sec_en

* Add model 2024-09-13-bertoso_en

* Add model 2024-09-12-resume_classifier_jerry124_pipeline_en

* Add model 2024-09-12-seq2tag_grammar_en

* Add model 2024-09-11-pruebamodelotfm_roberta_in_pipeline_en

* Add model 2024-09-16-studientage_translate_pipeline_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_french_abdelkareem_en

* Add model 2024-09-16-distilbert_base_uncased_finetuned_squad_riaraju_pipeline_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_english_kata958_pipeline_en

* Add model 2024-09-13-xlm_roberta_base_finetuned_panx_english_zardian_pipeline_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_french_reinoudbosch_en

* Add model 2024-09-18-burmese_awesome_model_sklug_en

* Add model 2024-09-18-burmese_awesome_model_mostafa2032020_pipeline_en

* Add model 2024-09-18-financial_sentiment_model_1000_samples_pipeline_en

* Add model 2024-09-18-classification_4_kfold_v1_en

* Add model 2024-09-18-distilbert_base_uncased_finetuned_emotion_xysj2012_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_english_praboda_pipeline_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_italian_sungwoo1_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_italian_sungwoo1_pipeline_en

* Add model 2024-09-18-cat_ner_iw_4_pipeline_en

* Add model 2024-09-11-practica2_arturogl_pipeline_en

* Add model 2024-09-16-whisper_tiny_finetuned_minds14_lightmourne_en

* Add model 2024-09-17-chinese_roberta_wwm_ext_3_0_8_pipeline_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_german_rlpeter70_pipeline_en

* Add model 2024-09-14-fine_tuned_albert_emotion_pipeline_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_english_jamie613_pipeline_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_german_cykim_pipeline_en

* Add model 2024-09-15-babyberta_wikipedia1_2_5m_aochildes_2_5m_without_masking_seed3_finetuned_squad_pipeline_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_italian_kiechu_en

* Add model 2024-09-17-chinese_distilbert_finetuned_squadv2_en

* Add model 2024-09-13-bert_suicidal_content_detection_pipeline_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_italian_kiechu_pipeline_en

* Add model 2024-09-18-all_roberta_large_v1_travel_16_16_5_oos_pipeline_en

* Add model 2024-09-14-test_whisper_samagradatagov_en

* Add model 2024-09-11-0_00001_0_9_a98zhang_pipeline_en

* Add model 2024-09-06-distilbert_base_uncased_finetuned_imdb_hlm1234_en

* Add model 2024-09-12-xlm_roberta_base_finetuned_panx_french_agvelu_en

* Add model 2024-09-14-clasificador_trek_albert_en

* Add model 2024-09-15-db_mc_2b_84_5_en

* Add model 2024-09-16-burmese_awesome_qa_model_2_koustavhazra_pipeline_en

* Add model 2024-09-18-halacha_siman_seif_classifier_nepal_bhasa_pipeline_en

* Add model 2024-09-18-xlm_roberta_base_trimmed_french_30000_xnli_french_pipeline_en

* Add model 2024-09-13-brwac_v1_4__checkpoint12_en

* Add model 2024-09-08-distilbert_base_uncased_finetuned_imdb_sabbasi_11_en

* Add model 2024-09-11-practicanlp_890mari_pipeline_en

* Add model 2024-09-06-qa_synthetic_data_only_18_aug_xlm_roberta_base_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_marc_english_tkesonia_en

* Add model 2024-09-17-queryner_augmented_data_bert_base_uncased_en

* Add model 2024-09-16-finetuned_adversarial_paraphrase_model_test_pipeline_en

* Add model 2024-09-17-auro_2_en

* Add model 2024-09-18-cross_encoder_v0_en

* Add model 2024-09-17-whisper_tiny_ga2en_v1_4_pipeline_ga

* Add model 2024-09-18-predict_perception_xlmr_blame_object_pipeline_en

* Add model 2024-09-12-whisper_small_bemba_zambia_csikasote_en

* Add model 2024-09-06-burmese_awesome_wnut_ganeither_en

* Add model 2024-09-10-bert_classifier_prot_bfd_membrane_en

* Add model 2024-09-15-fine_tuned_resume_model_pipeline_en

* Add model 2024-09-18-language_detection_fine_tuned_on_xlm_roberta_base_junaidali_en

* Add model 2024-09-18-roberta_large_finetuned_ner_single_label_en

* Add model 2024-09-16-bae_roberta_base_mrpc_5_en

* Add model 2024-09-15-medical_tiny_english_1_0v_check_train_en

* Add model 2024-09-18-bsc_bio_ehr_spanish_ctebmsp_es

* Add model 2024-09-14-xlm_roberta_base_finetuned_panx_english_yezune_pipeline_en

* Add model 2024-09-11-burmese_awesome_qa_model_hhjingbo_en

* Add model 2024-09-14-bsc_bio_ehr_spanish_symptemist_fasttext_85_ner_pipeline_en

* Add model 2024-09-12-emoji_emoji_random0_seed2_roberta_base_pipeline_en

* Add model 2024-09-18-distilbert_lr_linear_pipeline_en

* Add model 2024-09-18-models_vantaa32_pipeline_en

* Add model 2024-09-11-fine_tuning_roberta_pipeline_en

* Add model 2024-09-11-hate_hate_random0_seed2_twitter_roberta_base_dec2020_pipeline_en

* Add model 2024-09-18-sent_bert_khmer_small_uncased_tokenized_en

* Add model 2024-09-13-testvulmodel_pipeline_en

* Add model 2024-09-16-finetuned_sentiment_analysis_modell_en

* Add model 2024-09-09-custommodel_pipeline_en

* Add model 2024-09-17-qa_model_spanish_pipeline_en

* Add model 2024-09-15-xlm_roberta_base_finetuned_panx_english_100yen_en

* Add model 2024-09-11-xlm_roberta_base_finetuned_panx_german_french_jbreunig_en

* Add model 2024-09-08-deep_1_en

* Add model 2024-09-10-distilbert_base_uncased_finetuned_emotion_mohamedahmedae_pipeline_en

* Add model 2024-09-16-medical_english_french_9_5_5_epcohs_en

* Add model 2024-09-09-bertovosentneg2_pipeline_en

* Add model 2024-09-09-burmese_awesome_qa_model_punit_tiwari_en

* Add model 2024-09-15-whisper_small_hindi_graminvoice_hi

* Add model 2024-09-18-distilbert_sanskrit_saskta_glue_experiment_data_aug_wnli_256_pipeline_en

* Add model 2024-09-13-mdeberta_nuclear_tweets_pipeline_en

* Add model 2024-09-15-bert_base_uncased_ep_1_29_b_32_lr_8e_07_dp_0_5_swati_0_southern_sotho_true_fh_false_hs_0_pipeline_en

* Add model 2024-09-17-dl_xlm_roberta_base10_pipeline_en

* Add model 2024-09-17-distilled_roberta_en

* Add model 2024-09-17-cl_style_1_1_epoch_recipe_pretrained_roberta_base_squadv2_en

* Add model 2024-09-10-distilbert_base_uncased_finetuned_emotion_cocabuton_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_german_lulu0630_en

* Add model 2024-09-08-news_en

* Add model 2024-09-14-xlm_roberta_base_finetuned_panx_turkish_english_en

* Add model 2024-09-15-sent_bertugues_base_portuguese_cased_pipeline_pt

* Add model 2024-09-15-xlm_roberta_base_tweet_sentiment_german_trimmed_german_60000_en

* Add model 2024-09-18-distilbert_base_uncased_finetuned_emotion_by_sohsou_pipeline_en

* Add model 2024-09-18-burmese_awesome_model_mostafa2032020_en

* Add model 2024-09-18-burmese_awesome_qa_model_zzzalo_pipeline_en

* Add model 2024-09-14-whisper_small_uzbek_mirodil_pipeline_uz

* Add model 2024-09-16-bert_test_antti97b_pipeline_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_french_songys_en

* Add model 2024-09-10-xlm_roberta_base_finetuned_panx_german_xrverse_en

* Add model 2024-09-18-distilbert_base_uncased_finetuned_squad_kfleki555_en

* Add model 2024-09-18-distilbert_qa_pytorch_seed_en

* Add model 2024-09-11-gopal_finetuned_russian_tonga_tonga_islands_english_pipeline_en

* Add model 2024-09-15-finetuning_sentiment_model_3000_samples_sethanderson2_en

* Add model 2024-09-17-distilbert_base_uncased_finetuned_squad_ggaleano_en

* Add model 2024-09-16-indonesia_emotion_roberta_en

* Add model 2024-09-18-bertin_roberta_fine_tuned_text_classification_slovene_data_augmentation_test_3_en

* Add model 2024-09-18-roberta_similarity_mudasiryasin_en

* Add model 2024-09-18-discriminator_paraphrase_pipeline_en

* Add model 2024-09-18-roberta_base_finetuned_dow_pipeline_en

* Add model 2024-09-14-bert_finetuned_ner_alban12_en

* Add model 2024-09-14-whisper_base_tonga_tonga_islands_pf10h_pipeline_en

* Add model 2024-09-18-nerd_nerd_random3_seed2_twitter_roberta_base_2021_124m_en

* Add model 2024-09-18-roberta_base_fact_updates_rishavranaut_pipeline_en

* Add model 2024-09-18-roberta_large_hoax_classifier_def_v1_pipeline_en

* Add model 2024-09-18-cold_fusion_finetuned_convincingness_ibm_en

* Add model 2024-09-18-nace2_level1_26_en

* Add model 2024-09-18-topic_topic_random1_seed2_twitter_roberta_base_2022_154m_en

* Add model 2024-09-18-minilmv2_l6_h384_r_ocr_quality_en

* Add model 2024-09-18-norms_establish_check_reproducibility_16_en

* Add model 2024-09-18-roberta_hate_speech_dynabench_r4_target_advasary_debiased_without_dialect_pipeline_en

* Add model 2024-09-18-mtwitter_roberta_base_model_reviewingcls_r3_en

* Add model 2024-09-18-roberta_base_hoax_classifier_v3_defs_en

* Add model 2024-09-18-mnlp_adversarial_pipeline_en

* Add model 2024-09-18-roberta_hate_speech_dynabench_r4_target_advasary_debiased_without_dialect_en

* Add model 2024-09-18-trial_model_aaronw4477_pipeline_en

* Add model 2024-09-17-xlm_roberta_base_finetuned_panx_english_jaemin12_en

* Add model 2024-09-15-distilbert_base_uncased_finetuned_emotion_edmonds0_en

* Add model 2024-09-03-tosroberta_base_pipeline_en

* Add model 2024-09-18-sentiment_sentiment_small_random1_seed0_bertweet_large_en

* Add model 2024-09-11-indonesian_distilbert_base_cased_finetuned_indonlu_en

* Add model 2024-09-06-address_emnet_en

* Add model 2024-09-14-sent_bert_one_en

* Add model 2024-09-17-whisper_small_arabic_monaf3_pipeline_ar

* Add model 2024-09-10-nerd_nerd_random3_seed0_twitter_roberta_base_jun2021_en

* Add model 2024-09-13-marian_finetuned_kde4_english_tonga_tonga_islands_french_accelerate_delayedkarma_pipeline_en

* Add model 2024-09-16-distilbert_finetuned_squadv2_nctuananh_en

* Add model 2024-09-12-deberta_v3_base_test_watiforall_pipeline_en

* Add model 2024-09-10-checkpoint_50000_en

* Add model 2024-09-09-all_mpnet_base_v2_kunwooshin_en

* Add model 2024-09-16-200mil_codeberta_small_v1_pipeline_en

* Add model 2024-09-17-burmese_qa_model_yadah_en

* Add model 2024-09-13-distilbert_base_uncased_finetuned_squad_sandy50422gmail_pipeline_en

* Add model 2024-09-18-distilbert_base_uncased_finetuned_imdb_venkat_shadeslayer_en

* Add model 2024-09-18-distilbert_base_uncased_finetuned_imdb_venkat_shadeslayer_pipeline_en

* Add model 2024-09-18-sent_bert_khmer_small_uncased_tokenized_pipeline_en

* Add model 2024-09-09-roberta_ethics_test_shuffled_en

* Add model 2024-09-12-xlm_roberta_base_finetuned_panx_italian_v3rx2000_en

* Add model 2024-09-17-xlm_roberta_base_finetuned_panx_italian_lortigas_pipeline_en

* Add model 2024-09-18-bert_squad_mimirmtd_en

* Add model 2024-09-12-roberta_spendcategory_classifier_pipeline_en

* Add model 2024-09-15-marta_t5_gpt_pipeline_en

* Add model 2024-09-18-distilbert_base_uncased_finetuned_squad_songhyundong_pipeline_en

* Add model 2024-09-18-distilbert_finetuned_squadv2_duchaha_en

* Add model 2024-09-18-distilbert_finetuned_squadv2_duchaha_pipeline_en

* Add model 2024-09-18-sent_luxembert_pipeline_en

* Add model 2024-09-18-sent_conflibert_cont_cased_en

* Add model 2024-09-17-bert_base_squad_ahcene_ikram_pipeline_en

* Add model 2024-09-17-marefa_maltese_english_arabic_parallel_10k_splitted_translated_cosine_pipeline_en

* Add model 2024-09-10-distilbert_base_uncased_finetuned_imdb_kalexa2_en

* Add model 2024-09-15-model4_en

* Add model 2024-09-09-burmese_awesome_qa_model_alcalazans_pipeline_en

* Add model 2024-09-17-whisper_medium_serbian_v3_sr

* Add model 2024-09-11-burmese_first_setfit_hyperparam_4epochs_pipeline_en

* Add model 2024-09-09-nlp4web_group80_en

* Add model 2024-09-16-toxicity_judge_pipeline_en

* Add model 2024-09-12-answer_equivalence_tiny_bert_zli12321_pipeline_en

* Add model 2024-09-12-mariaj_en

* Add model 2024-09-11-fine_tuned_model_mpnet_qa_pipeline_en

* Add model 2024-09-16-reddit_title_sanskrit_saskta_3000_samples_en

* Add model 2024-09-08-roberta_fine_tuned_on_newsmstc_02_split_en

* Add model 2024-09-17-bert_base_german_cased_finetuned_squad_pipeline_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_german_lulu0630_pipeline_en

* Add model 2024-09-18-distilbert_base_uncased_finetuned_emotion_suraj101_pipeline_en

* Add model 2024-09-15-coarse_url_classifier_pipeline_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_english_philosucker_pipeline_en

* Add model 2024-09-16-sent_minilm_l_12_stackoverflow_en

* Add model 2024-09-18-distilbert_base_uncased_odm_zphr_0st1sd_ut72ut1large1pfxnf_simsp400_clean200_pipeline_en

* Add model 2024-09-09-opus_maltese_finetuned_english_spanish_pipeline_en

* Add model 2024-09-18-xlm_roberta_base_finetuned_panx_french_hhffxx_en

* Add model 2024-09-14-masked_language_model_pavi156_pipeline_en

* Add model 2024-09-13-bert_ner_en

* Add model 2024-09-17-n_distilbert_imdb_padding70model_pipeline_en

* Add model 2024-09-17-cross_encoder_stsb_distilroberta_base_v1___2024_09_11_21_23_06_pipeline_en

* Add model 2024-09-18-38650821_a0f5_4236_aa69_8f167a79306a_pipeline_en

* Add model 2024-09-18-distilled_bert_finetuned_squadv2_en

* Add model 2024-09-11-finetuned_adversarial_paraphrase_modell_pipeline_en

* Add model 2024-09-09-xlm_roberta_base_language_detection_xx

* Add model 2024-09-18-esg_classification_bert_all_data_0509_other_v2_pipeline_en

* Add model 2024-09-18-sad_en

* Add model 2024-09-18-fine_tune_embeddnew_sih_2_pipeline_en

* Add model 2024-09-11-distilbert_base_uncased_2_en

* Add model 2024-09-18-fine_tune_roberta_exist_fine_grained_en

* Add model 2024-09-16-xlm_roberta_emotion_detector_pipeline_en

* Add model 2024-09-13-declutr_model_emanuals_en

* Add model 2024-09-18-yelp_polarity_roberta_base_seed_3_pipeline_en

* Add model 2024-09-18-model_6_3_en

* Add model 2024-09-18-model_6_3_pipeline_en

* Add model 2024-09-17-xlm_roberta_base_finetuned_panx_french_amartyobanerjee_en

* Add model 2024-09-18-trial_model_aaronw4477_en

* Add model 2024-09-18-minilmv2_l6_h384_r_ocr_quality_pipeline_en

* Add model 2024-09-17-classifier_chapter4_pcuenq_en

* Add model 2024-09-13-perspective_utilitarian_deberta_01_pipeline_en

* Add model 2024-09-15-finetuning_sentiment_model_300_samples_clawdiawhiskerwitz_en

* Add model 2024-09-18-xlm_roberta_base_8_en

* Add model 2024-09-18-readabert_hindi_hi

* Add model 2024-09-18-bluebert_en

* Add model 2024-09-18-mitre_tactic_bert_case_based_en

* Add model 2024-09-17-roberta_base_ner_demo_oyunbaatar_mn

* Add model 2024-09-18-bsc_bio_ehr_spanish_ctebmsp_pipeline_es

* Add model 2024-09-17-sent_distilbert_base_uncased_finetuned_the_fire_flower_pipeline_en

* Add model 2024-09-17-whisper_small_turkish_sercan_tr

* Add model 2024-09-16-emotion_classification_a2ran_en

* Add model 2024-09-18-test_trainer_xysmalobia_en

* Add model 2024-09-12-xlm_roberta_base_finetuned_panx_german_mohit1707_pipeline_en

* Add model 2024-09-10-distilbert_base_uncased_meta_rd_pipeline_en

* Add model 2024-09-18-bert_turkish_turkish_tweet_pipeline_tr

* Add model 2024-09-13-spanish_english_orig_en

* Add model 2024-09-15-bert_ft_fin_txn_clf_v0_1_en

* Add model 2024-09-09-opus_maltese_english_romanian_finetuned_english_tonga_tonga_islands_romanian_shweta41_pipeline_en

* Add model 2024-09-11-burmese_awesome_qa_model_aidenmoon_pipeline_en

* Add model 2024-09-14-whisper_medium_korean_v0_1_2_ko

* Add model 2024-09-15-text_clf_model_v03_en

* Add model 2024-09-17-whispercheckpoints_pipeline_en

* Add model 2024-09-17-dsn_afrispeech_jajsmith_pipeline_en

* Add model 2024-09-15-sent_romanian_sentence_bert_base_uncased_v1_ro

* Add model 2024-09-17-fine_tuned_albert_large_v2_pipeline_en

* Add model 2024-09-16-platzi_distilroberta_base_mrpc_glue_tommasory_pipeline_en

* Add model 2024-09-11-mdebertav3_subjectivity_italian_pipeline_it

* Add model 2024-09-16-distilbert_finetuned_squadv2_tienhuynh_pipeline_en

* Add model 2024-09-17-bangla_bert_base_finetuned_brqa_confirmation_en

* Add model 2024-09-17-fine_tuned_albert_large_v2_en

* Add model 2024-09-16-gsarti_opus_maltese_tc_english_polish_opus100_accelerate_finetune_en

* Add model 2024-09-14-comave_large_english_en

* Add model 2024-09-14-whisper_small_kazakh_pipeline_hi

* Add model 2024-09-18-sent_condenser_bert_large_uncased_en

* Add model 2024-09-18-distilbert_base_uncased_finetuned_squad_d5716d28_alex_atelo_en

* Add model 2024-09-13-all_roberta_large_v1_travel_3_16_5_pipeline_en

* Add model 2024-09-18-distilroberta_financial_sentiment_model_2500_samples_fine_tune_en

* Add model 2024-09-18-platzi_distilroberta_base_mrpc_glue_carlos_venegas_en

* Add model 2024-09-18-roberta_large_temp_classifier_bootstrapped_en

* Add model 2024-09-18-finetuning_distilroberta_model_3000_samples_pipeline_en

* Add model 2024-09-18-cold_fusion_itr13_seed4_pipeline_en

* Add model 2024-09-18-hate_hate_random3_seed0_twitter_roberta_base_2019_90m_en

* Add model 2024-09-18-twitter_roberta_base_sentiment_finetuned_wuyue1987_pipeline_en

* Add model 2024-09-18-twitter_roberta_base_sentiment_finetuned_wuyue1987_en

* Add model 2024-09-18-platzi_distilroberta_base_mrpc_glue_luis_rascon_en

* Add model 2024-09-12-prototipo_4_emi_pipeline_en

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>
  • Loading branch information
jsl-models and ahmedlone127 authored Sep 18, 2024
1 parent bf76942 commit ba5ac12
Show file tree
Hide file tree
Showing 1,549 changed files with 124,185 additions and 4 deletions.
Original file line number Diff line number Diff line change
@@ -0,0 +1,92 @@
---
layout: model
title: English BertForQuestionAnswering Cased model (from wskhanh)
author: John Snow Labs
name: bert_qa_wskhanh_finetuned_squad
date: 2024-09-02
tags: [en, open_source, bert, question_answering, onnx]
task: Question Answering
language: en
edition: Spark NLP 5.5.0
spark_version: 3.0
supported: true
engine: onnx
annotator: BertForQuestionAnswering
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained BertForQuestionAnswering model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP. `bert-finetuned-squad` is a English model originally trained by `wskhanh`.

## Predicted Entities



{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/bert_qa_wskhanh_finetuned_squad_en_5.5.0_3.0_1725312737462.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/bert_qa_wskhanh_finetuned_squad_en_5.5.0_3.0_1725312737462.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python
Document_Assembler = MultiDocumentAssembler()\
.setInputCols(["question", "context"])\
.setOutputCols(["document_question", "document_context"])

Question_Answering = BertForQuestionAnswering.pretrained("bert_qa_wskhanh_finetuned_squad","en")\
.setInputCols(["document_question", "document_context"])\
.setOutputCol("answer")\
.setCaseSensitive(True)

pipeline = Pipeline(stages=[Document_Assembler, Question_Answering])

data = spark.createDataFrame([["What's my name?","My name is Clara and I live in Berkeley."]]).toDF("question", "context")

result = pipeline.fit(data).transform(data)
```
```scala
val Document_Assembler = new MultiDocumentAssembler()
.setInputCols(Array("question", "context"))
.setOutputCols(Array("document_question", "document_context"))

val Question_Answering = BertForQuestionAnswering.pretrained("bert_qa_wskhanh_finetuned_squad","en")
.setInputCols(Array("document_question", "document_context"))
.setOutputCol("answer")
.setCaseSensitive(true)

val pipeline = new Pipeline().setStages(Array(Document_Assembler, Question_Answering))

val data = Seq("What's my name?","My name is Clara and I live in Berkeley.").toDS.toDF("question", "context")

val result = pipeline.fit(data).transform(data)
```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|bert_qa_wskhanh_finetuned_squad|
|Compatibility:|Spark NLP 5.5.0+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[document_question, document_context]|
|Output Labels:|[answer]|
|Language:|en|
|Size:|403.7 MB|

## References

References

- https://huggingface.co/wskhanh/bert-finetuned-squad
70 changes: 70 additions & 0 deletions docs/_posts/ahmedlone127/2024-09-03-tosroberta_base_pipeline_en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,70 @@
---
layout: model
title: English tosroberta_base_pipeline pipeline RoBertaForSequenceClassification from CodeHima
author: John Snow Labs
name: tosroberta_base_pipeline
date: 2024-09-03
tags: [en, open_source, pipeline, onnx]
task: Text Classification
language: en
edition: Spark NLP 5.5.0
spark_version: 3.0
supported: true
annotator: PipelineModel
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained RoBertaForSequenceClassification, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`tosroberta_base_pipeline` is a English model originally trained by CodeHima.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/tosroberta_base_pipeline_en_5.5.0_3.0_1725369893934.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/tosroberta_base_pipeline_en_5.5.0_3.0_1725369893934.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python

pipeline = PretrainedPipeline("tosroberta_base_pipeline", lang = "en")
annotations = pipeline.transform(df)

```
```scala

val pipeline = new PretrainedPipeline("tosroberta_base_pipeline", lang = "en")
val annotations = pipeline.transform(df)

```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|tosroberta_base_pipeline|
|Type:|pipeline|
|Compatibility:|Spark NLP 5.5.0+|
|License:|Open Source|
|Edition:|Official|
|Language:|en|
|Size:|429.8 MB|

## References

https://huggingface.co/CodeHima/TOSRoberta-base

## Included Models

- DocumentAssembler
- TokenizerModel
- RoBertaForSequenceClassification
94 changes: 94 additions & 0 deletions docs/_posts/ahmedlone127/2024-09-04-est_roberta_et.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,94 @@
---
layout: model
title: Estonian est_roberta CamemBertEmbeddings from EMBEDDIA
author: John Snow Labs
name: est_roberta
date: 2024-09-04
tags: [et, open_source, onnx, embeddings, camembert]
task: Embeddings
language: et
edition: Spark NLP 5.5.0
spark_version: 3.0
supported: true
engine: onnx
annotator: CamemBertEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained CamemBertEmbeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`est_roberta` is a Estonian model originally trained by EMBEDDIA.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/est_roberta_et_5.5.0_3.0_1725442328963.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/est_roberta_et_5.5.0_3.0_1725442328963.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python

documentAssembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("document")

tokenizer = Tokenizer() \
.setInputCols("document") \
.setOutputCol("token")

embeddings = CamemBertEmbeddings.pretrained("est_roberta","et") \
.setInputCols(["document", "token"]) \
.setOutputCol("embeddings")

pipeline = Pipeline().setStages([documentAssembler, tokenizer, embeddings])
data = spark.createDataFrame([["I love spark-nlp"]]).toDF("text")
pipelineModel = pipeline.fit(data)
pipelineDF = pipelineModel.transform(data)

```
```scala

val documentAssembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("document")

val tokenizer = new Tokenizer()
.setInputCols(Array("document"))
.setOutputCol("token")

val embeddings = CamemBertEmbeddings.pretrained("est_roberta","et")
.setInputCols(Array("document", "token"))
.setOutputCol("embeddings")

val pipeline = new Pipeline().setStages(Array(documentAssembler, tokenizer, embeddings))
val data = Seq("I love spark-nlp").toDF("text")
val pipelineModel = pipeline.fit(data)
val pipelineDF = pipelineModel.transform(data)

```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|est_roberta|
|Compatibility:|Spark NLP 5.5.0+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[document, token]|
|Output Labels:|[camembert]|
|Language:|et|
|Size:|277.9 MB|

## References

https://huggingface.co/EMBEDDIA/est-roberta
Original file line number Diff line number Diff line change
@@ -0,0 +1,86 @@
---
layout: model
title: English lithuanian_un_data_fine_coarse_english MPNetEmbeddings from dell-research-harvard
author: John Snow Labs
name: lithuanian_un_data_fine_coarse_english
date: 2024-09-04
tags: [en, open_source, onnx, embeddings, mpnet]
task: Embeddings
language: en
edition: Spark NLP 5.5.0
spark_version: 3.0
supported: true
engine: onnx
annotator: MPNetEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained MPNetEmbeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`lithuanian_un_data_fine_coarse_english` is a English model originally trained by dell-research-harvard.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/lithuanian_un_data_fine_coarse_english_en_5.5.0_3.0_1725470007829.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/lithuanian_un_data_fine_coarse_english_en_5.5.0_3.0_1725470007829.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python

documentAssembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("document")

embeddings = MPNetEmbeddings.pretrained("lithuanian_un_data_fine_coarse_english","en") \
.setInputCols(["document"]) \
.setOutputCol("embeddings")

pipeline = Pipeline().setStages([documentAssembler, embeddings])
data = spark.createDataFrame([["I love spark-nlp"]]).toDF("text")
pipelineModel = pipeline.fit(data)
pipelineDF = pipelineModel.transform(data)

```
```scala

val documentAssembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("document")

val embeddings = MPNetEmbeddings.pretrained("lithuanian_un_data_fine_coarse_english","en")
.setInputCols(Array("document"))
.setOutputCol("embeddings")

val pipeline = new Pipeline().setStages(Array(documentAssembler, embeddings))
val data = Seq("I love spark-nlp").toDF("text")
val pipelineModel = pipeline.fit(data)
val pipelineDF = pipelineModel.transform(data)

```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|lithuanian_un_data_fine_coarse_english|
|Compatibility:|Spark NLP 5.5.0+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[document]|
|Output Labels:|[mpnet]|
|Language:|en|
|Size:|406.8 MB|

## References

https://huggingface.co/dell-research-harvard/lt-un-data-fine-coarse-en
Loading

0 comments on commit ba5ac12

Please sign in to comment.