Skip to content

Commit

Permalink
2023-09-13-bert_base_uncased_issues_128_juandeun_en (#13981)
Browse files Browse the repository at this point in the history
* Add model 2023-09-13-b_fb_sms_lm_en

* Add model 2023-09-13-kannada_bert_kn

* Add model 2023-09-13-bert_base_uncased_finetuned_bert_auto7_en

* Add model 2023-09-13-betonews_tweetcontext_en

* Add model 2023-09-13-bert_twitter_hashtag_en

* Add model 2023-09-13-bert_base_cased_finetuned_bert_auto7_en

* Add model 2023-09-13-telugu_bert_te

* Add model 2023-09-13-arab_bert_en

* Add model 2023-09-13-bert_base_uncased_finetuned_bert_mlm_en

* Add model 2023-09-13-ct_pubmedbert_re_en

* Add model 2023-09-13-mybert_mini_500k_en

* Add model 2023-09-13-malayalam_bert_ml

* Add model 2023-09-13-mybert_mini_1m_en

* Add model 2023-09-13-phrase_bert_finetuned_imdb_en

* Add model 2023-09-13-bert_large_uncased_whole_word_masking_finetuned_bert_mlm_en

* Add model 2023-09-13-berdou_200k_en

* Add model 2023-09-13-tamil_bert_ta

* Add model 2023-09-13-legal_hebert_en

* Add model 2023-09-13-bert_base_uncased_finetuned_imdb_sarmila_en

* Add model 2023-09-13-berdou_500k_en

* Add model 2023-09-13-bert_large_uncased_finetuned_bert_mlm5_en

* Add model 2023-09-13-alephbertgimmel_small_128_he

* Add model 2023-09-13-gujarati_bert_gu

* Add model 2023-09-13-alberti_bert_base_multilingual_cased_flax_community_xx

* Add model 2023-09-13-bert_base_japanese_ssuw_ja

* Add model 2023-09-13-bert_base_cased_finetuned_bert_mlm5_en

* Add model 2023-09-13-roberta_base_culinary_en

* Add model 2023-09-13-bert_base_uncased_swahili_sw

* Add model 2023-09-13-assamese_bert_as

* Add model 2023-09-13-cocodr_base_msmarco_warmup_en

* Add model 2023-09-13-shangpin_pre_training_en

* Add model 2023-09-13-bert_large_cased_whole_word_masking_finetuned_bert_mlm6_en

* Add model 2023-09-13-reddit_bert_text2_en

* Add model 2023-09-13-legal_indobert_pytorch_v4_en

* Add model 2023-09-13-bert_base_uncased_finetuned_bert_mlm9_en

* Add model 2023-09-13-reddit_bert_text3_en

* Add model 2023-09-13-odia_bert_or

* Add model 2023-09-13-luxembert_en

* Add model 2023-09-13-mbert_deen_en

* Add model 2023-09-13-reddit_bert_text4_en

* Add model 2023-09-13-bengali_bert_bn

* Add model 2023-09-13-reddit_bert_text_10_en

* Add model 2023-09-13-kcbert_large_finetuned_en

* Add model 2023-09-13-reddit_bert_text_20_en

* Add model 2023-09-13-punjabi_bert_pa

* Add model 2023-09-13-bert_base_uncased_issues_128_cj_mills_en

* Add model 2023-09-13-bert_base_uncased_finetuned_himani_gen_mlm_en

* Add model 2023-09-13-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_0_en

* Add model 2023-09-13-reddit_bert_text_5_en

* Add model 2023-09-13-dapbert_en

* Add model 2023-09-13-bert_base_uncased_finetuned_himani_gen_mlm_1_en

* Add model 2023-09-13-youtube_bert_en

* Add model 2023-09-13-pretrained_kyw_e1_en

* Add model 2023-09-13-dapscibert_en

* Add model 2023-09-13-bert_base_uncased_finetuned_himani_gen_mlm_12_en

* Add model 2023-09-13-youtube_bert_10_en

* Add model 2023-09-13-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_1_en

* Add model 2023-09-13-bert_pretraining_gaudi_2_batch_size_32_en

* Add model 2023-09-13-model1_en

* Add model 2023-09-13-klue_base_finetuned_en

* Add model 2023-09-13-me_bert_mr

* Add model 2023-09-13-bert_cluster_en

* Add model 2023-09-13-medical_bio_bert2_en

* Add model 2023-09-13-bert_base_uncased_finetuned_himani_gen_mlm_13_en

* Add model 2023-09-13-me_bert_mixed_mr

* Add model 2023-09-13-bert_large_cased_finetuned_hkdse_english_paper4_en

* Add model 2023-09-13-dabert_multi_en

* Add model 2023-09-13-bert_base_uncased_finetuned_himani_gen_mlm_14_en

* Add model 2023-09-13-bert_base_spanish_amvv_uncased_en

* Add model 2023-09-13-manubert_en

* Add model 2023-09-13-greeksocialbert_base_greek_uncased_v1_el

* Add model 2023-09-13-bert_base_uncased_finetuned_himani_gen_mlm_15_en

* Add model 2023-09-13-bert_base_pashto_v1_ps

* Add model 2023-09-13-parlbert_german_v1_de

* Add model 2023-09-13-bert_pretraining_gaudi_2_batch_size_64_en

* Add model 2023-09-13-bert_base_cased_portuguese_c_corpus_en

* Add model 2023-09-13-testc8_1_en

* Add model 2023-09-13-klue_bert_epoch3_en

* Add model 2023-09-13-bert_base_stackoverflow_comments_1m_en

* Add model 2023-09-13-testc8_2_en

* Add model 2023-09-13-bert_base_stackoverflow_comments_2m_en

* Add model 2023-09-13-kcbert_base_finetuned_en

* Add model 2023-09-13-bert_base_arabic_miner_en

* Add model 2023-09-14-bert_base_greek_uncased_v5_finetuned_polylex_malagasy_en

* Add model 2023-09-14-dummy_model_linbo_en

* Add model 2023-09-14-bert_base_code_comments_en

* Add model 2023-09-14-bert_base_greek_uncased_v6_finetuned_polylex_malagasy_en

* Add model 2023-09-14-bert_base_uncased_narsil_en

* Add model 2023-09-14-bertugues_base_portuguese_cased_pt

* Add model 2023-09-14-bert_large_stackoverflow_comments_1m_en

* Add model 2023-09-14-retromae_msmarco_distill_en

* Add model 2023-09-14-archaeobert_en

* Add model 2023-09-14-klue_bert_mlm_en

* Add model 2023-09-14-bert_base_uncased_issues_128_mabrouk_en

* Add model 2023-09-14-legalbert_large_1.7m_1_en

* Add model 2023-09-14-telugu_bert_scratch_te

* Add model 2023-09-14-muril_base_cased_en

* Add model 2023-09-14-malayalam_bert_scratch_ml

* Add model 2023-09-14-weights_bert_mlm_epoch50_en

* Add model 2023-09-14-bert_base_cased_conversational_finetuned_wallisian_en

* Add model 2023-09-14-mbert_squad_en

* Add model 2023-09-14-gujarati_bert_scratch_gu

* Add model 2023-09-14-9.4aistudy_en

* Add model 2023-09-14-bert_base_uncased_issues_128_veeps_en

* Add model 2023-09-14-bert_base_german_europeana_td_cased_en

* Add model 2023-09-14-kannada_bert_scratch_kn

* Add model 2023-09-14-bert_base_uncased_issues_128_bh8648_en

* Add model 2023-09-14-awesome_align_with_corsican_xx

* Add model 2023-09-14-bert_base_kor_v1_en

* Add model 2023-09-14-bert_base_uncased_finetuned_wallisian_en

* Add model 2023-09-14-test_dushen_en

* Add model 2023-09-14-bert_base_uncased_finetuned_wallisian_lower_en

* Add model 2023-09-14-legalbert_large_1.7m_2_en

* Add model 2023-09-14-domain_adapted_arbert_goudma_bert_en

* Add model 2023-09-14-medbert_512_norwegian_duplicates_de

* Add model 2023-09-14-closure_system_door_inne_bert_base_uncased_en

* Add model 2023-09-14-gepabert_de

* Add model 2023-09-14-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_5_en

* Add model 2023-09-14-bert_system_en

* Add model 2023-09-14-mergedistill_base_cased_anneal_en

* Add model 2023-09-14-aligner_english_vietnamese_en

* Add model 2023-09-14-medbit_r3_plus_it

* Add model 2023-09-14-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_6_en

* Add model 2023-09-14-mymodel_en

* Add model 2023-09-14-door_inner_with_sa_bert_base_uncased_en

* Add model 2023-09-14-public_models_en

* Add model 2023-09-14-frpile_mlm_en

* Add model 2023-09-14-radbert_en

* Add model 2023-09-14-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_7_en

* Add model 2023-09-14-bert_application_en

* Add model 2023-09-14-legal_hebert_ft_en

* Add model 2023-09-14-mlm_20230416_003_1_en

* Add model 2023-09-14-vatestnew_en

* Add model 2023-09-14-dzarabert_ar

* Add model 2023-09-14-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_10_en

* Add model 2023-09-14-alephbertgimmel_base_512_he

* Add model 2023-09-14-mergedistill_base_cased_anneal_v4_en

* Add model 2023-09-14-mvr_squad_bert_base_multilingual_cased_xx

* Add model 2023-09-14-mlm_20230416_003_2_en

* Add model 2023-09-14-medbit_it

* Add model 2023-09-14-bert_base_uncased_mlm_scirepeval_fos_chemistry_en

* Add model 2023-09-14-medruberttiny2_ru

* Add model 2023-09-14-bert_base_uncased_issues_128_abhilashawasthi_en

* Add model 2023-09-14-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_11_en

* Add model 2023-09-14-sagorbert_nwp_finetuning_test2_en

* Add model 2023-09-14-bert_base_uncased_reviews_128_en

* Add model 2023-09-14-biobit_it

* Add model 2023-09-14-bert_base_uncased_issues_128_reaverlee_en

* Add model 2023-09-14-bert_nlp_project_imdb_en

* Add model 2023-09-14-biomedvlp_cxr_bert_general_en

* Add model 2023-09-14-bert_base_uncased_finetuned_char_hangman_en

* Add model 2023-09-14-clinicaltrialbiobert_en

* Add model 2023-09-14-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_12_en

* Add model 2023-09-14-mlperf_inference_bert_pytorch_fp32_squad_v1.1_en

* Add model 2023-09-14-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_13_en

* Add model 2023-09-14-bert_base_bookcorpus_en

* Add model 2023-09-14-autotrain_acc_keys_2347073860_en

* Add model 2023-09-14-ucb_bert_finetunned_en

* Add model 2023-09-14-bert_nlp_project_google_en

* Add model 2023-09-14-bert_base_wikitext_en

* Add model 2023-09-14-bert_large_cased_sigir_support_refute_norwegian_label_40_2nd_test_lr10_8_20_en

* Add model 2023-09-14-splade_cocondenser_selfdistil_naver_en

* Add model 2023-09-14-jobs_pretraining_model_en

* Add model 2023-09-14-logion_50k_wordpiece_en

* Add model 2023-09-14-splade_cocondenser_ensembledistil_en

* Add model 2023-09-14-bert_based_ner_models_en

* Add model 2023-09-14-model_imdb_finetuned_en

* Add model 2023-09-14-bnlp_tokenizer_paraphrase_mlm_bert_900001_en

* Add model 2023-09-14-gbert_large_finetuned_cust_en

* Add model 2023-09-14-project3_model_en

* Add model 2023-09-14-bert_base_cased_finetuned_chemistry_en

* Add model 2023-09-14-sagorbert_nwp_finetuning_test4_en

* Add model 2023-09-14-bert_base_uncased_mlp_scirepeval_chemistry_large_en

* Add model 2023-09-14-skc_mlm_german_torch_de

* Add model 2023-09-14-kw_pubmed_1000_0.0003_en

* Add model 2023-09-14-test_bert_base_uncased_en

* Add model 2023-09-14-test_bert_base_spanish_wwm_cased_finetuned_ultrasounds_en

* Add model 2023-09-14-akkbert_en

* Add model 2023-09-14-kw_pubmed_1000_0.00006_en

* Add model 2023-09-14-tiny_mlm_imdb_en

* Add model 2023-09-14-tiny_mlm_tweet_en

* Add model 2023-09-14-kw_pubmed_1000_0.000006_en

* Add model 2023-09-14-oyo_bert_base_yo

* Add model 2023-09-14-mini_mlm_tweet_en

* Add model 2023-09-14-small_mlm_tweet_en

* Add model 2023-09-14-gbert_large_finetuned_cust18_en

* Add model 2023-09-14-bert_ucb_v1_en

* Add model 2023-09-14-mini_mlm_imdb_en

* Add model 2023-09-14-louribert_en

* Add model 2023-09-14-medium_mlm_tweet_en

* Add model 2023-09-14-applicationbert_en

* Add model 2023-09-14-base_mlm_tweet_en

* Add model 2023-09-14-bertimbau_pt

* Add model 2023-09-14-small_mlm_imdb_en

* Add model 2023-09-14-vbert_2021_base_en

* Add model 2023-09-14-louribert_more_tokens_saeid7776_en

* Add model 2023-09-14-model_saeid7776_en

* Add model 2023-09-14-model_v02_en

* Add model 2023-09-14-bert_base_uncased_duplicate_en

* Add model 2023-09-14-bert_base_minipile_128_en

* Add model 2023-09-14-gbert_base_finetuned_twitter_janst_en

* Add model 2023-09-14-bert_large_nordic_pile_1m_steps_en

* Add model 2023-09-14-bert_large_nordic_pile_1m_steps_sv

* Add model 2023-09-14-bibert_v0.1_en

* Add model 2023-09-14-bert_base_bangla_finetuned_summarization_dataset_en

* Add model 2023-09-14-incorporation_of_company_related_factual_knowledge_into_pre_trained_language_models_en

* Add model 2023-09-14-bert_multilang_finetune_bangla_summarization_dataset_en

* Add model 2023-09-14-bert_base_uncased_finetuned_wikitext_en

* Add model 2023-09-14-antismetisim1_finetuned_mlm_en

* Add model 2023-09-14-parlbert_german_law_de

* Add model 2023-09-14-dictabert_seg_he

* Add model 2023-09-14-dictabert_he

* Add model 2023-09-14-dictabert_morph_he

* Add model 2023-09-14-scholarbert_100_64bit_en

* Add model 2023-09-14-coronasentana_en

* Add model 2023-09-14-gbert_large_autopart_en

* Add model 2023-09-14-itd_bert_en

* Add model 2023-09-14-itd_longformer_en

* Add model 2023-09-14-lumbarradiologyreports_en

* Add model 2023-09-14-bert_base_german_cased_mlm_basque_chemistry_regulation_en

* Add model 2023-09-14-bert_base_spanish_wwm_cased_finetuned_peppa_pig_en

* Add model 2023-09-14-bert_base_spanish_wwm_cased_finetuned_wine_reviews_spanish_en

* Add model 2023-09-14-antismetisimlargedata_finetuned_mlm_en

* Add model 2023-09-14-word_ethical_ko

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_whisper_1ep_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_whisper_2ep_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_whisper_3ep_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_whisper_4ep_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_whisper_5ep_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_whisper_6ep_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_whisper_7ep_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_whisper_8ep_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_whisper_9ep_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_whisper_10ep_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_manual_1ep_en

* Add model 2023-09-14-bert_base_uncased_finetuned_wallisian_manual_1ep_lower_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_manual_2ep_en

* Add model 2023-09-14-bert_base_uncased_finetuned_wallisian_manual_2ep_lower_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_manual_3ep_en

* Add model 2023-09-14-bert_base_uncased_finetuned_wallisian_manual_3ep_lower_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_manual_4ep_en

* Add model 2023-09-14-bert_base_uncased_finetuned_wallisian_manual_4ep_lower_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_manual_5ep_en

* Add model 2023-09-14-bert_base_uncased_finetuned_wallisian_manual_5ep_lower_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_manual_6ep_en

* Add model 2023-09-14-bert_base_uncased_finetuned_wallisian_manual_6ep_lower_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_manual_7ep_en

* Add model 2023-09-14-bert_base_uncased_finetuned_wallisian_manual_7ep_lower_en

* Add model 2023-09-14-bert_base_cased_finetuned_wallisian_manual_8ep_en

* Add model 2023-09-14-bert_base_uncased_finetuned_wallisian_manual_8ep_lower_en

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>
  • Loading branch information
jsl-models and ahmedlone127 authored Sep 14, 2023
1 parent 6ec2297 commit d6f3fe5
Show file tree
Hide file tree
Showing 505 changed files with 46,965 additions and 0 deletions.
Original file line number Diff line number Diff line change
@@ -0,0 +1,93 @@
---
layout: model
title: English bert_base_dutch_cased_finetuned_mark BertEmbeddings from markverschuren
author: John Snow Labs
name: bert_base_dutch_cased_finetuned_mark
date: 2023-09-12
tags: [bert, en, open_source, fill_mask, onnx]
task: Embeddings
language: en
edition: Spark NLP 5.1.1
spark_version: 3.0
supported: true
engine: onnx
annotator: BertEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained BertEmbeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`bert_base_dutch_cased_finetuned_mark` is a English model originally trained by markverschuren.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/bert_base_dutch_cased_finetuned_mark_en_5.1.1_3.0_1694551719944.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/bert_base_dutch_cased_finetuned_mark_en_5.1.1_3.0_1694551719944.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python


document_assembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("documents")


embeddings =BertEmbeddings.pretrained("bert_base_dutch_cased_finetuned_mark","en") \
.setInputCols(["documents","token"]) \
.setOutputCol("embeddings")

pipeline = Pipeline().setStages([document_assembler, embeddings])

pipelineModel = pipeline.fit(data)

pipelineDF = pipelineModel.transform(data)

```
```scala


val document_assembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("embeddings")

val embeddings = BertEmbeddings
.pretrained("bert_base_dutch_cased_finetuned_mark", "en")
.setInputCols(Array("documents","token"))
.setOutputCol("embeddings")

val pipeline = new Pipeline().setStages(Array(document_assembler, embeddings))

val pipelineModel = pipeline.fit(data)

val pipelineDF = pipelineModel.transform(data)


```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|bert_base_dutch_cased_finetuned_mark|
|Compatibility:|Spark NLP 5.1.1+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[documents, token]|
|Output Labels:|[embeddings]|
|Language:|en|
|Size:|406.8 MB|

## References

https://huggingface.co/markverschuren/bert-base-dutch-cased-finetuned-mark
93 changes: 93 additions & 0 deletions docs/_posts/ahmedlone127/2023-09-12-legal_bert_small_uncased_en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,93 @@
---
layout: model
title: English legal_bert_small_uncased BertEmbeddings from nlpaueb
author: John Snow Labs
name: legal_bert_small_uncased
date: 2023-09-12
tags: [bert, en, open_source, fill_mask, onnx]
task: Embeddings
language: en
edition: Spark NLP 5.1.1
spark_version: 3.0
supported: true
engine: onnx
annotator: BertEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained BertEmbeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`legal_bert_small_uncased` is a English model originally trained by nlpaueb.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/legal_bert_small_uncased_en_5.1.1_3.0_1694561644609.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/legal_bert_small_uncased_en_5.1.1_3.0_1694561644609.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python


document_assembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("documents")


embeddings =BertEmbeddings.pretrained("legal_bert_small_uncased","en") \
.setInputCols(["documents","token"]) \
.setOutputCol("embeddings")

pipeline = Pipeline().setStages([document_assembler, embeddings])

pipelineModel = pipeline.fit(data)

pipelineDF = pipelineModel.transform(data)

```
```scala


val document_assembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("embeddings")

val embeddings = BertEmbeddings
.pretrained("legal_bert_small_uncased", "en")
.setInputCols(Array("documents","token"))
.setOutputCol("embeddings")

val pipeline = new Pipeline().setStages(Array(document_assembler, embeddings))

val pipelineModel = pipeline.fit(data)

val pipelineDF = pipelineModel.transform(data)


```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|legal_bert_small_uncased|
|Compatibility:|Spark NLP 5.1.1+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[documents, token]|
|Output Labels:|[embeddings]|
|Language:|en|
|Size:|130.6 MB|

## References

https://huggingface.co/nlpaueb/legal-bert-small-uncased
93 changes: 93 additions & 0 deletions docs/_posts/ahmedlone127/2023-09-13-adopted_bert_base_cased_en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,93 @@
---
layout: model
title: English adopted_bert_base_cased BertEmbeddings from sivanravid
author: John Snow Labs
name: adopted_bert_base_cased
date: 2023-09-13
tags: [bert, en, open_source, fill_mask, onnx]
task: Embeddings
language: en
edition: Spark NLP 5.1.1
spark_version: 3.0
supported: true
engine: onnx
annotator: BertEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained BertEmbeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`adopted_bert_base_cased` is a English model originally trained by sivanravid.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/adopted_bert_base_cased_en_5.1.1_3.0_1694617850169.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/adopted_bert_base_cased_en_5.1.1_3.0_1694617850169.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python


document_assembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("documents")


embeddings =BertEmbeddings.pretrained("adopted_bert_base_cased","en") \
.setInputCols(["documents","token"]) \
.setOutputCol("embeddings")

pipeline = Pipeline().setStages([document_assembler, embeddings])

pipelineModel = pipeline.fit(data)

pipelineDF = pipelineModel.transform(data)

```
```scala


val document_assembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("embeddings")

val embeddings = BertEmbeddings
.pretrained("adopted_bert_base_cased", "en")
.setInputCols(Array("documents","token"))
.setOutputCol("embeddings")

val pipeline = new Pipeline().setStages(Array(document_assembler, embeddings))

val pipelineModel = pipeline.fit(data)

val pipelineDF = pipelineModel.transform(data)


```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|adopted_bert_base_cased|
|Compatibility:|Spark NLP 5.1.1+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[documents, token]|
|Output Labels:|[embeddings]|
|Language:|en|
|Size:|403.6 MB|

## References

https://huggingface.co/sivanravid/adopted-bert-base-cased
93 changes: 93 additions & 0 deletions docs/_posts/ahmedlone127/2023-09-13-aivengers_bert_finetuned_en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,93 @@
---
layout: model
title: English aivengers_bert_finetuned BertEmbeddings from dkqp
author: John Snow Labs
name: aivengers_bert_finetuned
date: 2023-09-13
tags: [bert, en, open_source, fill_mask, onnx]
task: Embeddings
language: en
edition: Spark NLP 5.1.1
spark_version: 3.0
supported: true
engine: onnx
annotator: BertEmbeddings
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained BertEmbeddings model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`aivengers_bert_finetuned` is a English model originally trained by dkqp.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/aivengers_bert_finetuned_en_5.1.1_3.0_1694620043636.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/aivengers_bert_finetuned_en_5.1.1_3.0_1694620043636.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python


document_assembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("documents")


embeddings =BertEmbeddings.pretrained("aivengers_bert_finetuned","en") \
.setInputCols(["documents","token"]) \
.setOutputCol("embeddings")

pipeline = Pipeline().setStages([document_assembler, embeddings])

pipelineModel = pipeline.fit(data)

pipelineDF = pipelineModel.transform(data)

```
```scala


val document_assembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("embeddings")

val embeddings = BertEmbeddings
.pretrained("aivengers_bert_finetuned", "en")
.setInputCols(Array("documents","token"))
.setOutputCol("embeddings")

val pipeline = new Pipeline().setStages(Array(document_assembler, embeddings))

val pipelineModel = pipeline.fit(data)

val pipelineDF = pipelineModel.transform(data)


```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|aivengers_bert_finetuned|
|Compatibility:|Spark NLP 5.1.1+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[documents, token]|
|Output Labels:|[embeddings]|
|Language:|en|
|Size:|665.0 MB|

## References

https://huggingface.co/dkqp/AiVENGERS_BERT_FineTuned
Loading

0 comments on commit d6f3fe5

Please sign in to comment.