Skip to content

Commit

Permalink
2023-11-20-distilled_indobert_classification_en (#14074)
Browse files Browse the repository at this point in the history
* Add model 2023-11-22-burmese_awesome_wnut_model_liujunshi_en

* Add model 2023-11-22-burmese_awesome_wnut_model_7dberry_en

* Add model 2023-11-22-tryner_tabert_1k_en

* Add model 2023-11-22-distilbert_base_multilingual_cased_finetuned_ner_xx

* Add model 2023-11-22-loc_dataset_en

* Add model 2023-11-22-biomedical_ner_all_anonimization_try_6_en

* Add model 2023-11-22-meow_tagging_en

* Add model 2023-11-22-roberta_large_ner_model_mimic_top10_en

* Add model 2023-11-22-token_fine_tunned_flipkart_2_en

* Add model 2023-11-22-testingmodel_mn

* Add model 2023-11-22-numberprediction_en

* Add model 2023-11-22-token_classification_test_en

* Add model 2023-11-22-taner_1k_indic_glue_en

* Add model 2023-11-22-burmese_awesome_wnut_model_ni4z_en

* Add model 2023-11-22-burmese_awesome_wnut_model_lathashree01_en

* Add model 2023-11-22-rg_distilbert_augmanted_signatures_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_mankness_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_michelebern_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_nerea06_en

* Add model 2023-11-22-hun_wnut_modell_en

* Add model 2023-11-22-test_ner_finetuned_ner_en

* Add model 2023-11-22-ner_testing_1_en

* Add model 2023-11-22-distil_added_voca_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_haneen77_en

* Add model 2023-11-22-distilbert_bio_pv_superset_en

* Add model 2023-11-22-tryner_tabert_500_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_vutt_en

* Add model 2023-11-22-distilbert_fresh_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_chenyixin1986_en

* Add model 2023-11-22-230615_wnut_model_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_chunk_en

* Add model 2023-11-22-distilbert_base_multilingual_cased_finetuned_ner__dataset_ner_heb_standard_labels_xx

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_grantitdhcka_en

* Add model 2023-11-22-taner_500_naamapdam_fine_tuned_en

* Add model 2023-11-22-finetuned_ner_finegrained_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_yilingwawa_en

* Add model 2023-11-22-burmese_ebm_model_biobert_en

* Add model 2023-11-22-burmese_awesome_wnut_model_spleonard1_en

* Add model 2023-11-22-burmese_awesome_wnut_model_alayaran_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_mariahabib_en

* Add model 2023-11-22-few_nerd_en

* Add model 2023-11-22-insertion_prop_015_correct_data_en

* Add model 2023-11-22-sayula_popoluca_test_model_1_en

* Add model 2023-11-22-distilbert_base_uncased_mlm_scirepeval_fos_chemistry_tokencls_battery_en

* Add model 2023-11-22-biomedical_ner_all_anonimization_try_8_anonimization_try_9_en

* Add model 2023-11-22-try_connll_finetuned_ner_en

* Add model 2023-11-22-biomed_ner_en

* Add model 2023-11-22-tryner_tabert_2k_en

* Add model 2023-11-22-finetuned_model_en

* Add model 2023-11-22-distilbert_base_multilingual_cased_finetuned_ner__dataset_ner_heb_small_xx

* Add model 2023-11-22-jl_distilbert_german_finetuned_ner_en

* Add model 2023-11-22-distilkobert_finetuned_ner_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_nicgh3_en

* Add model 2023-11-22-burmese_awesome_wnut_model_vsufiy_en

* Add model 2023-11-22-tbert_ner_test_en

* Add model 2023-11-22-insertion_prop05_vocab_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_duyduong9htv_en

* Add model 2023-11-22-burmese_awesome_wnut_model_shadman_rohan_en

* Add model 2023-11-22-burmese_awesome_wnut_model_sinchir0_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_jgraves_en

* Add model 2023-11-22-ner_loc_en

* Add model 2023-11-22-taner_500_v2_en

* Add model 2023-11-22-testrun_model_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_bjfxs_en

* Add model 2023-11-22-clinico_finetuned_en

* Add model 2023-11-22-vietai_asm1_ner_en

* Add model 2023-11-22-taner_2k_indic_glue_en

* Add model 2023-11-22-rg_ner_for_emails_en

* Add model 2023-11-22-pautas_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_3_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_longxiang_en

* Add model 2023-11-22-nlp_hiba2_distemist_fine_tuned_biobert_pretrained_model_en

* Add model 2023-11-22-burmese_awesome_wnut_model_danstinga_en

* Add model 2023-11-22-dogebooch_biomedical_ner_all_datasets_4_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_orthogonal_orca_en

* Add model 2023-11-22-distilkobert_kemofact_0925_en

* Add model 2023-11-22-biomedical_ner_all_anonimization_try_7_en

* Add model 2023-11-22-burmese_awesome_address_tokenizer_model_v7_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_eca1g19_en

* Add model 2023-11-22-directquote_chunktext_distilbert_en

* Add model 2023-11-22-taner_4k_indic_glue_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_krag57_en

* Add model 2023-11-22-distilkobert_kemofact_efe_0927_en

* Add model 2023-11-22-punjabi_distilbert_ner_en

* Add model 2023-11-22-hueta_finetuned_1_en

* Add model 2023-11-22-ner_our_base_model_en

* Add model 2023-11-22-distilbert_finetuned_ner_s800_en

* Add model 2023-11-22-test_ner3_en

* Add model 2023-11-22-autotrain_test_ner_75401139975_en

* Add model 2023-11-22-tokenclass_wnut_en

* Add model 2023-11-22-burmese_awesome_wnut_model_gwd77777_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_andrewlitv_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_abdullahf129_en

* Add model 2023-11-22-insertion_prop_05_correct_data_en

* Add model 2023-11-22-distilbert_pabloguinea_en

* Add model 2023-11-22-burmese_awesome_reconstructor_model_en

* Add model 2023-11-22-quote_model_delta_en

* Add model 2023-11-22-token_fine_tunned_flipkart_2_gl7_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_zakria_en

* Add model 2023-11-22-burmese_awesome_wnut_model_longxiang_en

* Add model 2023-11-22-burmese_awesome_wnut_model_me11997_en

* Add model 2023-11-22-burmese_awesome_wnut_model_yannhabib_en

* Add model 2023-11-22-ner_distillbert_ner_en

* Add model 2023-11-22-burmese_awesome_wnut_model_jessicaassis_en

* Add model 2023-11-22-token_fine_tunned_flipkart_en

* Add model 2023-11-22-burmese_awesome_wnut_model_longmark_en

* Add model 2023-11-22-entity_extraction_not_evaluated_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_xsf_en

* Add model 2023-11-22-datos_ner_en

* Add model 2023-11-22-token_fine_tunned_flipkart_2_galician_en

* Add model 2023-11-22-color_extraction_2023_02_09_v2_finetuned_ner_en

* Add model 2023-11-22-sayula_popoluca_test_model_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_vijays2_en

* Add model 2023-11-22-burmese_awesome_wnut_model_claudehotline_en

* Add model 2023-11-22-distilbert_base_multilingual_cased_finetuned_ner__dataset_ner_heb_xx

* Add model 2023-11-22-consejo_ner_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_arifdknt_en

* Add model 2023-11-22-burmese_awesome_wnut_model_alicenkbaytop_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_mke10_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_sumanc_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_recruitment_eval_en

* Add model 2023-11-22-burmese_ner_model_jimi11_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_kisma_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_reugene_en

* Add model 2023-11-22-burmese_awesome_wnut_model_davidliu1110_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_slhoefel_en

* Add model 2023-11-22-claims_data_model_jlandis_en

* Add model 2023-11-22-sara_model_en

* Add model 2023-11-22-insertion_prop05_ls01_en

* Add model 2023-11-22-token_final_tunned_en

* Add model 2023-11-22-model_output_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_pivolan_en

* Add model 2023-11-22-burmese_awesome_wnut_model_ggouda_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_yilmazasl_en

* Add model 2023-11-22-tryner_tabert_4k_en

* Add model 2023-11-22-burmese_awesome_wnut_model_maunilvyas_en

* Add model 2023-11-22-burmese_awesome_wnut_model_andyrasika_en

* Add model 2023-11-22-burmese_awesome_wnut_model_sofa566_en

* Add model 2023-11-22-burmese_awesome_wnut_model_stepa_en

* Add model 2023-11-22-clinico_finetuned_augmented1_en

* Add model 2023-11-22-ner_distillbert_ner_tags_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_novik_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner1_en

* Add model 2023-11-22-distilbert_base_uncased_marfinbirt_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_turhana_en

* Add model 2023-11-22-dbert_finetuned_ct_2023_en

* Add model 2023-11-22-wikineural_multilingual_ner_xx

* Add model 2023-11-22-results_raucusreno_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_jasminebatra_en

* Add model 2023-11-22-burmese_awesome_wnut_model_terhdavid_en

* Add model 2023-11-22-hueta_finetuned_en

* Add model 2023-11-22-distilbert_base_cased_finetuned_ner_all_en

* Add model 2023-11-22-burmese_awesome_wnut_model_dimitriish_en

* Add model 2023-11-22-burmese_awesome_wnut_model_edthomasset_en

* Add model 2023-11-22-distilbert_base_cased_ner_trained_on_synthea_en

* Add model 2023-11-22-wnut_model_navendux_en

* Add model 2023-11-22-color_extraction_2023_02_10_v2_finetuned_ner_en

* Add model 2023-11-22-biomedical_ner_all_anonimization_try_2_en

* Add model 2023-11-22-rg_distilbert_big_data_en

* Add model 2023-11-22-tmp_trainer_en

* Add model 2023-11-22-modelworking4_en

* Add model 2023-11-22-burmese_awesome_wnut_model3_en

* Add model 2023-11-22-color_extraction_2023_02_10_v3_finetuned_ner_en

* Add model 2023-11-22-burmese_awesome_wnut_model_eitanli_en

* Add model 2023-11-22-finetune_wnut_model_en

* Add model 2023-11-22-tutorial_en

* Add model 2023-11-22-burmese_awesome_wnut_model_vsombhane_en

* Add model 2023-11-22-burmese_awesome_wnut_model_idriska_en

* Add model 2023-11-22-distilbert_finetuned_ner_copious_en

* Add model 2023-11-22-dimensions_extraction_2023_02_10_v0_en

* Add model 2023-11-22-modelworkingmanualdata2_en

* Add model 2023-11-22-burmese_awesome_wnut_model_xsf_en

* Add model 2023-11-22-copilot_wnut_model_en

* Add model 2023-11-22-burmese_awesome_wnut_model_yuliang555_en

* Add model 2023-11-22-burmese_awesome_wnut_model_thrushwanth_en

* Add model 2023-11-22-wolof_finetuned_ner_accelerate_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_pha_en

* Add model 2023-11-22-burmese_awesome_wnut_model_zinoli_en

* Add model 2023-11-22-increase_exp_en

* Add model 2023-11-22-burmese_awesome_wnut_model_anyuanay_en

* Add model 2023-11-22-biomedical_ner_all_anonimization_try_8_en

* Add model 2023-11-22-ner_model_v1_en

* Add model 2023-11-22-burmese_awesome_wnut_model_ramikassouf_en

* Add model 2023-11-22-taner_1k_naamapdam_fine_tuned_en

* Add model 2023-11-22-modelworking6_copy_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_linh101201_en

* Add model 2023-11-22-distilbert_base_cased_finetuned_ner_linh101201_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_model2_ner_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_thun11_en

* Add model 2023-11-22-modelworking3_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_model1_ner_en

* Add model 2023-11-22-test4_en

* Add model 2023-11-22-affilgood_ner_test_en

* Add model 2023-11-22-panda_ner_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_model3_ner_en

* Add model 2023-11-22-burmese_awesome_wnut_model_sravanipilla_en

* Add model 2023-11-22-wnut_model_realgon_en

* Add model 2023-11-22-burmese_awesome_wnut_model_shaohantian_en

* Add model 2023-11-22-ecobert_finetuned_ner_s800_en

* Add model 2023-11-22-burmese_test2_wnut_model_en

* Add model 2023-11-22-burmese_awesome_wnut_model_muibk_en

* Add model 2023-11-22-burmese_awesome_wnut_model_viktaradynets_en

* Add model 2023-11-22-burmese_awesome_wnut_model_alexisdpc_en

* Add model 2023-11-22-copilot_namanj_model_en

* Add model 2023-11-22-taner_1k_en

* Add model 2023-11-22-burmese_first_model_en

* Add model 2023-11-22-burmese_awesome_wnut_model_suhasparray_en

* Add model 2023-11-22-bert_multilingual_ner_xx

* Add model 2023-11-22-burmese_awesome_wnut_model_prudhvirazz_en

* Add model 2023-11-22-modelworking2_en

* Add model 2023-11-22-taner_500_en

* Add model 2023-11-22-burmese_awesome_wnut_model_sameerakoppana_en

* Add model 2023-11-22-ecobert_finetuned_ner_copious_en

* Add model 2023-11-22-medhack_en

* Add model 2023-11-22-burmese_awesome_wnut_model_nadeemraja_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_samih1974_en

* Add model 2023-11-22-moodprediction_en

* Add model 2023-11-22-burmese_awesome_wnut_model_kadir0_en

* Add model 2023-11-22-burmese_awesome_wnut_model_emmanuelq2_en

* Add model 2023-11-22-tryner_2k_en

* Add model 2023-11-22-spanish_ner_en

* Add model 2023-11-22-sophie_spanish_implementation_en

* Add model 2023-11-22-modelworkingmanualdata_en

* Add model 2023-11-22-burmese_awesome_pakner_model_en

* Add model 2023-11-22-burmese_awesome_wnut_model2_en

* Add model 2023-11-22-burmese_awesome_wnut_model_darrenhinde_en

* Add model 2023-11-22-social_groups_second_try_giladh_en

* Add model 2023-11-22-distilbert_base_cased_finetuned_ner_t2_g2_en

* Add model 2023-11-22-tiny_random_distilbertfortokenclassification_hf_tiny_model_private_en

* Add model 2023-11-22-burmese_awesome_wnut_model_mamuninfo_en

* Add model 2023-11-22-claims_data_model_mjokich_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_rohanv123_en

* Add model 2023-11-22-modelworkingmanualdata3_en

* Add model 2023-11-22-nlp_p4_en

* Add model 2023-11-22-burmese_nlp_model_en

* Add model 2023-11-22-burmese_awesome_wnut_model_hefeng0_en

* Add model 2023-11-22-tryner_4k_en

* Add model 2023-11-22-color_extraction_2023_02_10_v1_finetuned_ner_en

* Add model 2023-11-22-distilbert_base_multilingual_cased_ner_demo_amarsanaa1525_xx

* Add model 2023-11-22-burmese_awesome_wnut_model_atheer174_en

* Add model 2023-11-22-burmese_awesome_wnut_model_yyyy1992_en

* Add model 2023-11-22-burmese_awesome_wnut_model_1_en

* Add model 2023-11-22-basic_wnut_en

* Add model 2023-11-22-test_train_model_en

* Add model 2023-11-22-burmese_test_big_ner_model_en

* Add model 2023-11-22-burmese_awesome_wnut_model_iftisyed_en

* Add model 2023-11-22-portuguese_traing_en

* Add model 2023-11-22-burmese_awesome_wnut_model_blakemaster24_en

* Add model 2023-11-22-test2_en

* Add model 2023-11-22-distilbert_base_uncased_finetuned_ner_calin_en

---------

Co-authored-by: ahmedlone127 <ahmedlone127@gmail.com>
  • Loading branch information
jsl-models and ahmedlone127 authored Nov 22, 2023
1 parent 5c5433d commit c6f8db4
Show file tree
Hide file tree
Showing 1,028 changed files with 96,682 additions and 0 deletions.
Original file line number Diff line number Diff line change
@@ -0,0 +1,97 @@
---
layout: model
title: English 4_way_detection_prop_16_distilbert DistilBertForSequenceClassification from ultra-coder54732
author: John Snow Labs
name: 4_way_detection_prop_16_distilbert
date: 2023-11-20
tags: [bert, en, open_source, sequence_classification, onnx]
task: Text Classification
language: en
edition: Spark NLP 5.2.0
spark_version: 3.0
supported: true
engine: onnx
annotator: DistilBertForSequenceClassification
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained DistilBertForSequenceClassification model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`4_way_detection_prop_16_distilbert` is a English model originally trained by ultra-coder54732.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/4_way_detection_prop_16_distilbert_en_5.2.0_3.0_1700496105654.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/4_way_detection_prop_16_distilbert_en_5.2.0_3.0_1700496105654.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python

document_assembler = DocumentAssembler()\
.setInputCol("text")\
.setOutputCol("document")

tokenizer = Tokenizer()\
.setInputCols("document")\
.setOutputCol("token")

sequenceClassifier = DistilBertForSequenceClassification.pretrained("4_way_detection_prop_16_distilbert","en")\
.setInputCols(["document","token"])\
.setOutputCol("class")

pipeline = Pipeline().setStages([document_assembler, tokenizer, sequenceClassifier])

data = spark.createDataFrame([["PUT YOUR STRING HERE"]]).toDF("text")

result = pipeline.fit(data).transform(data)

```
```scala

val document_assembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("document")

val tokenizer = new Tokenizer()
.setInputCols("document")
.setOutputCol("token")

val sequenceClassifier = DistilBertForSequenceClassification.pretrained("4_way_detection_prop_16_distilbert","en")
.setInputCols(Array("document","token"))
.setOutputCol("class")

val pipeline = new Pipeline().setStages(Array(documentAssembler, tokenizer, sequenceClassifier))

val data = Seq("PUT YOUR STRING HERE").toDS.toDF("text")

val result = pipeline.fit(data).transform(data)


```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|4_way_detection_prop_16_distilbert|
|Compatibility:|Spark NLP 5.2.0+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[documents, token]|
|Output Labels:|[class]|
|Language:|en|
|Size:|249.5 MB|

## References

https://huggingface.co/ultra-coder54732/4-way-detection-prop-16-distilbert
93 changes: 93 additions & 0 deletions docs/_posts/ahmedlone127/2023-11-20-app_en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,93 @@
---
layout: model
title: English app DistilBertForTokenClassification from pierrerappolt-okta
author: John Snow Labs
name: app
date: 2023-11-20
tags: [bert, en, open_source, token_classification, onnx]
task: Named Entity Recognition
language: en
edition: Spark NLP 5.2.0
spark_version: 3.0
supported: true
engine: onnx
annotator: DistilBertForTokenClassification
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained DistilBertForTokenClassification model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`app` is a English model originally trained by pierrerappolt-okta.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/app_en_5.2.0_3.0_1700519714968.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/app_en_5.2.0_3.0_1700519714968.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python


documentAssembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("documents")


tokenClassifier = DistilBertForTokenClassification.pretrained("app","en") \
.setInputCols(["documents","token"]) \
.setOutputCol("ner")

pipeline = Pipeline().setStages([documentAssembler, tokenClassifier])

pipelineModel = pipeline.fit(data)

pipelineDF = pipelineModel.transform(data)

```
```scala


val documentAssembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("embeddings")

val tokenClassifier = DistilBertForTokenClassification
.pretrained("app", "en")
.setInputCols(Array("documents","token"))
.setOutputCol("ner")

val pipeline = new Pipeline().setStages(Array(documentAssembler, tokenClassifier))

val pipelineModel = pipeline.fit(data)

val pipelineDF = pipelineModel.transform(data)


```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|app|
|Compatibility:|Spark NLP 5.2.0+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[documents, token]|
|Output Labels:|[ner]|
|Language:|en|
|Size:|247.2 MB|

## References

https://huggingface.co/pierrerappolt-okta/app
Original file line number Diff line number Diff line change
@@ -0,0 +1,97 @@
---
layout: model
title: English background_distilebert_2023_02_21_19_08 DistilBertForSequenceClassification from leeju
author: John Snow Labs
name: background_distilebert_2023_02_21_19_08
date: 2023-11-20
tags: [bert, en, open_source, sequence_classification, onnx]
task: Text Classification
language: en
edition: Spark NLP 5.2.0
spark_version: 3.0
supported: true
engine: onnx
annotator: DistilBertForSequenceClassification
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained DistilBertForSequenceClassification model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`background_distilebert_2023_02_21_19_08` is a English model originally trained by leeju.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/background_distilebert_2023_02_21_19_08_en_5.2.0_3.0_1700483305055.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/background_distilebert_2023_02_21_19_08_en_5.2.0_3.0_1700483305055.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python

document_assembler = DocumentAssembler()\
.setInputCol("text")\
.setOutputCol("document")

tokenizer = Tokenizer()\
.setInputCols("document")\
.setOutputCol("token")

sequenceClassifier = DistilBertForSequenceClassification.pretrained("background_distilebert_2023_02_21_19_08","en")\
.setInputCols(["document","token"])\
.setOutputCol("class")

pipeline = Pipeline().setStages([document_assembler, tokenizer, sequenceClassifier])

data = spark.createDataFrame([["PUT YOUR STRING HERE"]]).toDF("text")

result = pipeline.fit(data).transform(data)

```
```scala

val document_assembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("document")

val tokenizer = new Tokenizer()
.setInputCols("document")
.setOutputCol("token")

val sequenceClassifier = DistilBertForSequenceClassification.pretrained("background_distilebert_2023_02_21_19_08","en")
.setInputCols(Array("document","token"))
.setOutputCol("class")

val pipeline = new Pipeline().setStages(Array(documentAssembler, tokenizer, sequenceClassifier))

val data = Seq("PUT YOUR STRING HERE").toDS.toDF("text")

val result = pipeline.fit(data).transform(data)


```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|background_distilebert_2023_02_21_19_08|
|Compatibility:|Spark NLP 5.2.0+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[documents, token]|
|Output Labels:|[class]|
|Language:|en|
|Size:|250.9 MB|

## References

https://huggingface.co/leeju/background-distilebert_2023-02-21_19-08
93 changes: 93 additions & 0 deletions docs/_posts/ahmedlone127/2023-11-20-bert_b07_en.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,93 @@
---
layout: model
title: English bert_b07 DistilBertForTokenClassification from LazzeKappa
author: John Snow Labs
name: bert_b07
date: 2023-11-20
tags: [bert, en, open_source, token_classification, onnx]
task: Named Entity Recognition
language: en
edition: Spark NLP 5.2.0
spark_version: 3.0
supported: true
engine: onnx
annotator: DistilBertForTokenClassification
article_header:
type: cover
use_language_switcher: "Python-Scala-Java"
---

## Description

Pretrained DistilBertForTokenClassification model, adapted from Hugging Face and curated to provide scalability and production-readiness using Spark NLP.`bert_b07` is a English model originally trained by LazzeKappa.

{:.btn-box}
<button class="button button-orange" disabled>Live Demo</button>
<button class="button button-orange" disabled>Open in Colab</button>
[Download](https://s3.amazonaws.com/auxdata.johnsnowlabs.com/public/models/bert_b07_en_5.2.0_3.0_1700521668306.zip){:.button.button-orange.button-orange-trans.arr.button-icon}
[Copy S3 URI](s3://auxdata.johnsnowlabs.com/public/models/bert_b07_en_5.2.0_3.0_1700521668306.zip){:.button.button-orange.button-orange-trans.button-icon.button-copy-s3}

## How to use



<div class="tabs-box" markdown="1">
{% include programmingLanguageSelectScalaPythonNLU.html %}
```python


documentAssembler = DocumentAssembler() \
.setInputCol("text") \
.setOutputCol("documents")


tokenClassifier = DistilBertForTokenClassification.pretrained("bert_b07","en") \
.setInputCols(["documents","token"]) \
.setOutputCol("ner")

pipeline = Pipeline().setStages([documentAssembler, tokenClassifier])

pipelineModel = pipeline.fit(data)

pipelineDF = pipelineModel.transform(data)

```
```scala


val documentAssembler = new DocumentAssembler()
.setInputCol("text")
.setOutputCol("embeddings")

val tokenClassifier = DistilBertForTokenClassification
.pretrained("bert_b07", "en")
.setInputCols(Array("documents","token"))
.setOutputCol("ner")

val pipeline = new Pipeline().setStages(Array(documentAssembler, tokenClassifier))

val pipelineModel = pipeline.fit(data)

val pipelineDF = pipelineModel.transform(data)


```
</div>

{:.model-param}
## Model Information

{:.table-model}
|---|---|
|Model Name:|bert_b07|
|Compatibility:|Spark NLP 5.2.0+|
|License:|Open Source|
|Edition:|Official|
|Input Labels:|[documents, token]|
|Output Labels:|[ner]|
|Language:|en|
|Size:|505.4 MB|

## References

https://huggingface.co/LazzeKappa/BERT_B07
Loading

0 comments on commit c6f8db4

Please sign in to comment.