#

mbert

Here are 32 public repositories matching this topic...

csebuetnlp / banglabert

This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chap…

named-entity-recognition document-classification natural-language-inference bert sentiment-classification textual-entailment emotion-classification bangla-nlp bengali-language-processing bengali-natural-language-processing multilingual-models bengali-nlp bert-fine-tuning xlm-roberta mbert bangla-language-processing bangla-natural-language-processing banglabert

Updated Jan 24, 2023
Python

cambridgeltl / ContrastiveBLI

Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-Lingual Word Embeddings.

information-retrieval machine-translation word-embeddings pytorch self-learning word-alignment bilingual-word-embedding bilingual-lexicon-extraction fasttext-embeddings cross-lingual-embeddings mbert contrastive-learning low-resource-machine-translation bilingual-lexicon-induction cross-lingual-word-embedding word-translation cross-lingual-word-embeddings bilingual-dictionary-induction

Updated Aug 12, 2024
Python

lirondos / lazaro

An observatory of anglicism usage in the Spanish press

corpus linguistics spanish crf-model bilstm-crf mbert spanish-newswire anglicisms borrowings

Updated Feb 7, 2024
Python

ishan00 / meta-learning-for-multi-task-multilingual

Official Repository for the paper titled "Meta-Learning for Effective Multi-task and Multilingual Modelling" accepted at EACL 2021

transformers named-entity-recognition question-answering natural-language-inference reptile multi-task-learning paraphrase-identification meta-learning multilingual-models part-of-speech-tagging mbert

Updated Jul 27, 2021
Python

negar-foroutan / multiLMs-lang-neutral-subnets

[EMNLP 2022] Discovering Language-neutral Sub-networks in Multilingual Language Models.

mt5 lottery-ticket-hypothesis mbert cross-lingual-transfer multilingual-language-models multilingual-nlp

Updated Apr 1, 2024
Python

Mukaffi28 / Vashantor-A-Large-scale-Multilingual-Benchmark-Dataset

A Large-scale Multilingual Benchmark Dataset for Automated Translation of Bangla Regional Dialects to Bangla Language

machine-translation neural-machine-translation mt5 mbert banglat5 bangla-bert-base regional-dialects

Updated Feb 4, 2024
Jupyter Notebook

fatemafaria142 / MultiBanFakeDetect-An-Extensive-Benchmark-Dataset-for-Multimodal-Bangla-Fake-News-Detection

This study introduces MultiBanFakeDetect, a novel multimodal dataset for Bangla fake news detection, combining textual and visual information. It features TextFakeNet for text analysis and MultiFusionFake for integrating multimodal data.

benchmark dataset resnet-101 multimodal-dataset xlm-roberta mbert fake-news-detection early-fusion late-fusion under-resourced-language fusion-techniques densenet-169 intermediate-fusion

Updated Aug 8, 2024
Jupyter Notebook

juletx / multilingual-question-answering

Zero-shot and Translation Experiments on XQuAD, MLQA and TyDiQA

translation machine-translation question-answering squad bert zero-shot roberta mbert xlm-r mlqa xquad multilingual-bert translate-train tydiqa translate-test

Updated Jun 14, 2022
Jupyter Notebook

BassaniRiccardo / ICEBERT

ICEBERT: Interlingual-Clusters Enhanced BERT. A BERT-like model trained on clusters of monolingual subwords.

clustering tokenization subword-segmentation mbert

Updated Jan 10, 2022
Python

fatemafaria142 / Large-Language-Models-Over-Transformer-Models-for-Bangla-NLI

This research examines the performance of Large Language Models (GPT-3.5 Turbo and Gemini 1.5 Pro) in Bengali Natural Language Inference, comparing them with state-of-the-art models using the XNLI dataset. It explores zero-shot and few-shot scenarios to evaluate their efficacy in low-resource settings.

bengali natural-language-inference low-resource-languages distilbert mbert pretrained-language-models banglabert large-language-models

Updated May 8, 2024
Jupyter Notebook

elsheikh21 / cross-natural-language-inference

ZeroShot XNLI

transformers torch xlm xnli xlm-roberta mbert

Updated Sep 2, 2020
Python

DiFronzo / Multilingual-Models

mBERT and XLM-R for encodeing of Scandinavian languages

multilingual python language transformers python3 pytorch xlm-roberta mbert xlm-r scandinavian

Updated Dec 14, 2022
Python

Elijas / lithuanian-text-summarization-model

Deployed model which can summarize Lithuanian language text by leveraging Artificial Neural Networks, Transformers, mBERT.

nlp pytorch summarization ann language-model bert streamlit mbert

Updated May 11, 2021
Python

peterzee-tsien / LING484-COMP599-Final-Projects

By using the hypothesis of historical linguistics, we found a way to improve the performance of multilingual transformers with limited amount of data

swahili yoruba ner pos-tagger fine-tuning mbert multilingual-bert wolof

Updated Apr 27, 2022
Jupyter Notebook

AditiBagora / Hasoc2021CodeMix

HASOC2021: Subtask 2 a) Codemix Challenge; Contains baselines and hierarchical approach that extracts the relevant context useful for classification of hostile tweets on English-Hindi code-mix data obtained from twitter.

tensorflow transformers torch feature-extraction mlp nlp-machine-learning fine-tuning xlm-roberta mbert

Updated Feb 20, 2022
Jupyter Notebook

michaelpeterhoffmann / masterthesis

Multilingual hate speech detection for German, Italian and Spanish Social Media Posts #machine learning #classifier

transformer transfer-learning bert svm-classifier mbert xlmroberta

Updated Nov 30, 2023
Jupyter Notebook

Koharu24 / mBERT-Unaligned-fine-tuning-for-a-cross-lingual-RD-of-untranslatable-terms

This is a project proposal to implement Yan et al.'s (2020) mBERT-Unaligned for cross-lingual RDs with Japanese, German and Italian untranslatable terms

nlp-machine-learning unaligned cross-linguistic-data reverse-dictionary mbert

Updated Aug 9, 2023

MusfiqDehan / Multilingual-Sentence-Alignments-Demo

Align Parallel Sentence of 104 Languages with the help of mBERT and LaBSE

mbert multilingual-bert labse multilingual-alignment

Updated Mar 7, 2024
Python

NasserMohamedEid / Text-AI-Detection

nlp bert mt5 streamlit mbert arabert roberta-model llm

Updated Mar 20, 2024
Jupyter Notebook

SKG24 / VIVARAN_chatbot_Supreme-court-hackathon

It is an ideation of the AI powered chatbot to help in legal understanding of the Indian government and its laws. To reach larger audience it supports all the constitutional languages.

ai chatbot ml supreme-court huggingface mbert

Updated Sep 12, 2024

Improve this page

Add a description, image, and links to the mbert topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mbert topic, visit your repo's landing page and select "manage topics."