Transformers compete on imbalanced product classification data

The task chosen is to classify product data into its respective "browse node"
Product data such as "Product name", "Product description", "Brand Name", "Brand Description" is given.
As seen in E-commerce websites, The number of products existing in each category is highly varied, and this imbalance makes categorization by machine-learning challenging, and interesting.
Hence, the performance of the amazing transformer models are explored on this imbalanced data.

The dataset is explore and pre-processed. [EDA and preprocessing]
The pre-processed dataset is tokenized using rust based batchwise tokenization and saved as pickle files. [tokenization]
Each transformer is trained using its respective tokensized dataset, the metrics are logged, and then visualized as shown below. [training] :

4. The top performer is retrained using the focal loss, and the improved is shown as below.

[1] Surana S, "Amazon Product Browse Node Classification Data", Kaggle Datasets. [link]

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
dataset		dataset
local_utils		local_utils
result_visualizations		result_visualizations
training_results		training_results
Eda_and_preprocessing.ipynb		Eda_and_preprocessing.ipynb
README.md		README.md
Training.ipynb		Training.ipynb
requirements.txt		requirements.txt
tokenization.ipynb		tokenization.ipynb

Provide feedback