Revisiting Anwesha: Enhancing Personalised and Natural Search in Bangla

About Anwesha

Anwesha is a semantic search tool over Bangla documents.

To know more about Anwesha, in its current form kindly visit here.

In this work, we have addressed the existing limitations of Anwesha. To know more about our work kindly refer to the paper below.

Citing

If you are using any of the resources, please cite the following paper:

@inproceedings{das-etal-2022-revisiting,
    title = "Revisiting Anwesha:Enhancing Personalised and Natural Search in {B}angla",
    author = "Das, Arup  and
      Acharya, Joyojyoti  and
      Kundu, Bibekananda  and
      Chakraborti, Sutanu",
    booktitle = "Proceedings of the 19th International Conference on Natural Language Processing (ICON)",
    month = dec,
    year = "2022",
    address = "New Delhi, India",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.icon-main.24",
    pages = "183--193",
    abstract = "Bangla is a low-resource, highly agglutinative language. Thus it is challenging to facilitate an effective search of Bangla documents. We have created a gold standard dataset containing query document relevance pairs for evaluation purposes. We utilise Named Entities to improve the retrieval effectiveness of traditional Bangla search algorithms. We suggest a reasonable starting model for leveraging implicit preference feedback based on the user search behaviour to enhance the results retrieved by the Explicit Semantic Analysis (ESA) approach. We use contextual sentence embeddings obtained via Language-agnostic BERT Sentence Embedding (LaBSE) to rerank the candidate documents retrieved by the traditional search algorithms (tf-idf) based on the top sentences that are most relevant to the query. This paper presents our empirical findings across these directions and critically analyses the results.",
}

Research Team

	Name	Contact	LinkedIn/ Website
1	Arup Das, _{(AIDB, IITM)}	cs20s016@smail.iitm.ac.in	https://www.linkedin.com/in/arup-das-90033a153/
2	Joyojyoti Acharya, _{(AIDB, IITM)}	cs21m024@smail.iitm.ac.in	https://www.linkedin.com/in/joy-iitm/
3	Bibekananda Kundu, _(CDAC)	bibekananda.kundu@gmail.com	https://www.linkedin.com/in/bibekananda-kundu-51205434/
4	Sutanu Chakraborti, _{(AIDB, IITM)}	sutanuc@cse.iitm.ac.in	http://www.cse.iitm.ac.in/~sutanuc/

Acknowledgement

We acknowledge the below Alumni's of AIDB Laboratory, IIT Madras for their invaluable contributions in the initial development stages of Anwesha.

	Name	Contact	LinkedIn/ Website
1	Lokasis Ghorai, _{(AIDB, IITM)}	cs20m033@smail.iitm.ac.in	https://www.linkedin.com/in/lokasis-ghorai-6b073a146/
2	Arjun Kumar Gupta, _{(AIDB, IITM)}	cs20m015@smail.iitm.ac.in	https://www.linkedin.com/in/arjun-kumar-gupta-10898117a/

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
ESA_sentence_Embeddings		ESA_sentence_Embeddings
LaBSE_Embeddings		LaBSE_Embeddings
ESA_Code.ipynb		ESA_Code.ipynb
Implicit_Preference_Feedback_with_ESA.ipynb		Implicit_Preference_Feedback_with_ESA.ipynb
LaBSE_Birch_Approach.ipynb		LaBSE_Birch_Approach.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Revisiting Anwesha: Enhancing Personalised and Natural Search in Bangla

About Anwesha

Citing

Research Team

Acknowledgement

About

Releases

Packages

Contributors 2

Languages

ArupDas15/Revisiting_Anwesha

Folders and files

Latest commit

History

Repository files navigation

Revisiting Anwesha: Enhancing Personalised and Natural Search in Bangla

About Anwesha

Citing

Research Team

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages