[FEATURE][DocBot] Generate responses with our RAG pipeline #83

dtaivpp · 2023-11-01T20:15:23Z

Is your feature request related to a problem?

At the moment we have several of the pieces of the RAG pipeline built but now we need to pull it all together. We need to pull this all together now.

What solution would you like?

Our DocBot class will call docbot.language_model:generate_response see #80. Generate response will need to query our cohere-index and collect the response to return to the user.

https://opensearch.org/docs/latest/ml-commons-plugin/conversational-search/#using-the-pipeline

Note* here the interaction_size is the number of previous chats to send as context. The context_size is the number of results from our search that we will send through.

The most challenging thing with this PR is we will need to use a neural search in our query section in order to find the most relevant documents. https://opensearch.org/docs/latest/search-plugins/neural-text-search/#step-4-search-the-index-using-neural-search. The model_id that we will need to reference here is the MODEL_ID that is being used by our ingestion pipeline.

The text was updated successfully, but these errors were encountered:

LucasWang750 · 2023-11-01T20:15:53Z

I would like to take this issue

dtaivpp · 2023-12-15T19:23:05Z

Here is an example of what the generate language pipeline looks like:

GET /docbot/_search
{
  "_source": {
    "exclude": [
      "content_embedding"
    ]
  },
  "query": {
    "hybrid": {
      "queries": [
        {
          "match": {
            "content": {
              "query": "How do I enable segment replication"
            }
          }
        },
        {
          "neural": {
            "content_embedding": {
              "query_text": "How do I enable segment replication",
              "model_id": "Z8VpCYwBKF5Jo_eo10QE",
              "k": 5
            }
          }
        }
      ]
    }
  },
  "ext": {
		"generative_qa_parameters": {
		  "llm_model": "gpt-3.5-turbo",
			"llm_question": "How do I enable segment replication",
			"conversation_id": "JcVbCYwBKF5Jo_eoe0TD",
                         "context_size": 3,
                         "interaction_size": 3,
                         "timeout": 45
		}
	}
}

We will need to pass in the model ID, conversation ID, and the question. Then when we are processing the answers this is the response ["ext"]["retrieval_augmented_generation"]["answer"]

dtaivpp added enhancement New feature or request untriaged Issues not seen by a maintainer yet. labels Nov 1, 2023

dtaivpp removed the untriaged Issues not seen by a maintainer yet. label Nov 1, 2023

dtaivpp assigned LucasWang750 Nov 1, 2023

dtaivpp added good first issue Good for newcomers help wanted Extra attention is needed OSCI labels Nov 1, 2023

dtaivpp unassigned LucasWang750 Dec 15, 2023

ortegaa32 mentioned this issue Dec 16, 2023

Attempt to generate response in Docbot #121

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE][DocBot] Generate responses with our RAG pipeline #83

[FEATURE][DocBot] Generate responses with our RAG pipeline #83

dtaivpp commented Nov 1, 2023

LucasWang750 commented Nov 1, 2023

dtaivpp commented Dec 15, 2023

[FEATURE][DocBot] Generate responses with our RAG pipeline #83

[FEATURE][DocBot] Generate responses with our RAG pipeline #83

Comments

dtaivpp commented Nov 1, 2023

Is your feature request related to a problem?

What solution would you like?

LucasWang750 commented Nov 1, 2023

dtaivpp commented Dec 15, 2023