[Obs AI Assistant] Error when using ollama model locally #204116

neptunian · 2024-12-12T19:11:04Z

Similar to the bug described here, #204014, the model (llama 3.2) isn't being passed to chat completions and fails with an unexpected error

[2024-12-12T14:02:39.743-05:00][WARN ][plugins.actions.gen-ai] action execution failure: .gen-ai:8608ccb6-4e2a-4045-a729-ab4b556ea5ad: Llama: an error occurred while running the action: Status code: 400. Message: API Error: Bad Request - model is required; retry: true
[2024-12-12T14:02:39.744-05:00][ERROR][plugins.observabilityAIAssistant.service] Error: Unexpected error
at createInferenceInternalError (elastic/kibana/x-pack/platform/packages/shared/ai-infra/inference-common/src/errors.ts:49:10)
at elastic/kibana/x-pack/platform/plugins/shared/inference/server/chat_complete/adapters/openai/openai_adapter.ts:68:92

The text was updated successfully, but these errors were encountered:

elasticmachine · 2024-12-12T19:11:06Z

Pinging @elastic/appex-ai-infra (Team:AI Infra)

elasticmachine · 2024-12-12T19:11:06Z

Pinging @elastic/obs-ai-assistant (Team:Obs AI Assistant)

pgayvallet · 2024-12-19T08:20:52Z

the model isn't being passed to chat completions

So this part is working as expected. The model is not a parameter of the inference APIs, and it not sent to the stack connector.

The stack connector is in charge of passing the information from the config to the provider during the remote call:

kibana/x-pack/plugins/stack_connectors/server/connector_types/openai/openai.ts

Lines 204 to 210 in 9372027

    
           const executeBody = getRequestWithStreamOption( 
        
             this.provider, 
        
             this.url, 
        
             body, 
        
             stream, 
        
             ...('defaultModel' in this.config ? [this.config.defaultModel] : []) 
        
           );

However looking at the code forging the request, it seems that the defaultModel is only being passed for openAI provider type and not for other:

kibana/x-pack/plugins/stack_connectors/server/connector_types/openai/lib/utils.ts

Lines 65 to 81 in 83a701e

    
           export function getRequestWithStreamOption( 
        
             provider: string, 
        
             url: string, 
        
             body: string, 
        
             stream: boolean, 
        
             defaultModel?: string 
        
           ): string { 
        
             switch (provider) { 
        
               case OpenAiProviderType.OpenAi: 
        
                 return openAiGetRequestWithStreamOption(url, body, stream, defaultModel!); 
        
               case OpenAiProviderType.AzureAi: 
        
                 return azureAiGetRequestWithStreamOption(url, body, stream); 
        
               case OpenAiProviderType.Other: 
        
                 return otherOpenAiGetRequestWithStreamOption(body, stream); 
        
               default: 
        
                 return body; 
        
             }

The right approach for the fix seems to be to adapt getRequestWithStreamOption and otherOpenAiGetRequestWithStreamOption

pgayvallet · 2024-12-19T13:09:20Z

So, I opened #204934 that will take care of the issue in the connector.

That fixes the general error and allows to properly communicates with ollama.

However, there seems to be something wrong with what the o11y assistant is doing, as the stream get closed in the middle and fails with something that seems related to the title generation:

Screen.Recording.2024-12-19.at.14.04.05.mov

The server error is, as always with RXJS, incredibly usable and totally shows where the issue is coming from:

[2024-12-19T14:04:13.554+01:00][ERROR][plugins.observabilityAIAssistant.service] Error
    at _super (/kibana/node_modules/rxjs/dist/cjs/internal/util/createErrorClass.js:7:26)
    at new EmptyErrorImpl (/kibana/node_modules/rxjs/dist/cjs/internal/util/EmptyError.js:6:5)
    at /kibana/node_modules/rxjs/dist/cjs/internal/operators/last.js:13:271
    at /kibana/node_modules/rxjs/dist/cjs/internal/operators/throwIfEmpty.js:14:86
    at OperatorSubscriber._this._complete (/kibana/node_modules/rxjs/dist/cjs/internal/operators/OperatorSubscriber.js:56:21)
    at Subscriber.complete (/kibana/node_modules/rxjs/dist/cjs/internal/Subscriber.js:69:18)

After some debug logs, this is being thrown here:

kibana/x-pack/platform/plugins/shared/observability_solution/observability_ai_assistant/server/service/client/index.ts

Lines 428 to 434 in 4455087

    
           return output$.pipe( 
        
             instrumentAndCountTokens('complete'), 
        
             withoutTokenCountEvents(), 
        
             catchError((error) => { 
        
               this.dependencies.logger.error(error); 
        
               return throwError(() => error); 
        
             }),

That observable is massive and we're getting out of my area of ownership though, so I'll let the @elastic/obs-ai-assistant team take a look if they so want.

…4934) ## Summary Part of #204116 When model is not present in the payload, use the default model as specified in the connector configuration. We were already doing that for OpenAI-OpenAI, but not for "Other"-OpenAI. ### Some section because I downloaded ollama just for that issue <img width="950" alt="Screenshot 2024-12-19 at 13 53 48" src="https://github.com/user-attachments/assets/4a6e4b35-a0c5-46e5-9372-677e99d070f8" /> <img width="769" alt="Screenshot 2024-12-19 at 13 54 54" src="https://github.com/user-attachments/assets/a0a5a12a-ea1e-42b7-8fa1-6531bef5ae6c" />

…stic#204934) ## Summary Part of elastic#204116 When model is not present in the payload, use the default model as specified in the connector configuration. We were already doing that for OpenAI-OpenAI, but not for "Other"-OpenAI. ### Some section because I downloaded ollama just for that issue <img width="950" alt="Screenshot 2024-12-19 at 13 53 48" src="https://github.com/user-attachments/assets/4a6e4b35-a0c5-46e5-9372-677e99d070f8" /> <img width="769" alt="Screenshot 2024-12-19 at 13 54 54" src="https://github.com/user-attachments/assets/a0a5a12a-ea1e-42b7-8fa1-6531bef5ae6c" /> (cherry picked from commit d4bc9be)

…stic#204934) ## Summary Part of elastic#204116 When model is not present in the payload, use the default model as specified in the connector configuration. We were already doing that for OpenAI-OpenAI, but not for "Other"-OpenAI. ### Some section because I downloaded ollama just for that issue <img width="950" alt="Screenshot 2024-12-19 at 13 53 48" src="https://github.com/user-attachments/assets/4a6e4b35-a0c5-46e5-9372-677e99d070f8" /> <img width="769" alt="Screenshot 2024-12-19 at 13 54 54" src="https://github.com/user-attachments/assets/a0a5a12a-ea1e-42b7-8fa1-6531bef5ae6c" />

) Closes elastic#204116 ## Summary fix: o11y assistant Error, when using the model (llama 3.2) the stream get closed in the middle and fails with an error related to the title generation (cherry picked from commit d577177)

…) (#207146) # Backport This will backport the following commits from `main` to `8.x`: - [[Obs AI Assistant] Error when using ollama model locally (#206739)](#206739)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sqren/backport)  Co-authored-by: Arturo Lidueña <arturo.liduena@elastic.co>

) Closes elastic#204116 ## Summary fix: o11y assistant Error, when using the model (llama 3.2) the stream get closed in the middle and fails with an error related to the title generation

…stic#204934) ## Summary Part of elastic#204116 When model is not present in the payload, use the default model as specified in the connector configuration. We were already doing that for OpenAI-OpenAI, but not for "Other"-OpenAI. ### Some section because I downloaded ollama just for that issue <img width="950" alt="Screenshot 2024-12-19 at 13 53 48" src="https://github.com/user-attachments/assets/4a6e4b35-a0c5-46e5-9372-677e99d070f8" /> <img width="769" alt="Screenshot 2024-12-19 at 13 54 54" src="https://github.com/user-attachments/assets/a0a5a12a-ea1e-42b7-8fa1-6531bef5ae6c" />

) Closes elastic#204116 ## Summary fix: o11y assistant Error, when using the model (llama 3.2) the stream get closed in the middle and fails with an error related to the title generation

neptunian added Team:AI Infra AppEx AI Infrastructure Team Team:Obs AI Assistant Observability AI Assistant labels Dec 12, 2024

neptunian added the bug Fixes for quality problems that affect the customer experience label Dec 12, 2024

pgayvallet mentioned this issue Dec 19, 2024

OpenAI connector: send default model for "other" openAI provider #204934

Merged

arturoliduena self-assigned this Jan 14, 2025

arturoliduena mentioned this issue Jan 15, 2025

[Obs AI Assistant] Error when using ollama model locally #206739

Merged

arturoliduena closed this as completed in #206739 Jan 18, 2025

arturoliduena closed this as completed in d577177 Jan 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Obs AI Assistant] Error when using ollama model locally #204116

[Obs AI Assistant] Error when using ollama model locally #204116

neptunian commented Dec 12, 2024 •

edited

Loading

elasticmachine commented Dec 12, 2024

elasticmachine commented Dec 12, 2024

pgayvallet commented Dec 19, 2024 •

edited

Loading

pgayvallet commented Dec 19, 2024

[Obs AI Assistant] Error when using ollama model locally #204116

[Obs AI Assistant] Error when using ollama model locally #204116

Comments

neptunian commented Dec 12, 2024 • edited Loading

elasticmachine commented Dec 12, 2024

elasticmachine commented Dec 12, 2024

pgayvallet commented Dec 19, 2024 • edited Loading

pgayvallet commented Dec 19, 2024

neptunian commented Dec 12, 2024 •

edited

Loading

pgayvallet commented Dec 19, 2024 •

edited

Loading