-
Notifications
You must be signed in to change notification settings - Fork 21
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat(Docling): prefetch model artifacts (#964)
Because - Some EasyOCR models are needed by Docling to transform PDF to Markdown. Without them, the first execution of the document component fails because the output starts by a "Downloading detection model, please wait..." print. - This also prevented coverage for the Docling converter. - The `use-docling` parameter in the document operator is less open to changes that an enum `converter` selector. This commit - Adds the EasyOCR models to the Docker images. - Corrects the integration test in the CI after [the latest changes in instill-core ](instill-ai/instill-core#1172). - Replaces the `use-docling` parameter by `converter`. The following changes are made on the `Dockerfile`: - `nobody:nogroup` needs to have a $HOME where the EasyOCR models will be placed (internally, this engine looks for the models in `~/.EasyOCR/model`). - The workdir (`/pipeline-backend`) is owned by `nobody:nogroup` in the dev image so we can run the coverage action without the root user.
- Loading branch information
Showing
14 changed files
with
172 additions
and
211 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.