Skip to content
This repository has been archived by the owner on Jan 9, 2025. It is now read-only.

feat(document): repair pdf with libreoffic #374

Closed
wants to merge 3 commits into from

Conversation

chuang8511
Copy link
Contributor

Because

  • The file could be malformed

This commit

  • repair pdf with libreoffice without using function in pdfplumber because of licence issue

Copy link

linear bot commented Sep 26, 2024

import base64
import sys

# TODO: Deal with the import error when running the code in the docker container
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@chuang8511
I think this PR is not ready, right?

Copy link
Contributor Author

@chuang8511 chuang8511 Sep 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TODO: Deal with the import error when running the code in the docker container

I will deal with this part with other PR.
But, this PR is not ready because there is a bug in the container for prod. I may need to modify this PR. So, I draft it first.

@chuang8511 chuang8511 marked this pull request as draft September 27, 2024 09:55
@jvallesm
Copy link
Collaborator

Ported to instill-ai/pipeline-backend#683

@jvallesm jvallesm closed this Sep 30, 2024
donch1989 pushed a commit to instill-ai/pipeline-backend that referenced this pull request Oct 7, 2024
This is a port of instill-ai/component#374

Because

- The file could be malformed

This commit

- repair pdf with qpdf without using function in pdfplumber because of
licence issue

Note
About the Python functions, now, the command runner is executing Python
code by string rather than files.
So, the import function in Python raise error when we execute Python in
Golang.
Current workaround is not using import but combine all string together.
We will need to find a better way to develop Python functions in
pipeline-backend & component

---------

Co-authored-by: chuang8511 <a0961192312@gmail.com>
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
No open projects
Status: No status
Development

Successfully merging this pull request may close these issues.

4 participants