You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is more of a cleaning scraped data than collecting data question, but one thing I struggled in the past is how to import and clean data from PDFs, and how to scale that up for large numbers of similar PDFs.
I tried different things with importing them with pdf_text or different OCR packages but I quite never found efficient ways to then import and clean data in bulk.
Thanks!
The text was updated successfully, but these errors were encountered:
This is more of a cleaning scraped data than collecting data question, but one thing I struggled in the past is how to import and clean data from PDFs, and how to scale that up for large numbers of similar PDFs.
I tried different things with importing them with pdf_text or different OCR packages but I quite never found efficient ways to then import and clean data in bulk.
Thanks!
The text was updated successfully, but these errors were encountered: