Replies: 1 comment
-
how can we run layout model on GPU? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
This thread is to outline and discuss the details for two goals:
We are happy to receive your feedback on the items outlined below! The results of the evaluation outlined here will be reflected in an updated Technical Report for docling 2.x.
1. Conversion speed and resource footprint
There are several dimensions to establish when assessing the runtime characteristics of docling and alternatives:
Dataset
We are considering a benchmark dataset sourced from DocLayNet and a portion of random samples from ccpdf mixed in, to make it more diverse.
The target size is ~4000 pages (i.e. <100 docs), such that benchmark runs would typically complete in one to a few hours even with a low-resource hardware profile.
Hardware / OS
To keep the number of experiments we need to drive at a reasonable level, we plan to benchmark on two system configurations:
Conversion options
We plan to define several profiles for conversion options, to show the impact of different models in the pipeline on conversion speed, e.g. with or without OCR, with or without table structure, with different PDF backends. For docling, we intend to profile the pipeline to analyze the contribution of different models/stages to the overall runtime.
Software
Docling is not out here alone. We want to evaluate and compare with popular open-source alternatives. Currently, we considder comparing to the following projects:
2. Speed optimization
From the above evaluation, we want to learn which parts of the docling pipeline need to be targeted first to achieve even better speed.
One known issue to address in this context is accelerator support. Currently, docling can partially use GPU acceleration, and will do so by default when a CUDA device is detected by torch. Among other things, we plan to:
Beta Was this translation helpful? Give feedback.
All reactions