-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Preprocessor output files are deleted when hbs renderer is called (from mdBook > 0.3.1) #1087
Comments
@Michael-F-Bryan can you take a look at this? |
I think a solution could be extending the Renderer trait with a prepare function, which can be called before running the preprocessor. |
I wouldn't expect a preprocessor to be touching anything within another backend's output directory. That could end up being quite brittle because you're making assumptions on how the backend will render things. Your idea of lifecycle methods (e.g. A second idea is to add a section to the [build]
delete-output-directory-before-build = ["html"] The latter would probably be easier to implement, but I'm not sure if it would be scalable/extensible or considered hacky. Otherwise, instead of generating image files, @sytsereitsma what are your thoughts? Do any of those solutions sound easier/better to you? I'm happy to make PRs for any |
@Michael-F-Bryan I'll change my preprocessor to use data URIs to avoid the issue altogether. You have a nice point where my preprocessor should be independent of the renderer. I do think the lifecycle methods can avoid the problem (e.g. for other preprocessors). It is very difficult to troubleshoot for a regular user because it just seems as if the preprocessor does not work properly. Didn't know about data URIs, thanks for the tip 👍 |
See sytsereitsma/mdbook-plantuml#7 (comment). You can write error messages to STDERR and return a non-zero exit code from the preprocessor to let users know about errors.
If we were to go down the lifecycle method path, can you think of any other lifecycle methods we may want to add to either the preprocessor or renderer? |
As far as my preprocessor is concerned everything is working fine. It's just that the output gets deleted once it is done :-) Regarding the lifecycle methods I do not see any other sensible method than before_preprocessor when looking at MDBook::execute_build_process (src/book/mod.rs). I have one more suggestion. Since deleting a directory is a significant operation, I'd add a log message when performing the deletion (maybe even an info message). Regarding fixing my preprocessor I think I will copy the files to the source dir of the book to be renderer agnostic. The maximum size of data URIs differs between browsers (according to MDN opera only allows 65k). Next to that it is likely that another renderer does not support data URIs. |
I ran into this issue as well with https://github.com/dylanowen/mdbook-graphviz I "solved" it by putting my output in the src directory which is pretty gross. As for suggestions of what we could do. I like the model we have now where we're always operating across stdin -> stdout so chaining / modifying markdown works nicely. What about extending I'm not sure how we'd serialize the resource files across the json boundary, and I'm not sure about backwards compatibility but it would give preprocessors more power to interact with the various files without needing to modify the src or the output directories. |
I like the idea of adding a "resource" section to the It wouldn't be backwards compatible if we were to add all non-markdown files as I agree that writing into the |
Something that came to mind.... Images can get rather large and plentiful, meaning the amount of data to transfer over stdio (and json) may get quite large. How about outputting to a 'preprocessor-resources' directory, which gets copied into the book output dir by the renderer? |
Hi Guys, I'd like to see this resolved. The serve command gets into an infinite regeneration loop because the files in the src dir are modified by the preprocessor (with the workaround I created). I'm more than happy to create a PR for this, but I'd like to have consensus on the path to follow. As discussed above we have the following options (so far):
Sytse |
That's a good point about transferring large amounts of data over stdio, I've been unable to find much info on the limits or performance hits for it though. My rationale for passing data through stdout/in was that it would give preprocessors more power to interact with each other. For example we could create a "Thumbnail Preprocessor" that would take in images in the src directory and generate a thumbnails page to reference them. In the current model this is possible, but it'd be great if the output of |
I'm not too worried about the stdio overhead compared to the whole sequence of events (i.e. generate a file, load it in memory, zip and encode it transfer it, extract and save it). My main concern is the unneeded data serialization/deserialization, even though that probably is a negligible overhead compared to the graphviz rendering. So time to put the assumptions to the test and generate some data to base the decisions on. I'll do some spikes on how serde json handles large string values and see if I can collect some metrics on different strategies. |
How about a slightly different approach, where in the first step of the build process, the |
Should solve the issue for this preprocesssor and create a better user-experience. Also enables other downstream renderers to work agnostically and not only the html renderer. - rust-lang/mdBook#1087
Should solve the issue for this preprocesssor and create a better user-experience. Also enables other downstream renderers to work agnostically and not only the html renderer. - rust-lang/mdBook#1087
Should solve the issue for this preprocesssor and create a better user-experience. Also enables other downstream renderers to work agnostically and not only the html renderer. Can convert text files to data-uri but will still defer to the original text embedding method to preserve historical functionality - rust-lang/mdBook#1087
* Creating DataURI for image inlining Should solve the issue for this preprocesssor and create a better user-experience. Also enables other downstream renderers to work agnostically and not only the html renderer. Can convert text files to data-uri but will still defer to the original text embedding method to preserve historical functionality - rust-lang/mdBook#1087 * Made the use of data URIs configurable * Cargo fmt and compiler warnings * Do not create data uri images in book src dir Create them in the .mdbook-plantuml-cache dir in the book root instead * Update e2e tests * Use markdown output instead of <img> tags and allow clicking Clicking appears to be blocked for data uris on most browsers Co-authored-by: Paul FREAKN Baker <paul-nelson-baker@users.noreply.github.com>
Hi,
I have a preprocessor (mdbook-plantuml) which creates files in the book/img directory. The changes in issue #985, remove these files again (the destination.exists() if clause in Renderer::render(...) in src/renderer/html_handlebars/hbs_renderer.rs).
This kind of breaks my preprocessor :-(
Any ideas on how this could/should be fixed?
Sytse
The text was updated successfully, but these errors were encountered: