You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
HTTrack appears to write the URL in new.txt escaped (e.g. spaces replaced with %20) but unescaped in new.zip. This causes cache lookup error when the two forms do not match:
Exception in thread "main" java.io.IOException: no cache entry: http://example.org/some%20file.jpg
at au.gov.nla.httrack2warc.httrack.HttrackCrawl.buildRecord(HttrackCrawl.java:148)
It appears in the new.txt entry context HTTrack is escaping the following characters:
spaces
double-quotes
character codes <= 31
character codes >= 127
Notably this does not include the % character. Therefore this transformation is not safely reversible.
The text was updated successfully, but these errors were encountered:
HTTrack appears to write the URL in new.txt escaped (e.g. spaces replaced with %20) but unescaped in new.zip. This causes cache lookup error when the two forms do not match:
It appears in the new.txt entry context HTTrack is escaping the following characters:
Notably this does not include the % character. Therefore this transformation is not safely reversible.
The text was updated successfully, but these errors were encountered: