ASCIIHexDecode should be vectorized #8

sambitdash · 2017-07-24T06:12:53Z

The conversion is relatively simple. Hence, should be made a vector operation and not byte by byte read.

wstaelens · 2017-10-20T10:01:31Z

@sambitdash can you explain more how you can "vectorize" this to make it a vector operation?

sambitdash · 2017-10-20T11:23:25Z

The vector operation code generation is to be carried out by the compiler not human optimized as the code needs to be run on varied disparate processors. Most processors support vector operations like SIMD if the code does not have unnecessary branching. However, if you are look at the byte operation code it's full of branching. A sample code for this can be seen in the comment:

JuliaLang/julia#23267 (comment)

However, the generated code was not reviewed for potential further optimizations. Hope this explains what is needed.

Also note that you may not need error handling in stream operations but can repair errors as streams may have corrupt data. So error recovery may be desirable than exit on failure.

sambitdash · 2018-05-25T10:48:13Z

This may not be needed as PDF spec does not mandate the streams to have even number of hexits. It can also have control characters like CR-LF. So the branching may be significantly higher. Closing now.

sambitdash · 2018-05-25T10:49:13Z

Current implementation is about 60ms for a 10Mb stream on i7 processor.

Julia Version 0.6.2
Commit d386e40c17 (2017-12-13 18:08 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
  CPU: Intel(R) Core(TM) i7-6500U CPU @ 2.50GHz
  WORD_SIZE: 64
  BLAS: libopenblas (USE64BITINT DYNAMIC_ARCH NO_AFFINITY Haswell)
  LAPACK: libopenblas64_
  LIBM: libopenlibm
  LLVM: libLLVM-3.9.1 (ORCJIT, skylake)

sambitdash added the enhancement label Oct 2, 2017

sambitdash closed this as completed May 25, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ASCIIHexDecode should be vectorized #8

ASCIIHexDecode should be vectorized #8

sambitdash commented Jul 24, 2017

wstaelens commented Oct 20, 2017

sambitdash commented Oct 20, 2017 •

edited

Loading

sambitdash commented May 25, 2018

sambitdash commented May 25, 2018 •

edited

Loading

ASCIIHexDecode should be vectorized #8

ASCIIHexDecode should be vectorized #8

Comments

sambitdash commented Jul 24, 2017

wstaelens commented Oct 20, 2017

sambitdash commented Oct 20, 2017 • edited Loading

sambitdash commented May 25, 2018

sambitdash commented May 25, 2018 • edited Loading

sambitdash commented Oct 20, 2017 •

edited

Loading

sambitdash commented May 25, 2018 •

edited

Loading