- Support mixing usage of CPU and GPU.
- Support ONNX format.
- Support ARM Linux development board.
- Support CPU quantization.
- Update DSP library.
- Add
Depthwise Deconvolution
of Caffe. - Add documents about debug and benchmark.
- Bug fixed.
- Remove all APIs in mace_runtime.h
- Add GPUContext and GPUContextBuilder API.
- Add MaceEngineConfig API.
- Add MaceStatus API.
- MaceTensor support data format.
Thanks for the following guys who contribute code which make MACE better.
ByronHsu, conansherry, jackwish, herbakamil, tomaszkaliciak, oneTaken, madhavajay, wayen820, idstein, newway1995.
- New work flow and documents.
- Separate the model library from MACE library.
- Reduce the size of static and dynamic library.
- Support
ArgMax
Operations. - Support
Deconvolution
of Caffe. - Support NDK-17b.
- Use file to store OpenCL tuned parameters and Add
SetOpenCLParameterPath
API.
- Add a new
MaceEngine::Init
API with model data file.
- Not unmap the model data file when load model from files with CPU runtime.
- 2D LWS tuning does not work.
- Winograd convolution of GPU failed when open tuning.
- Incorrect dynamic library of host.
Thanks for the following guys who contribute code which make MACE better.
Zero King(@l2dy), James Bie(@JamesBie), Sun Aries(@SunAriesCN), Allen(@allen0125), conansherry(@conansherry), 黎明灰烬(@jackwish)
- Change build and run tools
- Handle runtime failure
- Change interface that report error type
- Improve CPU performance
- Merge CPU/GPU engine to on
- support
float
data_type
when running in GPU
- Return status instead of abort when allocate failed
- Change mace header interfaces, only including necessary methods.