Post-training Optimization Tool

Starting with the 2020.1 version, OpenVINO™ toolkit delivers the Post-Training Optimization Tool designed to accelerate the inference of DL models by converting them into a more hardware-friendly representation by applying specific methods that do not require re-training, for example, post-training quantization. For more details about the low-precision flow in OpenVINO™, refer to the Low Precision Optimization Guide.

Post-Training Optimization Tool includes standalone command-line tool and Python* API that provide the following key features:

Key features:

Two supported post-training quantization algorithms: fast DefaultQuantization and precise AccuracyAwareQuantization, as well as multiple experimental methods.
Global optimization of post-training quantization parameters using Tree-structured Parzen Estimator.
Symmetric and asymmetric quantization schemes. For more details, see the Quantization section.
Per-channel quantization for Convolutional and Fully-Connected layers.
Multiple domains: Computer Vision, Recommendation Systems.
Ability to implement custom calibration pipeline via supported API.
Compression for different HW targets such as CPU, GPU, VPU.
Post-training sparsity.

Usage

System requirements

Ubuntu 18.04 or later (64-bit)
Python 3.6 or later
OpenVINO

Installation (Temporary)

Clone compression tool repo: git clone git@gitlab-icv.inn.intel.com:algo/post-training-compression-tool.git

Download submodules:

git submodule init
git submodule update

Clone DLDT repo: git clone https://github.com/openvinotoolkit/openvino (Not into the post-training-compression-tool)
Switch dldt to required branch: feature/low_precision/develop_fp_v10
Build inference engine (Instruction can be found in dldt repo)
Switch dldt to mkaglins/poc branch (Inference engine is built from feature/low_precision/develop_fp_v10 branch to support FakeQuantize layers. ModelOptimizer is used from mkaglins/poc branch. So stay on mkaglins/poc branch as you've built IE and don't build it from there again)
Set PYTHONPATH variable: export PYTHONPATH=<path to DLDT bins>/bin/intel64/Release/lib/python_api/python3.6:<path to DLDT>/dldt/model-optimizer
Install requirements for accuracy checker:
- From POT root: cd ./thirdparty/open_model_zoo/tools/accuracy_checker
- Call setup script: python3 setup.py install
- Get back to root POT dir: cd <PATH_TO_POT_DIR>
Install requirements for the tool:
- Call setup script: python3 setup.py install

Run

Prepare configuration file for the tool based on the examples in the configs folder
Navigate to compression tool directory
Launch the tool running the following command: python3 main.py -c <path to config file> -e

To test the tool you can use PyTorch Mobilenet_v2 model from tests/data/models/mobilenetv2/mobilenetv2.onnx

If there're some errors with imports in ModelOptimizer first of all make the following steps:
- Checkout mkaglins/poc branch in DLDT (It's important!)

3.4 KiB Raw Blame History

Post-training Optimization Tool

Key features:

Usage

System requirements

Installation (Temporary)

Run

3.4 KiB

Raw Blame History