Files

Andrey Zaytsev 4ae6258bed Feature/azaytsev/from 2021 4 (#9247 )

* Added info on DockerHub CI Framework

* Feature/azaytsev/change layout (#3295)

* Changes according to feedback comments

* Replaced @ref's with html links

* Fixed links, added a title page for installing from repos and images, fixed formatting issues

* Added links

* minor fix

* Added DL Streamer to the list of components installed by default

* Link fixes

* Link fixes

* ovms doc fix (#2988)

* added OpenVINO Model Server

* ovms doc fixes

Co-authored-by: Trawinski, Dariusz <dariusz.trawinski@intel.com>

* Updated openvino_docs.xml

* Updated the link to software license agreements

* Revert "Updated the link to software license agreements"

This reverts commit 706dac500e.

* Docs to Sphinx (#8151)

* docs to sphinx

* Update GPU.md

* Update CPU.md

* Update AUTO.md

* Update performance_int8_vs_fp32.md

* update

* update md

* updates

* disable doc ci

* disable ci

* fix index.rst

Co-authored-by: Andrey Zaytsev <andrey.zaytsev@intel.com>
# Conflicts:
#	.gitignore
#	docs/CMakeLists.txt
#	docs/IE_DG/Deep_Learning_Inference_Engine_DevGuide.md
#	docs/IE_DG/Extensibility_DG/Custom_ONNX_Ops.md
#	docs/IE_DG/Extensibility_DG/VPU_Kernel.md
#	docs/IE_DG/InferenceEngine_QueryAPI.md
#	docs/IE_DG/Int8Inference.md
#	docs/IE_DG/Integrate_with_customer_application_new_API.md
#	docs/IE_DG/Model_caching_overview.md
#	docs/IE_DG/supported_plugins/GPU_RemoteBlob_API.md
#	docs/IE_DG/supported_plugins/HETERO.md
#	docs/IE_DG/supported_plugins/MULTI.md
#	docs/MO_DG/prepare_model/convert_model/Convert_Model_From_Caffe.md
#	docs/MO_DG/prepare_model/convert_model/Convert_Model_From_Kaldi.md
#	docs/MO_DG/prepare_model/convert_model/Convert_Model_From_MxNet.md
#	docs/MO_DG/prepare_model/convert_model/Convert_Model_From_ONNX.md
#	docs/MO_DG/prepare_model/convert_model/Converting_Model.md
#	docs/MO_DG/prepare_model/convert_model/Converting_Model_General.md
#	docs/MO_DG/prepare_model/convert_model/Cutting_Model.md
#	docs/MO_DG/prepare_model/convert_model/pytorch_specific/Convert_RNNT.md
#	docs/MO_DG/prepare_model/convert_model/tf_specific/Convert_EfficientDet_Models.md
#	docs/MO_DG/prepare_model/convert_model/tf_specific/Convert_WideAndDeep_Family_Models.md
#	docs/MO_DG/prepare_model/convert_model/tf_specific/Convert_YOLO_From_Tensorflow.md
#	docs/doxygen/Doxyfile.config
#	docs/doxygen/ie_docs.xml
#	docs/doxygen/ie_plugin_api.config
#	docs/doxygen/ngraph_cpp_api.config
#	docs/doxygen/openvino_docs.xml
#	docs/get_started/get_started_macos.md
#	docs/get_started/get_started_raspbian.md
#	docs/get_started/get_started_windows.md
#	docs/img/cpu_int8_flow.png
#	docs/index.md
#	docs/install_guides/VisionAcceleratorFPGA_Configure.md
#	docs/install_guides/VisionAcceleratorFPGA_Configure_Windows.md
#	docs/install_guides/deployment-manager-tool.md
#	docs/install_guides/installing-openvino-linux.md
#	docs/install_guides/installing-openvino-macos.md
#	docs/install_guides/installing-openvino-windows.md
#	docs/optimization_guide/dldt_optimization_guide.md
#	inference-engine/ie_bridges/c/include/c_api/ie_c_api.h
#	inference-engine/ie_bridges/python/docs/api_overview.md
#	inference-engine/ie_bridges/python/sample/ngraph_function_creation_sample/README.md
#	inference-engine/ie_bridges/python/sample/speech_sample/README.md
#	inference-engine/ie_bridges/python/src/openvino/inference_engine/ie_api.pyx
#	inference-engine/include/ie_api.h
#	inference-engine/include/ie_core.hpp
#	inference-engine/include/ie_version.hpp
#	inference-engine/samples/benchmark_app/README.md
#	inference-engine/samples/speech_sample/README.md
#	inference-engine/src/plugin_api/exec_graph_info.hpp
#	inference-engine/src/plugin_api/file_utils.h
#	inference-engine/src/transformations/include/transformations_visibility.hpp
#	inference-engine/tools/benchmark_tool/README.md
#	ngraph/core/include/ngraph/ngraph.hpp
#	ngraph/frontend/onnx_common/include/onnx_common/parser.hpp
#	ngraph/python/src/ngraph/utils/node_factory.py
#	openvino/itt/include/openvino/itt.hpp
#	thirdparty/ade
#	tools/benchmark/README.md

* Cherry-picked remove font-family (#8211)

* Cherry-picked: Update get_started_scripts.md (#8338)

* doc updates (#8268)

* Various doc changes

* theme changes

* remove font-family (#8211)

* fix  css

* Update uninstalling-openvino.md

* fix css

* fix

* Fixes for Installation Guides

Co-authored-by: Andrey Zaytsev <andrey.zaytsev@intel.com>
Co-authored-by: kblaszczak-intel <karol.blaszczak@intel.com>
# Conflicts:
#	docs/IE_DG/Bfloat16Inference.md
#	docs/IE_DG/InferenceEngine_QueryAPI.md
#	docs/IE_DG/OnnxImporterTutorial.md
#	docs/IE_DG/supported_plugins/AUTO.md
#	docs/IE_DG/supported_plugins/HETERO.md
#	docs/IE_DG/supported_plugins/MULTI.md
#	docs/MO_DG/prepare_model/convert_model/Convert_Model_From_Kaldi.md
#	docs/MO_DG/prepare_model/convert_model/tf_specific/Convert_YOLO_From_Tensorflow.md
#	docs/install_guides/installing-openvino-macos.md
#	docs/install_guides/installing-openvino-windows.md
#	docs/ops/opset.md
#	inference-engine/samples/benchmark_app/README.md
#	inference-engine/tools/benchmark_tool/README.md
#	thirdparty/ade

* Cherry-picked: doc script changes (#8568)

* fix openvino-sphinx-theme

* add linkcheck target

* fix

* change version

* add doxygen-xfail.txt

* fix

* AA

* fix

* fix

* fix

* fix

* fix
# Conflicts:
#	thirdparty/ade

* Cherry-pick: Feature/azaytsev/doc updates gna 2021 4 2 (#8567)

* Various doc changes

* Reformatted C++/Pythob sections. Updated with info from PR8490

* additional fix

* Gemini Lake replaced with Elkhart Lake

* Fixed links in IGs, Added 12th Gen
# Conflicts:
#	docs/IE_DG/supported_plugins/GNA.md
#	thirdparty/ade

* Cherry-pick: Feature/azaytsev/doc fixes (#8897)

* Various doc changes

* Removed the empty Learning path topic

* Restored the Gemini Lake CPIU list
# Conflicts:
#	docs/IE_DG/supported_plugins/GNA.md
#	thirdparty/ade

* Cherry-pick: sphinx copybutton doxyrest code blocks (#8992)

# Conflicts:
#	thirdparty/ade

* Cherry-pick: iframe video enable fullscreen (#9041)

# Conflicts:
#	thirdparty/ade

* Cherry-pick: fix untitled titles (#9213)

# Conflicts:
#	thirdparty/ade

* Cherry-pick: perf bench graph animation (#9045)

* animation

* fix
# Conflicts:
#	thirdparty/ade

* Cherry-pick: doc pytest (#8888)

* docs pytest

* fixes
# Conflicts:
#	docs/doxygen/doxygen-ignore.txt
#	docs/scripts/ie_docs.xml
#	thirdparty/ade

* Cherry-pick: restore deleted files (#9215)

* Added new operations to the doc structure (from removed ie_docs.xml)

* Additional fixes

* Update docs/IE_DG/InferenceEngine_QueryAPI.md

Co-authored-by: Helena Kloosterman <helena.kloosterman@intel.com>

* Update docs/IE_DG/Int8Inference.md

Co-authored-by: Helena Kloosterman <helena.kloosterman@intel.com>

* Update Custom_Layers_Guide.md

* Changes according to review  comments

* doc scripts fixes

* Update docs/IE_DG/Int8Inference.md

Co-authored-by: Helena Kloosterman <helena.kloosterman@intel.com>

* Update Int8Inference.md

* update xfail

* clang format

* updated xfail

Co-authored-by: Trawinski, Dariusz <dariusz.trawinski@intel.com>
Co-authored-by: Nikolay Tyukaev <nikolay.tyukaev@intel.com>
Co-authored-by: kblaszczak-intel <karol.blaszczak@intel.com>
Co-authored-by: Yury Gorbachev <yury.gorbachev@intel.com>
Co-authored-by: Helena Kloosterman <helena.kloosterman@intel.com>

2021-12-21 20:26:37 +03:00

14 KiB

Raw Blame History

Auto-Device Plugin

Auto-Device Plugin Execution (C++)

@sphinxdirective .. raw:: html

<div id="switcher-cpp" class="switcher-anchor">C++</div>

@endsphinxdirective

The AUTO device is a new, special "virtual" or "proxy" device in the OpenVINO™ toolkit.

Use "AUTO" as the device name to delegate selection of an actual accelerator to OpenVINO. The Auto-device plugin internally recognizes and selects devices from among CPU, integrated GPU and discrete Intel GPUs (when available) depending on the device capabilities and the characteristics of CNN models (for example, precision). Then the Auto-device assigns inference requests to the selected device.

From the application's point of view, this is just another device that handles all accelerators in the full system.

With the 2021.4 release, Auto-device setup is done in three major steps:

Configure each device as usual (for example, via the conventional SetConfig() method)
Load a network to the Auto-device plugin. This is the only change needed in your application.
As with any other executable network resulting from LoadNetwork(), create as many requests as needed to saturate the devices.

These steps are covered below in detail.

Defining and Configuring the Auto-Device Plugin

Following the OpenVINO convention for devices names, the Auto-device uses the label "AUTO". The only configuration option for Auto-device is a limited device list:

Parameter name	Parameter values	Default	Description
"MULTI_DEVICE_PRIORITIES"	comma-separated device names with no spaces	N/A	Device candidate list to be selected

You can use the configuration name directly as a string or use InferenceEngine::MultiDeviceConfigParams::KEY_MULTI_DEVICE_PRIORITIES from multi-device/multi_device_config.hpp, which defines the same string.

There are two ways to use Auto-device:

Directly indicate device by "AUTO" or an empty string: @snippet snippets/AUTO0.cpp part0
Use the Auto-device configuration: @snippet snippets/AUTO1.cpp part1

Both methods allow limiting the list of device candidates for the AUTO plugin.

Note

: The Inference Engine lets you use "GPU" as an alias for "GPU.0" in function calls.

The Auto-device plugin supports query device optimization capabilities in metric.

Parameter name	Parameter values
"OPTIMIZATION_CAPABILITIES"	Auto-Device capabilities

Enumerating Devices and Selection Logic

The Inference Engine now features a dedicated API to enumerate devices and their capabilities. See Hello Query Device C++ Sample. This is the example output from the sample (truncated to device names only):

./hello_query_device
Available devices: 
    Device: CPU
...
    Device: GPU.0
...
    Device: GPU.1

Default Auto-Device Selection Logic

With the 2021.4 release, the Auto-Device selects the most suitable device using the following default logic:

Check if dGPU (discrete), iGPU (integrated) and CPU devices are available
Get the precision of the input model, such as FP32
According to the priority of dGPU, iGPU, and CPU (in this order), if the device supports the precision of the input network, select it as the most suitable device

For example, CPU, dGPU and iGPU can support the following precision and optimization capabilities:

Device	OPTIMIZATION_CAPABILITIES
CPU	WINOGRAD FP32 FP16 INT8 BIN
dGPU	FP32 BIN BATCHED_BLOB FP16 INT8
iGPU	FP32 BIN BATCHED_BLOB FP16 INT8

When the application uses the Auto-device to run FP16 IR on a system with CPU, dGPU and iGPU, Auto-device will offload this workload to dGPU.
When the application uses the Auto-device to run FP16 IR on a system with CPU and iGPU, Auto-device will offload this workload to iGPU.
When the application uses the Auto-device to run WINOGRAD-enabled IR on a system with CPU, dGPU and iGPU, Auto-device will offload this workload to CPU.

In cases when loading the network to dGPU or iGPU fails, CPU is the fall-back choice.

According to the Auto-device selection logic from the previous section, tell the Inference Engine to use the most suitable device from available devices as follows:

@snippet snippets/AUTO2.cpp part2

You can also use the Auto-device plugin to choose a device from a limited choice of devices, in this example CPU and GPU:

@snippet snippets/AUTO3.cpp part3

Configuring the Individual Devices and Creating the Auto-Device on Top

It is possible to configure each individual device as usual and create the "AUTO" device on top:

@snippet snippets/AUTO4.cpp part4

Alternatively, you can combine all the individual device settings into single config file and load it, allowing the Auto-device plugin to parse and apply it to the right devices. See the code example here:

@snippet snippets/AUTO5.cpp part5

Using the Auto-Device with OpenVINO Samples and Benchmark App

Note that every OpenVINO sample or application that supports the "-d" (which stands for "device") command-line option transparently accepts the Auto-device. The Benchmark Application is the best example of the optimal usage of the Auto-device. You do not need to set the number of requests and CPU threads, as the application provides optimal out-of-the-box performance. Below is the example command-line to evaluate AUTO performance with that:

@sphinxdirective .. tab:: Package, Docker, open-source installation

.. code-block:: sh

  ./benchmark_app.py –d AUTO –m <model>

.. tab:: pip installation

.. code-block:: sh

  benchmark_app –d AUTO –m <model>

@endsphinxdirective

You can also use the auto-device with limit device choice:

@sphinxdirective .. tab:: Package, Docker, open-source installation

.. code-block:: sh

  ./benchmark_app.py –d AUTO:CPU,GPU –m <model>

.. tab:: pip installation

.. code-block:: sh

  benchmark_app –d AUTO:CPU,GPU –m <model>

@endsphinxdirective

NOTES:

The default CPU stream is 1 if using -d AUTO.
You can use the FP16 IR to work with Auto-device.
No demos are fully optimized for Auto-device yet to select the most suitable device, use GPU streams/throttling, and so on.

Auto-Device Plugin Execution (Python)

@sphinxdirective .. raw:: html

<div id="switcher-python" class="switcher-anchor">Python</div>

@endsphinxdirective

The AUTO device is a new, special "virtual" or "proxy" device in the OpenVINO™ toolkit.

From the application's point of view, this is just another device that handles all accelerators in the full system.

With the 2021.4 release, Auto-device setup is done in three major steps:

Configure each device as usual (for example, via the conventional IECore.set_config method).
Load a network to the Auto-device plugin. This is the only change needed in your application.
As with any other executable network resulting from IECore.load_network, create as many requests as needed to saturate the devices.

These steps are covered below in detail.

Defining and Configuring the Auto-Device Plugin

Following the OpenVINO convention for devices names, the Auto-device uses the label "AUTO". The only configuration option for Auto-device is a limited device list:

Parameter name	Parameter values	Default	Description
"AUTO_DEVICE_LIST"	comma-separated device names with no spaces	N/A	Device candidate list to be selected

There are two ways to use the Auto-device plugin:

Directly indicate device by "AUTO" or an empty string.
Use the Auto-device configuration

Both methods allow limiting the list of device candidates for the AUTO plugin.

from openvino.inference_engine import IECore

ie = IECore()
# Read a network in IR or ONNX format
net = ie.read_network(model=path_to_model)

# Load a network on the "AUTO" device
exec_net = ie.load_network(network=net, device_name="AUTO")

# Optionally specify the list of device candidates for the AUTO plugin
# The following two lines are equivalent
exec_net = ie.load_network(network=net, device_name="AUTO:CPU,GPU")
exec_net = ie.load_network(network=net, device_name="AUTO",
                           config={"AUTO_DEVICE_LIST": "CPU,GPU"})

The Auto-device plugin supports query device optimization capabilities in metric.

Parameter name	Parameter values
"OPTIMIZATION_CAPABILITIES"	Auto-Device capabilities

Enumerating Devices and Selection Logic

The Inference Engine now features a dedicated API to enumerate devices and their capabilities. See the Hello Query Device Python Sample for code.

This is the example output from the sample (truncated to device names only):

./hello_query_device

Available devices:
    Device: CPU
...
    Device: GPU.0
...
    Device: GPU.1

Default Auto-Device Selection Logic

With the 2021.4 release, the Auto-Device selects the most suitable device using the following default logic:

Check if dGPU (discrete), iGPU (integrated) and CPU devices are available
Get the precision of the input model, such as FP32
According to the priority of dGPU, iGPU, and CPU (in this order), if the device supports the precision of the input network, select it as the most suitable device

For example, CPU, dGPU and iGPU can support the following precision and optimization capabilities:

Device	OPTIMIZATION_CAPABILITIES
CPU	WINOGRAD FP32 FP16 INT8 BIN
dGPU	FP32 BIN BATCHED_BLOB FP16 INT8
iGPU	FP32 BIN BATCHED_BLOB FP16 INT8

When the application uses the Auto-device to run FP16 IR on a system with CPU, dGPU and iGPU, Auto-device will offload this workload to dGPU.
When the application uses the Auto-device to run FP16 IR on a system with CPU and iGPU, Auto-device will offload this workload to iGPU.
When the application uses the Auto-device to run WINOGRAD-enabled IR on a system with CPU, dGPU and iGPU, Auto-device will offload this workload to CPU.

In cases when loading the network to dGPU or iGPU fails, CPU is the fall-back choice.

To show the capabilities for a specific device, query the OPTIMIZATION_CAPABILITIES metric:

from openvino.inference_engine import IECore

ie = IECore()
ie.get_metric(device_name=device,
              metric_name="OPTIMIZATION_CAPABILITIES")

Configuring the Individual Devices and Creating the Auto-Device on Top

It is possible to configure each individual device as usual and create the "AUTO" device on top:

from openvino.inference_engine import IECore

ie = IECore()
net = ie.read_network(model=path_to_model)

cpu_config = {}
gpu_config = {}

ie.set_config(config=cpu_config, device_name="CPU")
ie.set_config(config=gpu_config, device_name="GPU")

# Load the network to the AUTO device
exec_net = ie.load_network(network=net, device_name="AUTO")

from openvino.inference_engine import IECore

# Init the Inference Engine Core
ie = IECore()

# Read a network in IR or ONNX format
net = ie.read_network(model=path_to_model)

full_config = {}

# Load the network to the AUTO device
exec_net = ie.load_network(network=net, device_name="AUTO", config=full_config)

Using the Auto-Device with OpenVINO Samples and Benchmark App

@sphinxdirective .. tab:: Package, Docker, open-source installation

.. code-block:: sh

  ./benchmark_app.py –d AUTO –m <model>

.. tab:: pip installation

.. code-block:: sh

  benchmark_app –d AUTO –m <model>

@endsphinxdirective

You can also use the auto-device with limit device choice:

@sphinxdirective .. tab:: Package, Docker, open-source installation

.. code-block:: sh

  ./benchmark_app.py –d AUTO:CPU,GPU –m <model>

.. tab:: pip installation

.. code-block:: sh

  benchmark_app –d AUTO:CPU,GPU –m <model>

@endsphinxdirective

Note

: If you installed OpenVINO with pip, use benchmark_app -d AUTO:CPU,GPU -m <model>

14 KiB Raw Blame History Unescape Escape

Auto-Device Plugin

Auto-Device Plugin Execution (C++)

Defining and Configuring the Auto-Device Plugin

Enumerating Devices and Selection Logic

Default Auto-Device Selection Logic

Configuring the Individual Devices and Creating the Auto-Device on Top

Using the Auto-Device with OpenVINO Samples and Benchmark App

Auto-Device Plugin Execution (Python)

Defining and Configuring the Auto-Device Plugin

Enumerating Devices and Selection Logic

Default Auto-Device Selection Logic

Configuring the Individual Devices and Creating the Auto-Device on Top

Using the Auto-Device with OpenVINO Samples and Benchmark App

14 KiB

Raw Blame History