Feature/merge 2021 3 to master (#5307)
* Feature/azaytsev/cldnn doc fixes (#4600)
* Legal fixes, removed the Generating docs section
* Removed info regarding generating docs
Co-authored-by: Trawinski, Dariusz <dariusz.trawinski@intel.com>
* Feature/azaytsev/gna model link fixes (#4599)
* Added info on DockerHub CI Framework
* Feature/azaytsev/change layout (#3295)
* Changes according to feedback comments
* Replaced @ref's with html links
* Fixed links, added a title page for installing from repos and images, fixed formatting issues
* Added links
* minor fix
* Added DL Streamer to the list of components installed by default
* Link fixes
* Link fixes
* ovms doc fix (#2988)
* added OpenVINO Model Server
* ovms doc fixes
Co-authored-by: Trawinski, Dariusz <dariusz.trawinski@intel.com>
* Updated openvino_docs.xml
* Link Fixes
Co-authored-by: Trawinski, Dariusz <dariusz.trawinski@intel.com>
* Fix for broken CC in CPU plugin (#4595)
* Azure CI: Add "ref: releases/2021/3"
* Fixed clone rt info (#4597)
* [.ci/azure] Enable CC build (#4619)
* Formula fix (#4624)
* Fixed transformation to pull constants into Loop body (cherry-pick of PR 4591) (#4607)
* Cherry-pick of PR 4591
* Fixed typo
* Moved a check into the parameter_unchanged_after_iteration function
* Fixed KW hits (#4638)
* [CPU] Supported ANY layout for inputs in inferRequest (#4621)
* [.ci/azure] Add windows_conditional_compilation.yml (#4648) (#4655)
* Fix for MKLDNN constant layers execution (#4642)
* Fix for MKLDNN constant layers execution
* Single mkldnn::engine for all MKLDNN graphs
* Add workaround for control edges to support TF 2.4 RNN (#4634)
Signed-off-by: Roman Kazantsev <roman.kazantsev@intel.com>
* Corrected PyYAML dependency (#4598) (#4620)
5.4.2 is absent on PyPI
* [CPU] Statically analyzed issues. (#4637)
* Docs api (#4657)
* Updated API changes document
* Comment for CVS-49440
* Add documentation on how to convert QuartzNet model (#4664)
* Add documentation on how to convert QuartzNet model (#4422)
* Add documentation on how to convert QuartzNet model
* Apply review feedback
* Small fix
* Apply review feedback
* Apply suggestions from code review
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
* Add reference to file
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
* Fixed bug in assign elimination transformation. (#4644)
* [doc] Updated PyPI support OSes (#4643) (#4662)
* [doc] Updated PyPI support OSes (#4643)
* Updated PyPI support OSes
* Added python versions for win and mac
* Update pypi-openvino-dev.md
* Update pypi-openvino-dev.md
* Update pypi-openvino-rt.md
* Update pypi-openvino-dev.md
Co-authored-by: Andrey Zaytsev <andrey.zaytsev@intel.com>
* [IE][VPU]: Fix empty output of CTCGreedyDecoderSeqLen (#4653)
* Allow the second output of CTCGreedyDecoderSeqLen to be nullptr in cases when it is not used but calculated in the Myriad plugin. In this case, parse the second output as FakeData
* It is a cherry-pick of #4652
* Update the firmware to release version
* [VPU] WA for Segmentation fault on dlclose() issue (#4645)
* Document TensorFlow 2* Update: Layers Support and Remove Beta Status (#4474) (#4711)
* Document TensorFlow 2* Update: Layers Support and Remove Beta Status
Signed-off-by: Roman Kazantsev <roman.kazantsev@intel.com>
* Update documentation based on latest test results and feedback
Signed-off-by: Roman Kazantsev <roman.kazantsev@intel.com>
* Remove ConvLSTM2D from supported layers list
Signed-off-by: Roman Kazantsev <roman.kazantsev@intel.com>
* Document Dot layer without limitation
Signed-off-by: Roman Kazantsev <roman.kazantsev@intel.com>
* Address feedback upon DenseFeatures and RNN operations
Signed-off-by: Roman Kazantsev <roman.kazantsev@intel.com>
* Do a grammar correction
Signed-off-by: Roman Kazantsev <roman.kazantsev@intel.com>
* Do a grammar correction based on feedback
Signed-off-by: Roman Kazantsev <roman.kazantsev@intel.com>
* Updated nGraph custom op documentation (#4604)
* Updated nGraph custom op documentation
* Fixed comments
* [IE CLDNN] Fix missing variable initializations and types (#4669)
* Fix NormalizeL2 creation in QueryNetwork (cherry pick from master PR 4310) (#4651)
* Updated documentation about the supported YOLOv3 model from ONNX (#4722) (#4726)
* Restored folded Operations for QueryNetwork (#4685)
* Restored folded Operations for QueryNetwork
* Fixed comment
* Add unfolded constant operations to supported layers map
* Add STN to list of supported models (#4728)
* Fix python API for Loop/TensorIterator/Assign/ReadValue operations
* Catch std::except in fuzz tests (#4695)
Fuzz tests must catch all expected exceptions from IE. IE is using C++ std
library which may raise standard exceptions which IE pass through.
* Docs update (#4626)
* Updated latency case desc to cover multi-socket machines
* updated opt guide a bit
* avoiding '#' which is interpreted as ref
* Update CPU.md
* Update docs/optimization_guide/dldt_optimization_guide.md
Co-authored-by: Alina Alborova <alina.alborova@intel.com>
* Update docs/optimization_guide/dldt_optimization_guide.md
Co-authored-by: Alina Alborova <alina.alborova@intel.com>
* Update docs/optimization_guide/dldt_optimization_guide.md
Co-authored-by: Alina Alborova <alina.alborova@intel.com>
* Update docs/optimization_guide/dldt_optimization_guide.md
Co-authored-by: Alina Alborova <alina.alborova@intel.com>
* Update docs/optimization_guide/dldt_optimization_guide.md
Co-authored-by: Alina Alborova <alina.alborova@intel.com>
Co-authored-by: Alina Alborova <alina.alborova@intel.com>
* Blocked dims hwc 2021/3 (#4729)
* Fix for BlockedDims
* Added test for HWC layout
* [GNA] Update documentation regarding splits and concatenations support (#4740)
* Added mo.py to wheel packages (#4731)
* Inserted a disclaimer (#4760)
* Fixed some klockwork issues in C API samples (#4767)
* Feature/vpu doc fixes 2021 3 (#4635)
* Documentation fixes and updates for VPU
* minor correction
* minor correction
* Fixed links
* updated supported layers list for vpu
* [DOCS] added iname/oname (#4735)
* [VPU] Limit dlclose() WA to be used for Ubuntu only (#4806)
* Fixed wrong link (#4817)
* MKLDNN weights cache key calculation algorithm changed (#4790)
* Updated PIP install instructions (#4821)
* Document YOLACT support (#4749)
* Document YOLACT support
* Add preprocessing section
* Apply suggestions from code review
Co-authored-by: Tatiana Savina <tatiana.savina@intel.com>
Co-authored-by: Tatiana Savina <tatiana.savina@intel.com>
* Add documentation on how to convert F3Net model (#4863)
* Add instruction for F3Net model pytorch->onnx conversion
* Fix style
* Fixed dead lock in telemetry (#4873)
* Fixed dead lock in telemetry
* Refactored TelemetrySender.send function
* Refactored send function implementation to avoid deadlocks
* Unit tests for telemetry sender function
* Added legal header
* avladimi/cvs-31369: Documented packages content to YUM/APT IGs (#4839)
* Documented runtime/dev packages content
* Minor formatting fixes
* Implemented review comments
* Update installing-openvino-apt.md
Co-authored-by: Andrey Zaytsev <andrey.zaytsev@intel.com>
* [DOC] Low-Precision 8-bit Integer Inference (#4834)
* [DOC] Low-Precision 8-bit Integer Inference
* [DOC] Low-Precision 8-bit Integer Inference: comment fixes
* Update docs/IE_DG/Int8Inference.md
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
* Update docs/IE_DG/Int8Inference.md
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
* Update docs/IE_DG/Int8Inference.md
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
* Update docs/IE_DG/Int8Inference.md
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
* Update docs/IE_DG/Int8Inference.md
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
* Update docs/IE_DG/Int8Inference.md
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
* Update docs/IE_DG/Int8Inference.md
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
* Update docs/IE_DG/Int8Inference.md
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
* Update docs/IE_DG/Int8Inference.md
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
* Update docs/IE_DG/Int8Inference.md
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
* [DOC] LPT comments fix
* [DOC] LPT comments fix: absolute links are updated to relative
* Update Int8Inference.md
* Update Int8Inference.md
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
Co-authored-by: Andrey Zaytsev <andrey.zaytsev@intel.com>
* Avladimi/cherry pick from master (#4892)
* Fixed CVS-48061
* Reviewed and edited the Customization instructions
* Fixed broken links in the TOC
* Fixed links
* Fixed formatting in the IG for Raspberry
* Feature/benchmarks 2021 3 (#4910)
* added new topics, changed the intro text
* updated
* Updates
* Updates
* Updates
* Updates
* Updates
* Added yolo-v4-tf and unet-camvid-onnx graphs
* Date for pricing is updated to March 15th
* Feature/omz link changes (#4911)
* Changed labels for demos and model downloader
* Changed links to models and tools
* Changed links to models and tools
* Changed links to demos
* [cherry-pick] Extensibility docs review (#4915)
* Feature/ovsa docs 2021 3 (#4914)
* Updated to 2021-3, fixed formatting issues
* Fixed formatting issues
* Fixed formatting issues
* Fixed formatting issues
* Update ovsa_get_started.md
* Clarification of Low Latency Transformation and State API documentation (#4877)
* Assign/ReadValue, LowLatency and StateAPI clarifications
* Apply suggestions from code review: spelling mistakes
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
* fixed wording
* cherry-pick missing commit to release branch: low latency documentation
* Resolve review remarks
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
Co-authored-by: Svetlana Dolinina <svetlana.a.dolinina@intel.com>
* DevCloud call outs (#4904)
* [README.md] change latest release to 2021.3
* [49342] Update recommended CMake version on install guide in documentation (#4763)
* Inserted a disclaimer
* Another disclaimer
* Update installing-openvino-windows.md
* Update installing-openvino-windows.md
* Update installing-openvino-windows.md
* Feature/doc fixes 2021 3 (#4971)
* Made changes for CVS-50424
* Changes for CVS-49349
* Minor change for CVS-49349
* Changes for CVS-49343
* Cherry-pick #PR4254
* Replaced /opt/intel/openvino/ with /opt/intel/openvino_2021/ as the default target directory
* (CVS-50786) Added a new section Reference IMplementations to keep Speech Library and Speech Recognition Demos
* Doc fixes
* Replaced links to inference_engine_intro.md with Deep_Learning_Inference_Engine_DevGuide.md, fixed links
* Fixed link
* Fixes
* Fixes
* Reemoved Intel® Xeon® processor E family
* fixes for graphs (#5057)
* compression.configs.hardware config to package_data (#5066)
* update OpenCV version to 4.5.2 (#5069)
* update OpenCV version to 4.5.2
* Enable mo.front.common.extractors module (#5038)
* Enable mo.front.common.extractors module (#5018)
* Enable mo.front.common.extractors module
* Update package_BOM.txt
* Test MO wheel content
* fix doc iframe issue - 2021.3 (#5090)
* wrap with htmlonly
* wrap with htmlonly
* Add specification for ExperimentalDetectron* oprations (#5128)
* Feature/benchmarks 2021 3 ehl (#5191)
* Added EHL config
* Updated graphs
* improve table formatting
* Wrap <iframe> tag with \htmlonly \endhtmlonly to avoid build errors
* Updated graphs
* Fixed links to TDP and Price for 8380
* Add PyTorch section to the documentation (#4972)
* Add PyTorch section to the documentation
* Apply review feedback
* Remove section about loop
* Apply review feedback
* Apply review feedback
* Apply review feedback
* doc: add Red Hat docker registry (#5184) (#5253)
* Incorporate changes in master
Co-authored-by: Trawinski, Dariusz <dariusz.trawinski@intel.com>
Co-authored-by: Vladislav Volkov <vladislav.volkov@intel.com>
Co-authored-by: azhogov <alexander.zhogov@intel.com>
Co-authored-by: Ilya Churaev <ilya.churaev@intel.com>
Co-authored-by: Alina Kladieva <alina.kladieva@intel.com>
Co-authored-by: Evgeny Lazarev <evgeny.lazarev@intel.com>
Co-authored-by: Gorokhov Dmitriy <dmitry.gorokhov@intel.com>
Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>
Co-authored-by: Mikhail Ryzhov <mikhail.ryzhov@intel.com>
Co-authored-by: Nikolay Shchegolev <nikolay.shchegolev@intel.com>
Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>
Co-authored-by: Anastasiya Ageeva <anastasiya.ageeva@intel.com>
Co-authored-by: Anastasia Popova <anastasia.popova@intel.com>
Co-authored-by: Maksim Doronin <maksim.doronin@intel.com>
Co-authored-by: Andrew Bakalin <andrew.bakalin@intel.com>
Co-authored-by: Mikhail Letavin <mikhail.letavin@intel.com>
Co-authored-by: Anton Chetverikov <Anton.Chetverikov@intel.com>
Co-authored-by: Ivan Tikhonov <ivan.tikhonov@intel.com>
Co-authored-by: Andrey Somsikov <andrey.somsikov@intel.com>
Co-authored-by: Maxim Shevtsov <maxim.y.shevtsov@intel.com>
Co-authored-by: Alina Alborova <alina.alborova@intel.com>
Co-authored-by: Elizaveta Lobanova <elizaveta.lobanova@intel.com>
Co-authored-by: Andrey Dmitriev <andrey.dmitriev@intel.com>
Co-authored-by: Helena Kloosterman <helena.kloosterman@intel.com>
Co-authored-by: Tatiana Savina <tatiana.savina@intel.com>
Co-authored-by: Edward Shogulin <edward.shogulin@intel.com>
Co-authored-by: Svetlana Dolinina <svetlana.a.dolinina@intel.com>
Co-authored-by: Alexey Suhov <alexey.suhov@intel.com>
Co-authored-by: Sergey Lyubimtsev <sergey.lyubimtsev@intel.com>
Co-authored-by: Dmitry Kurtaev <dmitry.kurtaev+github@gmail.com>
Co-authored-by: Nikolay Tyukaev <nikolay.tyukaev@intel.com>
Co-authored-by: Kate Generalova <kate.generalova@intel.com>
2021-04-19 20:19:17 +03:00
# Performance Benchmarks {#openvino_docs_performance_benchmarks}
2020-07-20 17:36:08 +03:00
2021-12-21 20:26:37 +03:00
@sphinxdirective
.. toctree::
:maxdepth: 1
:hidden:
2023-04-17 16:43:57 +02:00
openvino_docs_performance_benchmarks_faq
openvino_docs_performance_int8_vs_fp32
2023-05-18 17:12:34 +02:00
Performance Data Spreadsheet (download xlsx) < https: / / docs . openvino . ai / 2023 . 0 / _static / benchmarks_files / OV-2022 . 3-Performance-Data . xlsx >
2022-12-12 14:55:51 +01:00
openvino_docs_MO_DG_Getting_Performance_Numbers
2021-12-21 20:26:37 +03:00
2022-03-24 22:27:29 +03:00
2023-04-17 16:43:57 +02:00
This page presents benchmark results for `Intel® Distribution of OpenVINO™ toolkit <https://software.intel.com/content/www/us/en/develop/tools/openvino-toolkit.html>` __
and :doc:`OpenVINO Model Server <ovms_what_is_openvino_model_server>` , for a representative selection of public neural networks and Intel® devices.
The results may help you decide which hardware to use in your applications or plan AI workload for the hardware you have already implemented in your solutions.
Click the buttons below to see the chosen benchmark data.
2020-07-20 17:36:08 +03:00
2023-04-17 16:43:57 +02:00
.. grid:: 1 1 2 2
:gutter: 4
2020-07-20 17:36:08 +03:00
2023-04-17 16:43:57 +02:00
.. grid-item::
2020-07-20 17:36:08 +03:00
2023-04-17 16:43:57 +02:00
.. button-link:: #
:class: ov-toolkit-benchmark-results
:color: primary
:outline:
:expand:
:material-regular:`bar_chart;1.4em` OpenVINO Benchmark Graphs
.. grid-item::
.. button-link:: #
:class: ovms-toolkit-benchmark-results
:color: primary
:outline:
:expand:
:material-regular:`bar_chart;1.4em` OVMS Benchmark Graphs
For a successful deep learning inference application, the following four key metrics need to be considered:
.. tab:: :material-regular:`keyboard_double_arrow_right;1.4em` Throughput
Measures the number of inferences delivered within a latency threshold
(for example, number of Frames Per Second - FPS). When deploying a system with
deep learning inference, select the throughput that delivers the best trade-off
between latency and power for the price and performance that meets your requirements.
.. tab:: :material-regular:`attach_money;1.4em` Value
While throughput is important, what is more critical in edge AI deployments is
the performance efficiency or performance-per-cost. Application performance in
throughput per dollar of system cost is the best measure of value. The value KPI is
calculated as “Throughput measured as inferences per second / price of inference engine”.
This means for a 2 socket system 2x the price of a CPU is used. Prices are as per
date of benchmarking and sources can be found as links in the Hardware Platforms (PDF) description below.
.. tab:: :material-regular:`flash_on;1.4em` Efficiency
System power is a key consideration from the edge to the data center. When selecting
deep learning solutions, power efficiency (throughput/watt) is a critical factor to consider.
Intel designs provide excellent power efficiency for running deep learning workloads.
The efficiency KPI is calculated as “Throughput measured as inferences per second / TDP of
inference engine”. This means for a 2 socket system 2x the power dissipation (TDP) of a CPU is used.
TDP-values are as per date of benchmarking and sources can be found as links in the Hardware Platforms (PDF) description below.
.. tab:: :material-regular:`hourglass_empty;1.4em` Latency
This measures the synchronous execution of inference requests and is reported in milliseconds.
Each inference request (for example: preprocess, infer, postprocess) is allowed to complete before
the next is started. This performance metric is relevant in usage scenarios where a single image
input needs to be acted upon as soon as possible. An example would be the healthcare sector where
medical personnel only request analysis of a single ultra sound scanning image or in real-time or
near real-time applications for example an industrial robot's response to actions in its environment
or obstacle avoidance for autonomous vehicles.
Platforms, Configurations, Methodology
###########################################################
For a listing of all platforms and configurations used for testing, refer to the following:
.. grid:: 1 1 2 2
:gutter: 4
.. grid-item::
.. button-link:: _static/benchmarks_files/platform_list_22.3.pdf
:color: primary
:outline:
:expand:
:material-regular:`download;1.5em` Click for Hardware Platforms [PDF]
.. button-link:: _static/benchmarks_files/OV-2022.3-system-info-detailed.xlsx
:color: primary
:outline:
:expand:
:material-regular:`download;1.5em` Click for Configuration Details [XLSX]
.. the files above need to be updated with OVMS !!!
The OpenVINO benchmark setup includes a single system with OpenVINO™, as well as the benchmark application installed.
It measures the time spent on actual inference (excluding any pre or post processing) and then reports on the inferences
per second (or Frames Per Second).
OpenVINO™ Model Server (OVMS) employs the Intel® Distribution of OpenVINO™ toolkit runtime libraries and exposes a set of
models via a convenient inference API over gRPC or HTTP/REST. Its benchmark results are measured with the configuration of
multiple-clients-single-server, using two hardware platforms connected by ethernet. Network bandwidth depends on both, platforms
and models under investigation. It is set not to be a bottleneck for workload intensity. The connection is dedicated
only to measuring performance.
.. dropdown:: See more details about OVMS benchmark setup
The benchmark setup for OVMS consists of four main parts:
.. image:: _static/images/performance_benchmarks_ovms_02.png
:alt: OVMS Benchmark Setup Diagram
* **OpenVINO™ Model Server** is launched as a docker container on the server platform and it listens (and answers on)
requests from clients. OpenVINO™ Model Server is run on the same machine as the OpenVINO™ toolkit benchmark application
in corresponding benchmarking. Models served by OpenVINO™ Model Server are located in a local file system mounted into
the docker container. The OpenVINO™ Model Server instance communicates with other components via ports over a dedicated docker network.
* **Clients** are run in separated physical machine referred to as client platform. Clients are implemented in Python3
programming language based on TensorFlow* API and they work as parallel processes. Each client waits for a response from OpenVINO™
Model Server before it will send a new next request. The role played by the clients is also verification of responses.
* **Load balancer** works on the client platform in a docker container. HAProxy is used for this purpose. Its main role is
counting of requests forwarded from clients to OpenVINO™ Model Server, estimating its latency, and sharing this information by
Prometheus service. The reason of locating the load balancer on the client site is to simulate real life scenario that includes
impact of physical network on reported metrics.
* **Execution Controller** is launched on the client platform. It is responsible for synchronization of the whole measurement process,
downloading metrics from the load balancer, and presenting the final report of the execution.
Test performance yourself
####################################
2023-03-20 19:22:55 +01:00
You can also test performance for your system yourself, following the guide on :doc:`getting performance numbers <openvino_docs_MO_DG_Getting_Performance_Numbers>` .
2023-04-17 16:43:57 +02:00
2023-03-20 19:22:55 +01:00
Performance of a particular application can also be evaluated virtually using `Intel® DevCloud for the Edge <https://devcloud.intel.com/edge/>` __.
It is a remote development environment with access to Intel® hardware and the latest versions of the Intel® Distribution of the OpenVINO™ Toolkit.
To learn more about it, visit `the website <https://www.intel.com/content/www/us/en/developer/tools/devcloud/edge/overview.html>` __
or `create an account <https://www.intel.com/content/www/us/en/secure/forms/devcloud-enrollment/account-provisioning.html>` __.
2022-12-20 23:07:51 +01:00
2022-09-22 15:04:12 +02:00
2023-03-20 19:22:55 +01:00
2023-04-17 16:43:57 +02:00
Disclaimers
####################################
* Intel® Distribution of OpenVINO™ toolkit performance results are based on release 2022.3, as of December 13, 2022.
2023-03-20 19:22:55 +01:00
2023-04-17 16:43:57 +02:00
* OpenVINO Model Server performance results are based on release 2022.3, as of December 13, 2022.
The results may not reflect all publicly available updates. Intel technologies’ features and benefits depend on system configuration
and may require enabled hardware, software, or service activation. Learn more at intel.com, or from the OEM or retailer.
See configuration disclosure for details. No product can be absolutely secure.
Performance varies by use, configuration and other factors. Learn more at `www.intel.com/PerformanceIndex <https://www.intel.com/PerformanceIndex>` __.
Your costs and results may vary.
Intel optimizations, for Intel compilers or other products, may not optimize to the same degree for non-Intel products.
@endsphinxdirective
2022-12-12 14:55:51 +01:00