openvino

Go to file

Sergey Lyalin f0300a36eb Efficient FP32 -> FP16 conversion for convert_precision, save_model, ovc and mo (#18988 )

* WIP Postpone fp16 in CompressFloatConstantsImpl

* Apply suggestions from code review

Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>

* WIP: Compression to FP16 in Serialize

* Prepared for efficient fp32 to fp16 conversion

* Update src/core/reference/src/runtime/reference/convert.cpp

* Called real slow reference implementations in the place where the optimized versions are supposed to be implemented

* Code style

* Fixed 0 values in the fast f64 to f16 compression

* Optimized convert_from_f32_to_f16_with_clamp

* Added optimized f32->f16 instance of change_constant_precision

* compression transformation Python test

* use tmp dir, minor corrections

* Update src/bindings/python/tests/test_transformations/test_compression.py

* Update src/bindings/python/tests/test_transformations/test_compression.py

* style fix

* define rt_info for postponed_fp16_compression

* remove redundant class

* fix temp dir for Win in test_compression.py

* update definitions in convert.hpp

* Update implementation in convert.cpp

* Update serialize.cpp

* Update compress_float_constants.cpp

* added macros for ARM/non_x86 in convert.cpp

* fix macros in convert.cpp

* change fixme placement in serialize.cpp

* style_fix

* Update src/core/reference/src/runtime/reference/convert.cpp

* style_fix

* Optimized count_out_of_f16_range

* Code style

* Revert unused

* Update src/core/src/pass/serialize.cpp

Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>

* Update src/core/reference/src/runtime/reference/convert.cpp

Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>

* use optimized convert_from_f32_to_f16_with_clamp for non postponed

* minor corrections

* Update src/common/transformations/src/transformations/common_optimizations/compress_float_constants.cpp

* Update compress_float_constants.cpp

* Switched mo and ovc to save_model instead of serialize to leverage performance improvements in fp32->fp16

* Applied minor code imporvements to address review feedback

* Minor changes in code

* Update tools/ovc/openvino/tools/ovc/main.py

* Apply suggestions from code review

* Fixed failed test in case when both usual xml compression and fp16 compression are applied simultaneously (disabled for now)

* Added description for CompressFloatConstantImpl postponed parameter

* Description of postponed parameter for CompressFloatConstants

* Reverted switching to save_model in mo as the compression can be applied not only via CLI and old code should be kept for Python path (not applicable for ovc)

* Removed remining committed test artefacts and reverted remaining changes in mo

---------

Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
Co-authored-by: dmitrygo <dmitry.gorokhov@intel.com>
Co-authored-by: Vladimir Paramuzov <vladimir.paramuzov@intel.com>
Co-authored-by: Pavel Esir <pavel.esir@intel.com>
Co-authored-by: Pavel Esir <pavel.esir@gmail.com>

2023-08-17 11:08:33 +00:00

.ci

Removed InputCutInfo, disabled input cut in OVC. (#18927 )

2023-08-10 23:47:27 +04:00

.github

Create build.yml (#18928 )

2023-08-10 13:37:51 +04:00

cmake

Added 'openvino_req_files' component for archives (#19174 )

2023-08-14 16:58:08 +04:00

docs

[DOCS] Add numpy to installation instructions for master (#19127 )

2023-08-16 10:20:49 +02:00

licensing

[docs] VPU licensing (#19222 )

2023-08-16 17:01:32 +04:00

samples

benhcmark_app: fix -api sync -i multiple images (#19142 )

2023-08-11 19:09:48 +04:00

scripts

Removed compile_tool and benchmark_app_legacy from OpenVINO repo (#18350 )

2023-07-04 19:35:51 +04:00

src

Efficient FP32 -> FP16 conversion for convert_precision, save_model, ovc and mo (#18988 )

2023-08-17 11:08:33 +00:00

tests

Set CF=False. (#19223 )

2023-08-17 11:22:26 +04:00

thirdparty

Conan: compilation with dynamic flatbuffers (#19238 )

2023-08-17 11:36:30 +04:00

tools

Efficient FP32 -> FP16 conversion for convert_precision, save_model, ovc and mo (#18988 )

2023-08-17 11:08:33 +00:00

.gitattributes

Added SVG files to lfs (#15227 )

2023-01-20 15:54:47 +04:00

.gitignore

Vcpkg conan fixes (#17765 )

2023-05-29 15:40:51 +04:00

.gitmodules

[CPU] MLAS backend integration (#17885 )

2023-07-26 07:40:34 +00:00

CMakeLists.txt

Properly handle python-package in cpack exclude components (#19089 )

2023-08-09 21:10:13 +04:00

conanfile.txt

Support of protobuf >= 21 (#18351 )

2023-07-04 17:08:29 +04:00

CONTRIBUTING.md

[Docs] Update docs with information about Contributions Welcome issue (#17503 )

2023-05-12 15:19:58 +02:00

cspell.json

Adds configuration file for cspell (#17355 )

2023-06-07 12:16:28 +02:00

install_build_dependencies.sh

Fixed Python API build for Ubuntu 22.04 with python3.11 (#17297 ) (#17298 )

2023-04-29 04:34:10 +04:00

Jenkinsfile

Beautify Jenkinsfile a little bit

2021-05-31 15:24:56 +03:00

LICENSE

Publishing R3

2018-10-16 13:45:03 +03:00

README.md

Changed the component for conda-forge downloads stat (#18755 )

2023-07-24 18:49:13 +04:00

SECURITY.md

Added SECURITY.md back (#3177 )

2020-11-17 16:44:44 +03:00

vcpkg.json

Build only release for vcpkg (#17990 )

2023-06-13 01:49:49 +04:00

README.md

What is OpenVINO toolkit?

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference.

Boost deep learning performance in computer vision, automatic speech recognition, natural language processing and other common tasks
Use models trained with popular frameworks like TensorFlow, PyTorch and more
Reduce resource demands and efficiently deploy on a range of Intel® platforms from edge to cloud

This open-source version includes several components: namely Model Optimizer, OpenVINO™ Runtime, Post-Training Optimization Tool, as well as CPU, GPU, GNA, multi device and heterogeneous plugins to accelerate deep learning inference on Intel® CPUs and Intel® Processor Graphics. It supports pre-trained models from Open Model Zoo, along with 100+ open source and public models in popular formats such as TensorFlow, ONNX, PaddlePaddle, MXNet, Caffe, Kaldi.

Components

OpenVINO™ Runtime - is a set of C++ libraries with C and Python bindings providing a common API to deliver inference solutions on the platform of your choice.
- core - provides the base API for model representation and modification.
- inference - provides an API to infer models on the device.
- transformations - contains the set of common transformations which are used in OpenVINO plugins.
- low precision transformations - contains the set of transformations that are used in low precision models
- bindings - contains all available OpenVINO bindings which are maintained by the OpenVINO team.
  - c - C API for OpenVINO™ Runtime
  - python - Python API for OpenVINO™ Runtime
Plugins - contains OpenVINO plugins which are maintained in open-source by the OpenVINO team. For more information, take a look at the list of supported devices.
Frontends - contains available OpenVINO frontends that allow reading models from the native framework format.
Model Optimizer - is a cross-platform command-line tool that facilitates the transition between training and deployment environments, performs static model analysis, and adjusts deep learning models for optimal execution on end-point target devices.
Post-Training Optimization Tool - is designed to accelerate the inference of deep learning models by applying special methods without model retraining or fine-tuning, for example, post-training 8-bit quantization.
Samples - applications in C, C++ and Python languages that show basic OpenVINO use cases.

Supported Hardware matrix

The OpenVINO™ Runtime can infer models on different hardware devices. This section provides the list of supported devices.

Device	Plugin	Library	ShortDescription
CPU	Intel CPU	*openvino_intel_cpu_plugin*	Intel Xeon with Intel® Advanced Vector Extensions 2 (Intel® AVX2), Intel® Advanced Vector Extensions 512 (Intel® AVX-512), and AVX512_BF16, Intel Core Processors with Intel AVX2, Intel Atom Processors with Intel® Streaming SIMD Extensions (Intel® SSE)
CPU	ARM CPU	*openvino_arm_cpu_plugin*	Raspberry Pi™ 4 Model B, Apple® Mac mini with M1 chip, NVIDIA® Jetson Nano™, Android™ devices
GPU	Intel GPU	*openvino_intel_gpu_plugin*	Intel Processor Graphics, including Intel HD Graphics and Intel Iris Graphics
GNA	Intel GNA	*openvino_intel_gna_plugin*	Intel Speech Enabling Developer Kit, Amazon Alexa* Premium Far-Field Developer Kit, Intel Pentium Silver J5005 Processor, Intel Pentium Silver N5000 Processor, Intel Celeron J4005 Processor, Intel Celeron J4105 Processor, Intel Celeron Processor N4100, Intel Celeron Processor N4000, Intel Core i3-8121U Processor, Intel Core i7-1065G7 Processor, Intel Core i7-1060G7 Processor, Intel Core i5-1035G4 Processor, Intel Core i5-1035G7 Processor, Intel Core i5-1035G1 Processor, Intel Core i5-1030G7 Processor, Intel Core i5-1030G4 Processor, Intel Core i3-1005G1 Processor, Intel Core i3-1000G1 Processor, Intel Core i3-1000G4 Processor

OpenVINO™ Toolkit also contains several plugins which simplify loading models on several hardware devices:

Plugin	Library	ShortDescription
Auto	*openvino_auto_plugin*	Auto plugin enables selecting Intel device for inference automatically
Auto Batch	*openvino_auto_batch_plugin*	Auto batch plugin performs on-the-fly automatic batching (i.e. grouping inference requests together) to improve device utilization, with no programming effort from the user
Hetero	*openvino_hetero_plugin*	Heterogeneous execution enables automatic inference splitting between several devices
Multi	*openvino_auto_plugin*	Multi plugin enables simultaneous inference of the same model on several devices in parallel

License

OpenVINO™ Toolkit is licensed under Apache License Version 2.0. By contributing to the project, you agree to the license and copyright terms therein and release your contribution under these terms.

Documentation

User documentation

The latest documentation for OpenVINO™ Toolkit is available here. This documentation contains detailed information about all OpenVINO components and provides all the important information you may need to create an application based on binary OpenVINO distribution or own OpenVINO version without source code modification.

Developer documentation

Developer documentation contains information about architectural decisions which are applied inside the OpenVINO components. This documentation has all necessary information which could be needed in order to contribute to OpenVINO.

Tutorials

The list of OpenVINO tutorials:

Jupyter notebooks

Products which use OpenVINO

System requirements

The system requirements vary depending on platform and are available on dedicated pages:

How to build

See How to build OpenVINO to get more information about the OpenVINO build process.

How to contribute

See Contributions Welcome for good first issues.

See CONTRIBUTING for contribution details. Thank you!

Get a support

Report questions, issues and suggestions, using:

GitHub* Issues
The openvino tag on StackOverflow*
Forum

Additional Resources

OpenVINO Wiki
OpenVINO Storage
Additional OpenVINO™ toolkit modules:
- openvino_contrib
Intel® Distribution of OpenVINO™ toolkit Product Page
Intel® Distribution of OpenVINO™ toolkit Release Notes
Neural Network Compression Framework (NNCF) - a suite of advanced algorithms for model inference optimization including quantization, filter pruning, binarization and sparsity
OpenVINO™ Training Extensions (OTE) - convenient environment to train Deep Learning models and convert them using OpenVINO for optimized inference.
OpenVINO™ Model Server (OVMS) - a scalable, high-performance solution for serving deep learning models optimized for Intel architectures
Computer Vision Annotation Tool (CVAT) - an online, interactive video and image annotation tool for computer vision purposes.
Dataset Management Framework (Datumaro) - a framework and CLI tool to build, transform, and analyze datasets.

* Other names and brands may be claimed as the property of others.

Languages

C++ 80.5%

Python 15.5%

C 2.8%

CMake 0.9%

Cython 0.1%

README.md

Contents:

What is OpenVINO toolkit?

Components

Supported Hardware matrix

License

Documentation

User documentation

Developer documentation

Tutorials

Products which use OpenVINO

System requirements

How to build

How to contribute

Get a support

Additional Resources