* Fix GPU NonMaxSuppression implementation
* Introduce Nms9 single layer tests
* Adapt internal NMS and GPU implementation for NMS9 implementation
* Adapt CPU implementation in GPU for NMS9
* Add blocked layouts support to NMS
* Add unit tests for blocked formats for NMS
* Fix boxes groups size for the small shapes
* Use ocl implementation for blocked layout input
* Fix templates typedefs to pass win build
* Fix second output to set data in correct format
Co-authored-by: Tetiana Gubanova <tgubanova@lohika.com>
* [TF FE] Add Transpose Sinking for additional unary-wise Operations
It helps to fix performance degradation for MobileNet models
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Add LogicalNot for Transpose sinking
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* [TF FE] Support dynamic rank support for Convolutional and Pooling operations (#12661)
* [TF FE] Add dynamic rank support for Convolutional and Pooling operations
Refactor DepthwiseConv2D, AvgPool, and FusedBatchNorm operations
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Fix build issue with rvalue
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Fix build issue with climit
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Skip duplication of Parameter nodes
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Revert changes in StridedSlice and add check for AvgPool operation type
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Revert the rest of changes for StridedSlice
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Fix translator for AvgPool: add pad mode
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Introduce helper default_op_checks
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* [TF FE] Refactor translators for Resize operations and correct Pooling (#12721)
* [TF FE] Refactor translators for Resize operations and correct Pooling
It allows to convert magenta_arbitrary-image-stylization model
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Align TF FE tranlator for Resize with legacy frontend
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Do minor fix for MaxPool
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Disable GroupedGatherElimination in case of scalar inputs containing indices
* clang format
Co-authored-by: Tomasz Dołbniak <tomasz.dolbniak@intel.com>
* Add overrided method to generating vector of strings
* Trim the value from the the left and right
* Add test to verify that output names are correctly read from IR
* Use spaces instead of tabs
* Add C++ tests for read model contains outputs with whitespaces
* Fix test for add output
* Remove python test
* Cherry-pick U22 adoption in github actions
* More fixes for shellcheck
* More fixes for shellcheck
* Update .github/workflows/py_checks.yml
Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
* [TF FE] Handle optional attributes for Convolutional operations (#12230)
* [TF FE] Handle optional attributes for Convolutional operations
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Apply code-style rules
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* [TF FE] Implement LinSpace and BatchMatMul translators (#12271)
* [TF FE] Implement LinSpace and BatchMatMul translators
It helps to convert STN model (from e2e testing) using TensorFlow frontend
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Fix BatchMatMul translator
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Fix LinSpace operation translator
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Apply code-review feedback
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Apply code-style rules
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Apply code style rules
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* [TF FE] Fix conversion of NetVLAD model (#12328)
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* [TF FE] Implement translators for TensorFlow ConvBackpropInput operations (#12356)
* [TF FE] Implement ConvBackPropInput translators
Now the translators supports dynamic input_sizes attribute and different padding modes
including EXPLICIT mode
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Fix clang-style issue
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Fix code-style issue
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Fix code-style issue
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Apply code-review feedback and fix build issues
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Apply code-review feedback: check for input size
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Fix retrieving explicit_padding attribute
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Fix code style
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* [TF FE] Fix StridedSlice translator for new_axis vector size longer input rank (#12442)
* [TF FE] Fix StridedSlice translator for new_axis vector longer input rank
Currently, new_axis vector is cut by input rank that is correct and leads to the loss of new axes.
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Use int64 type in mask_to_vector function
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* [TF FE] Refactor translators for Conv2d and Conv3d (#12444)
It allows to convert CNN-Transformer model. Padding was previously incorrect.
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* [TF FE] Implement conversion for Attention OCR model (#12428)
* [TF FE] Implement conversion for Attention OCR model
The following scope of work is done to make Attention OCR convertable:
1. Refactored translators for BiasAdd, Slice, and ArgMax operations. Add translation for StopGradient operation.
2. The previous traversing algorithm to compute topological sorted nodes list was incorrect. Now it is implemented based on topologically_sorted function from core/graph_util.hpp.
3. The unsupported data types are now preliminary converted to undefined type for the purpose of to have them cut off.
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* [TF FE] Refactor MaxPool operation translator for xj_feature model (#12485)
* [TF FE] Refactor MaxPool operation translator for xj_feature model
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Correct MaxPoolV2 since it has three inputs
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
+ benchmark cache_dir option takes longer than cl_cache_dir env in loading network.
+ For clDNN execution, benchmark cache_dir created onednn_engine if just ONEDNN_ENABLE config is ON.
+ Creation of onednn_engine in ocl_engine is changed to on-demand.
Signed-off-by: Min, Byungil <byungil.min@intel.com>
Signed-off-by: Min, Byungil <byungil.min@intel.com>