* add static_output attribute to enable GPU implementation of GenerateProposals.
This change may be reverted when dynamic shapes support will be implemented
in GPU plugin.
* - add GPU single layer test;
- add normalized attribute to SLT.
* add GPU primitive for GenerateProposals
* add kernel for GenerateProposals
* add unit test for GenerateProposals
* add blocked layouts support
* tidy up
* support blocked layouts also for 2nd and 3d outputs
* Transformation to deal with dynamic output in GPU
- revert changes in GenerateProposals interface and shape inferenece;
- add transformation;
- custom compare method in single-layer test.
* address review comments
* fix after rebase
* fix after rebase
* review comment: added nms_eta to testcase name generation
* - added input types combination to functional tests;
- fix float16 comparison.
* fix after rebase
* use vector for input ids
* fix after rebase
* [TF FE] Support DynamicPartition operation
Add support for DynamicPartition operation and validate it with the layer tests
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Add implementation
* Rescale partition indices to provide stable sorting
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Test to interval shape propagated by transpose
* Test to propagate labels by transpose
* Add template transpose shape inference
* Fixes to transpose shape inference
* Update names for shapes:
input -> input_shape
order -> order_shape
* Not fill output shape for dynamic range
* Add constexpr to SeqGen and Between comparator
* Correct StaticShape creation in test
* Tests check partial value propagate in arg input
* Add evaluate upper, lower, label to transpose
- add test
* Add common methods for inference and evaluate
* Move helpers to shape_inference
* Move transpose attribute to transpose op
* Fix include in transpose operator
* Correct label generation and type
* Fix null conversion
* Use uint64_t for labels tensor
* Fix compare labels
* Use order length as output rank
* Update transpose transformation test
* Move helpers to validation_util
* Correct test assertion for expected shape
* Transpose evaluate use common function
for output calculation
* Remove redundant helpers from transpose test
* fix Performance inefficiencies issue caused by previous PR.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Update.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
Co-authored-by: River Li <river.li@intel.com>
* marks tf2 keras tests as precommit for new FE
* adds tf2 layers tests for new FE in precommit
* removes redundant tabulations
* review changes
* fix field name
* fix syntax error
* removes failing tests
* removes failed test
* removes failed test
* Fix benchmark_app commandline parsing
Object 'str' does not have a contains() method. Replaced by
"searchstring in object" query.
* Fix perf counter output and use consistent units
When ProfilingInfo was bound by pybind11, those entries turned into
Python timedelta objects. This caused two problems:
1. There was a division by 1000 to go to milliseconds. This caused a lot
of precision lost, because `timedelta(microseconds=2300)/1000.0 =
timedelta(2)` and `timedelta(microseconds=33) = timedelta(0)`.
2. When converting those timedelta objects to str, the output is in the
form of (HH:MM:SS.XXXXXX). This is not very useful microsecond based
performance counters.
This change simply reverts everything to printing plain microsecond
based integers.
* Align counter output to ms between Python/C++
* [GPU] mark constants as supported on QN. Fetch common part for CPU and GPU.
* Code style
* Apply review comments. Fix TEMPLATE plugin FP16, use common QN helper.
* Apply review comments: CNNNetwork -> Model inside GetSupportedNodes
* Apply review comments: move functino from plugin interface
* parent 6e7016ccda
author Ilya Churaev <ilya.churaev@intel.com> 1664281499 +0400
committer Ilya Churaev <ilya.churaev@intel.com> 1664510018 +0400
Fixed warnings on local machine
* Added CMAKE_COMPILE_WARNING_AS_ERROR usage
* Fixed style
* Fixed merge conflicts
* Fixed typo
* Fixed myriad build for macOS
* Fixed warning
* Fixed tests
* Disabled incorrect test
* Try to fix linux tests
* Revert "Try to fix linux tests"
This reverts commit 29224c93ff.
* Fixed tests
* Revert logic with incorrect cast
* Fixed log softmax
* Disable warning as error for cuda
* Try to fix inference_engine_s
* Fixed cmake
* Revert "Fixed cmake"
This reverts commit 87e9e4e674.
* Revert "Try to fix inference_engine_s"
This reverts commit a1adca8b05.
* WA for static symbols in inference_engine_s test library
* Fixed code style
* Fixed static definition for master
* Revert "Fixed static definition for master"
This reverts commit 20d00d215a.
* Revert "Fixed code style"
This reverts commit 0eb2362543.
* Revert "WA for static symbols in inference_engine_s test library"
This reverts commit 75ef86a79d.
* Fixed linker issue for Windows
* Disable WaE by default
* Disable warning as error in the developer package
* Try to fix dev package
* Try to fix Windows Jenkins
* Revert old behavior for tread_warn_as_err variable
* [GPU] Split and VariadicSplit new shape infer (#13216)
GPU update test script
* [GPU] Added W/A for crop offset (#13216)
Co-authored-by: Ahn, Paul Y <paul.y.ahn@intel.com>
* [GPU] Fix crop gpu test failures (#13216)
- Fixed by review comments
* [GPU] Move input offsets calculation to crop_inst::calc_output_layouts (#13216)
Co-authored-by: Taylor Yeonbok Lee <taylor.lee@intel.com>
* [TF FE] Switch on fallback for TF FE even for use_new_frontend option
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Add unit-test for TF FE transformation config fallback
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* [TF FE] Support DeepSpeech model by TF FE
Add the final part to support BlockLSTM operation with sliced state cell
from the last time step.
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Apply code-review feedback: use get_pattern_map, no nullptr
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* [GPU] Implement (I)RDFT-9
* Added GPU implementation of RDFT operation
* Added basic GPU implementation of IRDFT operation, but still needs adjusting
* Added single layer tests for (I)RDFT
* Added unit tests for (I)RDFT
* Added blocked layout support for all DFTs operation
* [GPU] Added changes after review.
* [GPU] Added single-layer test for single axis case for DFTs.
* [GPU] Simplified attach_dft_impl method.
* [GPU] Moved negative axes cases to separate test for DFTs.
* [GPU] Adjusted tests to cover all signal sizes cases (less, equal and more than input size) for DFTs.
* [GPU] Fixed IRDFT implementation.
* [GPU] Used RDFTLayerTest.
* [GPU] Fixes after rebase.
* [GPU] Renamed kind enum to direction.
There is build error on RHEL 9
openvino/src/plugins/intel_gna/backend/pwl_border_values_counter_identity.cpp:70:51: error: ‘numeric_limits’ is not a member of ‘std’
70 | const int16_t y_min = is_low_precision ? std::numeric_limits<int8_t>::min() : std::numeric_limits<int16_t>::min();
numerical_limits is declared in limits, so include it.
Signed-off-by: Tom Rix <trix@redhat.com>
Signed-off-by: Tom Rix <trix@redhat.com>
* [GNA] Add extra segments to PWL only if it is fused with Conv2D layer
* added checking if extra segments are needed by checking if activation
functions are fused with Conv2D or Conv2D with Pooling applied
* added unit tests verifying implmentation
* Update src/tests/unit/gna/gna_extra_pwl_segments_tests.cpp
fix license
Remove useless and confusing comments
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>