* Fix the deconv fused issue on AVX2 and AVX512 and enable deconv test
* Keep GroupDeconv BF16 test cases still disabled.
* Update to also excluding nightly
* Update onednn submodule.
* Update onednn submodule
* Update onednn submodule.
* Update the ONDENN submodule
* Update the ONEDNN commit.
* Update with merged onednn commit.
* Define new ppp API for nv12
* Add new ppp API function
* Add new ppp API unit test
* Add hello nv12 input classification ov
* Define new ppp API for nv12
* Add new ppp API function
* Add new ppp API unit test
* Add hello nv12 input classification ov
* Fix the clang -formate issue
* Modify the function called is_supported_image_size
* Update code as suggested
* Add hello_nv12_input_classification e2e test
* clang-format openvinotoolkit
* Fix the doc error in CI
Co-authored-by: River Li <river.li@intel.com>
Some compiler flags restrict the compiler from making arbitrary decisions while handling undefined C/C++ behaviors.
Therefore they can be used to fix some issues caused by undefined behavior.
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
Co-authored-by: Chen Peter <peter.chen@intel.com>
* FTZ_and_DAZ_set_for_cpu
* remove DAZ
* fix
* extract to utils
* ie core part changes to add do as property and benchmark_app enable do
* enable brgcov from Luocheng patch
* add debug info
* enable_brgemm_on_avx512
* add python binding
* dlb test
* FTZ_and_DAZ_set_for_cpu
* remove DAZ
* fix
* extract to utils
* ie core part changes to add do as property and benchmark_app enable do
* enable brgcov from Luocheng patch
* add debug info
* enable_brgemm_on_avx512
* add python binding
* dlb test
* revert test code
* revert test code
* Handle in-place failure cases in reshape node
* Disable inplace when non-const reshape connected to constant
* Add comment to reshape_inplace test
* move copy WA into execute() to cover more general in-place failure cases
* enable brgconv f32
* use config to enable brgconv f32
* when brg disabled not init bin-postops
* change prop name for extensive
* use more general field
* fix review comments.
* Add FORCE_TBB_TERMINATE to legacy API
* Put this config into proper place
* fix issue in property test
Co-authored-by: Shen, Wanglei <wanglei.shen@intel.com>
* [CPU] Optimize NonZero operation
# Conflicts:
# src/plugins/intel_cpu/src/nodes/non_zero.cpp
* [CPU] Rewrite NonZero implementation, so it will use generic ie_parallel API
* [CPU] NonZero operation: apply an additional optimization
* NonZero operation: add fallback code for inRank >= 6
* NonZero operation: apply review modifications
# Conflicts:
# src/plugins/intel_cpu/src/nodes/non_zero.cpp
* NonZero operation: inShape.getDims().size() -> inRank
* NonZero operation: eliminate input array index calculation by slight modification of ie_parallel API
* Adjust ie_parallel.hpp style for clang-format
* Try to unbreak the build
* Move to parallel_nt and add a cache for nd loops to optimize more
* Add minimal size threshold for threading and reduce warning count
* Try to workaround linter errors
* One more try to unbreak cpplint build
Co-authored-by: Michal Lukaszewski <michal.lukaszewski@intel.com>
* Remove vmaxps in store_vector.
This instruction is not needed for dst_prc int8.
And it may lead to wrong result with denormals optimization is on.
* Add vpmaxsd if dst_prc is u8 or u16.
* Enable hint to tput if no property is specified for both AUTO device and target device.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* 1. Update logic.
2. Add test cases.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Update.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Update. Set hints to default for target device if no hints setting for AUTO plugin and no specific properties setting for target device.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
This extra semicolon creates an output as example below. The extra
'::' is equivalent to add '.' as part of the LD_LIBRARY_PATH. This
breaks glibc build, and very often creates weird issue when launch
commands from different path.
...inference_engine/external/tbb/lib::/opt/intel/openvino_2021/...
We also noticed that :${parameter:+:$parameter} is widely used in
this file. Please review the code and fix as needed.
* ExperimentalDetectronDetectionOutput: refine sorting criteria for NMS stage
This is to ensure the operation produces stable predictable results across
the possible sorting algorithm implementaions.
This property is useful for the operation testing.
* [GPU] Implement ExperimentalDetectronDetectionOutput operation
* [GPU] ExperimentalDetectronDetectionOutput: use vector types and operations in kernel
* Reformat changed files to make clang format checker happy
* [GPU] ExperimentalDetectronDetectionOutput: add another test case to the unit test
* [GPU] ExperimentalDetectronDetectionOutput: Add f16 test
* ExperimentalDetectronDetectionOutput: single-layer test: use all three outputs
* [GPU] ExperimentalDetectronDetectionOutput: increase single layer test coverage
More attribute permutations were added.
* add testcase for plugin properties should not be revised by compile_model
* rename smoke_cpuCompileModelBehaviorTests to smoke_gpuCompileModelBehaviorTests
* remove property EXCLUSIVE_ASYNC_REQUESTS in ov2.0 test
* add testcase for plugin properties should not be revised by loadNetwork