* initial version of implementation
* styles applied
* fixed and registration
* add more unit tests
* fixed and in legacy opset
* review remarks
* refactor of version name range
* [dGPU] Enable stable diffusion
+ Prevent to fuse swish into oneDNN reorder.
+ Makes concat explicitly if batch size is greater than 1 and the siblings are oneDNN impl.
* Small CoreImpl refactoring
* Removed cache_dirhandling from CPU plugin
* clang-format
* Fixed python tests
* Fix
* Fixed bugs in HETERO case
* Fixed clang-format and warnings in auto plugin
* Added import_export as capability for TEMPLATE plugin
* Commented throw exception from loaded_from_cache
* Fixed clang-formatof ro template plugin
This is a corner case because body graph nodes have named output ports.
This allows to support custom RetinaNet model.
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* remove ov::device::thermal
ov::device::thermal was only supported on myriad
* additional cleanup
* remove myriad from AUTO and MULTI
auto n multi n hetero
+ remove mentions of listing myriad devices
* two final fixes
* Update ov_auto.py
---------
Co-authored-by: Ilya Churaev <ilya.churaev@intel.com>
* Remove warning suppression: wd4018, wd4309
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* Remove linux warning suppression no-sign-compare
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* ov::intel_cpu::VectorDims base value type is size_t;
dnnl::memory::dims base value type is int64_t;
All compare data up to int64_t can fix warning and there is potential issue.
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* channelAxis maybe == -1; means: no exist any more.
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* Fix recursive macro: "one_of", "everyone_is" sign-compare warning.
Must pass same value type.
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* Fix Windows sign unsign compare warning
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* There are 2 instances:
using ov::Dimension::value_type = int64_t
using ov::intel_cpu::StaticDimension::value_type = size_t
All up to int64.
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* linux have too many sign-compare issue.
Complete windows sign-compare firstly.
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* Fix clang issues.
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* Fix warning.
Because instantiate T1=unsigned int, T2=int
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* Fix warning for tests unit reorder_node_test.cpp
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* Fix warning : ASSERT_GE(step, 1u);
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* Fix tests: warning C4018
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* Remove auto, using int64_t is more reasonable.
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
---------
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* [GPU] Add data broadcasting for OneDNN binary ops for Gemm primitive
* Based on https://github.com/openvinotoolkit/openvino/pull/15790 and enable onednn gemm from support multiple users and non constant input.
--------
Signed-off-by: hyunback <hyunback.kim@intel.com>
Co-authored-by: Sergey Shlyapnikov <sergey.shlyapnikov@intel.com>
* Support setting and getting element type, shape and value in PyTorch FE InputModel
* Fix code style
* Fix code style
* Fix rsub layer test
* Fix py style
* Apply review feedback
* Fix code style
* Fix initial values of input and output flags in Place
* Enable AUTO to support execution mode hint.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Add test case.
* Set default value "PERFORMANCE" for ov::hint::execution_mode.
* Update.
* Update.
* Correct default ov::hint::execution_mode value for the default value checking test case.
* Update.
* Delete obsolete config.hpp file.
---------
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Init auto_batch plugin unit test
* Add more mock test
* Add to ci yml file
* Fix clang issue
* Resolve compilation issue
* Fix symbol multiple definition in static build
* Add test cases for AutoBatchInferRequest
* Add test cases for AutoBatchAsyncInferRequest
* qFixed build error after PR-15229
* Resolve blocked issue when call StartAsync test cases
* add more test for auto batch async inference
---------
Co-authored-by: Chen Peter <peter.chen@intel.com>
* AUTO cumulative throughput mode ignore candidate device that fail to load device
* Simplify the judgement logic of whether Auto set to Multi
* Add description about _AutoSetToMulti variable
* Update variable name to _AutoCallMulti
* Refine logic of AUTO execution_devices
* Add loading error massage
* Add test case
* Add filter to execution_devices of MULTI
* Add execution_devices test in load fail sitution
* Simplify the logic of execution_devices
* Update auto_executable_network.cpp
* Update src/plugins/auto/multi_executable_network.cpp
Co-authored-by: yanlan song <bell.song@intel.com>
* Update src/plugins/auto/auto_executable_network.cpp
Co-authored-by: yanlan song <bell.song@intel.com>
* Update test case
---------
Co-authored-by: Chen Peter <peter.chen@intel.com>
Co-authored-by: yanlan song <bell.song@intel.com>
* [GPU] Added shape agnostic optimized SoftMax kernel
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Update SoftmaxKernelBaseBF::Validate policy for shape agnostic kernel
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Add softmax_gpu_bf shape agnostic TC for ov_gpu_unit_tests
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Fix failed TCs for ie-tests-linux-ubuntu20-gpu
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Update to use stack array instead of global buffer
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Remove global buffer usage completely
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Add #undef directive
Signed-off-by: Andrew Park <andrew.park@intel.com>
---------
Signed-off-by: Andrew Park <andrew.park@intel.com>