* update linux CPU map parser and add unit test
* add one more test data
* fix clang issue
* update test case by using TEST_P
* fix code style issue
* add one more test data with hyper threading off
* remove duplicated test data
* fix issue for Windows build
* fix issue for Windows build
* add description for test data
* add core type table for each socket
* fix code style issue
* fix code style issue
* remove redundant content
* remove parse_processor_info_linux() from INFERENCE_ENGINE_API_CPP
* fix code style issue
* update example of core type table
* fix code style issue
* [TF FE] Support TensorList operations and RNN layers
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Remove TensorList operations from the fallback
* Fix computation of dummy tensor size
---------
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* serialization of read_value and assign primitives
* lines should be <= 160 characters long
* added unit tests for read_value and assign
* updated to store is_output_evnet in primitive_inst
* removing _is_output_event in typed_primitive_impl_ocl
* added comments for mem_allocated and is_output_null
* [C API] remote tensor support
Provide C interface for remote context and remote tensor:
1. OCL and VA context and buffer support
2. unite test for remote context and remote tensor
Change-Id: I2c449aef21cbe928ca470b4e3bcf1e03a1d1ca43
* Fix clang issue
Change-Id: I83c9592d21ff9cb8aeb85148277d96db74b455c7
* [CAPI] Add ocl nv12 input inference test case
1. Add fully nv12(2 ocl remote tensor) as input plus preprocess do csc+resize, then do inference
2. Add get_device_name for remote tensor
3. Add test case for preprocess to set mem type
Change-Id: Ieaab50c8de20e5c7258697030672e0b010627a81
* Update documentation
Change-Id: Ia7dbaea48d38f5534aba60fbb25cd0a1f2f9eab0
* Remove debug code
Change-Id: Ic5c5a24d3c40bb258b7007dcea44594af2d92344
* Fix issues brought by rebase
Change-Id: I2520c5ccf3620349e202ea40c08bb1c437d5af88
* Resolve document issue
Change-Id: Ia14500f8623147f481dda286a0afaa8ecfffa7c9
* Resolve some comments
1. Add specific header file for gpu plugin
2. clang-format issue
3. interface compatible issue
Change-Id: Icc4723af071af30f0422ac9a107e57ddeec94aac
* fix clang issue
Change-Id: I46e1fed3dd9a4e51260b695dc3fb194b9571ed58
* Add gpu header file directory
Change-Id: I8c15d9da58a46c070dcc68530cb2beea8cd4bba9
* Remove HAVE_OCL_SUPPORT macro
Change-Id: I10093a99c1858649f1c5502248729704fcec34ef
* Address some comments
Change-Id: I72830288d063623641e8946c8470631e81fdeb34
* Printov:AnyMmap with the help of ov::Any
Change-Id: I8abd3a8d94ba8116974c59a489cda2af15f225d7
* 1. Correct the device list by priority order from high to low.
2. Remove GNA, CUDA, HPU, HDDL, NVIDIA from device list supported by AUTO/MULTI.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Filter out supported device when not specify the candidate device for AUTO plugin.
* Add Debug MSG
* Update.
* Update AUTO mock test cases.
* Update.
* Update.
* Update code style.
---------
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
Co-authored-by: Chen Peter <peter.chen@intel.com>
* [TF FE] Refactor LinSpace translator and add layer test
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Remove start_shape from test parameters
---------
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* [TF FE] Refactor LeakyRelu and add layer tests
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Support LeakyRELU for TensorFlow Lite
---------
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* [GPU] Shape agnositc optimized gemm kernel
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Fix CI failure
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Apply code review
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Fix dynamic shape accuracy drop on SQuAD v1.1
- F1: 91.81%, EM: 85.25% @bert-small-uncased-whole-word-masking-squad-0001
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Apply code review
Signed-off-by: Andrew Park <andrew.park@intel.com>
---------
Signed-off-by: Andrew Park <andrew.park@intel.com>
* [GPU] Fix the functional issue using fc:onednn in bert model.
* The issue had happened when input dims are 3 with post-po eltwise.
* oneDNN FC out supports 2-dims only, so OV need to update output and post-op too.
* Fix ACC issue in b16 onednn FC. cldnn updates yxfb format in b16 for opt kernel, but no need in onednn.
* Remove W.A code for running fc cldnn.
* Support gemm primtiive and multi types ForceImplTypes
* Change env name OV_GPU_ForceImplTypes
* Do not change elstwise post-op shape from original node: it caused the ACC issue when multiple users.
Signed-off-by: hyunback <hyunback.kim@intel.com>
* add support for aten::stack
* add new lines
* updated aten stack transformation
* add comments to the code
---------
Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>
Co-authored-by: Andrei Kochin <andrei.kochin@intel.com>
* [TF FE] Refactor translators for Reduce ops and add layer tests
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Correct number of inputs
* Use already implemented logical operations tests
---------
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>