* [GPU] Apply multi-threads for async compilation context (#15683)
- Use CPUStreamExecutor in compilation context
- Use single compilation context, impl_cache and kernels_cache for multple streams
- Move compilation context to cldnn::program
- Move impl_cache to cldnn::program
- Create thread-safe impl_cache
- Create thread independent compilation function in kernels_cache
- Use kernels_cache in program and remove it from network
* [GPU] Fix segfault issue: ocl_engine and ocl_device are released during remained compilation context task are running (#15683)
- compilation context has own CPUStreamExecutor
* [GPU] Follow-up codereview (#15683)
- LruCacheThreadSafe inherit LruCache
- FuncRemoveItem has std::pair<Key,Value> as input
- Change prepare_tools to init_program
* [GPU] Create primitive_impl::build_kernels (#15683)
* [GPU] Fix unit test build error (#15683)
* [GPU] Remove redundant code (#15683)
- Remove try catch for debug
- Call compilation_context.cancel() in destructor of network
* [GPU] combine two atomic counter in kernels_cache (#15683)
* [GPU] Follow-up code review (#15683)
* [GPU] Fix nullptr exception in unit test (#15683)
* [GPU] Follow-up code review (#15683)
- Modify mutex lock in compilation context
* [GPU] Fix windows build issue (#15683)
* Solve test case failure issue for 32bits
1. ov_core_unit_test
2. ov_cpu_unit_test
Change-Id: I5e6afda0865fedc1de7fe84dd5f132e642263303
* Solve windows build issue
Change-Id: I1e6ea4d930c41322a73a701d566f0cdee2a4e098
* Disable several 64bit test cases in case of 32bit system
Change-Id: Ib8ef784953bf15cb42048dd905f17a85e52482b1
* Update a simple solution
Change-Id: Ie2e2cd369fe98bfcd26f3416bf36d4dfb0f24c25
* update for 64bits failure
Change-Id: I6571b7842a0fecc01fff169a21fa7aae9eb9da14
* Use OPENVINO_ARCH_64_BIT replace custom macro
Change-Id: I7e72b74aed8f0226513bc0e06ce2381322b42f71
* use kernel caching for dynamic models
* replaced cl_cache with blob
* updated to serialize dims info of input and output
* updated to skip unicode tests in Windows
* [TF FE] Support conversion of models with non-standard extensions in the path
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Update tools/mo/unit_tests/moc_tf_fe/conversion_basic_models.py
---------
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* C++ exception with description write lock_type thrown in the test body.
Use get_output_values_to_float()
* fusings_gpu/gemm_2in_act_scale_quantize_eltwise_i8.basic/2
* fusings_gpu/gemm_2in_act_scale_eltwise.basic/2
* Remove WA test code of [GPU][DG2] Fix fusings_gpu/gemm_2in_scale.basic/7 #15353
* Now non full-tensor post-ops are broadcasted
* Added some new tensor API
* Added tests on constructors
* Small changes
* Fixed tensor tests
* Fixed tests
* Added parametrized tests
* Extend tests and delete copy_to from remote tensor
* [GNA] Create ngraph implementation for relu_torch_pot model for further tests. Create legacy pass fusing FC-Eltwise-Const layers pattern into single FC layer with biases
* [GNA] Fix review comments, applied proper code style to changed code
* Add test for negative axes, preliminary solution to solve uncorrect
results
* Normalize axes in operation NormalizeL2
* Add test for negative axes
* Add EOF
* Update ov::hint::performance_hint UNDEFINED value from empty string to "UNDEFINED".
* Update benchmark Python version.
* Update.
* Update.
* Update.
* Update the description about hint setting within benchmark APP README and help message.
* Fix remote blob creation to use original shape
* Revert "Fix remote blob creation to use original shape"
This reverts commit 35c674aa97.
* Fix cldnn tensor adjusted blob to be reinterpreted with actual input layout
* gpu model caching unit tests
* added serialization unit tests
* added save and load for quantize primitive_inst
* reduced the range of inputs for Gemm tests
* updated the copyright year