* Enable crop shape agnostic kernel
* Added unit test
* Added new scalar argument for crop (eltwise) for being used as runtime input offset in shape agnostic kernel
* Fix eltwise to have runtime offset only for crop
* Fix unittest error
* Applied review comment
* [GPU] Fix output format not changing at runtime
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Add remove_redundant_reorders pass TC for ov_gpu_unit_tests
Signed-off-by: Andrew Park <andrew.park@intel.com>
---------
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Review label and interval shape propagation for:
- space to batch
- space to depth
- shuffle channels
- depth to space
- batch to space
* Review template implementation of shape_infer for:
- space to batch
- space to depth
- shuffle channels
- depth to space
- batch to space
* Apply clang-format
* Update src/core/shape_inference/include/batch_to_space_shape_inference.hpp
Co-authored-by: Tomasz Jankowski <tomasz1.jankowski@intel.com>
* Update src/core/shape_inference/include/space_to_batch_shape_inference.hpp
Co-authored-by: Tomasz Jankowski <tomasz1.jankowski@intel.com>
* Shuffle channels remove label from channel dim
---------
Co-authored-by: Tomasz Jankowski <tomasz1.jankowski@intel.com>
* Moved Task, Streams, CPUStreams Executors to new API
* Fixed some build issues
* Fixed new build issues
* Try to fix tests
* Fixed inference unit tests
* Small build fix
* Added more system headers
* Try to fix naming style
* Fixed namespace
* Fixed android build
* [GPU] Apply multi-threads for async compilation context (#15683)
- Use CPUStreamExecutor in compilation context
- Use single compilation context, impl_cache and kernels_cache for multple streams
- Move compilation context to cldnn::program
- Move impl_cache to cldnn::program
- Create thread-safe impl_cache
- Create thread independent compilation function in kernels_cache
- Use kernels_cache in program and remove it from network
* [GPU] Fix segfault issue: ocl_engine and ocl_device are released during remained compilation context task are running (#15683)
- compilation context has own CPUStreamExecutor
* [GPU] Follow-up codereview (#15683)
- LruCacheThreadSafe inherit LruCache
- FuncRemoveItem has std::pair<Key,Value> as input
- Change prepare_tools to init_program
* [GPU] Create primitive_impl::build_kernels (#15683)
* [GPU] Fix unit test build error (#15683)
* [GPU] Remove redundant code (#15683)
- Remove try catch for debug
- Call compilation_context.cancel() in destructor of network
* [GPU] combine two atomic counter in kernels_cache (#15683)
* [GPU] Follow-up code review (#15683)
* [GPU] Fix nullptr exception in unit test (#15683)
* [GPU] Follow-up code review (#15683)
- Modify mutex lock in compilation context
* [GPU] Fix windows build issue (#15683)
* Solve test case failure issue for 32bits
1. ov_core_unit_test
2. ov_cpu_unit_test
Change-Id: I5e6afda0865fedc1de7fe84dd5f132e642263303
* Solve windows build issue
Change-Id: I1e6ea4d930c41322a73a701d566f0cdee2a4e098
* Disable several 64bit test cases in case of 32bit system
Change-Id: Ib8ef784953bf15cb42048dd905f17a85e52482b1
* Update a simple solution
Change-Id: Ie2e2cd369fe98bfcd26f3416bf36d4dfb0f24c25
* update for 64bits failure
Change-Id: I6571b7842a0fecc01fff169a21fa7aae9eb9da14
* Use OPENVINO_ARCH_64_BIT replace custom macro
Change-Id: I7e72b74aed8f0226513bc0e06ce2381322b42f71
* use kernel caching for dynamic models
* replaced cl_cache with blob
* updated to serialize dims info of input and output
* updated to skip unicode tests in Windows
* [TF FE] Support conversion of models with non-standard extensions in the path
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Update tools/mo/unit_tests/moc_tf_fe/conversion_basic_models.py
---------
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>