* [GPU] Optimize stable_diffusion performance in iGPU.
Change the existing heuristic shape condition to permute and no transpose gemm in case of transpose gemm.
Signed-off-by: hyunback <hyunback.kim@intel.com>
* add dynamic shape support for dgpu in prepare_buffer_fusing
* add unit test
* add space between test cases
* update condition of impl create() for concat dynamic shape
* update unit test
* add comment and update unit test
* add impl_param.is_type() function
* [GPU] Impl cldnn::condition to support dynamic shape (#18051)
* Impl CreateIfOp
* Update calc_output_layouts and execute_impl
* Enable gpu unit test
* Create gpu functional test
* [GPU] Follow-up code review (#18051)
* remove redundant codes
* create custom execute method for condition_inst
* change name from update_loop_primitive_map to update_inner_program_io_map
* [GPU] Fix gpu func test failures for fp16
* Add more test-cases to support fp16 and nested if case
* [GPU] remove redundant codes
* refactoring var names
* fix windows build error
* [GPU] Fix windows build issue
* [GPU] update calc_output_layouts
* [GPU] remove custom condition_inst::execute
* Remove virtual keyword from primitive_inst::execute()
* [GPU] Share single task executor between main program and inner program
* [GPU] Fix input rank issue for const inner network in condition op
* [GPU] apply calc_output_layouts for roi_align
Co-authored-by: Vladimir Paramuzov <vladimir.paramuzov@intel.com>
* [GPU] avoid checking allow_new_shape_infer for inner program
---------
Co-authored-by: Vladimir Paramuzov <vladimir.paramuzov@intel.com>
* Restrict scipy module version for POT
Latest release https://pypi.org/project/scipy/1.11.0 causes dependency conflicts
* Bump OMZ to include scipy restriction
---------
Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>
* Fix get_partial_shape tensor API to access the correct index of dimensions
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Update the rule specifying output_type to the legacy one by referring to calc_output_layout
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Add reproducible TCs related to issues for ov_gpu_unit_tests
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Fix failed fc dynamic i8 TCs for ov_gpu_unit_tests
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Fix are_data_types_sutable_for_onednn not to invalidate output layout
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Apply comment
Signed-off-by: Andrew Park <andrew.park@intel.com>
---------
Signed-off-by: Andrew Park <andrew.park@intel.com>
* Not to add sync if the node is within shape of subgraph
Because the dependency is cpu impl so the execution is already finished.
* Fixed as review comment : Skip clFinish only when the runtime dep is shape of subgraph, not the current node
* [IE TESTS] Add Global test config for Subgraph base test
* Replace using option by function redefinition
* fix build
* remove extra changes for gna/template
* code style
* add nvidia to devices
* Fix debian
* remove nvidia
* add relative path to the table
* add another column
* new table format
* fix build issue
* fix tab name
* remove old table
* format fixes
* change font
* change path windows
* change tabset name
* add arm and 86_64 tables
* remove list dots
* ResolveNamesCollisions transformation refactoring; enable it in MOC
* fix the description
* call ResolveNamesCollisions transformation in the frontends; resolve review comments
* Resolve review comments
* Rename ngraph with ov in common_test_utils
* Fix comments
* Delete opset includes from commonTestUtils
* Fix CI and code style
* Apply comments
* Fix CI and apply comments
* Used pip wheel to build OpenVINO wheel
* Improvements
* Added python api properly into deb / rpm packages
* Use cmake -E variant from cmake 3.13
* Improvements to fix CI
* Some refactoring
* Don't use developer package for scripts projects
* Final fixes
* Fixed to use input shape rank when calculating output layout, added unit test case
* Fixed to use input shape rank when creating shape_of primitive, added functional tests