Commit Graph

1019 Commits

Author SHA1 Message Date
Vladimir Paramuzov
325d02b760 [GPU] Use stream executor and exceptions from new api (#18531)
* [Common] Handle ov::Exception too in CompiledModel wrapper

* [GPU] Use new threading and exception interfaces where possible
2023-07-14 20:51:40 +02:00
Sungeun Kim
181238bcfb [GPU] Move unit test file to the proper location (#18545) 2023-07-14 14:19:56 +04:00
Andrew Kwangwoong Park
38913f2184 [GPU] Add debug config for disabled async compilation (#18535) 2023-07-14 10:42:03 +04:00
Andrew Kwangwoong Park
3f67b3948d [GPU] Dynamism support for Proposal (#18489) 2023-07-14 10:28:56 +04:00
Sergey Shlyapnikov
b7935bb869 [GPU] Fix data races in RegisterFactory() and LogHelper::LogHelper() functions (#18521) 2023-07-14 09:06:10 +04:00
Sungeun Kim
6913a5effb [GPU] set fixed random seed (#18144)
* random_generator
* apply random_generator to all tests
2023-07-13 21:13:29 +09:00
yanlan song
77f065e4b1 refactor code (#18475)
Signed-off-by: fishbell <bell.song@intel.com>
2023-07-13 13:58:44 +04:00
Min, Byungil
884006a0cf [GPU] Optimizing primitives for only batch axis padding (#18163)
+ Optimized out Reshape if only batch axis has padding
+ Not to add Reorder before oneDNN if only batch axis has padding
+ Re-calculate output layout's padding if Reshape is optimized opt
+ Not to apply optimization of stable diffusion iGPU perf to dGPU(#18200)

Signed-off-by: Min, Byungil <byungil.min@intel.com>
2023-07-13 18:19:04 +09:00
Sergey Shlyapnikov
feab0ab75e [GPU] Memory preallocation optimization (#18181)
* [GPU] Use real layout for cpu impls instead of memory's

* [GPU] Add memory tracking and pre allocation mechanism

* Tests and minor code refactoring

* Apply review comments

* Remove unused include
2023-07-13 07:26:17 +02:00
Andrew Kwangwoong Park
eba1b46719 [GPU] Update ROIPooling to use nGraph shape inference (#18500) 2023-07-13 09:04:13 +04:00
Nadezhda Ageeva
3330cd69a7 [GPU] Introduce ov::config_device_id internal property to the new API (#17820)
* [GPU] Introduce ov::config_device_id internal property to the new API. Fix setting property for specific device

* Introduce ov::internal::supported_properties

* Fix caching tests

* Fix win tests

* Fix Proxy tests

* Replace CONFIG_KEY_INTERNAL(CONFIG_DEVICE_ID) by ov::internal::config_device_id in PROXY plugin

* Review comments
2023-07-12 10:03:35 +00:00
Andrei Gorbachev
af6c2b0671 [Coverity] Replace find to count (#18493)
* fix CVS-112761

* fix CVS-112760

* fix CVS-112770

* fix CVS-112763, replace find to count
2023-07-12 11:57:41 +04:00
Paul Youngsoo Ahn
1e69152d9f [GPU] Set default format only for the dynamic model with static input shape (#18486) 2023-07-11 23:37:56 -07:00
Taylor Yeonbok Lee
8f513002b6 Disable redundant reset for internal buffer (#18447) 2023-07-12 02:00:11 +02:00
Pawel Raasz
56f51135d4 Grey preprocessing yuv nv12 i420 (#18239)
* Add two plane YOV to Grey conversion

* Add i420 to grey conversion

* Add yuv to grey conversion for GPU

* Fix cmakes

* Remove static from local function

* Remove opencv dependency from tests

* Put grey_from_yuv_single_plane into namespace
2023-07-11 12:41:24 +02:00
Paul Youngsoo Ahn
ff4ed54bba [GPU] Fix kernel selecting issue on dynamic model with static input shape (#18448)
* [GPU] disable blocked format for dynamic shape model(#18448)

* [GPU] Return default format for output layout rank when user node is reshape in get_preferred_format
- Rollback code to disable blocked formmat for dynamic shape

* [GPU] Add unit test checking has_reshape_user

* [GPU] remove redundant comments
2023-07-10 22:54:56 -07:00
Pavel Durandin
75696f4545 [GPU] Fix DG2 units tests with null users (#18455)
* [GPU] Fix DG2 units tests with null users

* [GPU] Fix DG2 units tests with null users
2023-07-10 19:32:38 +04:00
Andrei Gorbachev
8f529df763 [Coverity] Fix coverity (#18420)
* fix CVS-112761

* fix CVS-112760

* fix CVS-112770
2023-07-10 09:49:59 +04:00
Andrew Kwangwoong Park
eb3bb52a08 [GPU] Update DepthToSpace to use nGraph shape inference (#18422)
* Update DepthToSpace to use ngraph shape infer

* Remove legacy block_size limitation for static shape

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Add TCs for ov_gpu_func_tests and ov_gpu_unit_tests

Signed-off-by: Andrew Park <andrew.park@intel.com>

---------

Signed-off-by: Andrew Park <andrew.park@intel.com>
2023-07-07 15:13:48 -07:00
Paul Youngsoo Ahn
f940333a1c [GPU] Apply ngraph shape infer for ROIAlign (#18424) 2023-07-07 11:37:21 -07:00
Vladimir Paramuzov
2939818278 [GPU] Enable shape agnostic quantize_scale_shift kernel (#18188)
* [GPU] Enable shape agnostic quantize_scale_shift kernel

* Resolve conflicts in compile_graph.cpp

* Update compile_graph.cpp

---------

Co-authored-by: Pavel Durandin <pavel.durandin@intel.com>
2023-07-07 10:38:41 +00:00
Ilya Churaev
b69190c2c1 Proxy plugin 2.0 (#17416)
* Initial commit for proxy plugin

* Include proxy to openvino library

* Fixed some tests

* Added plugin properties

* Added readme

* Support Compile model for the proxy plugin

* Fixed python

* Remove gmock

* Fixed some tests

* Fixed load plugin tests

* Removed internal PROXY_FALLBACK_PRIORITIES initialization property

* Fixed code style

* Added new tests

* Create architecture.md

* Fixed some proxy tests

* Implement not implemented functions

* Fixed ICore mock

* Remove redundant code

* Added import/export tests

* Fixed hetero import/export tests

* Fixed ICore mock

* Fixed import tests

* Fixed build

* Remove redundant parse config

* Fixed some comments

* Try to fix Windows build

* Fixed incorrect logic

* Small fix in tests

* Fixed python API

* Fixed typo

* Try to fix python

* Switch GPU under proxy

* Fixed GPU name

* Revert GPU plugin under proxy

* Small changes in CMake files

* Temp commit

* Build without proxy

* Revert "Temp commit"

This reverts commit 1ac9824bdf.

* Fixed test linking

* Removed tests from ncc check

* Add option to disable proxy plugin

* Fixed minor comments

* Disable some proxy tests if IR frontend is disabled

* Enable Intel GPU under the proxy

* Fixed typo

* Fixed segfault in tests

* Small fix for case if only GPU is registered

* Fixed code style

* Added remote context tests

* Added proxy tests to CI

* Fixed mock engine

* Test change

* Revert "Test change"

This reverts commit 2d1d67766f.

* Add new tests

* Removed some tests

* Revert "Removed some tests"

This reverts commit 090398652f.

* Revert incorrect logic

* Removed unused variables

* Use original name for the GPU plugin

* Fixed CMake

* Do not show hidden devices

* Try to fix GPU remote context

* Fixed GPU plugin build

* Added interface for proxy remote context

* Remove local properties

* Remove redundant API

* Fixed typo

* Added remote tensors

* Fixed code style

* Fixed some comments

* Unwrap remote tensors before conversion to Blob

* Added cast for legacy API

* Fixed some cldnn tests

* Do not add preprocessing for proxy plugin

* Enabled more tests and wrap tensors in infer request

* Use hardware request inside conversion wrapper

* Fixed hand on cache calculation

* Try to fix some tests

* Support set tensor for remote tensors in proxy plugin

* Revert "Support set tensor for remote tensors in proxy plugin"

This reverts commit 5a927de590.

* Remove redundant friend from compiled model and fix life time for infer
request

* Fixed code style

* Add additional so pointers to the tensor

* Rewrite approach for compile model and tensor

* Removed API from proxy

* Fixed is/as Blob for wrapped Blobs

* Wrap tensor when set tensor to plugin

* Fixed recursive call

* Don't unwrap tensors for AUTO plugin

* Fixed Some Multi tests with remote blob for proxy

* Align context name with tests

* Fixed code style

* Try to fix more tests

* Some minor changes

* Try to fix OVHoldersTests

* Try to save low level SO in high level wrappers

* Revert "Try to save low level SO in high level wrappers"

This reverts commit 430ff8a526.

* Revert "Try to fix OVHoldersTests"

This reverts commit 32604f0d3e.

* Disable some tests

* Fixed DynamicBatchShapeTests

* Fixed caching tests and skip OVHoldersTest

* Small refactoring

* Fixed import model

* Small fix

* Fix typo which causes fails of caching tests

* Disabled AUTO BATCH for proxy device

* Support Export in Auto batch

* Small changes

* Fixed initialization fallback to plugin with proxy name

* Added more values for tests

* Ask all devices and create context if no device id

* Support export in auto batch

* Fixed some comments

* Fixed some comments and removed auto batch

* Fixed some comments

* Fixed auto batch test and some comments

* Fixed build

* Removed proxy plugin class from dev api

* Fixed code style

* Fixed disabled tests
2023-07-07 13:45:20 +04:00
Paul Youngsoo Ahn
da84027b72 [GPU] Fix setting impl type / format issue on dGPU (#18345)
* [GPU] Add roi_align get_shape_infer_dependencies (#18345)

* [GPU] Fix concat cpu impl for buffer fusing case

* [GPU] Add roi_align shape_infer unit tests

* [GPU] Fix windows build issue

* [GPU] add unit test
2023-07-07 11:04:14 +02:00
Andrei Gorbachev
6e49b97dd6 [Coverity] fix type x_channel_idx (#18309)
* fix 112753

* fix 112753
2023-07-07 08:19:20 +04:00
Eddy Kim
42d44ddc84 updated not to propagate events when in-order queue is used (#18415) 2023-07-06 19:57:12 -07:00
Eddy Kim
58d1fc3c6b [GPU] updates to use a graph local net_id for model caching (#18372)
* newly added local_net_id

* updated calls to network ctors

* updated to use uint32_t for local_net_id

* added comments for _local_net_id
2023-07-06 18:34:08 -07:00
Tanmay
e0d7ce691b fix typo in primitive_inst.cpp (#18417) 2023-07-07 01:25:56 +04:00
Andrew Kwangwoong Park
f5baec14aa [GPU] Fix issue in runtime buffer fusing for oneDNN concat and auto padding support for conv (#18393)
* Add auto pad attribute support for conv

* Fix to let concat onednn impl check can_be_optimized in impl_param instead of that in node

* Apply auto padding to kernel param  for conv ocl impl

* conv shape agnostic kernel is not selected if conv is not explicit padding

* Fix failed TCs for ov_gpu_unit_tests

Signed-off-by: Andrew Park <andrew.park@intel.com>

---------

Signed-off-by: Andrew Park <andrew.park@intel.com>
2023-07-06 11:20:34 -07:00
Kelvin Choi
f0ec940fc1 [GPU] Support FC 6d output by compressing as 4d in kernel for dynamic (#17996) 2023-07-06 11:07:28 -07:00
hyunback kim
2e8ef4c200 [GPU] oneDNN 3.2 integration (#18326)
Signed-off-by: hyunback <hyunback.kim@intel.com>
2023-07-06 16:34:47 +09:00
Jade Cho
ebf1199733 [GPU] Optimize conv_b_fs_zyx_fsv16_imad (#18356)
+ Make this kernel do not support asymmetric weight.
2023-07-06 16:28:40 +09:00
Andrei Gorbachev
87ed81f51c [Coverity] fix _type declaration (#18322)
* _type declaration

* fix _type init
2023-07-06 09:32:44 +04:00
Andrew Kwangwoong Park
9069dab72d [GPU] Some fixes to enable dynamic validation model (#18340)
* skip fuse_quantize_f if input or output layout is dynamic

* Update condition of can_fuse_reorder_to_prev for concat in shapeof subgraph

* skip concat_in_place_optimization if concat is shape of subgraph

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Add reorder if eltwise is shapeof subgraph and data type between input and output is different

* Skip reorder optimization if reorder has dynamic shape on remove_redundant_reorders

* Add reproducible TCs for ov_gpu_unit_tests

---------

Signed-off-by: Andrew Park <andrew.park@intel.com>
2023-07-05 10:05:12 -07:00
Kelvin Choi
68e1f07ec9 [GPU] selected format adjusts to the required input rank at get_preferred_format (#18044) 2023-07-05 09:49:19 -07:00
Andrei Gorbachev
8a76f4e7fa fix coverity block_size < 1 (#18320) 2023-07-05 12:09:40 +04:00
Andrei Gorbachev
c58bf68bc8 [GPU] fix uninitialized scalar variable (#18199)
* fix uninitialized scalar variable

* fix get_default_params
2023-07-05 10:29:04 +04:00
Kelvin Choi
35e26906ed [GPU] Update pad output shape for dynamic (#17995) 2023-07-05 04:22:49 +00:00
Eddy Kim
4c072ac4b5 [GPU] memory reuse for dynamic models (#18228)
* enable memory reuse for dynamic models

* updated to return dependant events for the shape_of primitive

* fixed memory_pool.release_memory()

* fixed a lint error

* fixed missing default value

* updated to use reset flag for dynamic models

* changed to use is_dynamic_output_layout instead of is_dynamic

* updated to use get_internal_params instread of buffer_ptr

* added a memory reuse test for dynamic models
2023-07-04 18:12:49 -07:00
Ilya Churaev
7fc16c3295 Move OpenVINO core developers API to OpenVINO folder (#18360)
* Move openvino core developers API to OpenVINO folder

* Fixed GPU plugin

* Fixed code style
2023-07-04 22:14:53 +04:00
Steve Yoo
6be030be05 Fixed SpaceToBatch and BatchToSpace for 3d case (#18033)
* Added functional tests for SpaceToBatch and BatchToSpace

* Added functional tests for template plugin
2023-07-02 21:39:45 -07:00
Wilson Seok
504f1d8237 [GPU] fix pr18171 regression (#18272) 2023-06-29 09:44:38 +09:00
hyunback kim
2582f04c9c [GPU] Optimize stable diffusion perf igpu (#18200)
* [GPU] Optimize stable_diffusion performance in iGPU.

Change the existing heuristic shape condition to permute and no transpose gemm in case of transpose gemm.

Signed-off-by: hyunback <hyunback.kim@intel.com>
2023-06-28 13:57:10 +02:00
Wilson Seok
1efb9eafae [GPU] Add condition check for dynamic shape and onednn_impl in concat_in_place_optimization::match() (#18034)
* add dynamic shape support for dgpu in prepare_buffer_fusing

* add unit test

* add space between test cases

* update condition of impl create() for concat dynamic shape

* update unit test

* add comment and update unit test

* add impl_param.is_type() function
2023-06-27 23:39:00 -07:00
Paul Youngsoo Ahn
50897e86e6 [GPU] Impl cldnn::condition to support dynamic shape (#18051)
* [GPU] Impl cldnn::condition to support dynamic shape (#18051)
* Impl CreateIfOp
* Update calc_output_layouts and execute_impl
* Enable gpu unit test
* Create gpu functional test

* [GPU] Follow-up code review (#18051)
* remove redundant codes
* create custom execute method for condition_inst
* change name from update_loop_primitive_map to update_inner_program_io_map

* [GPU] Fix gpu func test failures for fp16

* Add more test-cases to support fp16 and nested if case

* [GPU] remove redundant codes
* refactoring var names
* fix windows build error

* [GPU] Fix windows build issue

* [GPU] update calc_output_layouts

* [GPU] remove custom condition_inst::execute

* Remove virtual keyword from primitive_inst::execute()

* [GPU] Share single task executor between main program and inner program

* [GPU] Fix input rank issue for const inner network in condition op

* [GPU] apply calc_output_layouts for roi_align

Co-authored-by: Vladimir Paramuzov <vladimir.paramuzov@intel.com>

* [GPU] avoid checking allow_new_shape_infer for inner program

---------

Co-authored-by: Vladimir Paramuzov <vladimir.paramuzov@intel.com>
2023-06-27 17:05:26 +02:00
Andrew Kwangwoong Park
1566567ca4 [GPU] Fix output layout calculation for crop and fc (#18207)
* Fix get_partial_shape tensor API to access the correct index of dimensions

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Update the rule specifying output_type to the legacy one by referring to calc_output_layout

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Add reproducible TCs related to issues for ov_gpu_unit_tests

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Fix failed fc dynamic i8 TCs for ov_gpu_unit_tests

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Fix are_data_types_sutable_for_onednn not to invalidate output layout

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Apply comment

Signed-off-by: Andrew Park <andrew.park@intel.com>

---------

Signed-off-by: Andrew Park <andrew.park@intel.com>
2023-06-27 11:30:30 +02:00
Mingyu Kim
61b15ce31a Revert "[GPU] Reorder weights refactoring (#17787)" (#18248)
This reverts commit d00c7d30f9.
2023-06-27 17:26:18 +09:00
Taylor Yeonbok Lee
bcf58344cc Fix crash for shape of subgraph due to missing mem_dep (#18246) 2023-06-26 16:48:10 -07:00
Wilson Seok
f306a11b82 [GPU] fix issues of MobileFaceNet for dynamic shape (#18171)
* fix issues of MobileFaceNet for dynamic shape

* update unit test
2023-06-26 17:22:15 +09:00
Taylor Yeonbok Lee
bf299c807e [GPU] Not to add sync if the node belongs to shape of subgraph (#18158)
* Not to add sync if the node is within shape of subgraph
Because the dependency is cpu impl so the execution is already finished.

* Fixed as review comment : Skip clFinish only when the runtime dep is shape of subgraph, not the current node
2023-06-25 21:51:45 -07:00
Taylor Yeonbok Lee
22ef2f4e6a Fix bug in weight reorder. (#18224)
The original memory was overwritten unexpectedly because it was chekcing shared_ptr instead of actual buffer address
2023-06-24 00:35:07 -07:00