Commit Graph

1099 Commits

Author SHA1 Message Date
Eddy Kim
9deef1480a updated to init onednn attr (#19055) 2023-08-10 07:38:32 +00:00
Sergey Shlyapnikov
32a6a31de2 [GPU] Prevent Conv's input data type changing at reorder_inputs pass (#19042) 2023-08-10 10:59:32 +04:00
Sergey Shlyapnikov
d91d72c89c [GPU] RandomUniform new shape inference for dynamism support (#19087) 2023-08-10 09:43:08 +04:00
hyunback kim
f683fabcbf [GPU] Add Disable Winograd convolution debug config. (#18701)
Signed-off-by: hyunback <hyunback.kim@intel.com>
2023-08-10 10:58:57 +09:00
Roman Lyamin
575f497c8b [GPU] Fix data race in CompilationContext (#19103) 2023-08-10 00:50:49 +00:00
Roman Lyamin
2d5a4e20ec [GPU] Fix premature termination of async compilation task in tests (#19035) 2023-08-09 12:47:06 +04:00
Roman Lyamin
7d89c93ded [GPU] Added gi_yxs_os_yxsv2_osv16 format to to_weights_layout map (#19074) 2023-08-09 12:03:48 +04:00
Andrew Kwangwoong Park
df659daf44 [GPU] Add reorder if the input format of eltwise is different from the output format (#19068)
* [GPU] Add reorder if the input format of eltwise is different from the output format

* Fix failed TCs for eltwise's mixed layout
2023-08-08 23:21:54 -07:00
Kelvin Choi
e60da26894 [GPU] Support dynamic BatchToSpace and SpaceToBatch (#18630) 2023-08-09 05:56:55 +00:00
David Nam
a8f4e114f3 [GPU] Check if padding is supported for static shape when buffer fusing (#18861)
* [GPU] Check if padding is supported even for static shape when buffer fusing

* Add unit test

* Remove unnecessary logic
2023-08-08 22:15:56 -07:00
hyunback kim
58bd8e7f0b [GPU] Add disable winograd convolution in execute_config. (#18964)
* [GPU] Add disable winograd convolution in execute_config.

Signed-off-by: hyunback <hyunback.kim@intel.com>
2023-08-09 13:32:36 +09:00
Sergey Shlyapnikov
e7565eda3e [GPU] Added paddings processing for dynamic convolutions primitives; enabled convolution osv16 optimized shape agnostic kernel (#19001) 2023-08-08 14:57:54 +02:00
Min, Byungil
70e88c6b23 [GPU] Add reorder to avoid perf drop by using mvn_ref kernel (#18871)
+ Added is_padded_spatial to program_node
+ Added reorder to remove padded input in spatial axis for mvn
+ case applied only for blocked formats of implemented mvn opt kernel

Signed-off-by: Min, Byungil <byungil.min@intel.com>
2023-08-08 16:04:54 +09:00
Kelvin Choi
b83389a557 [GPU] Support non constant input of split length for variadic split (#18898) 2023-08-07 21:26:54 -07:00
Jade Cho
e5e944ccc2 [dGPU] Copy data from gpu_buffer to gpu_usm (#19010)
* [dGPU] Copy data from gpu_buffer to gpu_usm

* Add a unit test.
2023-08-08 11:47:22 +09:00
Andrew Kwangwoong Park
a2807f1edb [GPU] Add debug config for disabled primitive fusing (#18938)
* Add debug config for disabled primitive fusing

* Apply comment

* Fix failed TCs on CI
2023-08-07 18:02:48 -07:00
Eddy Kim
d278ff5786 [GPU] updated to enqueue only fully_connected layers for async build (#18891)
* updated to enqueue only fc for async build

* updated use_async_compilation(), make_task_executor_config() and disabled gemm_onednn.impl_replacement_with_cldnn

* added _num_async_build_threads

* added gemm to the async compliation targets
2023-08-08 09:27:35 +09:00
Andrei Gorbachev
36309938d9 [Coverity] fix coverity (#18784)
* fix CVS-112758

* fix CVS-114053

* fix CVS-116203

* fix CVS-112785 part 0

* fix CVS-112785 part 1

* fix CVS-112785 part 2

* fix CVS-112785 part 2+

* Update src/plugins/intel_gpu/src/plugin/ops/gather.cpp

Co-authored-by: Roman Lyamin <Roman.Lyamin@intel.com>

* fix CVS-112785 OPENVINO_ASSERT

* add suggestion

* fix CVS-112785

---------

Co-authored-by: Roman Lyamin <Roman.Lyamin@intel.com>
2023-08-04 14:38:06 +00:00
Edward Shogulin
b76fc24824 [LPT] Unexpected quantisation level fix (#18888)
* [LPT] Unexpected quantisation level support

* [LPT] Unexpected quantisation level support - extension for weights

* [LPT] Unexpected quantisation level support - extension for weights + tests

* refactoring: compilation fix
2023-08-04 15:02:12 +01:00
Andrew Kwangwoong Park
a3bb0a8cc7 Restore disable_memory_reuse initialization (#18987) 2023-08-04 12:53:09 +00:00
Ivan Tikhonov
5f6aba5ec5 Update ResolveNameCollisions to handle all friendly names, not only auto-generated (#18948)
* Update ResolveNameCollisions transformation to handle all friendly names, not only autogenerated

* codestyle
2023-08-04 13:38:59 +04:00
Taylor Yeonbok Lee
08ab7f4ab5 Removing blocking enqueuememcpy (#18977) 2023-08-04 02:46:41 +00:00
Kelvin Choi
561f71d86c [GPU Enable dynamism for grid_sample (#18953) 2023-08-03 11:09:07 -07:00
Wilson Seok
bdab260131 [GPU] Fix priorbox constant propagation (#18842)
* removed priorbox in mark_if_constant

* fix priorbox operation for dynamic shape

* restore share test classes and disabled the test cases

* add exception throw for PriorBoxClustered
2023-08-03 10:39:56 -07:00
Wilson Seok
3a1f45e70e [GPU] fix normalize dynamic shape support (#18956)
* fix normalize dynamic shape support

* remove redundant code
2023-08-03 10:37:41 -07:00
Andrew Kwangwoong Park
1501e29cfe [GPU] Allow StridedSlice as predecessor for in place concat (#18836)
* Allow StridedSlice as predecessor for in place concat

* Enable padding support for strided slice

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Add prepare_buffer_fusing TC for ov_gpu_unit_tests

---------

Signed-off-by: Andrew Park <andrew.park@intel.com>
2023-08-03 05:50:26 +00:00
Taylor Yeonbok Lee
31e46ea255 [GPU] Skip reorder at runtime if data type and format are not changed (#18859)
* Skip reorder at runtime if data type and format are not changedt

* Update shape of reorder user at predecessor node so that we can allocate pred nodes' output to host mem if needed

* Reinterpret reorder memory at runtime if needed
(e.g., input is fake-aligned fc and reorder uses that memory)

* Add debug config

* Fix CI test failure

* Do not skip after optimized reshape

* Do not skip user reorder if the user reorder is output and current node is static, and the memory is allocated to device

* Disable skip reorder user if current node has fused node

* Update src/plugins/intel_gpu/src/graph/include/reorder_inst.h

Co-authored-by: Eddy Kim <eddy.kim@intel.com>

* Minor fix for compilation error

* Do not skip reorder if the reorder's user is optimizable concat

* Fix CI failures

* No need to wait for input_layout because the events is already resolved in dgpu

* Fixed corner case where only some of the multiple output layouts are static

---------

Co-authored-by: Eddy Kim <eddy.kim@intel.com>
2023-08-02 20:59:52 -07:00
Wilson Seok
d51fc7adad add num_classes=-1 support in cpu_impl of detection_output (#18924) 2023-08-02 11:37:19 -07:00
Eddy Kim
9e9cf72973 updated to reset variable memory when shape is changed (#18941) 2023-08-02 11:24:58 -07:00
Eddy Kim
1fd598c303 onednn-gpu update to fix caching issue (#18935) 2023-08-02 11:14:30 -07:00
Wilson Seok
86b8e0a930 [GPU] Add autopad option in CreateOp of groupconvolution (#18897)
* add autopad option in CreateOp of groupnormalization

* add autopad option in CreateOp of GroupConvolution
2023-08-01 20:41:05 -07:00
Taylor Yeonbok Lee
db8c29ee6f [GPU] Release unused internal memory from pool (#18917)
* * Not to reuse internal memory for dynamic shape because of the current inefficiency in the pool
* Added a new debug config for dump runtime memory pool

* Apply DisableMemoryReuse for all usages

* Resolved perf issue of memory reuse from pool : Previously original ibuf record was not released when we allocate new memory for that buf.
After releasing the memory, # of the memory pool record does not increase => no longer inefficient memory pool retireval.

* Added test
2023-08-01 20:25:05 -07:00
Pawel Raasz
9067a1fead Review NMS, MatrixNms shape inference (#18869)
* Review NMS shape inference

* Fix test issues

* Review MatrixNms shape inference

* Unregister NMS with dynamic output from CPU plugin
2023-08-01 19:44:55 +04:00
Taylor Yeonbok Lee
974ef62ce6 [GPU] Do weight reorder for fc shape agnostic kernels at build time (#18829)
* Do weight reorder at build time

* Add test
2023-08-01 03:40:06 +00:00
Oleg Pipikin
c52afe8032 Refactor func test utils (#18849) 2023-07-31 16:22:32 +02:00
Yaroslav Torziuk
86bd5a4e2b [GPU] Interpolate v11 GPU reference implementation (#18472)
* Extract axes normalization and validation in separate functions in Interpolate op

* Update resample primitive declaration

* Update output layout calculation for Interpolate v11

* Update Interpolate op builder

* Add a shared test instance for Interpolate from 11th opset

* Add basic tests for Interpolate from opset 11

* Add new resample types and appropriate flags in ParamsKey

* Replace map which holds axes and scales with two separate vectors in resample_params

* Add resample kernel implementation
2023-07-31 05:41:21 +02:00
Jade Cho
c0783f16ed [dGPU] Enable user scratchpad mode for onednn primitive. (#18699)
* [dGPU] Enable user scratchpad mode.
* Reuse intermediate buffer.
* Add own id to the memory dependencies at the c-tor of program_node
+ Allocate intermediate memory with memory_pool::get_memory() function.
+ Assign scratchpad memory desc in load() function for onednn primitive
serialization
* Allocate device mem for onednn scratchpad mem
2023-07-30 23:13:45 +09:00
Pavel Esir
5113900350 [GPU] Fuse MVN before ConvertPrecision (#18845) 2023-07-28 20:05:05 +02:00
Vladimir Paramuzov
3e0b4b337b [GPU] LUID property added (#18815) 2023-07-28 19:16:25 +04:00
Vladimir Paramuzov
9177809bd7 [GPU] Fixed removal of reorder connected to non-default out port (#18826) 2023-07-28 14:55:46 +04:00
David Nam
1fcdc90989 Modify the condition making batch interpretation true/false (#18283)
* Modify the condition making batch interpretation true/false

- When the user is Convert for Constant node, and tensor is 1d,
- Set needBatchInterpretation to true

* Narrow down the range of the condition

* Merge the condition

* Add additional condition not to check self node

* Fix incomplete condition

* Check if all inputs to binary eltwise is 1d

* Change code style
2023-07-27 22:42:07 -07:00
Andrew Kwangwoong Park
aba2770921 [GPU] Do not add post reorder for inputs if data type does not change (#18797)
* Remove reorders of inputs for KV cache

* Fix failed CI TC for ov_gpu_func_tests
2023-07-27 20:50:04 -07:00
Taylor Yeonbok Lee
acb7e870ce Add new debug config disable_memory_reuse (#18792) 2023-07-27 21:45:34 +00:00
Sergey Shlyapnikov
2bd49cc1f8 [GPU] Add extra flush() command for dynamic models and OOO queue (#18819) 2023-07-27 16:15:37 +04:00
Mingyu Kim
416fdddd34 [GPU] bugfix for copy_from (#17993) 2023-07-27 11:47:40 +09:00
Min, Byungil
83a78eb559 [GPU] Improvement for buffer dump (#18542)
* [GPU] Improvement for buffer dump

+ added OV_GPU_DumpLayersInput to support dump input layers
+ added OV_GPU_DumpLayersRawBinary to make binary dump
+ added OV_GPU_LoadDumpRawBinary to use binary dump as input
+ binary dump naming rule layername_datatype_tensor_format.bin

Signed-off-by: Min, Byungil <byungil.min@intel.com>
2023-07-27 11:11:34 +09:00
Taylor Yeonbok Lee
c145d8f1e9 [GPU] Allocate reorder & permute's output memory for bounded dynamic shape (#18793)
* Previously reorder / permute was not allocating its memory at build time thought the shape has upper bound

* Update src/plugins/intel_gpu/src/graph/permute.cpp

Co-authored-by: Sergey Shlyapnikov <Sergeishlyapnikov@gmail.com>

* Fix as review comment

---------

Co-authored-by: Sergey Shlyapnikov <Sergeishlyapnikov@gmail.com>
2023-07-27 02:08:58 +02:00
Oleg Pipikin
ff0b51f3ca Rename CommonTestUtils namespace (#18590)
* Rename CommonTestUtils namespace 1

* Rename CommonTestUtils namespace 2

* Rename CommonTestUtils namespace 3

* Rename CommonTestUtils namespace 4

* Fix1

* Fix2

* Fix3

* Apply comments

* Fix4

* Fix5
2023-07-26 21:06:50 +02:00
Sergey Shlyapnikov
7f917446c2 [GPU] Force ov::enable_profiling if OV_GPU_DumpProfilingData debug option is used (#18687) 2023-07-26 13:32:30 +04:00
Nadezhda Ageeva
97b4b13074 Adds ov::internal::exclusive_async_requests in ov::internal::supported_properties (#18746)
* Adds ov::internal::exclusive_async_requests into ov::internal::supported_properties

* Update HETERO tests for ov::internal::exclusive_async_requests
2023-07-26 09:43:21 +04:00