openvino

Author	SHA1	Message	Date
Paul Youngsoo Ahn	e8b108ac6b	[GPU] Change lws to avoid synchronization issue in nonzero_count (#16116 ) * [GPU] Change lws to avoid synchronization issue in nonzero_count (#16116) * [GPU] Add unit test (#16116) * [GPU] update count_nonzero_ref kernel(#16116) - Support the case total data size exceed max work group size - Add dynamic shape test case * [GPU] Change input indexing calculation and add random input generator in unit test (#16116) * [GPU] update random generation input funciton in nonzero_count (#16116) * [GPU] update unit test (#16116) * [GPU] cldnn unit test: update random generation function for other test failure (fusings_gpu/conv_fp32_multi_eltwise_quantization.basic/0) (#16116)	2023-03-12 23:32:20 -07:00
Ilya Churaev	75314c2c53	Rename OPENVINO_UNREACHABLE to OPENVINO_THROW (#16201 ) * Changed some exceptions to OPENVINO_THROW * Changed samples throw exception * Fixed some comments * Remove OPENVINO_UNREACHABLE	2023-03-10 20:23:13 +04:00
Maciej Smyk	5e406a80d3	[DOCS] OpenVINO Wiki links update - master (#16219 ) * wiki links	2023-03-10 16:16:14 +01:00
Roman Lyamin	b8e1dea345	[GPU] Fix binary_convolution non-constant weights (#15898 ) * [GPU] Fix binary_convolution non-constant weights * [GPU] Remove unused checks related to allowInputReordering	2023-03-10 14:36:12 +04:00
Ilya Churaev	45bdbf7486	Changed throw ov::Exception to macro (#16150 ) * Changed throw ov::Exception to macro * Fixed code style * Revert myriad headers * CPPlint fixes * Fixed typo	2023-03-10 11:14:50 +04:00
Mykhailo Hnap	d5e98cbdce	[GPU] IsFinite, IsInf, IsNaN operations (#15979 ) * [GPU] Enabled ComparisonLayerTest in single layer tests. It seems that before, these tests were disabled cause of some failures. Now I cannot see any errors, so I just enabled all of them. * [GPU] Run clang format for comparison single layer tests. * [GPU] Added handling of f16 type to IsInfLayerTest. * [GPU] Added single-layer tests for IsFinite and IsNaN operations. * [GPU] Added single-layer test for IsInf operation. * [GPU] Implemented IsFinite, IsInf, and IsNaN operations as activation functions. But notice that currently, the activation kernel support only the same output data type as the input data type. So an additional reorder would be needed to convert to the correct output data type for these ops. Also worth noting is that activation functions are fused in reorder kernel. But for now, it's not working for these ops because in reorder activation call, there is a hard conversion of input data to output data type before activation. I don't know why it's added there, but it breaks fusion. So need to fix this activation fusion or disable this fusion for these ops. * Revert "[GPU] Implemented IsFinite, IsInf, and IsNaN operations as activation functions." This reverts commit 3f9ffe617ecddce6dbbcdeab9584a7ddeb6d1845. * [GPU] Implemented IsFinite, IsInf, and IsNaN operations as eltwise op. * [GPU] Changed CLDNN_ERROR_MESSAGE to OPENVINO_ASSERT in check_inputs_count method.	2023-03-09 16:10:48 -08:00
Andrew Kwangwoong Park	3ec386a741	[GPU] Minor fixes for dynamic BERT models (#16158 ) * [GPU] Minor fix for dynamic bert-base-uncased-qqp Signed-off-by: Andrew Park <andrew.park@intel.com> * Fix to check full tensor only for static shape during creating onednn gemm Signed-off-by: Andrew Park <andrew.park@intel.com> --------- Signed-off-by: Andrew Park <andrew.park@intel.com>	2023-03-09 14:48:08 -08:00
Taylor Yeonbok Lee	dff7f2451b	Revert PR15386's change (#16172 ) - Previously, PR15386 changed allocation of memory of primitives which are to be used as shape infer dep to host memory, for better shape infer perf. - However this causes cache coherence issue in dGPU. - Reverting this change so that the memory will be allocated to devicet	2023-03-09 22:44:32 +00:00
Sungeun Kim	0365ebf5ad	disable test case: fusings_gpu/lrn_fp16_eltwise_activation.basic/7 (#16149 )	2023-03-09 08:38:33 +00:00
Jade Cho	aaeace9740	[GPU] Fix stable diffusion failure (#16052 ) * [dGPU] Enable stable diffusion + Prevent to fuse swish into oneDNN reorder. + Makes concat explicitly if batch size is greater than 1 and the siblings are oneDNN impl.	2023-03-09 14:35:31 +09:00
Andrew Kwangwoong Park	b7ff3a1d64	[GPU] Added shape agnostic Pad kernel implementation (#16160 ) Signed-off-by: Andrew Park <andrew.park@intel.com>	2023-03-08 15:36:43 -08:00
Vladimir Paramuzov	75b48f2153	[GPU] Changed impls cache key type to avoid hash collisions (#16130 )	2023-03-08 16:09:55 +04:00
hyunback kim	a9cbccd829	Broadcast for post ops enable enable onednngemm (#16074 ) * [GPU] Add data broadcasting for OneDNN binary ops for Gemm primitive * Based on https://github.com/openvinotoolkit/openvino/pull/15790 and enable onednn gemm from support multiple users and non constant input. -------- Signed-off-by: hyunback <hyunback.kim@intel.com> Co-authored-by: Sergey Shlyapnikov <sergey.shlyapnikov@intel.com>	2023-03-08 13:55:51 +09:00
Roman Lyamin	681faadce3	[GPU] Added shape agnostic kernels for GatherElements and Tile (#15798 ) * [GPU] Added shape agnostic kernel for GatherElements * [GPU] Added shape agnostic kernel for Tile	2023-03-08 08:34:24 +04:00
Vladimir Paramuzov	a1eb76ad06	[GPU] Move is_local_block_io_supported WA to kernel selector (#15235 )	2023-03-07 15:12:08 +04:00
Min, Byungil	87b18a21c1	[GPU] Optimize eltwise kernel for blocked format (#15717 ) * [GPU] Optimize eltwise kernel for blocked format + Optimize etlwise_blocked_opt + Replace deprecated kernels with eltwise_blocked_opt + Remove eltwise_b_fs_yx_fsv16, b_fs_yx_fsv4 kernels + Add test-cases in eltwise_gpu_test Signed-off-by: byungilm <byungil.min@intel.com>	2023-03-07 14:21:09 +09:00
Vladimir Paramuzov	eff0bce7e3	[GPU] Move some op parameters from node to primitive class (#16070 ) * [GPU] Move parameters of conv and quantize primitive from node to primitive --------- Co-authored-by: Eddy Kim <eddy.kim@intel.com>	2023-03-07 08:56:00 +04:00
Andrew Kwangwoong Park	7123e8879e	[GPU] Added shape agnostic optimized SoftMax kernel (#15834 ) * [GPU] Added shape agnostic optimized SoftMax kernel Signed-off-by: Andrew Park <andrew.park@intel.com> * Update SoftmaxKernelBaseBF::Validate policy for shape agnostic kernel Signed-off-by: Andrew Park <andrew.park@intel.com> * Add softmax_gpu_bf shape agnostic TC for ov_gpu_unit_tests Signed-off-by: Andrew Park <andrew.park@intel.com> * Fix failed TCs for ie-tests-linux-ubuntu20-gpu Signed-off-by: Andrew Park <andrew.park@intel.com> * Update to use stack array instead of global buffer Signed-off-by: Andrew Park <andrew.park@intel.com> * Remove global buffer usage completely Signed-off-by: Andrew Park <andrew.park@intel.com> * Add #undef directive Signed-off-by: Andrew Park <andrew.park@intel.com> --------- Signed-off-by: Andrew Park <andrew.park@intel.com>	2023-03-06 09:10:29 -08:00
Andrew Kwangwoong Park	4ce35fd851	[GPU] Minor fixes for dynamic model (#16075 ) Signed-off-by: Andrew Park <andrew.park@intel.com>	2023-03-06 15:50:38 +04:00
Xiping Yan	8b66b35bf7	[CPU]Remove C4250 warning suppress, and fix the corresponding warning. (#15966 )	2023-03-06 12:43:53 +04:00
Xuejun Zhai	9b97235902	Xuejun/remove api in ov any (#15667 ) * [Remove APIs] remove ov::any api & Signed-off-by: xuejun <Xuejun.Zhai@intel.com> * [Remove APIs] remove ov::any api Signed-off-by: xuejun <Xuejun.Zhai@intel.com> * [Remove APIs] remove interfaces in ov::any Base* operator->() & const Base* operator->() Signed-off-by: xuejun <Xuejun.Zhai@intel.com> * [Remove APIs] remove ov::any interfaces Base* get() & const Base* get() Signed-off-by: xuejun <Xuejun.Zhai@intel.com> * [Remove APIs] remove ov::any interfaces call(const Any& any) & dynamic_pointer_cast(const ::ov::Any& any) & static_pointer_cast(const ::ov::Any& any) Signed-off-by: xuejun <Xuejun.Zhai@intel.com> * [Remove APIs] fix code format issues in ov::any Signed-off-by: xuejun <Xuejun.Zhai@intel.com> * [Remove APIs] fix review issue Signed-off-by: xuejun <xuejun.zhai@intel.com> * [Remove APIs] clear code Signed-off-by: xuejun <xuejun.zhai@intel.com> * [Remove APIs] fix review issue Signed-off-by: xuejun <xuejun.zhai@intel.com> * [Remove APIs] fix compiler issue Signed-off-by: xuejun <xuejun.zhai@intel.com> * [Remove APIs] fix compiler issue Signed-off-by: xuejun <xuejun.zhai@intel.com> * [Remove APIs] fix compiler issue Signed-off-by: xuejun <xuejun.zhai@intel.com> * Fix variant error Signed-off-by: xuejun <xuejun.zhai@intel.com> --------- Signed-off-by: xuejun <Xuejun.Zhai@intel.com> Signed-off-by: xuejun <xuejun.zhai@intel.com>	2023-03-06 10:24:08 +04:00
Ilya Lavrenov	e1fbb7d768	Fixes for multi-config generators (#16097 )	2023-03-05 10:46:53 +04:00
Ilya Lavrenov	9c4c559909	Fixed compilation on Debian 11 with gcc 12.2 (#16096 )	2023-03-04 20:45:04 +04:00
Steve Yoo	a16f1923d7	Added recalculating processing order if it is not correct (#15987 )	2023-03-02 14:40:15 -08:00
Kelvin Choi	6979c06ca1	[GPU] Support non constant input for Pad (#15697 ) * [GPU] Support non constant input for Pad * Refactor by comments	2023-03-02 10:38:43 -08:00
Ilya Lavrenov	4d925e0a3d	Test GPU plugin arm64 build via Android precommit (#16055 )	2023-03-02 21:06:36 +04:00
hyunback kim	cb7eeadd62	[GPU] Integration oneDNN3.1 (#15804 ) * [GPU] Integration oneDNN3.1 * [GPU] Add os_iyx_osv8 format Signed-off-by: hyunback <hyunback.kim@intel.com>	2023-03-03 00:18:42 +09:00
Ilya Lavrenov	0d798b7431	Building GPU plugin for Linux ARM64 (#16008 ) * Building GPU plugin for ARM64 * changed order of headers * Fixed clang-format	2023-03-02 12:43:33 +04:00
Roman Lyamin	24b0baa0d1	[GPU] Added support mixed input formats for Select (#16009 )	2023-03-02 09:19:02 +04:00
Vladimir Paramuzov	27ac7d9092	[GPU] backend independent code for fuse params in program_node (#16028 )	2023-03-02 09:18:29 +04:00
Vladimir Paramuzov	c5c7e4ff65	[GPU] Cleanup tuning cache methods (#16000 )	2023-03-01 16:30:47 +04:00
Vladimir Paramuzov	3de00347f3	[GPU] Code cleanup (#16014 ) * [GPU] Improve exception message for program build * [GPU] Code cleanup	2023-03-01 14:05:59 +04:00
Roman Lyamin	1070a3b6c1	[GPU] Added fp16 support for GatherTree (#15983 )	2023-02-28 09:54:56 +04:00
Wilson Seok	93a1be3607	Skip set_selected_impl() of post_optimize_weight when target generic layer is already created (#15852 )	2023-02-27 11:24:53 -08:00
Eddy Kim	d2a5be0ab8	enabled exec_graph and pc in deserialized model (#15975 )	2023-02-27 10:14:04 -08:00
Andrew Kwangwoong Park	39e63ace67	[GPU] Minor fix for dynamic mobilebert (#15909 ) Signed-off-by: Andrew Park <andrew.park@intel.com>	2023-02-25 20:22:44 -08:00
Taylor Yeonbok Lee	fabf67ee5e	[GPU] Enable crop for shape agnostic kernel (#15866 ) * Enable crop shape agnostic kernel * Added unit test * Added new scalar argument for crop (eltwise) for being used as runtime input offset in shape agnostic kernel * Fix eltwise to have runtime offset only for crop * Fix unittest error * Applied review comment	2023-02-25 15:49:46 -08:00
Taylor Yeonbok Lee	9822568194	Fix build error in clang++ (#15948 )	2023-02-25 06:48:12 +04:00
Andrew Kwangwoong Park	46e8aad4bb	[GPU] Fix output format not changing at runtime (#15887 ) * [GPU] Fix output format not changing at runtime Signed-off-by: Andrew Park <andrew.park@intel.com> * Add remove_redundant_reorders pass TC for ov_gpu_unit_tests Signed-off-by: Andrew Park <andrew.park@intel.com> --------- Signed-off-by: Andrew Park <andrew.park@intel.com>	2023-02-24 14:26:54 -08:00
Eddy Kim	30939f5021	updated to share constant data memories across multiple streams (#15915 )	2023-02-24 14:26:10 -08:00
Ilya Lavrenov	6d7b94b8cd	Improved API validator logic (#15942 )	2023-02-25 01:11:50 +04:00
Paul Youngsoo Ahn	c1c8d6320e	[GPU] Apply multi-threads for async compilation context (#15683 ) * [GPU] Apply multi-threads for async compilation context (#15683) - Use CPUStreamExecutor in compilation context - Use single compilation context, impl_cache and kernels_cache for multple streams - Move compilation context to cldnn::program - Move impl_cache to cldnn::program - Create thread-safe impl_cache - Create thread independent compilation function in kernels_cache - Use kernels_cache in program and remove it from network * [GPU] Fix segfault issue: ocl_engine and ocl_device are released during remained compilation context task are running (#15683) - compilation context has own CPUStreamExecutor * [GPU] Follow-up codereview (#15683) - LruCacheThreadSafe inherit LruCache - FuncRemoveItem has std::pair<Key,Value> as input - Change prepare_tools to init_program * [GPU] Create primitive_impl::build_kernels (#15683) * [GPU] Fix unit test build error (#15683) * [GPU] Remove redundant code (#15683) - Remove try catch for debug - Call compilation_context.cancel() in destructor of network * [GPU] combine two atomic counter in kernels_cache (#15683) * [GPU] Follow-up code review (#15683) * [GPU] Fix nullptr exception in unit test (#15683) * [GPU] Follow-up code review (#15683) - Modify mutex lock in compilation context * [GPU] Fix windows build issue (#15683)	2023-02-23 23:08:50 -08:00
hyunback kim	be5f90199d	[GPU] Add oneDNN FC preferred_format to bfyx (#15704 ) Signed-off-by: hyunback <hyunback.kim@intel.com>	2023-02-24 15:19:54 +09:00
Eddy Kim	f562e96305	[GPU] Fallback to kernel caching in the case of dynamic models (#15842 ) * use kernel caching for dynamic models * replaced cl_cache with blob * updated to serialize dims info of input and output * updated to skip unicode tests in Windows	2023-02-23 22:05:16 -08:00
Dohyun Kim (Felix)	a4f0b340d0	[GPU] Resolve unit test not run as onednn (#15217 )	2023-02-24 10:07:56 +09:00
Dohyun Kim (Felix)	f00fb325a6	[GPU][DG2] Disable remained failing tests (#15873 )	2023-02-24 10:07:01 +09:00
Ilya Lavrenov	87bcbc1747	Supported OpenSUSE 15.3 (#15897 )	2023-02-23 11:25:33 +04:00
Dohyun Kim (Felix)	1028c7b5d5	[GPU] Fix weight reorder bug (#15672 )	2023-02-23 14:48:46 +09:00
Jade Cho	c749163f72	[GPU] Update unit tests for swap XY (#15833 )	2023-02-23 14:38:10 +09:00
Dohyun Kim (Felix)	1f196bacd3	[GPU][DG2] Fix some testcases (#15774 ) * C++ exception with description write lock_type thrown in the test body. Use get_output_values_to_float() * fusings_gpu/gemm_2in_act_scale_quantize_eltwise_i8.basic/2 * fusings_gpu/gemm_2in_act_scale_eltwise.basic/2 * Remove WA test code of [GPU][DG2] Fix fusings_gpu/gemm_2in_scale.basic/7 #15353 * Now non full-tensor post-ops are broadcasted	2023-02-23 14:23:40 +09:00

1 2 3 4 5 ...

740 Commits