openvino

Author	SHA1	Message	Date
Vladimir Paramuzov	bb59672639	[GPU] Fixed shape agnostic scatter nd update kernel (#16319 )	2023-03-17 09:57:25 +04:00
Wilson Seok	c5ccb3e954	add condition for activation ceil so it works when data type is fp32 or fp16 only (#16334 )	2023-03-17 11:46:44 +09:00
hyunback kim	8d1139b61a	Fix unet3d mlperf dump (#16253 ) * Enable dump in unet3d_mlperf Signed-off-by: hyunback <hyunback.kim@intel.com>	2023-03-17 10:51:40 +09:00
Andrew Kwangwoong Park	e4500c7d61	[GPU] Fixes for dynamic model in dGPU (#16298 ) Signed-off-by: Andrew Park <andrew.park@intel.com>	2023-03-16 11:09:55 -07:00
Tomasz Dołbniak	6762fe692d	Interpolate-11 spec + core op (#16162 )	2023-03-16 14:37:57 +01:00
Wilson Seok	461cc2aee8	change activation position in reorder_data_bfyx_to_blocked_format kernel (#16307 )	2023-03-16 17:48:23 +09:00
Andrei Gorbachev	2f3ae4518e	[GPU] Fix warnings (#16196 ) * fix 1 * fix 2-10 * fixed code style * fixed win plugin * fixed linux plugin * fixed a part of tests * fixed test fot linux * fixed pooling_gpu_test fot linux * fixed pooling_gpu_test fot linux * fix after review and enable wd4267 in makefile * fix after review * errors of unit test are fixed	2023-03-16 09:29:16 +04:00
Vladimir Paramuzov	28d3e1087e	[GPU] Fix strided slice kernel with begin/end/stride as inputs (#16302 )	2023-03-15 16:25:45 +04:00
Vladimir Paramuzov	e44fd03d2a	[GPU] Shape agnostic concat kernel + refactoring (#16170 )	2023-03-15 09:47:31 +04:00
Eddy Kim	e348481849	[GPU] Transformed IR serialization for dynamic models (#16169 ) * IR serialization for dynamic models * added ShapeOf1To3 transformation pass * fixed input output type mismatch * removed unnecessary codes * moved ConvertShapeOf1To3 from common to GPU plugin * updated copyright year * fixed build errors	2023-03-14 11:03:02 -07:00
Mateusz Tabaka	8477bc8897	Reduce the number of validate and infer types in ConvertPrecision (#15277 ) * Reduce the number of validate and infer types in ConvertPrecision Currently, ConvertPrecision pass frequently runs validate and infer types. This is due to the fact that it iterates over every precision pair, then over the whole model followed by validate and infer types. The proposed solution is to iterate over the model: for each node iterate over precisions array, update the node if required followed by validate and infer types. Ticket: 81311 * use map * clang format * move enum hasher * fix gpu * revalidate * reinvalidate if node has changed * remove validate for input prec changes * fix gpu * review * find * fix pytorch case * revalidate --------- Co-authored-by: Michal Lukaszewski <michal.lukaszewski@intel.com>	2023-03-14 21:45:24 +04:00
hyunback kim	164db3def9	[GPU] Fix twin tranformer functional regression. (#16111 ) * [GPU] Fix twin tranformer functional regression. gemm/FC select_preferred_format select simple format depends on out rank size. Signed-off-by: hyunback <hyunback.kim@intel.com>	2023-03-14 17:34:41 +09:00
Taylor Yeonbok Lee	3a96e06d4c	Minor fixes (#16275 ) - Fix are_data_types_suitable_for_onednn not to invalidate output layout - Fix seg fault of printing resample node info	2023-03-14 08:09:54 +00:00
Taylor Yeonbok Lee	f0f1c47063	Fix concat to use ngraph shape infer (#16226 ) Fix crop to return shape of original rank	2023-03-13 20:25:23 +00:00
Ilya Lavrenov	f080a0d9cf	Added NCC style for frontends sources (#16200 ) * Ability to provide several source dirs for ncc-style checks * Fixed include headers; added NCC to TF common * Fixed NCC for frontends * Fixed NCC for frontends * Extra fixes * Fixest push --f * Clang-format * Apply comments * Add an option to specify required clang-format version * Update src/frontends/tensorflow/src/decoder_proto.cpp * Update src/frontends/tensorflow/src/decoder_proto.cpp	2023-03-13 14:54:00 +00:00
Paul Youngsoo Ahn	e8b108ac6b	[GPU] Change lws to avoid synchronization issue in nonzero_count (#16116 ) * [GPU] Change lws to avoid synchronization issue in nonzero_count (#16116) * [GPU] Add unit test (#16116) * [GPU] update count_nonzero_ref kernel(#16116) - Support the case total data size exceed max work group size - Add dynamic shape test case * [GPU] Change input indexing calculation and add random input generator in unit test (#16116) * [GPU] update random generation input funciton in nonzero_count (#16116) * [GPU] update unit test (#16116) * [GPU] cldnn unit test: update random generation function for other test failure (fusings_gpu/conv_fp32_multi_eltwise_quantization.basic/0) (#16116)	2023-03-12 23:32:20 -07:00
Ilya Churaev	75314c2c53	Rename OPENVINO_UNREACHABLE to OPENVINO_THROW (#16201 ) * Changed some exceptions to OPENVINO_THROW * Changed samples throw exception * Fixed some comments * Remove OPENVINO_UNREACHABLE	2023-03-10 20:23:13 +04:00
Maciej Smyk	5e406a80d3	[DOCS] OpenVINO Wiki links update - master (#16219 ) * wiki links	2023-03-10 16:16:14 +01:00
Roman Lyamin	b8e1dea345	[GPU] Fix binary_convolution non-constant weights (#15898 ) * [GPU] Fix binary_convolution non-constant weights * [GPU] Remove unused checks related to allowInputReordering	2023-03-10 14:36:12 +04:00
Ilya Churaev	45bdbf7486	Changed throw ov::Exception to macro (#16150 ) * Changed throw ov::Exception to macro * Fixed code style * Revert myriad headers * CPPlint fixes * Fixed typo	2023-03-10 11:14:50 +04:00
Mykhailo Hnap	d5e98cbdce	[GPU] IsFinite, IsInf, IsNaN operations (#15979 ) * [GPU] Enabled ComparisonLayerTest in single layer tests. It seems that before, these tests were disabled cause of some failures. Now I cannot see any errors, so I just enabled all of them. * [GPU] Run clang format for comparison single layer tests. * [GPU] Added handling of f16 type to IsInfLayerTest. * [GPU] Added single-layer tests for IsFinite and IsNaN operations. * [GPU] Added single-layer test for IsInf operation. * [GPU] Implemented IsFinite, IsInf, and IsNaN operations as activation functions. But notice that currently, the activation kernel support only the same output data type as the input data type. So an additional reorder would be needed to convert to the correct output data type for these ops. Also worth noting is that activation functions are fused in reorder kernel. But for now, it's not working for these ops because in reorder activation call, there is a hard conversion of input data to output data type before activation. I don't know why it's added there, but it breaks fusion. So need to fix this activation fusion or disable this fusion for these ops. * Revert "[GPU] Implemented IsFinite, IsInf, and IsNaN operations as activation functions." This reverts commit 3f9ffe617ecddce6dbbcdeab9584a7ddeb6d1845. * [GPU] Implemented IsFinite, IsInf, and IsNaN operations as eltwise op. * [GPU] Changed CLDNN_ERROR_MESSAGE to OPENVINO_ASSERT in check_inputs_count method.	2023-03-09 16:10:48 -08:00
Andrew Kwangwoong Park	3ec386a741	[GPU] Minor fixes for dynamic BERT models (#16158 ) * [GPU] Minor fix for dynamic bert-base-uncased-qqp Signed-off-by: Andrew Park <andrew.park@intel.com> * Fix to check full tensor only for static shape during creating onednn gemm Signed-off-by: Andrew Park <andrew.park@intel.com> --------- Signed-off-by: Andrew Park <andrew.park@intel.com>	2023-03-09 14:48:08 -08:00
Taylor Yeonbok Lee	dff7f2451b	Revert PR15386's change (#16172 ) - Previously, PR15386 changed allocation of memory of primitives which are to be used as shape infer dep to host memory, for better shape infer perf. - However this causes cache coherence issue in dGPU. - Reverting this change so that the memory will be allocated to devicet	2023-03-09 22:44:32 +00:00
Sungeun Kim	0365ebf5ad	disable test case: fusings_gpu/lrn_fp16_eltwise_activation.basic/7 (#16149 )	2023-03-09 08:38:33 +00:00
Jade Cho	aaeace9740	[GPU] Fix stable diffusion failure (#16052 ) * [dGPU] Enable stable diffusion + Prevent to fuse swish into oneDNN reorder. + Makes concat explicitly if batch size is greater than 1 and the siblings are oneDNN impl.	2023-03-09 14:35:31 +09:00
Andrew Kwangwoong Park	b7ff3a1d64	[GPU] Added shape agnostic Pad kernel implementation (#16160 ) Signed-off-by: Andrew Park <andrew.park@intel.com>	2023-03-08 15:36:43 -08:00
Vladimir Paramuzov	75b48f2153	[GPU] Changed impls cache key type to avoid hash collisions (#16130 )	2023-03-08 16:09:55 +04:00
hyunback kim	a9cbccd829	Broadcast for post ops enable enable onednngemm (#16074 ) * [GPU] Add data broadcasting for OneDNN binary ops for Gemm primitive * Based on https://github.com/openvinotoolkit/openvino/pull/15790 and enable onednn gemm from support multiple users and non constant input. -------- Signed-off-by: hyunback <hyunback.kim@intel.com> Co-authored-by: Sergey Shlyapnikov <sergey.shlyapnikov@intel.com>	2023-03-08 13:55:51 +09:00
Roman Lyamin	681faadce3	[GPU] Added shape agnostic kernels for GatherElements and Tile (#15798 ) * [GPU] Added shape agnostic kernel for GatherElements * [GPU] Added shape agnostic kernel for Tile	2023-03-08 08:34:24 +04:00
Vladimir Paramuzov	a1eb76ad06	[GPU] Move is_local_block_io_supported WA to kernel selector (#15235 )	2023-03-07 15:12:08 +04:00
Min, Byungil	87b18a21c1	[GPU] Optimize eltwise kernel for blocked format (#15717 ) * [GPU] Optimize eltwise kernel for blocked format + Optimize etlwise_blocked_opt + Replace deprecated kernels with eltwise_blocked_opt + Remove eltwise_b_fs_yx_fsv16, b_fs_yx_fsv4 kernels + Add test-cases in eltwise_gpu_test Signed-off-by: byungilm <byungil.min@intel.com>	2023-03-07 14:21:09 +09:00
Vladimir Paramuzov	eff0bce7e3	[GPU] Move some op parameters from node to primitive class (#16070 ) * [GPU] Move parameters of conv and quantize primitive from node to primitive --------- Co-authored-by: Eddy Kim <eddy.kim@intel.com>	2023-03-07 08:56:00 +04:00
Andrew Kwangwoong Park	7123e8879e	[GPU] Added shape agnostic optimized SoftMax kernel (#15834 ) * [GPU] Added shape agnostic optimized SoftMax kernel Signed-off-by: Andrew Park <andrew.park@intel.com> * Update SoftmaxKernelBaseBF::Validate policy for shape agnostic kernel Signed-off-by: Andrew Park <andrew.park@intel.com> * Add softmax_gpu_bf shape agnostic TC for ov_gpu_unit_tests Signed-off-by: Andrew Park <andrew.park@intel.com> * Fix failed TCs for ie-tests-linux-ubuntu20-gpu Signed-off-by: Andrew Park <andrew.park@intel.com> * Update to use stack array instead of global buffer Signed-off-by: Andrew Park <andrew.park@intel.com> * Remove global buffer usage completely Signed-off-by: Andrew Park <andrew.park@intel.com> * Add #undef directive Signed-off-by: Andrew Park <andrew.park@intel.com> --------- Signed-off-by: Andrew Park <andrew.park@intel.com>	2023-03-06 09:10:29 -08:00
Andrew Kwangwoong Park	4ce35fd851	[GPU] Minor fixes for dynamic model (#16075 ) Signed-off-by: Andrew Park <andrew.park@intel.com>	2023-03-06 15:50:38 +04:00
Xiping Yan	8b66b35bf7	[CPU]Remove C4250 warning suppress, and fix the corresponding warning. (#15966 )	2023-03-06 12:43:53 +04:00
Xuejun Zhai	9b97235902	Xuejun/remove api in ov any (#15667 ) * [Remove APIs] remove ov::any api & Signed-off-by: xuejun <Xuejun.Zhai@intel.com> * [Remove APIs] remove ov::any api Signed-off-by: xuejun <Xuejun.Zhai@intel.com> * [Remove APIs] remove interfaces in ov::any Base* operator->() & const Base* operator->() Signed-off-by: xuejun <Xuejun.Zhai@intel.com> * [Remove APIs] remove ov::any interfaces Base* get() & const Base* get() Signed-off-by: xuejun <Xuejun.Zhai@intel.com> * [Remove APIs] remove ov::any interfaces call(const Any& any) & dynamic_pointer_cast(const ::ov::Any& any) & static_pointer_cast(const ::ov::Any& any) Signed-off-by: xuejun <Xuejun.Zhai@intel.com> * [Remove APIs] fix code format issues in ov::any Signed-off-by: xuejun <Xuejun.Zhai@intel.com> * [Remove APIs] fix review issue Signed-off-by: xuejun <xuejun.zhai@intel.com> * [Remove APIs] clear code Signed-off-by: xuejun <xuejun.zhai@intel.com> * [Remove APIs] fix review issue Signed-off-by: xuejun <xuejun.zhai@intel.com> * [Remove APIs] fix compiler issue Signed-off-by: xuejun <xuejun.zhai@intel.com> * [Remove APIs] fix compiler issue Signed-off-by: xuejun <xuejun.zhai@intel.com> * [Remove APIs] fix compiler issue Signed-off-by: xuejun <xuejun.zhai@intel.com> * Fix variant error Signed-off-by: xuejun <xuejun.zhai@intel.com> --------- Signed-off-by: xuejun <Xuejun.Zhai@intel.com> Signed-off-by: xuejun <xuejun.zhai@intel.com>	2023-03-06 10:24:08 +04:00
Ilya Lavrenov	e1fbb7d768	Fixes for multi-config generators (#16097 )	2023-03-05 10:46:53 +04:00
Ilya Lavrenov	9c4c559909	Fixed compilation on Debian 11 with gcc 12.2 (#16096 )	2023-03-04 20:45:04 +04:00
Steve Yoo	a16f1923d7	Added recalculating processing order if it is not correct (#15987 )	2023-03-02 14:40:15 -08:00
Kelvin Choi	6979c06ca1	[GPU] Support non constant input for Pad (#15697 ) * [GPU] Support non constant input for Pad * Refactor by comments	2023-03-02 10:38:43 -08:00
Ilya Lavrenov	4d925e0a3d	Test GPU plugin arm64 build via Android precommit (#16055 )	2023-03-02 21:06:36 +04:00
hyunback kim	cb7eeadd62	[GPU] Integration oneDNN3.1 (#15804 ) * [GPU] Integration oneDNN3.1 * [GPU] Add os_iyx_osv8 format Signed-off-by: hyunback <hyunback.kim@intel.com>	2023-03-03 00:18:42 +09:00
Ilya Lavrenov	0d798b7431	Building GPU plugin for Linux ARM64 (#16008 ) * Building GPU plugin for ARM64 * changed order of headers * Fixed clang-format	2023-03-02 12:43:33 +04:00
Roman Lyamin	24b0baa0d1	[GPU] Added support mixed input formats for Select (#16009 )	2023-03-02 09:19:02 +04:00
Vladimir Paramuzov	27ac7d9092	[GPU] backend independent code for fuse params in program_node (#16028 )	2023-03-02 09:18:29 +04:00
Vladimir Paramuzov	c5c7e4ff65	[GPU] Cleanup tuning cache methods (#16000 )	2023-03-01 16:30:47 +04:00
Vladimir Paramuzov	3de00347f3	[GPU] Code cleanup (#16014 ) * [GPU] Improve exception message for program build * [GPU] Code cleanup	2023-03-01 14:05:59 +04:00
Roman Lyamin	1070a3b6c1	[GPU] Added fp16 support for GatherTree (#15983 )	2023-02-28 09:54:56 +04:00
Wilson Seok	93a1be3607	Skip set_selected_impl() of post_optimize_weight when target generic layer is already created (#15852 )	2023-02-27 11:24:53 -08:00
Eddy Kim	d2a5be0ab8	enabled exec_graph and pc in deserialized model (#15975 )	2023-02-27 10:14:04 -08:00

1 2 3 4 5 ...

805 Commits