openvino

Author	SHA1	Message	Date
Maksim Kutakov	194f1d7d6e	[CPU] Make static TI run a dynamic subgraph (#20527 ) * Make static TI run a dynamic subgraph * Dedicated SL test * Change condition to respect stat shapes * Adjust test to cover the code path properly	2023-11-23 14:43:27 +01:00
Pawel Raasz	75edfc27d6	[shape_infer]Add preserve partial values on inputs for Mod operator (#20169 ) * Preserve partial values on mod inputs - static values full range of integers - intervals only if not negatives * Fix bounds evaluate when inputs are scalars	2023-11-23 14:43:26 +01:00
Mateusz Tabaka	39522fa34b	CompressQuantizeWeights - fix zero point calculation (#20541 ) Current implementation tries to leverage branchless approach, but it's not correct if scale is 0. In that case - zero point can can become inf or nan and multiplication by 0 doesn't change its value. That causes another issue - infinite or NaN zero point cannot be optimized out later. Ticket: CVS-122931 Co-authored-by: Ivan Tikhonov <ivan.tikhonov@intel.com>	2023-11-23 14:43:10 +01:00
Andrei Gorbachev	698bacda5b	[GPU] Refactor ConvertColorNV12, ConvolutionBackprop, Convolution (#20376 ) * ConvertColorNV12 * ConvolutionBackprop * Convolution * fix after review	2023-11-23 14:43:09 +01:00
Vladimir Paramuzov	ee89b41a84	Fixed ngraph reference impl for Range operation (#20631 ) * Fixed ngraph reference impl for Range operation * Truncate range for integer output type * explicit static cast	2023-11-23 14:43:09 +01:00
Ilya Churaev	9f09f071d2	Add AlignedBuffer to OpenVINO developer API (#20532 ) * Add AlignedBuffer to OpenVINO developer API * Fixed build * Fixed code style and remove opset deprecation * Fixed Windows build * Fixed GNA * Fixed comment	2023-11-23 14:43:09 +01:00
Yuan Hu	0a5a8a9c23	[core] fix memory leak issue imported by #18868 (#19832 ) * try to fix memory leak issue cpustreamer is released, but there are still thread id in t_stream_count_map * fix threadlocal affect all threads Signed-off-by: HU Yuan2 <yuan2.hu@intel.com> * add comment for local() function to avoid mistaken modification in the future Signed-off-by: HU Yuan2 <yuan2.hu@intel.com> * use custom stread id Signed-off-by: HU Yuan2 <yuan2.hu@intel.com> * fix review comments Signed-off-by: HU Yuan2 <yuan2.hu@intel.com> * fix format issue Signed-off-by: HU Yuan2 <yuan2.hu@intel.com> * create shared_ptr before assert Signed-off-by: HU Yuan2 <yuan2.hu@intel.com> --------- Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>	2023-11-23 14:43:09 +01:00
Vladislav Golubev	18955647f7	[Transformations] FuseU4WeightsAndZeroPoint transformation (#20503 )	2023-11-23 14:43:08 +01:00
Zhang Yi	10b186560d	[CPU][MLAS] FC uses L2 cache for thread partition (#20436 )	2023-11-23 14:43:08 +01:00
Ivan Novoselov	09105ccc71	[Snippets] Dynamic pipeline reorganization (#18563 )	2023-11-23 14:43:08 +01:00
Roman Lyamin	56cd4c8477	[GPU] Fix for reorder fusing in fuse_constant_transposes pass (#20651 )	2023-11-23 14:43:07 +01:00
Andrew Kwangwoong Park	dc697d5b55	[GPU] Add RMS internal op, related transformation, primitive, and kernels for RMS decomposition fusion (#20355 ) * Initial implementation of primitive, kernel selector, dummy kernel for RMS Norm Signed-off-by: Andrew Park <andrew.park@intel.com> * RMS ref kernel implementation with single WI Signed-off-by: Andrew Park <andrew.park@intel.com> * Add TC and reference func for ov_gpu_unit_tests Signed-off-by: Andrew Park <andrew.park@intel.com> * Add internal RMS norm op Signed-off-by: Andrew Park <andrew.park@intel.com> * Add transformation which fuse RMS decompsition pattern to RMS internal op Signed-off-by: Andrew Park <andrew.park@intel.com> * Fix pattern for RMS fusion transformation * Update rms ref kernel for optimization and additional planar format suuport * Initial impl for optimized rms kernel excluding leftovers handling and case smaller than vector size * Update the initial version to handle leftovers and case smaller than vector size * Fuse pre decom and post comp reorders additionally * Enable dynamic impl for rms again * Revert fuse pre decomp and post comp reorders additionally * Add subgraph TC for ov_gpu_func_tests * decrease error margin for f32 data type * update description Signed-off-by: Andrew Park <andrew.park@intel.com> * update test param for input shapes * Apply comments * Fix failed TC for invalid gamma element type * Apply comments Signed-off-by: Andrew Park <andrew.park@intel.com> * Update pattern that fuse post reorder together * Apply comments --------- Signed-off-by: Andrew Park <andrew.park@intel.com>	2023-11-23 14:43:07 +01:00
Maxim Vafin	7c43ef30ee	[PT FE] Implement custom op for types alignment (#20431 ) * [PT FE] Implement custom op for types alignment * Fix code style * Fix inplace ops * Fix layer tests * Remove no longer needed change * Fix ovc tests * Fix fe tests	2023-11-23 14:43:07 +01:00
Roman Kazantsev	6291446152	[TF FE] Provide full support of TF1 Control flow and TensorArray* ops (#20270 ) * [TF FE] Provide full support of TF1 Control flow and TensorArray ops Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com> * Add missed header for TensorArrayV3 op * Temporarily disable GRU cell fusion * Update src/common/transformations/src/transformations/common_optimizations/moc_transformations.cpp * Fix a case when element_shape for TensorArrayV3 * Fix translator for TensorArrayCloseV3 * Update summarize graph with TensorArrayCloseV3 * Add layer tests for TensorArrayScatterV3, Close, Size, Array * Fix output shape for Merge node * Remove unused variable * Fix translator for TensorArrayConcatV3 * Fix translator for TensorArrayConcatV3 * Add layer tests for TensorArrayWriteV3, Gather, and Concat Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com> * Add translator for GatherTree * Fix TF FE unit-test for GatherTree * Fix GatherTree translator * Fix GatherTree translator to handle 1d end_token * Fix undeclared parameter issue * Fix GatherTree unit-test * Add TensorArrayV3Replacer transformation * Temporarily disable dangling transformation * Recover RemoveMultiSubGraphOpDanglingParamsResults transformation * Recover GRUCellFusion transformation * Simplify check for GRUCellFusion transformation * Use proper name for unit-tests * Simplify translator for TensorArrayWriteV3 Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com> * Fix RemoveMultiSubgraphOpDanglingParamsResults transformation Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com> * Additional fix for remove_multi_subgraph_op_dangling_params * Make static TI run a dynamic subgraph * Dedicated SL test * Change condition to respect stat shapes * Adjust test to cover the code path properly * Recover fallback for still failing case GNMT --------- Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com> Co-authored-by: Maksim Kutakov <maksim.kutakov@intel.com>	2023-11-23 14:43:07 +01:00
Roman Kazantsev	2d836757ab	[TF FE] Document full list of TF operations and their support by TF FE (#20640 ) * [TF FE] Document full list of TF operations and their support by TF FE Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com> * Update src/frontends/tensorflow/docs/supported_ops.md Co-authored-by: Karol Blaszczak <karol.blaszczak@intel.com> --------- Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com> Co-authored-by: Karol Blaszczak <karol.blaszczak@intel.com>	2023-11-23 14:43:06 +01:00
Fang Xu	fe7b373031	fix compilation issue for openmp on windows (#20312 ) * fix compilation issue for openmp on windows * update based on suggestions	2023-11-23 14:43:06 +01:00
Andrey Kashchikhin	a1f27cc9b8	[CI] [GHA] Introduce macOS ARM64 as a matrix parameter in the macOS pipeline (#20363 ) * add m1 mac pipelines as a matrix parameter * Update mac.yml disable java_api because of macos arm64 - Java is not available on macOS arm64 runners * Update mac.yml added always condition for all tests * Update mac.yml * Update mac.yml * Update mac.yml * Update setup.py temp commit * Update tools/openvino_dev/setup.py * use matrix for var * add mxnet to extras only for x86_64 * skip failing tests * use xfail for Python tests; add missing filter for transformations tests * skip CPU func tests on x86_64 mac; skip some tests from CPU func tests on arm mac * Update mac.yml * skip tests on mac arm * skip tests on darwin; apply review * add more skips for python and c++ tests * skip tf tests * skip more tf tests; skip more Python UT stages * rm alwayses, rm triggers, add nightly trigger --------- Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>	2023-11-23 14:43:06 +01:00
Irina Efode	78caf92977	Fix OpImplCheck in GHA (#20650 ) * Update mac.yml * Update mac.yml	2023-11-23 14:43:06 +01:00
Ilya Churaev	bfc019524f	Introduce WA to improve performance of find_port() method (#20573 ) * Introduce WA to improve performance of find_port() method * Add mutex * Remove redindant lock * Reduce the number of get_tensor_ptr calls * Fixed typo * Removed WAs from Hetero plugin	2023-11-23 14:43:05 +01:00
Irina Efode	b27580e0c8	[GHA][CONFORMANCE] Enable and conformance for Template in GHA (#20604 ) * [GHA][CONFORMANCE] Enable and conformance for Template in GHA * apply review * Update linux.yml	2023-11-23 14:43:05 +01:00
Karan Jakhar	9dc516daea	fixing type, suppored -> supported (#20639 )	2023-11-23 14:43:05 +01:00
Evgenya Nugmanova	e8240204e7	Clear error message in the Constant::cast_vector (#20629 )	2023-11-23 14:43:05 +01:00
Ilya Lavrenov	32969e41ef	Moved all tensorflow proto files to a folder with ov prefix to avoid ODR (#20636 ) conflicts	2023-11-23 14:43:04 +01:00
Maxim Vafin	940459f7d8	Update model list for hf hub tests (#20485 ) * Update model list and install cpu torch * Move to hub tests * Update tests/model_hub_tests/torch_tests/requirements.txt * Make pytorch mainteiners owners of torch tests	2023-11-23 14:43:04 +01:00
Lucas Alava Peña	1e6cafba31	fix typo in opset3 shuffle_channels docstring (#20158 ) Co-authored-by: Jan Iwaszkiewicz <jan.iwaszkiewicz@intel.com>	2023-11-23 14:43:04 +01:00
Mateusz Mikolajczyk	68f1c828d3	[PT FE] Add aten::as_strided (#19482 ) * Add aten::as_strided * rm commented code * Update src/frontends/pytorch/src/op/as_strided.cpp Co-authored-by: Maxim Vafin <maxim.vafin@intel.com> * Update src/frontends/pytorch/src/op/as_strided.cpp Co-authored-by: Maxim Vafin <maxim.vafin@intel.com> * Fix CI error * Fix CI issues * mark_node for remaining constants * Add test reproducing issue * Use strides from torchscript * Add led model to test suite * Add sugested changes --------- Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>	2023-11-23 14:43:03 +01:00
Maxim Vafin	6cbcdd5ead	[PT FE] Add readme for pytorch frontend (#20610 ) * Add readme for pytorch frontend * Mension fx decoder * Apply suggestions from code review * Update src/frontends/pytorch/README.md * Apply suggestions from code review Co-authored-by: Tatiana Savina <tatiana.savina@intel.com> * Apply suggestions from code review Co-authored-by: Tatiana Savina <tatiana.savina@intel.com> * Apply suggestions from code review Co-authored-by: Tatiana Savina <tatiana.savina@intel.com> * Update src/frontends/pytorch/README.md * Update src/frontends/pytorch/README.md Co-authored-by: Tatiana Savina <tatiana.savina@intel.com> --------- Co-authored-by: Tatiana Savina <tatiana.savina@intel.com> Co-authored-by: Andrei Kochin <andrei.kochin@intel.com>	2023-11-23 14:43:03 +01:00
rsato10	b3c848b9b4	[TF FE] Support ToBool operation (#20511 ) * [TF FE][TF Hub] Support ToBool operations * [TF FE][TF Hub] Support ToBool operations * fixing select operation Support ToBool operations for TF Hub models * added false and true const for tobool operations * added reduction axes * Apply suggestions from code review * Update tests/layer_tests/tensorflow_tests/test_tf_ToBool.py * Update tests/layer_tests/tensorflow_tests/test_tf_ToBool.py * Update tests/layer_tests/tensorflow_tests/test_tf_ToBool.py * Update src/frontends/tensorflow_common/src/op/tobool.cpp * added second zero constant * added correct types src\frontends\tensorflow_common\src\op\tobool.cpp * added includes src\frontends\tensorflow_common\src\op\tobool.cpp * Update src/frontends/tensorflow_common/src/op/tobool.cpp * remove select and not_equal src/frontends/tensorflow_common/src/op/tobool.cpp * Apply suggestions from code review * Update src/frontends/tensorflow_common/src/op/tobool.cpp * Apply suggestions from code review * Update src/frontends/tensorflow_common/src/op/tobool.cpp --------- Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>	2023-11-23 14:43:03 +01:00
Ilya Lavrenov	c89b53d98b	Updated urllib3 to resolve CVE (#20620 )	2023-11-23 14:43:03 +01:00
Ilya Lavrenov	7736af3296	WA issues with dynamic protobuf usage in Fes (#20612 )	2023-11-23 14:43:02 +01:00
Siddhant Chauhan	da8ed3e861	[TF FE][TF Hub] Support TruncateDiv operation (#20615 ) * [TF FE][TF Hub] Support TruncateDiv operation * [TF FE][TF Hub] Support TruncateDiv operation * Update src/frontends/tensorflow_common/src/op/truncate_div.cpp --------- Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>	2023-11-23 14:43:02 +01:00
Anastasiia Pnevskaia	b5b5cbba53	Updated OMZ submodule. (#20611 )	2023-11-23 14:43:02 +01:00
Siddhant Chauhan	fd2c20b64d	[TF FE][TF Hub] Support TruncateMod operation (#20468 ) * [TF FE][TF Hub] Support TruncateMod operation * Update truncate_mod.cpp * fix	2023-11-23 14:43:02 +01:00
Mustafa Cavus	f134d8bfb9	Llm and sd additional ops (#20435 ) * TorchFX: New ops added (baddbbmm, leaky_relu_) * TorchFX: Initial scaled_dot_product_flash_attention * Code Formatting: scaled_fot_product_attention translation * TorchFX unit test enabled for SDPA * Typo fix in comment line Co-authored-by: Maxim Vafin <maxim.vafin@intel.com> --------- Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>	2023-11-23 14:43:01 +01:00
Mustafa Cavus	934e8ddaf8	TorchFX: Constant value pass without copy (#20380 ) * TorchFX: Constant value pass optimization * Replace op.Constant with make_constant in fx_decoder * Using shared memory for constant value passing Co-authored-by: Jan Iwaszkiewicz <jan.iwaszkiewicz@intel.com> --------- Co-authored-by: Jan Iwaszkiewicz <jan.iwaszkiewicz@intel.com>	2023-11-23 14:43:01 +01:00
Andrey Kashchikhin	a75e994e9f	fix yml (#20614 )	2023-11-23 14:43:01 +01:00
Andrey Kashchikhin	b35d14a147	[CI] [GHA] Extend Win CC pipeline with tests, make it static (#20579 ) * start with transferring * add CPU fun tests stage; complete CC stage * start tests in comd * uncomment * fix step * rm restore key * align cmake * add tbb path setting, continue on error * add logs; add missing dir; add extension for ov cpu test bin * add missing slash for path; explicitely set BUILD_TYPE * correct paths for layer tests summary files; add tbb as target * correct path to tbb * rm triggers; add nightly trigger; rm debug conditions * rm comment on cache	2023-11-23 14:43:01 +01:00
Sebastian Golebiewski	3719791f66	[DOCS] Fix command for Building with Ninja (#20605 ) * Fix command for Building with Ninja Removing current directory from the command. * Update docs/dev/build_windows.md --------- Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>	2023-11-23 14:43:00 +01:00
Anastasiia Pnevskaia	ca4b54dc87	Switching telemetry to opt-out and disabling telemetry in CI jobs (#20391 ) * Added disabling telemetry script to linux azure job, added debug checks to convert_model. * Telemetry disabling. * Disabling telemetry. * Config corrections. * Config corrections. * Update .github/workflows/mac.yml Co-authored-by: Andrey Kashchikhin <andrey.kashchikhin@intel.com> * Debug output. * Win config correction. * Win config correction. * Debug output. * Debug output. * Added turning off telemetry to onnx azure tests config. * Corrected config. * Removed debug output. * Switch telemetry to opt-out. * Removed not needed blank lines. * Text correction. * Debug output. * Remove changes from CI configs, add CI var to dockers. * Config correction. * Debug output. * Config corrected. * Readme corrected. * Config changed. * Config changed. * Debug output. * Required version changed. * Remove debug output. --------- Co-authored-by: Andrey Kashchikhin <andrey.kashchikhin@intel.com>	2023-11-23 14:43:00 +01:00
Ilya Lavrenov	395f1814ff	Add prefixes to proto files to avoid ODR conflicts (#20588 )	2023-11-23 14:43:00 +01:00
Fang Xu	ba7f31999a	build onetbb with hwloc2.9.3 (#20481 )	2023-11-23 14:43:00 +01:00
Ilya Lavrenov	323726c9b5	Enabled LC_RPATH for brew (#20587 )	2023-11-23 14:42:59 +01:00
Irina Efode	56e38abc03	[OP CONFORMANCE] Fix op version operations in conformance report (#20502 ) * [OP CONFORMANCE] Fix op version operations in conformance report * Code style + SD test	2023-11-23 14:42:59 +01:00
Ilya Lavrenov	6a54c9acde	Enabled FrontendLibCloseTest FE test suite (#20592 )	2023-11-23 14:42:59 +01:00
Katarzyna Mitrus	76dccd2abf	Fix MO multinomial shape array (#20548 )	2023-11-23 14:42:59 +01:00
Sergey Lyalin	72de5dd36f	Partial GPTQ int4 models conversion support + Swap nibbles in u4/i4 element type (#20371 ) * Reference implementation for u4 constant compression from pytorch model based on bitwise ops pattern * Fixed order of 4-bit halfs in byte * Switched PyTorch FE to dev mode: in case if model cannot be fully converted, give partially converted model with PTFrameworkNode's with a printed warning (normally would raise an exception in case). * Moved u4 compression to utils_quantize. Implemented not-interleaved version of u4 compression * Removed debug output * Added aten::matmul to the list of exceptions in may_produce_alias as a workaround for gptq models * Added patching for gptq models applied automatically in convert_model * WA for an inssue with u4 with earlier convert to fp16 * U4 blocked repacking for gptq patched model layout * Deleted obsolete u4 re-packing based on aten::cat. Fixed the resulting u4 constant shape. Removed debug output. * Revert "Switched PyTorch FE to dev mode: in case if model cannot be fully converted, give partially converted model with PTFrameworkNode's with a printed warning (normally would raise an exception in case)." This reverts commit `0ef1455e70`. * Update src/frontends/pytorch/src/op/cat.cpp * Check mask and shift values in u4 pattern. deque -> OutputVector for u4_compression_stack * Convert to a given floating type instead of half in gptq patching. Better structured code. * Code style fix * Removed deque include * Code style fixes * Trailing space removed * Fixed patched_forward and ts_decoder after unvalidated commits. * Swap nibbles in u4/i4 * Better exception handling around jit.trace and gptq.patch_model * Update src/bindings/python/src/openvino/frontend/pytorch/gptq.py Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com> * Update src/bindings/python/src/openvino/frontend/pytorch/gptq.py Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com> * Code style * Revers int4 byte order * Fixed core tests * Fixed unguarded dynamic_cast result Co-authored-by: Evgenya Nugmanova <eva.my.link@gmail.com> * Fixed transformation tests * Update src/bindings/python/src/openvino/frontend/pytorch/gptq.py Co-authored-by: Maxim Vafin <maxim.vafin@intel.com> * Prevent patching of non-gptq models * Removed extra calling of quantized weights decompression patterns * Better detection of supported AutoGPTQ models + more diagnostics * Accurate diagnostics in case when aten::stack has multiple axes --------- Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com> Co-authored-by: Ilya Churaev <ilyachur@gmail.com> Co-authored-by: Evgenya Nugmanova <eva.my.link@gmail.com> Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>	2023-11-23 14:42:58 +01:00
Mikhail Ryzhov	968e3c21ce	[GHA] Temporary disable win workflows (#20568 ) * moved win workflow to nightly * reset redundant files * removed redundant files	2023-11-23 14:42:58 +01:00
Anastasia Kuporosova	431d1c295e	[Docs] fix failure of python snippet (#20516 ) * [Docs] fix failure of python snippet * trigger python snippets * fix ga * Update .github/workflows/linux.yml Co-authored-by: Mikhail Ryzhov <mikhail.ryzhov@intel.com> * Update .github/workflows/linux.yml Co-authored-by: Mikhail Ryzhov <mikhail.ryzhov@intel.com> * return back fix --------- Co-authored-by: Chen Peter <peter.chen@intel.com> Co-authored-by: Mikhail Ryzhov <mikhail.ryzhov@intel.com>	2023-11-23 14:42:58 +01:00
Pawel Raasz	0c32adc633	[core]Migrate VariadicSplit op to new API (#20416 ) * Migrate VariadicSlice to new API - refactor to reduce bin size * Move `get_tensors_partial_shapes` to dev API * Use get_tensors_partial_shapes in VariadicSplit * Remove `visit_attributes` is same as base	2023-11-23 14:42:58 +01:00
Sergey Shlyapnikov	057174ed7f	[GPU] Deferred events deallocation and tuning for FC bf_tiled kernel (#20519 )	2023-11-23 14:42:57 +01:00

... 2 3 4 5 6 ...

13151 Commits