Commit Graph

13151 Commits

Author SHA1 Message Date
Maksim Kutakov
194f1d7d6e [CPU] Make static TI run a dynamic subgraph (#20527)
* Make static TI run a dynamic subgraph

* Dedicated SL test

* Change condition to respect stat shapes

* Adjust test to cover the code path properly
2023-11-23 14:43:27 +01:00
Pawel Raasz
75edfc27d6 [shape_infer]Add preserve partial values on inputs for Mod operator (#20169)
* Preserve partial values on mod inputs
- static values full range of integers
- intervals only if not negatives

* Fix bounds evaluate when inputs are scalars
2023-11-23 14:43:26 +01:00
Mateusz Tabaka
39522fa34b CompressQuantizeWeights - fix zero point calculation (#20541)
Current implementation tries to leverage branchless approach, but it's not correct
if scale is 0. In that case - zero point can can become inf or nan and multiplication
by 0 doesn't change its value. That causes another issue - infinite or NaN zero point
cannot be optimized out later.

Ticket: CVS-122931

Co-authored-by: Ivan Tikhonov <ivan.tikhonov@intel.com>
2023-11-23 14:43:10 +01:00
Andrei Gorbachev
698bacda5b [GPU] Refactor ConvertColorNV12, ConvolutionBackprop, Convolution (#20376)
* ConvertColorNV12

* ConvolutionBackprop

* Convolution

* fix after review
2023-11-23 14:43:09 +01:00
Vladimir Paramuzov
ee89b41a84 Fixed ngraph reference impl for Range operation (#20631)
* Fixed ngraph reference impl for Range operation

* Truncate range for integer output type

* explicit static cast
2023-11-23 14:43:09 +01:00
Ilya Churaev
9f09f071d2 Add AlignedBuffer to OpenVINO developer API (#20532)
* Add AlignedBuffer to OpenVINO developer API

* Fixed build

* Fixed code style and remove opset deprecation

* Fixed Windows build

* Fixed GNA

* Fixed comment
2023-11-23 14:43:09 +01:00
Yuan Hu
0a5a8a9c23 [core] fix memory leak issue imported by #18868 (#19832)
* try to fix memory leak issue

cpustreamer is released, but there are still thread id in t_stream_count_map

* fix threadlocal affect all threads

Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>

* add comment for local() function to avoid mistaken modification
in the future

Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>

* use custom stread id

Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>

* fix review comments

Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>

* fix format issue

Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>

* create shared_ptr before assert

Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>

---------

Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>
2023-11-23 14:43:09 +01:00
Vladislav Golubev
18955647f7 [Transformations] FuseU4WeightsAndZeroPoint transformation (#20503) 2023-11-23 14:43:08 +01:00
Zhang Yi
10b186560d [CPU][MLAS] FC uses L2 cache for thread partition (#20436) 2023-11-23 14:43:08 +01:00
Ivan Novoselov
09105ccc71 [Snippets] Dynamic pipeline reorganization (#18563) 2023-11-23 14:43:08 +01:00
Roman Lyamin
56cd4c8477 [GPU] Fix for reorder fusing in fuse_constant_transposes pass (#20651) 2023-11-23 14:43:07 +01:00
Andrew Kwangwoong Park
dc697d5b55 [GPU] Add RMS internal op, related transformation, primitive, and kernels for RMS decomposition fusion (#20355)
* Initial implementation of primitive, kernel selector, dummy kernel for RMS Norm

Signed-off-by: Andrew Park <andrew.park@intel.com>

* RMS ref kernel implementation with single WI

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Add TC and reference func for ov_gpu_unit_tests

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Add internal RMS norm op

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Add transformation which fuse RMS decompsition pattern to RMS internal op

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Fix pattern for RMS fusion transformation

* Update rms ref kernel for optimization and additional planar format suuport

* Initial impl for optimized rms kernel excluding leftovers handling and case smaller than vector size

* Update the initial version to handle leftovers and case smaller than vector size

* Fuse pre decom and post comp reorders additionally

* Enable dynamic impl for rms again

* Revert fuse pre decomp and post comp reorders additionally

* Add subgraph TC for ov_gpu_func_tests

* decrease error margin for f32 data type

* update description

Signed-off-by: Andrew Park <andrew.park@intel.com>

* update test param for input shapes

* Apply comments

* Fix failed TC for invalid gamma element type

* Apply comments

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Update pattern that fuse post reorder together

* Apply comments

---------

Signed-off-by: Andrew Park <andrew.park@intel.com>
2023-11-23 14:43:07 +01:00
Maxim Vafin
7c43ef30ee [PT FE] Implement custom op for types alignment (#20431)
* [PT FE] Implement custom op for types alignment

* Fix code style

* Fix inplace ops

* Fix layer tests

* Remove no longer needed change

* Fix ovc tests

* Fix fe tests
2023-11-23 14:43:07 +01:00
Roman Kazantsev
6291446152 [TF FE] Provide full support of TF1 Control flow and TensorArray* ops (#20270)
* [TF FE] Provide full support of TF1 Control flow and TensorArray ops

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

* Add missed header for TensorArrayV3 op

* Temporarily disable GRU cell fusion

* Update src/common/transformations/src/transformations/common_optimizations/moc_transformations.cpp

* Fix a case when element_shape for TensorArrayV3

* Fix translator for TensorArrayCloseV3

* Update summarize graph with TensorArrayCloseV3

* Add layer tests for TensorArrayScatterV3, Close, Size, Array

* Fix output shape for Merge node

* Remove unused variable

* Fix translator for TensorArrayConcatV3

* Fix translator for TensorArrayConcatV3

* Add layer tests for TensorArrayWriteV3, Gather, and Concat

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

* Add translator for GatherTree

* Fix TF FE unit-test for GatherTree

* Fix GatherTree translator

* Fix GatherTree translator to handle 1d end_token

* Fix undeclared parameter issue

* Fix GatherTree unit-test

* Add TensorArrayV3Replacer transformation

* Temporarily disable dangling transformation

* Recover RemoveMultiSubGraphOpDanglingParamsResults transformation

* Recover GRUCellFusion transformation

* Simplify check for GRUCellFusion transformation

* Use proper name for unit-tests

* Simplify translator for TensorArrayWriteV3

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

* Fix RemoveMultiSubgraphOpDanglingParamsResults transformation

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

* Additional fix for remove_multi_subgraph_op_dangling_params

* Make static TI run a dynamic subgraph

* Dedicated SL test

* Change condition to respect stat shapes

* Adjust test to cover the code path properly

* Recover fallback for still failing case GNMT

---------

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Co-authored-by: Maksim Kutakov <maksim.kutakov@intel.com>
2023-11-23 14:43:07 +01:00
Roman Kazantsev
2d836757ab [TF FE] Document full list of TF operations and their support by TF FE (#20640)
* [TF FE] Document full list of TF operations and their support by TF FE

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

* Update src/frontends/tensorflow/docs/supported_ops.md

Co-authored-by: Karol Blaszczak <karol.blaszczak@intel.com>

---------

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Co-authored-by: Karol Blaszczak <karol.blaszczak@intel.com>
2023-11-23 14:43:06 +01:00
Fang Xu
fe7b373031 fix compilation issue for openmp on windows (#20312)
* fix compilation issue for openmp on windows

* update based on suggestions
2023-11-23 14:43:06 +01:00
Andrey Kashchikhin
a1f27cc9b8 [CI] [GHA] Introduce macOS ARM64 as a matrix parameter in the macOS pipeline (#20363)
* add m1 mac pipelines as a matrix parameter

* Update mac.yml

disable java_api because of macos arm64 - Java is not available on macOS arm64 runners

* Update mac.yml

added always condition for all tests

* Update mac.yml

* Update mac.yml

* Update mac.yml

* Update setup.py

temp commit

* Update tools/openvino_dev/setup.py

* use matrix for var

* add mxnet to extras only for x86_64

* skip failing tests

* use xfail for Python tests; add missing filter for transformations tests

* skip CPU func tests on x86_64 mac; skip some tests from CPU func tests on arm mac

* Update mac.yml

* skip tests on mac arm

* skip tests on darwin; apply review

* add more skips for python and c++ tests

* skip tf tests

* skip more tf tests; skip more Python UT stages

* rm alwayses, rm triggers, add nightly trigger

---------

Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
2023-11-23 14:43:06 +01:00
Irina Efode
78caf92977 Fix OpImplCheck in GHA (#20650)
* Update mac.yml

* Update mac.yml
2023-11-23 14:43:06 +01:00
Ilya Churaev
bfc019524f Introduce WA to improve performance of find_port() method (#20573)
* Introduce WA to improve performance of find_port() method

* Add mutex

* Remove redindant lock

* Reduce the number of get_tensor_ptr calls

* Fixed typo

* Removed WAs from Hetero plugin
2023-11-23 14:43:05 +01:00
Irina Efode
b27580e0c8 [GHA][CONFORMANCE] Enable and conformance for Template in GHA (#20604)
* [GHA][CONFORMANCE] Enable  and conformance  for Template in GHA

* apply review

* Update linux.yml
2023-11-23 14:43:05 +01:00
Karan Jakhar
9dc516daea fixing type, suppored -> supported (#20639) 2023-11-23 14:43:05 +01:00
Evgenya Nugmanova
e8240204e7 Clear error message in the Constant::cast_vector (#20629) 2023-11-23 14:43:05 +01:00
Ilya Lavrenov
32969e41ef Moved all tensorflow proto files to a folder with ov prefix to avoid ODR (#20636)
conflicts
2023-11-23 14:43:04 +01:00
Maxim Vafin
940459f7d8 Update model list for hf hub tests (#20485)
* Update model list and install cpu torch

* Move to hub tests

* Update tests/model_hub_tests/torch_tests/requirements.txt

* Make pytorch mainteiners owners of torch tests
2023-11-23 14:43:04 +01:00
Lucas Alava Peña
1e6cafba31 fix typo in opset3 shuffle_channels docstring (#20158)
Co-authored-by: Jan Iwaszkiewicz <jan.iwaszkiewicz@intel.com>
2023-11-23 14:43:04 +01:00
Mateusz Mikolajczyk
68f1c828d3 [PT FE] Add aten::as_strided (#19482)
* Add aten::as_strided

* rm commented code

* Update src/frontends/pytorch/src/op/as_strided.cpp

Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>

* Update src/frontends/pytorch/src/op/as_strided.cpp

Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>

* Fix CI error

* Fix CI issues

* mark_node for remaining constants

* Add test reproducing issue

* Use strides from torchscript

* Add led model to test suite

* Add sugested changes

---------

Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>
2023-11-23 14:43:03 +01:00
Maxim Vafin
6cbcdd5ead [PT FE] Add readme for pytorch frontend (#20610)
* Add readme for pytorch frontend

* Mension fx decoder

* Apply suggestions from code review

* Update src/frontends/pytorch/README.md

* Apply suggestions from code review

Co-authored-by: Tatiana Savina <tatiana.savina@intel.com>

* Apply suggestions from code review

Co-authored-by: Tatiana Savina <tatiana.savina@intel.com>

* Apply suggestions from code review

Co-authored-by: Tatiana Savina <tatiana.savina@intel.com>

* Update src/frontends/pytorch/README.md

* Update src/frontends/pytorch/README.md

Co-authored-by: Tatiana Savina <tatiana.savina@intel.com>

---------

Co-authored-by: Tatiana Savina <tatiana.savina@intel.com>
Co-authored-by: Andrei Kochin <andrei.kochin@intel.com>
2023-11-23 14:43:03 +01:00
rsato10
b3c848b9b4 [TF FE] Support ToBool operation (#20511)
* [TF FE][TF Hub] Support ToBool operations

* [TF FE][TF Hub] Support ToBool operations

* fixing select operation Support ToBool operations for TF Hub models

* added false and true const for tobool operations

* added reduction axes

* Apply suggestions from code review

* Update tests/layer_tests/tensorflow_tests/test_tf_ToBool.py

* Update tests/layer_tests/tensorflow_tests/test_tf_ToBool.py

* Update tests/layer_tests/tensorflow_tests/test_tf_ToBool.py

* Update src/frontends/tensorflow_common/src/op/tobool.cpp

* added second zero constant

* added correct types  src\frontends\tensorflow_common\src\op\tobool.cpp

* added includes src\frontends\tensorflow_common\src\op\tobool.cpp

* Update src/frontends/tensorflow_common/src/op/tobool.cpp

* remove select and not_equal src/frontends/tensorflow_common/src/op/tobool.cpp

* Apply suggestions from code review

* Update src/frontends/tensorflow_common/src/op/tobool.cpp

* Apply suggestions from code review

* Update src/frontends/tensorflow_common/src/op/tobool.cpp

---------

Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>
2023-11-23 14:43:03 +01:00
Ilya Lavrenov
c89b53d98b Updated urllib3 to resolve CVE (#20620) 2023-11-23 14:43:03 +01:00
Ilya Lavrenov
7736af3296 WA issues with dynamic protobuf usage in Fes (#20612) 2023-11-23 14:43:02 +01:00
Siddhant Chauhan
da8ed3e861 [TF FE][TF Hub] Support TruncateDiv operation (#20615)
* [TF FE][TF Hub] Support TruncateDiv operation

* [TF FE][TF Hub] Support TruncateDiv operation

* Update src/frontends/tensorflow_common/src/op/truncate_div.cpp

---------

Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>
2023-11-23 14:43:02 +01:00
Anastasiia Pnevskaia
b5b5cbba53 Updated OMZ submodule. (#20611) 2023-11-23 14:43:02 +01:00
Siddhant Chauhan
fd2c20b64d [TF FE][TF Hub] Support TruncateMod operation (#20468)
* [TF FE][TF Hub] Support TruncateMod operation

* Update truncate_mod.cpp

* fix
2023-11-23 14:43:02 +01:00
Mustafa Cavus
f134d8bfb9 Llm and sd additional ops (#20435)
* TorchFX: New ops added (baddbbmm, leaky_relu_)

* TorchFX: Initial scaled_dot_product_flash_attention

* Code Formatting: scaled_fot_product_attention translation

* TorchFX unit test enabled for SDPA

* Typo fix in comment line

Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>

---------

Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>
2023-11-23 14:43:01 +01:00
Mustafa Cavus
934e8ddaf8 TorchFX: Constant value pass without copy (#20380)
* TorchFX: Constant value pass optimization

* Replace op.Constant with make_constant in fx_decoder

* Using shared memory for constant value passing

Co-authored-by: Jan Iwaszkiewicz <jan.iwaszkiewicz@intel.com>

---------

Co-authored-by: Jan Iwaszkiewicz <jan.iwaszkiewicz@intel.com>
2023-11-23 14:43:01 +01:00
Andrey Kashchikhin
a75e994e9f fix yml (#20614) 2023-11-23 14:43:01 +01:00
Andrey Kashchikhin
b35d14a147 [CI] [GHA] Extend Win CC pipeline with tests, make it static (#20579)
* start with transferring

* add CPU fun tests stage; complete CC stage

* start tests in comd

* uncomment

* fix step

* rm restore key

* align cmake

* add tbb path setting, continue on error

* add logs; add missing dir; add extension for ov cpu test bin

* add missing slash for path; explicitely set BUILD_TYPE

* correct paths for layer tests summary files; add tbb as target

* correct path to tbb

* rm triggers; add nightly trigger; rm debug conditions

* rm comment on cache
2023-11-23 14:43:01 +01:00
Sebastian Golebiewski
3719791f66 [DOCS] Fix command for Building with Ninja (#20605)
* Fix command for Building with Ninja

Removing current directory from the command.

* Update docs/dev/build_windows.md

---------

Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
2023-11-23 14:43:00 +01:00
Anastasiia Pnevskaia
ca4b54dc87 Switching telemetry to opt-out and disabling telemetry in CI jobs (#20391)
* Added disabling telemetry script to linux azure job, added debug checks to convert_model.

* Telemetry disabling.

* Disabling telemetry.

* Config corrections.

* Config corrections.

* Update .github/workflows/mac.yml

Co-authored-by: Andrey Kashchikhin <andrey.kashchikhin@intel.com>

* Debug output.

* Win config correction.

* Win config correction.

* Debug output.

* Debug output.

* Added turning off telemetry to onnx azure tests config.

* Corrected config.

* Removed debug output.

* Switch telemetry to opt-out.

* Removed not needed blank lines.

* Text correction.

* Debug output.

* Remove changes from CI configs, add CI var to dockers.

* Config correction.

* Debug output.

* Config corrected.

* Readme corrected.

* Config changed.

* Config changed.

* Debug output.

* Required version changed.

* Remove debug output.

---------

Co-authored-by: Andrey Kashchikhin <andrey.kashchikhin@intel.com>
2023-11-23 14:43:00 +01:00
Ilya Lavrenov
395f1814ff Add prefixes to proto files to avoid ODR conflicts (#20588) 2023-11-23 14:43:00 +01:00
Fang Xu
ba7f31999a build onetbb with hwloc2.9.3 (#20481) 2023-11-23 14:43:00 +01:00
Ilya Lavrenov
323726c9b5 Enabled LC_RPATH for brew (#20587) 2023-11-23 14:42:59 +01:00
Irina Efode
56e38abc03 [OP CONFORMANCE] Fix op version operations in conformance report (#20502)
* [OP CONFORMANCE] Fix op version operations in conformance report

* Code style + SD test
2023-11-23 14:42:59 +01:00
Ilya Lavrenov
6a54c9acde Enabled FrontendLibCloseTest FE test suite (#20592) 2023-11-23 14:42:59 +01:00
Katarzyna Mitrus
76dccd2abf Fix MO multinomial shape array (#20548) 2023-11-23 14:42:59 +01:00
Sergey Lyalin
72de5dd36f Partial GPTQ int4 models conversion support + Swap nibbles in u4/i4 element type (#20371)
* Reference implementation for u4 constant compression from pytorch model based on bitwise ops pattern

* Fixed order of 4-bit halfs in byte

* Switched PyTorch FE to dev mode: in case if model cannot be fully converted, give partially converted model with PTFrameworkNode's with a printed warning (normally would raise an exception in case).

* Moved u4 compression to utils_quantize. Implemented not-interleaved version of u4 compression

* Removed debug output

* Added aten::matmul to the list of exceptions in may_produce_alias as a workaround for gptq models

* Added patching for gptq models applied automatically in convert_model

* WA for an inssue with u4 with earlier convert to fp16

* U4 blocked repacking for gptq patched model layout

* Deleted obsolete u4 re-packing based on aten::cat. Fixed the resulting u4 constant shape. Removed debug output.

* Revert "Switched PyTorch FE to dev mode: in case if model cannot be fully converted, give partially converted model with PTFrameworkNode's with a printed warning (normally would raise an exception in case)."

This reverts commit 0ef1455e70.

* Update src/frontends/pytorch/src/op/cat.cpp

* Check mask and shift values in u4 pattern. deque -> OutputVector for u4_compression_stack

* Convert to a given floating type instead of half in gptq patching. Better structured code.

* Code style fix

* Removed deque include

* Code style fixes

* Trailing space removed

* Fixed patched_forward and ts_decoder after unvalidated commits.

* Swap nibbles in u4/i4

* Better exception handling around jit.trace and gptq.patch_model

* Update src/bindings/python/src/openvino/frontend/pytorch/gptq.py

Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com>

* Update src/bindings/python/src/openvino/frontend/pytorch/gptq.py

Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com>

* Code style

* Revers int4 byte order

* Fixed core tests

* Fixed unguarded dynamic_cast result

Co-authored-by: Evgenya Nugmanova <eva.my.link@gmail.com>

* Fixed transformation tests

* Update src/bindings/python/src/openvino/frontend/pytorch/gptq.py

Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>

* Prevent patching of non-gptq models

* Removed extra calling of quantized weights decompression patterns

* Better detection of supported AutoGPTQ models + more diagnostics

* Accurate diagnostics in case when aten::stack has multiple axes

---------

Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com>
Co-authored-by: Ilya Churaev <ilyachur@gmail.com>
Co-authored-by: Evgenya Nugmanova <eva.my.link@gmail.com>
Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>
2023-11-23 14:42:58 +01:00
Mikhail Ryzhov
968e3c21ce [GHA] Temporary disable win workflows (#20568)
* moved win workflow to nightly

* reset redundant files

* removed redundant files
2023-11-23 14:42:58 +01:00
Anastasia Kuporosova
431d1c295e [Docs] fix failure of python snippet (#20516)
* [Docs] fix failure of python snippet

* trigger python snippets

* fix ga

* Update .github/workflows/linux.yml

Co-authored-by: Mikhail Ryzhov <mikhail.ryzhov@intel.com>

* Update .github/workflows/linux.yml

Co-authored-by: Mikhail Ryzhov <mikhail.ryzhov@intel.com>

* return back fix

---------

Co-authored-by: Chen Peter <peter.chen@intel.com>
Co-authored-by: Mikhail Ryzhov <mikhail.ryzhov@intel.com>
2023-11-23 14:42:58 +01:00
Pawel Raasz
0c32adc633 [core]Migrate VariadicSplit op to new API (#20416)
* Migrate VariadicSlice to new API
- refactor to reduce bin size

* Move `get_tensors_partial_shapes` to dev API

* Use get_tensors_partial_shapes in VariadicSplit

* Remove `visit_attributes` is same as base
2023-11-23 14:42:58 +01:00
Sergey Shlyapnikov
057174ed7f [GPU] Deferred events deallocation and tuning for FC bf_tiled kernel (#20519) 2023-11-23 14:42:57 +01:00