Commit Graph

13380 Commits

Author SHA1 Message Date
Pawel Raasz
e1a33f10d5
[core]Migrate Slice to new API (#20417)
* Migrate slice to new API

* Remove visit_attributes, is same as base class

* Move shape checks to shape_infer
- minor refactor Slice op

* Move `get_tensors_partial_shapes` to dev API

* Correct comment

Co-authored-by: Tomasz Jankowski <tomasz1.jankowski@intel.com>

---------

Co-authored-by: Tomasz Jankowski <tomasz1.jankowski@intel.com>
2023-10-25 08:19:14 +00:00
Evgenya Nugmanova
7874adb58e
[Symbolic SI] Refactor Table of Equivalence (#20627) 2023-10-25 09:54:47 +02:00
Nikolay Shchegolev
dc4240bc61
[CPU] Removed custom ShapeInference impl for RandomUniform (#20599) 2023-10-25 09:15:03 +02:00
Maciej Smyk
706d657637
[DOCS] Install Guide Update for master (#20677)
* missing info

* System Requirements

* Update installing-openvino-from-archive-macos.md

* system requirements update
2023-10-25 09:12:41 +02:00
Wilson Seok
c70f0ca45d
[GPU] skip excessive mem alloc request in build (#20399)
* skip excessive mem alloc request in build

* update mem check function

* fix os behavior

* update mem size check location

* only dynamic shape case takes check_allocatable

* update check condition
2023-10-25 09:09:11 +02:00
Wilson Seok
9d56c31581
[GPU] add shape infer in scatter elements update (#20250)
* add shape infer in scatter elements update

* output shape is same with input shape in dynamic case
2023-10-25 09:01:52 +02:00
Vitaliy Urusovskij
a71283ea94
RNNSequence, Result, Reshape, ReorgYolo, RegionYolo layer tests to API2.0 (#20644)
* `RNNSequenceTest` to API2.0

* `Result` to API2.0

* `Reshape` to API2.0

* `ReorgYolo` to API2.0

* `RegionYolo` to API2.0

* Alignment fixes

* Skip more `RNNSequenceTest` cases
2023-10-25 08:16:28 +02:00
Maxim Vafin
46f46c6cc6
[PT FE] Fix xmod model test (#20682) 2023-10-25 08:00:34 +02:00
Oleg Pipikin
30260e3c7c
Refaсtor ActivationLayerTest (#20180)
* Refator ActivationLayerTest
2023-10-25 09:37:29 +04:00
Tomasz Jankowski
973b194776
[core] Migrate DepthToSpace operator to new API (#20515)
* Move into ov namespace

* Use ov::Tensor in place of HostTensor

---------

Co-authored-by: Michal Lukaszewski <michal.lukaszewski@intel.com>
2023-10-25 07:10:54 +02:00
Pawel Raasz
84732515b4
Fix check repeats in values in Tile (#20654)
- no action if any of repeats is zero

Co-authored-by: Michal Lukaszewski <michal.lukaszewski@intel.com>
2023-10-25 07:27:21 +04:00
Eddy Kim
6fa4f9fd78
[GPU] fixed to create Graphs with different stream_ids (#20626)
* fixed to create Graphs with different stream_ids

* added num_streams config for auto batch test
2023-10-24 20:26:36 -07:00
Pawel Raasz
5fee2ef67e
Migrate LessEqual and GreaterEqual to new API (#20645)
Co-authored-by: Michal Lukaszewski <michal.lukaszewski@intel.com>
2023-10-25 07:26:31 +04:00
Pawel Raasz
2b65855793
[core]Migrate Less and Greater operators to new API (#20628)
* Migrate Less operator to new API

* Migrate Greater operator to new API
- use less implementation in greater to reduce bin size

---------

Co-authored-by: Michal Lukaszewski <michal.lukaszewski@intel.com>
2023-10-25 07:25:05 +04:00
Andrey Babushkin
ee6263a141
[workflows/linux] Switch to sccache and Azure Blob Storage (#20484)
* [workflows/linux] Switch to sccache and Azure Blob Storage

* Install curl

* Remove --show-config

* Add sccache to other Linux workflows

* sccache to  Android, curl to riscv and CC

* Use sccache action instead of manual install

* Oops, missed sccache manual installation in two places

* Use env vars instead of hardcoded CMAKE_C(XX)_COMPILER_LAUNCHER

* Forgot one more stage in Linux CC pipeline

* Temporarily disable Blob Storage for RISC-V

For some reason sccache has no effect on build time and show 0 hits
and 0 compilation requests despite being in CMake calls

* forgot to add sccache installation to Linux CC

* Revert "Temporarily disable Blob Storage for RISC-V"

This reverts commit b528f41dad583a38b9ef93121e38044b9dccb71b.

* Missing container option for CC build

* Remove curl installation

* Remove CCACHE* variables which have no effect on sccache

* Revert sccache changes for Linux RISC-V workflow
2023-10-24 20:52:56 +01:00
Oleksii Khovan
984e4dbf35
[GPU] NMSRotated-13 (#20411)
* Add Rotation support to primitive and kernel

* Add unit tests

* Add transformation for NMSRotated

* add single-layer tests

* Fix: angle value for the same box may have its sign changed several times passing through iterations of batch and class loops.

* fix review comments
2023-10-24 21:36:02 +02:00
Siddhant Chauhan
eb55360f10
[ONNX] Extend ONNX Frontend with BlackmanWindow, HammingWindow and HannWindow operators (#19428)
* ONNX BlackManWindow enabled

* added a test periodic

* Add the license statement

* ONNX HammingWindow, HannWindow enabled

also added basic tests for each

* minor tests added

* made reviewed changes

* made reviewed changes

used output_datatype directly, returned y_values directly

* fixed clang-format

* add OPENVINO_SUPPRESS_DEPRECATED_START

* include math.h

* float fix

* fix

* fix namespace to set_1

* test fixes

* fix cast to output_datatype

* fix, replace cast with ov::convert

* fix, use element::f32

* major fixes

* fixes

* Update onnx_import.in.cpp

* Update onnx_import.in.cpp

---------

Co-authored-by: Przemyslaw Wysocki <przemyslaw.wysocki@intel.com>
2023-10-24 21:45:50 +04:00
Evgeny Kotov
22184c32f4
fix random layer names and count (#20323)
* add sorting for fix sporadic failure in SharedOpOptimization shared_node_optimization

* fix Output and Input comparison

* remove unneed sorting from transformation

* add unit test

* code review fixes

* code review fixes

* code review fixes

* code review fixes

---------

Co-authored-by: Ivan Tikhonov <ivan.tikhonov@intel.com>
2023-10-24 18:18:50 +02:00
Ilya Churaev
124f2bc5c9
Remove NF4 Convert from public API (#20666)
* Remove NF4 Convert from public API

* Fixed build
2023-10-24 19:19:16 +04:00
Ilya Lavrenov
337e225dbd
Removed linux.yml AZure pipeline (#20341) 2023-10-24 16:59:17 +02:00
Sergey Shlyapnikov
26cab79988
[GPU] Improve OOOQ synchronization between ShapeOf subgraph's CPU impls and GPU kernels (#20595)
* [GPU] Add ITT counters for primitive inst

* [GPU] Improve OOOQ synchronization between ShapeOf subgraph's CPU impls and GPU kernels
2023-10-24 18:49:41 +04:00
Andrei Gorbachev
20bd58759a
[GPU] Refactor grn, gru_cell, group_convolution, group_convolution_backprop_data, is_inf, log_softmax, logical, mat_mul, lrn, lstm_cell (#20520)
* grn

* gru_cell

* group_convolution

* group_convolution_backprop_data

* is_inf

* log_softmax

* logical

* mat_mul

* lrn

* lstm_cell

* fix after review
2023-10-24 18:48:29 +04:00
Aleksandr Voron
c6707aab86
[CPU] Add input type check into in-place condition (#20529) 2023-10-24 18:45:48 +04:00
Irina Efode
63fff9d270
[IE TESTS][CONFORMANCE] Move Calculate reference over TEMPLATE plugin instead of interpreter (#12532)
* [CONFORMANCE] Enable template calculation reference

* Move convert init to cpu

* skip gpu
2023-10-24 16:34:37 +02:00
Evgeny Kotov
251602636c
Fuse GeLU (#20428)
* add transformation + test

* move transformation to GeluFusion

* add comments

* code review fixes

* fix

* code review fixes

* code style fix

* fix windows build warning
2023-10-24 18:23:14 +04:00
Pawel Raasz
5ffde7d8d6
[core]Migrate Minimum operator to new API (#20597)
* Migrate Minimum op to new API

* Refactor evaluates to reduce binary size
- add infer_broadcast_shape, get shapes from tensors reduce OV_ASSERT
- refactor Evaluate structures to reduce binary size

---------

Co-authored-by: Michal Lukaszewski <michal.lukaszewski@intel.com>
2023-10-24 14:48:09 +02:00
Vitaliy Urusovskij
e977a6ed9b
Fix COPY_INSTEAD_OF_MOVE issue in Hetero (#20667) 2023-10-24 12:44:01 +00:00
Mateusz Tabaka
b7406247ff
Add TransposeMatMul transformation to MOC (#20460)
Transformation fuses Transpose on first or second MatMul's input
and sets MatMul's transpose_a/transpose_b accordingly.

TransposeMatMul is already part of SmartReshape, but it can be added
to MOCTransformations as well so native models that are don't use reshape
can benefit from that.

Ticket: CVS-118908
2023-10-24 14:40:36 +02:00
Maksim Kutakov
6395fc672d
[CPU] Make static TI run a dynamic subgraph (#20527)
* Make static TI run a dynamic subgraph

* Dedicated SL test

* Change condition to respect stat shapes

* Adjust test to cover the code path properly
2023-10-24 14:28:00 +02:00
Pawel Raasz
750f62fd04
[shape_infer]Add preserve partial values on inputs for Mod operator (#20169)
* Preserve partial values on mod inputs
- static values full range of integers
- intervals only if not negatives

* Fix bounds evaluate when inputs are scalars
2023-10-24 14:53:54 +04:00
Vitaliy Urusovskij
1daa4b9e5e
Skip smoke_TestsROIPooling f16 test on ARM (#20552) 2023-10-24 09:09:37 +00:00
Mateusz Tabaka
2668f68816
CompressQuantizeWeights - fix zero point calculation (#20541)
Current implementation tries to leverage branchless approach, but it's not correct
if scale is 0. In that case - zero point can can become inf or nan and multiplication
by 0 doesn't change its value. That causes another issue - infinite or NaN zero point
cannot be optimized out later.

Ticket: CVS-122931

Co-authored-by: Ivan Tikhonov <ivan.tikhonov@intel.com>
2023-10-24 10:21:06 +02:00
Andrei Gorbachev
ea6922386e
[GPU] Refactor ConvertColorNV12, ConvolutionBackprop, Convolution (#20376)
* ConvertColorNV12

* ConvolutionBackprop

* Convolution

* fix after review
2023-10-24 09:38:23 +02:00
Vladimir Paramuzov
5853509b3c
Fixed ngraph reference impl for Range operation (#20631)
* Fixed ngraph reference impl for Range operation

* Truncate range for integer output type

* explicit static cast
2023-10-24 10:18:04 +04:00
Ilya Churaev
7ceff55b71
Add AlignedBuffer to OpenVINO developer API (#20532)
* Add AlignedBuffer to OpenVINO developer API

* Fixed build

* Fixed code style and remove opset deprecation

* Fixed Windows build

* Fixed GNA

* Fixed comment
2023-10-24 06:13:23 +00:00
Yuan Hu
84a0994ec5
[core] fix memory leak issue imported by #18868 (#19832)
* try to fix memory leak issue

cpustreamer is released, but there are still thread id in t_stream_count_map

* fix threadlocal affect all threads

Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>

* add comment for local() function to avoid mistaken modification
in the future

Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>

* use custom stread id

Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>

* fix review comments

Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>

* fix format issue

Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>

* create shared_ptr before assert

Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>

---------

Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>
2023-10-24 13:59:08 +08:00
Vladislav Golubev
afda7ad70f
[Transformations] FuseU4WeightsAndZeroPoint transformation (#20503) 2023-10-24 09:44:26 +04:00
Zhang Yi
d490ab68d1
[CPU][MLAS] FC uses L2 cache for thread partition (#20436) 2023-10-24 09:31:03 +04:00
Ivan Novoselov
bc82ba4419
[Snippets] Dynamic pipeline reorganization (#18563) 2023-10-24 09:23:10 +04:00
Roman Lyamin
59fe0a05a4
[GPU] Fix for reorder fusing in fuse_constant_transposes pass (#20651) 2023-10-24 09:21:30 +04:00
Andrew Kwangwoong Park
894221ea38
[GPU] Add RMS internal op, related transformation, primitive, and kernels for RMS decomposition fusion (#20355)
* Initial implementation of primitive, kernel selector, dummy kernel for RMS Norm

Signed-off-by: Andrew Park <andrew.park@intel.com>

* RMS ref kernel implementation with single WI

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Add TC and reference func for ov_gpu_unit_tests

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Add internal RMS norm op

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Add transformation which fuse RMS decompsition pattern to RMS internal op

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Fix pattern for RMS fusion transformation

* Update rms ref kernel for optimization and additional planar format suuport

* Initial impl for optimized rms kernel excluding leftovers handling and case smaller than vector size

* Update the initial version to handle leftovers and case smaller than vector size

* Fuse pre decom and post comp reorders additionally

* Enable dynamic impl for rms again

* Revert fuse pre decomp and post comp reorders additionally

* Add subgraph TC for ov_gpu_func_tests

* decrease error margin for f32 data type

* update description

Signed-off-by: Andrew Park <andrew.park@intel.com>

* update test param for input shapes

* Apply comments

* Fix failed TC for invalid gamma element type

* Apply comments

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Update pattern that fuse post reorder together

* Apply comments

---------

Signed-off-by: Andrew Park <andrew.park@intel.com>
2023-10-24 02:52:02 +00:00
Maxim Vafin
8d0381b0fe
[PT FE] Implement custom op for types alignment (#20431)
* [PT FE] Implement custom op for types alignment

* Fix code style

* Fix inplace ops

* Fix layer tests

* Remove no longer needed change

* Fix ovc tests

* Fix fe tests
2023-10-23 22:54:08 +02:00
Roman Kazantsev
009ef5657c
[TF FE] Provide full support of TF1 Control flow and TensorArray* ops (#20270)
* [TF FE] Provide full support of TF1 Control flow and TensorArray ops

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

* Add missed header for TensorArrayV3 op

* Temporarily disable GRU cell fusion

* Update src/common/transformations/src/transformations/common_optimizations/moc_transformations.cpp

* Fix a case when element_shape for TensorArrayV3

* Fix translator for TensorArrayCloseV3

* Update summarize graph with TensorArrayCloseV3

* Add layer tests for TensorArrayScatterV3, Close, Size, Array

* Fix output shape for Merge node

* Remove unused variable

* Fix translator for TensorArrayConcatV3

* Fix translator for TensorArrayConcatV3

* Add layer tests for TensorArrayWriteV3, Gather, and Concat

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

* Add translator for GatherTree

* Fix TF FE unit-test for GatherTree

* Fix GatherTree translator

* Fix GatherTree translator to handle 1d end_token

* Fix undeclared parameter issue

* Fix GatherTree unit-test

* Add TensorArrayV3Replacer transformation

* Temporarily disable dangling transformation

* Recover RemoveMultiSubGraphOpDanglingParamsResults transformation

* Recover GRUCellFusion transformation

* Simplify check for GRUCellFusion transformation

* Use proper name for unit-tests

* Simplify translator for TensorArrayWriteV3

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

* Fix RemoveMultiSubgraphOpDanglingParamsResults transformation

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

* Additional fix for remove_multi_subgraph_op_dangling_params

* Make static TI run a dynamic subgraph

* Dedicated SL test

* Change condition to respect stat shapes

* Adjust test to cover the code path properly

* Recover fallback for still failing case GNMT

---------

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Co-authored-by: Maksim Kutakov <maksim.kutakov@intel.com>
2023-10-23 22:50:26 +02:00
Roman Kazantsev
99dfbb400a
[TF FE] Document full list of TF operations and their support by TF FE (#20640)
* [TF FE] Document full list of TF operations and their support by TF FE

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>

* Update src/frontends/tensorflow/docs/supported_ops.md

Co-authored-by: Karol Blaszczak <karol.blaszczak@intel.com>

---------

Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Co-authored-by: Karol Blaszczak <karol.blaszczak@intel.com>
2023-10-23 15:24:35 +04:00
Fang Xu
5e017dc5d2
fix compilation issue for openmp on windows (#20312)
* fix compilation issue for openmp on windows

* update based on suggestions
2023-10-23 15:18:51 +04:00
Andrey Kashchikhin
b67cff7cd5
[CI] [GHA] Introduce macOS ARM64 as a matrix parameter in the macOS pipeline (#20363)
* add m1 mac pipelines as a matrix parameter

* Update mac.yml

disable java_api because of macos arm64 - Java is not available on macOS arm64 runners

* Update mac.yml

added always condition for all tests

* Update mac.yml

* Update mac.yml

* Update mac.yml

* Update setup.py

temp commit

* Update tools/openvino_dev/setup.py

* use matrix for var

* add mxnet to extras only for x86_64

* skip failing tests

* use xfail for Python tests; add missing filter for transformations tests

* skip CPU func tests on x86_64 mac; skip some tests from CPU func tests on arm mac

* Update mac.yml

* skip tests on mac arm

* skip tests on darwin; apply review

* add more skips for python and c++ tests

* skip tf tests

* skip more tf tests; skip more Python UT stages

* rm alwayses, rm triggers, add nightly trigger

---------

Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
2023-10-23 15:06:22 +04:00
Irina Efode
4fe6d5ec54
Fix OpImplCheck in GHA (#20650)
* Update mac.yml

* Update mac.yml
2023-10-23 15:04:05 +04:00
Ilya Churaev
865b21ecd4
Introduce WA to improve performance of find_port() method (#20573)
* Introduce WA to improve performance of find_port() method

* Add mutex

* Remove redindant lock

* Reduce the number of get_tensor_ptr calls

* Fixed typo

* Removed WAs from Hetero plugin
2023-10-23 13:44:58 +04:00
Irina Efode
6e4ec88db8
[GHA][CONFORMANCE] Enable and conformance for Template in GHA (#20604)
* [GHA][CONFORMANCE] Enable  and conformance  for Template in GHA

* apply review

* Update linux.yml
2023-10-23 10:47:15 +02:00
Karan Jakhar
5dafee4ac1
fixing type, suppored -> supported (#20639) 2023-10-22 17:25:59 +04:00