Commit Graph

13409 Commits

Author SHA1 Message Date
Irina Efode
137c8e896f
[OP CONFORMANCE] Fix op version operations in conformance report (#20502)
* [OP CONFORMANCE] Fix op version operations in conformance report

* Code style + SD test
2023-10-19 11:22:44 +04:00
Ilya Lavrenov
5fb6785f9e
Enabled FrontendLibCloseTest FE test suite (#20592) 2023-10-19 11:18:42 +04:00
Katarzyna Mitrus
dc9ec1ee9d
Fix MO multinomial shape array (#20548) 2023-10-19 04:53:38 +04:00
Sergey Lyalin
46935e0a00
Partial GPTQ int4 models conversion support + Swap nibbles in u4/i4 element type (#20371)
* Reference implementation for u4 constant compression from pytorch model based on bitwise ops pattern

* Fixed order of 4-bit halfs in byte

* Switched PyTorch FE to dev mode: in case if model cannot be fully converted, give partially converted model with PTFrameworkNode's with a printed warning (normally would raise an exception in case).

* Moved u4 compression to utils_quantize. Implemented not-interleaved version of u4 compression

* Removed debug output

* Added aten::matmul to the list of exceptions in may_produce_alias as a workaround for gptq models

* Added patching for gptq models applied automatically in convert_model

* WA for an inssue with u4 with earlier convert to fp16

* U4 blocked repacking for gptq patched model layout

* Deleted obsolete u4 re-packing based on aten::cat. Fixed the resulting u4 constant shape. Removed debug output.

* Revert "Switched PyTorch FE to dev mode: in case if model cannot be fully converted, give partially converted model with PTFrameworkNode's with a printed warning (normally would raise an exception in case)."

This reverts commit 0ef1455e70.

* Update src/frontends/pytorch/src/op/cat.cpp

* Check mask and shift values in u4 pattern. deque -> OutputVector for u4_compression_stack

* Convert to a given floating type instead of half in gptq patching. Better structured code.

* Code style fix

* Removed deque include

* Code style fixes

* Trailing space removed

* Fixed patched_forward and ts_decoder after unvalidated commits.

* Swap nibbles in u4/i4

* Better exception handling around jit.trace and gptq.patch_model

* Update src/bindings/python/src/openvino/frontend/pytorch/gptq.py

Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com>

* Update src/bindings/python/src/openvino/frontend/pytorch/gptq.py

Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com>

* Code style

* Revers int4 byte order

* Fixed core tests

* Fixed unguarded dynamic_cast result

Co-authored-by: Evgenya Nugmanova <eva.my.link@gmail.com>

* Fixed transformation tests

* Update src/bindings/python/src/openvino/frontend/pytorch/gptq.py

Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>

* Prevent patching of non-gptq models

* Removed extra calling of quantized weights decompression patterns

* Better detection of supported AutoGPTQ models + more diagnostics

* Accurate diagnostics in case when aten::stack has multiple axes

---------

Co-authored-by: Alexander Kozlov <alexander.kozlov@intel.com>
Co-authored-by: Ilya Churaev <ilyachur@gmail.com>
Co-authored-by: Evgenya Nugmanova <eva.my.link@gmail.com>
Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>
2023-10-18 21:06:17 +04:00
Mikhail Ryzhov
cf9791e3e0
[GHA] Temporary disable win workflows (#20568)
* moved win workflow to nightly

* reset redundant files

* removed redundant files
2023-10-18 20:43:46 +04:00
Anastasia Kuporosova
491454103e
[Docs] fix failure of python snippet (#20516)
* [Docs] fix failure of python snippet

* trigger python snippets

* fix ga

* Update .github/workflows/linux.yml

Co-authored-by: Mikhail Ryzhov <mikhail.ryzhov@intel.com>

* Update .github/workflows/linux.yml

Co-authored-by: Mikhail Ryzhov <mikhail.ryzhov@intel.com>

* return back fix

---------

Co-authored-by: Chen Peter <peter.chen@intel.com>
Co-authored-by: Mikhail Ryzhov <mikhail.ryzhov@intel.com>
2023-10-18 14:23:52 +00:00
Pawel Raasz
f2549f2d59
[core]Migrate VariadicSplit op to new API (#20416)
* Migrate VariadicSlice to new API
- refactor to reduce bin size

* Move `get_tensors_partial_shapes` to dev API

* Use get_tensors_partial_shapes in VariadicSplit

* Remove `visit_attributes` is same as base
2023-10-18 15:02:18 +02:00
Sergey Shlyapnikov
6c396fac9d
[GPU] Deferred events deallocation and tuning for FC bf_tiled kernel (#20519) 2023-10-18 16:28:27 +04:00
Sergey Shlyapnikov
6e97b91a77
[GPU] UsmHostTensor implementation (#20518)
Co-authored-by: Vladimir Paramuzov <vladimir.paramuzov@intel.com>
2023-10-18 16:12:15 +04:00
Vladimir Paramuzov
90ad4c618d
[GPU] Grouped decompression scale/zp support (#20491) 2023-10-18 15:56:35 +04:00
Tatiana Savina
4574fb112c
change snippet name (#20538) 2023-10-18 13:27:10 +04:00
Zhang Yi
d82cd839ce
[PaddleFT]Simplify slice converter by opset8::slice (#20508) 2023-10-18 13:21:53 +04:00
Pawel Raasz
9fb40b0007
[core]Migrate LogicalNot to new API (#20438)
* Migrate LogicalNot to new API

* Rename file not.hpp -> logical_not.hpp

* Apply code style
2023-10-18 09:16:12 +00:00
Pawel Raasz
2415f0c7cf
[core]Migrate BatchToSpace to new API (#20450)
* Migrate BatchToSpace to new API

* Remove not required suppression macro
2023-10-18 09:09:27 +00:00
Ekaterina Aidova
222fbb1aec
[PT FE]: support aten::fill_diagonal_, aten::fill (#20395)
* [PT FE]: support aten::fill_diagonal_, aten::fill

* remove xfail

* Update src/frontends/pytorch/src/op/full.cpp

Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>

* Update tests/model_hub_tests/torch_tests/test_hf_transformers.py

---------

Co-authored-by: Maxim Vafin <maxim.vafin@intel.com>
2023-10-18 10:58:54 +02:00
Nikolay Shchegolev
4eab5b4635
[CPU] RandomUniform-8 implementation. (#20171) 2023-10-18 12:50:46 +04:00
hyunback kim
5f336a6870
[GPU] Support o_is_yx_isv2 (#20521)
216-attention-center model in notebooks needs o_is_yx_isv2 with onednn3.3

Signed-off-by: hyunback <hyunback.kim@intel.com>
2023-10-18 07:54:09 +00:00
Sungeun Kim
30a038c776
[GPU] Gather needs to keep the original input/output rank (#20042)
* Gather needs to keep the original input/output rank
  - because the parameters as indices, batch_dims and axis depend on the rank.
  - add input_rank to gather primitive.
* don't query on set_preferred_formats pass
  -when the force_implementations is set.
  -when forcing_impl is not onednn.
2023-10-18 16:28:22 +09:00
Andrei Gorbachev
c8b64f6a95
[GPU] Refactor BroadcastLayer, Bucketize, Concat (#20369)
* BroadcastLayer

* Bucketize

* Concat

* fix incorrect fill_random_unique_sequence fo f16

* Update src/plugins/intel_gpu/tests/functional/shared_tests_instances/single_layer_tests/broadcast.cpp

Co-authored-by: Roman Lyamin <Roman.Lyamin@intel.com>

* fix after review

---------

Co-authored-by: Roman Lyamin <Roman.Lyamin@intel.com>
2023-10-18 10:02:18 +04:00
Siddhant Chauhan
a30e25c725
[TF FE][TF Hub] Support BatchMatMulV3 operation (#20528)
* [TF FE][TF Hub] Support BatchMatMulV3 operation

* Update src/frontends/tensorflow_common/src/op/matmul.cpp

* Update src/frontends/tensorflow_common/src/op/matmul.cpp

---------

Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>
2023-10-18 09:49:33 +04:00
Nesterov Alexander
3b2ad48d79
[ARM CPU] Common fixes for fp16 (#20504) 2023-10-18 07:04:39 +02:00
Wang, Yang
f723f90100
fixing the issue of importing property from openvino. (#20531) 2023-10-18 10:36:20 +08:00
Maxim Vafin
f3b61d77e3
[GHA] Filter timm scope for model hub tests (#20525)
* Filter timm scope

* Apply suggestions from code review
2023-10-17 23:01:50 +04:00
Pawel Raasz
7515b042e8
[core]Migrate squeeze operator to new API (#20282)
* Migrate Squeeze to new API

* Remove visit_attributes as same as base class
2023-10-17 19:17:49 +02:00
Katarzyna Mitrus
33a97740b2
[MO][Opset13] Multinomial-13 support in MO IR Reader (#20446)
* Add Multinomial-13 to MO

* Add Multinomial tests for MO IR reader

* Move convert_type check

* Imports clean up

* Update pacgage BOM file

* Avoid files collision in tests
2023-10-17 17:50:42 +02:00
Tatiana Savina
f78e206ecb
[DOCS] Update components in readme (#20524)
* update readme

* add space

* add space
2023-10-17 17:49:07 +02:00
Tatiana Savina
b270e36590
add AMX (#20523) 2023-10-17 17:48:20 +02:00
Oleg Pipikin
a5fb3823f6
Refactor EinsumLayerTest, EyeLayerTest (#20288)
* Refactor EinsumLayerTest

* Refator EyeLayerTest

* Apply comments

* Fix

---------

Co-authored-by: Vitaliy Urusovskij <vitaliy.urusovskij@intel.com>
2023-10-17 17:35:24 +02:00
Georgy Krivoruchko
1e7977332b
Fixed naming and comments in ReduceProd (#20512) 2023-10-17 17:30:22 +02:00
Vitaliy Urusovskij
8fff47caf9
ROIPooling, ROIAlign layer tests to API2.0 (#20086)
* `ROIPoolingLayerTest` to API2.0

* `ROIAlignLayerTest` to API2.0
2023-10-17 14:35:19 +00:00
Aleksandr Voron
e87d147f4c
[CPU] [ARM] Enable Pooling SLT tests on ARM (#18013) 2023-10-17 18:09:44 +04:00
Katz Sasaki
404f4e29b7
fix typo of config check error message (#20517) 2023-10-17 17:26:37 +04:00
Oleg Pipikin
a1daedc2e1
Disable NormalizeL2 and NonZero f16 tests on MacOs Arm64 (#20451) 2023-10-17 14:43:55 +04:00
Tomasz Jankowski
e1a29ae96e
[core] Migrate MaxPool operator to new API (#20424)
* Use API 2.0 in MaxPool v1

* Use API 2.0 in MaxPool v8

* Fix type selection

* Fix type selection

* Avoid redundant local copies
2023-10-17 13:30:18 +04:00
Vladislav Golubev
93a70497bd
[CPU] fp16 weights decompression works with conversion to f32 (#20427) 2023-10-17 12:47:15 +04:00
Georgy Krivoruchko
05297a5572
Added coverage of "0" in bounds to ReduceProd (#18583)
* Check for ReduceProd + SoftMax fix

* Check for ReduceProd + SoftMax fix

* Fix after moving on get_constant_max_of_type

* Extended tests and added coverage for other types

* Code optimization
2023-10-17 12:38:24 +04:00
Maciej Smyk
07831c9dce
[DOCS] Supported formats update for Benchmark C++ Tool for master (#20449)
* Update cpp_benchmark_tool.md

* Update cpp_benchmark_tool.md

* Update cpp_benchmark_tool.md
2023-10-17 09:59:14 +02:00
Ilya Churaev
3ced4a23e7
Remove TREAT_WARNING option from the doc (#20507) 2023-10-17 11:36:48 +04:00
Siddhant Chauhan
07a29f80b4
[TF FE][TF Hub] Support Xlog1py operation (#20500)
* [TF FE][TF Hub] Support Xlog1py operation

* Update test_tf_Xlog1py.py
2023-10-17 11:36:13 +04:00
Roman Lyamin
fe9d5607ed
[GPU] Fixed data generation for f16 fusion tests (#20247)
* [GPU] Fixed data generation for f16 fusion tests

* [GPU] Temporary tolerance increase for failed tests on iGPU

* [GPU] Temporary skip or tolerance increase for failed tests on dGPU
2023-10-17 11:17:07 +04:00
Wang, Yang
86000bb8a2
[GPU] Reserve CPU resource for GPU inference (#19214)
* Update.

* Update.

* Update.

* add test case.

* Update.

* Update test cases.

* Update.

* Update.

* Updated.

* Updated.

* Updated.

---------

Co-authored-by: Chen Peter <peter.chen@intel.com>
2023-10-17 05:42:56 +00:00
Wang, Yang
494f3d4bc8
[AUTO] Update python example and selection logic notice (#20294)
* Enable device properties setting via AUTO plugin.

* Update.

* Update.

* Update.

* Update.

* Update.

* Update.

* Update docs/OV_Runtime_UG/auto_device_selection.md

Co-authored-by: yanlan song <bell.song@intel.com>

* Add for stateful model

* Fix the link

---------

Co-authored-by: yanlan song <bell.song@intel.com>
Co-authored-by: Chen Peter <peter.chen@intel.com>
2023-10-17 11:28:59 +08:00
Pawel Raasz
baee150678
Restore f16 precision due to GPU performance drop (#20443) 2023-10-17 01:22:32 +02:00
Aleksandr Voron
c1816af864
[CPU] add_compile_definitions (OV_CPU_ARM_ENABLE_FP16) (#20498) 2023-10-16 20:06:59 +02:00
Yaroslav Torziuk
5d33a68598
[GPU] GroupNormalization-12 GPU reference implementation (#19155)
* Add group_normalization_kernel_selector

* Define group_normalization GPU primitive and its instantiation

* Add GroupNormalization operation builder

* Add test class for GroupNormalization operator

* Add instantiation of GroupNormalization test for GPU Plugin

* Disable GroupNormalizationDecomposition transformation in GPU Plugin

* Add GroupNormalizationKernelRef implementation

* Add GroupNormalization unit tests which cover blocked layout support
2023-10-16 10:54:56 -07:00
Vitaliy Urusovskij
d617952b0a
Pooling to API2.0 (#20495)
* Refactor PoolingLayerTest

* Additional refactoring

---------

Co-authored-by: Oleg Pipikin <oleg.pipikin@intel.com>
2023-10-16 19:40:24 +02:00
Karol Blaszczak
e0e7687143
[DOCS] restructuring the conversion section (#20487) 2023-10-16 19:06:55 +02:00
Paul Youngsoo Ahn
89bb8745a5
[GPU] Enable dynamic loop (#19818)
* [GPU] enable dynamic loop
- support multiple outputs
- support dynamic loop memory allocation
- support negative num_iterations
- implement calc_output_layouts
- add dynamic loop functional / unit tests

* Fix fail to check memory to set when original 1d data
- follow up code reviews

* Fix unit test failures

* Follow up code review
* Modify concat memory map creation process
* Check whether or not first input of loop is num_iteration_id

* Follow up code review
- refactoring preprocess_backedge_memory

* * Fix ci failures
* Clear custom_outputs_vec for condition
* Add num_outputs for condition and loop

* *Fix constant and param of body network have mismatched layouts

* Set consts.needsBatchInterpretation for const

* * refactoring is_dynamic in loop_inst::execute

* * remove wait_for_events in body_network execution loop

* * Remove redundant events

* * follow-up code review - modify OPENVNO_ASSERT

* * Remove redundant codes in loop_inst::execute

* * add current iteration update nodes into the ov::Model

* * rollback some codes for the performance degradation
2023-10-16 09:41:51 -07:00
Andrey Kashchikhin
755651cd34
disable pr trigger (#20497) 2023-10-16 19:21:48 +04:00
Sergey E. Losev
bee7559faa
Check output shape in OVInferRequestDynamicTests (#20398)
Co-authored-by: Ilya Churaev <ilya.churaev@intel.com>
2023-10-16 17:46:49 +04:00