Commit Graph

13806 Commits

Author SHA1 Message Date
Vitaliy Urusovskij
e6ab01c681
Shared onnx tests to API2.0 (#21726) 2023-12-20 01:03:33 +04:00
Sofya Balandina
5b776e9f1f
[apiConformance] Fix checks for finding logs in analyze_confomance_fails (#21767) 2023-12-20 00:52:08 +04:00
Ilya Lavrenov
662aa57a92
Downgrade GHA for upload / download back to @v3 (#21770) 2023-12-20 00:48:31 +04:00
Ilya Lavrenov
dc79c41fb8
Updated xbyak_aarch64 in onednn (#21762) 2023-12-20 00:13:17 +04:00
Jan Iwaszkiewicz
712e374519
[PyOV] Allow replacing of invalid bytes in string-based data (#21761)
* [PyOV] Allow replacing of invalid bytes in string-based data

* Fix linter
2023-12-19 23:25:52 +04:00
Georgy Krivoruchko
27c9ca749d
[ONNX] Switched to ONNX 1.15.0 (#20929)
* [ONNX] Switched to ONNX 1.15.0

* Updated usage of ONNX serialization API

* Updated xfailed lists

* Updated xfailed lists

* Added xfailed ML-operators

* Added xfailed float8 cases

* Fixing mo tests

* Reverted legacy tests

* Sets default opset version in layer tests

* Skipped test which breaks moving on 1.15

* Updated xfailed list

* Fixed typo and added more tests to skipped

* Added xfailed tests

* Reduced opset due to need to work on onnx 1.13

* Removed unnecessary xfails

* Skipped misaligned tests

* WA for old compiler build
2023-12-19 18:55:32 +00:00
Alina Kladieva
224eb79ab6
[GHA] Prepare new required stages for precommit (#21766)
* Add Smart CI support to webassembly workflow

* Add ONNX_Models to required checks list
2023-12-19 22:52:12 +04:00
River Li
7b1074b275
Generalize model cache reusing (#21492)
* Generalize model cache reusing

* Use .as<std::AnyMap>() to convert std::string to std::AnyMap

* Store driver version for different devices

* Use driver version as gpu compiled model runtime properties
2023-12-19 16:50:05 +01:00
Vladimir Paramuzov
294cc6668c
[GPU] KVCache custom op and fusion pass (#21689) 2023-12-19 17:56:25 +04:00
Yaroslav Torziuk
6719133af2
[GPU] Fix accuracy problems brought by permute_f_y kernel (#20241)
* Add test cases from Levit model

* Enable PermuteKernel_f_y_axes kernel

* Fix permute of y and b for planar layout where x is not 1

* Enable fusing in permute_kernel_f_y_axes

* Add tests for fusing in permute_kernel_f_y_axes

---------

Co-authored-by: Wilson Seok <wilson.seok@intel.com>
2023-12-19 13:50:32 +01:00
Nikolay Shchegolev
6246fb0c23
[CPU] StringMemory for the output tensor in the InferRequest (#21746) 2023-12-19 15:20:15 +04:00
Roman Lyamin
6060683f4c
[GPU] Fix for is_user_cpu() function (#21744) 2023-12-19 10:48:13 +01:00
Sofya Balandina
8f13219728
[op conformance] Made fixes to allign with accuracy validation (#21347) 2023-12-19 13:43:40 +04:00
Xiping Yan
a3a3f0b55c
[CPU] Fix chatglmv2 accuracy issue and crash with input 1024 token in SPR (#21360) 2023-12-19 13:13:35 +04:00
Pawel Raasz
63cb89fe1b
Use move instead copy (#21732) 2023-12-19 12:43:28 +04:00
Maciej Smyk
e94feea5c7
[DOCS] Update of graphs in repo (#21737)
* csv files

* Update GNA.rst

* csv files

* csv files

* csv

* csv

* removal of duplicates

* duplicates removal

* Delete small_IR_graph_demonstration.png

* moving graph files to static folder

* Update LowPrecisionModelRepresentation.rst

* update
2023-12-19 09:20:24 +01:00
Anastasiia Pnevskaia
3c157b2a1b
[TF FE] [Telemetry] Added error_info event to TF FE. (#21736)
* Added error_info event to TF FE.
2023-12-19 12:14:36 +04:00
Pawel Raasz
499a9694e5
Fix Coverity CID 1529136, 1529124 (#21733)
* CID 1529136, fix auto makes a copy of PartialShape
- minor updates to reduce redundant Rank object creation

* CID 1529124 fix copy caused by auto
2023-12-19 07:06:33 +00:00
rghvsh
a4e8f9d98c
[ONNX] Extend ONNX Frontend with BitwiseAnd-18 operator (#21585) 2023-12-19 09:54:55 +04:00
Vladimir Paramuzov
63b23a1ebb
[GPU] Enable state precision conversion to infer precision (#21729) 2023-12-19 09:36:03 +04:00
Min, Byungil
0e92469330
[GPU] bugfix slice mem calculation for batch iteration (#21716)
Signed-off-by: Min, Byungil <byungil.min@intel.com>
2023-12-19 14:20:38 +09:00
Taylor Yeonbok Lee
b770780b89
[GPU] Skip redundant gather in stateful model (#21681)
* Skip redundant gather in stateful model

* Fix memory reuse issue for node skipped at runtime.
If the node is not marked as can_be_optimized at build time, memory dep is not properly applied
=> So it can cause the wrong memory reuse
2023-12-19 00:00:39 +00:00
Jan Iwaszkiewicz
c59498be9c
[PyOV] Allow single inputs in form of lists of simple types (#21734) 2023-12-18 22:14:31 +01:00
Sergey Shlyapnikov
0e496fa0b5
[GPU] Include Select node to shape_of sub-graph and add CPU impl to avoid runtime synchronizations (#21738) 2023-12-18 19:52:11 +00:00
Wanglei Shen
d5b80062d9
support E3950 in MT 2.0 on both Linux and Windows (#21735) 2023-12-18 20:42:22 +01:00
Gorokhov Dmitriy
058d488083
[CPU] FullyConnected acceleration with 4bit weights decompression on SPR (#21723) 2023-12-18 23:16:45 +04:00
Maksim Kutakov
18a9c772b4
[CPU] Plugin optimizations for LLM beam search via states (#21642) 2023-12-18 23:16:32 +04:00
Eddy Kim
01967bdbff
[GPU] Adding a switch to change the caching policy (#21443)
* added a new property to enable full model caching

* added a new public property cache_mode

* updated according to clang-format guide

* changed default caching policy to optimize_speed
2023-12-18 17:50:33 +00:00
Eddy Kim
63e08f9965
[GPU] adding support for weights compression to gather (#21711)
* 1st version

* fixed style check error

* added unit tests

* removed unnecessary comments

* updated hash to include decompression type

* applied code reviews

* applied code reviews

* fixed unit tests
2023-12-18 17:46:11 +00:00
Mateusz Tabaka
d03dc4fa2a
Add LSTMCellFusion transformation (#21594)
* Add LSTMCellFusion transformation

Partially fixes: CVS-125605

* code style

* fix accuracy issue

* add headers
2023-12-18 18:35:25 +04:00
Mateusz Mikolajczyk
ec8765f950
Fix L to not broadcast it (#21724) 2023-12-18 17:31:56 +04:00
Sergey Shlyapnikov
032ac898e2
[GPU] Apply is_non_decompression_multiply() callback only for compressed models (#21719) 2023-12-18 12:20:20 +00:00
Mateusz Mikolajczyk
98e8caad79
[PyAPI][Opset13] ScaledDotProductAttention-13 PyAPI improvements + tests (#21422)
* Improve ScaledDotProductAttention PyAPI

* Add tmp tests

* Improve PyAPI & add tests

* Fix test issue

* Fix pytest type

* Update src/bindings/python/tests/test_graph/test_scaled_dot_product_attention.py

Co-authored-by: Anastasia Kuporosova <anastasia.kuporosova@intel.com>

* Improve const creation

---------

Co-authored-by: Anastasia Kuporosova <anastasia.kuporosova@intel.com>
2023-12-18 12:35:14 +01:00
Piotr Krzemiński
8ff25cf4a2
[SPEC] Update ScaledDotProductAttention specification for broadcasting of batch dimensions (#21714)
* [SPEC] Update spec for broadcasting of batch dimensions, improve clarity

* [SPEC] Update code block style, comments in xml, attention_mask details
2023-12-18 12:19:06 +01:00
Pawel Raasz
a6f3c7d1b2
Us dynamic dim in pattern if only one valid bound (#21694) 2023-12-18 15:02:01 +04:00
Mateusz Mikolajczyk
a1e296eaa8
[TF FE][Opset13] Enable Multinomial operator in TF frontend (#20646)
* Enable Multinomial operator in TF frontend

* Implement requested changes

* Update tests/layer_tests/tensorflow_tests/test_tf_Multinomial.py

Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>

* Align with CPU implementation

---------

Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>
2023-12-18 12:00:04 +01:00
Tomasz Jankowski
a29843013d
[Ref] Fix ScatterElementsUpdate lowest value bounding (#21669)
* Use lowest for value bounding

* Add test for lowest on float
2023-12-18 11:57:01 +01:00
Nikolay Shchegolev
554e7d6730
[CPU] Dynamically shaped model with string tensor output fails (#21713) 2023-12-18 14:44:36 +04:00
Mateusz Mikolajczyk
be476519fb
[PT FE] ScaledDotProductAttention fix issue with other float dtypes (#21638)
* Fix scale input

* Improve torch scaled dot tests
2023-12-18 14:32:54 +04:00
Ivan Tikhonov
e224e4f4f0
Introduce OriginalPrecision attribute (#21696)
* Introduce OriginalPrecision attribute

* resolve review comments

---------

Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
2023-12-18 14:10:46 +04:00
Georgy Krivoruchko
62c9469f52
[ONNX] Reflected Shell-scripts to run on Windows (#18672)
* Reflected Shell-scripts to run on Windows

* Fixes

* Removed script from tests_compatibility
2023-12-18 13:51:20 +04:00
Eddy Kim
35dafce513
[GPU] move convert after gather to reduce memory usage (#21616) 2023-12-18 13:34:46 +04:00
Mateusz Mikolajczyk
610e0fab5c
[Opset13] ScaledDotProductAttention-13 input validation improvements (#21281)
* Improve test naming to match op

* Initial draft for scaleddot shape infer

* Fix formatting

* Improve shape & type infer

* Add shape tests

* Fix issues in shape infer

* Fix type infer

* Improve type_prop tests

* Ignore attention when causal

* Fix template type shape infer

* Fix issues with scalar inputs + test improvement

* Fix `get_node_input_partial_shapes`

* Improve shape/type validation and tests

* Allow for broadcastable inputs

* Improve tests

* Add CPU shape infer test

* Add broadcast shape infer test

* Use const

* Use const &

* Improve query input handling

* Fix test issues

* Change broadcast rules

---------

Co-authored-by: Michal Lukaszewski <michal.lukaszewski@intel.com>
2023-12-18 10:30:54 +01:00
Wang Wangwang
3a2958b360
[GPU] Fix convert fc to compressed when the input is SubGraphOp (#21687) 2023-12-18 13:29:16 +04:00
Vitaliy Urusovskij
e71707b7d6
Remove extra includes (#21699)
Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
2023-12-18 13:20:54 +04:00
Irina Efode
d613bb0e65
[CPU][ARM] Fix build CPU ARM plugin (#21722) 2023-12-18 13:20:33 +04:00
Piotr Krzemiński
acb3f5ef28
[Core][Tests] Add attribute tests and decompotion tests for Scaled Dot Product Attention (#21356)
* [CORE][TESTS] Add attribute tests and decompotion tests for scaled dot product attention

* Update batch_norm_decomposition_test.cpp

* Clang fix

* Update scaled_dot_product_decomposition_test.cpp

* [TESTS]STS] Fix CIs with rename and clang fix

* [TESTS] Fix decomposition for CIs

* [TESTS] Remove constant mode

* [TESTS] Fix visitor test for CI

* [Tests] Add suggested broadcast tests
2023-12-18 10:18:48 +01:00
Jan Iwaszkiewicz
57b1916f8d
[PyOV] Remove restrictions on opset13.constant to align with opset1-12 (#21674) 2023-12-18 10:02:43 +01:00
Sebastian Golebiewski
a9a935ee6e
[DOCS] Fix notebooks sections in menu (#21695)
* Fix notebooks sections in menu

* prepend commands in cmake
2023-12-18 13:01:22 +04:00
Mateusz Mikolajczyk
1783aad48d
[PT FE][Opset13] Add translation for torchscript aten::multinomial operator (#20397)
* Enable aten::multinomial in PT FE

* Add aten::multinomial test

* Fix typo

* Update tests/layer_tests/pytorch_tests/test_multinomial.py

Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>

* Update multinomial to changes in specification

* rm ;

* Add temporary convert for multinomial i64 support

* Revert convert

* Fix torch test

---------

Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>
2023-12-18 09:54:06 +01:00