Commit Graph

13769 Commits

Author SHA1 Message Date
River Li
dc64268564 Remove ov::hint::PerformanceMode::UNDEFINED (#21592)
* Remove ov::hint::PerformanceMode::UNDEFINED

* Update for reviewer comments and build issue

* Fix build error - may be used uninitialized

* Update

---------

Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
2023-12-20 21:15:26 +04:00
Alina Kladieva
3d3bb51de9 [GHA] Fix smart ci for docs changes in linux arm64 (#21798)
Update linux_arm64.yml
2023-12-20 21:13:17 +04:00
Alina Kladieva
2b950a65b3 [GHA] Add links to Smart CI doc to GHA overview. Align folder structure (#21794)
Update custom_actions.md
2023-12-20 19:53:48 +04:00
yanlan song
15e43c6f5b support user dynamism output buffer (#21647)
* test gpu user output buffer dynamism

Signed-off-by: fishbell <bell.song@intel.com>

* enable test

Signed-off-by: fishbell <bell.song@intel.com>

* check valid pointer

Signed-off-by: fishbell <bell.song@intel.com>

* update case

Signed-off-by: fishbell <bell.song@intel.com>

* cpplint

Signed-off-by: fishbell <bell.song@intel.com>

---------

Signed-off-by: fishbell <bell.song@intel.com>
2023-12-20 19:40:16 +04:00
Vitaliy Urusovskij
8384279b2e Preprocessing tests to new utils (#21752) 2023-12-20 15:28:06 +04:00
Sebastian Golebiewski
c95a34ea15 Updating thumbnails for notebooks (#21781) 2023-12-20 11:16:42 +01:00
Ilya Lavrenov
139242d47d Delete old Azure pipelines (#21771) 2023-12-20 12:28:08 +04:00
Ilya Lavrenov
119dc95de6 Revert "[op conformance] Made fixes to allign with accuracy validation (#21347)" (#21778)
This reverts commit 8f13219728.
2023-12-20 11:41:27 +04:00
Ilya Lavrenov
f6cd2150c2 Disable JS for on all Linux platforms with gcc < 7 (#21777) 2023-12-20 11:39:46 +04:00
Ilya Lavrenov
3a6a62558a Disable system TBB on Linux arm64 (#21769)
* Disable system TBB on Linux arm64

* Update cmake/features.cmake
2023-12-20 10:53:49 +04:00
Sergey Shlyapnikov
51eab3c931 [GPU] Revert transformation changes (#21749)
* Revert "[GPU] Apply is_non_decompression_multiply() callback only for compressed models (#21719)"

This reverts commit 032ac898e2.

* [GPU] Apply MarkDequantizationSubgraph callback function only to compressed models
2023-12-20 03:46:09 +00:00
Vishniakov Nikolai
8d704f6400 [JS OV] Node.js API (#19848)
Co-authored-by: almilosz <alicja.miloszewska@intel.com>
Co-authored-by: Andrei Kashchikhin <andrey.kashchikhin@intel.com>
Co-authored-by: yatarkan <yaroslav.tarkan@intel.com>
Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
2023-12-20 00:19:34 +00:00
Georgy Krivoruchko
2779df5800 [ONNX] Refactoring tests on API 2.0 (#21748)
* Test in onnx_import_dyn_shapes.in.cpp moved to API 2.0

* Test in onnx_import_exception.cpp moved to API 2.0

* Tests in onnx_import_org_openvino.in.cpp moved to API 2.0

* Tests in onnx_import_org_pytorch.in.cpp moved to API 2.0

* Fixed code style
2023-12-19 22:49:44 +01:00
Vitaliy Urusovskij
e6ab01c681 Shared onnx tests to API2.0 (#21726) 2023-12-20 01:03:33 +04:00
Sofya Balandina
5b776e9f1f [apiConformance] Fix checks for finding logs in analyze_confomance_fails (#21767) 2023-12-20 00:52:08 +04:00
Ilya Lavrenov
662aa57a92 Downgrade GHA for upload / download back to @v3 (#21770) 2023-12-20 00:48:31 +04:00
Ilya Lavrenov
dc79c41fb8 Updated xbyak_aarch64 in onednn (#21762) 2023-12-20 00:13:17 +04:00
Jan Iwaszkiewicz
712e374519 [PyOV] Allow replacing of invalid bytes in string-based data (#21761)
* [PyOV] Allow replacing of invalid bytes in string-based data

* Fix linter
2023-12-19 23:25:52 +04:00
Georgy Krivoruchko
27c9ca749d [ONNX] Switched to ONNX 1.15.0 (#20929)
* [ONNX] Switched to ONNX 1.15.0

* Updated usage of ONNX serialization API

* Updated xfailed lists

* Updated xfailed lists

* Added xfailed ML-operators

* Added xfailed float8 cases

* Fixing mo tests

* Reverted legacy tests

* Sets default opset version in layer tests

* Skipped test which breaks moving on 1.15

* Updated xfailed list

* Fixed typo and added more tests to skipped

* Added xfailed tests

* Reduced opset due to need to work on onnx 1.13

* Removed unnecessary xfails

* Skipped misaligned tests

* WA for old compiler build
2023-12-19 18:55:32 +00:00
Alina Kladieva
224eb79ab6 [GHA] Prepare new required stages for precommit (#21766)
* Add Smart CI support to webassembly workflow

* Add ONNX_Models to required checks list
2023-12-19 22:52:12 +04:00
River Li
7b1074b275 Generalize model cache reusing (#21492)
* Generalize model cache reusing

* Use .as<std::AnyMap>() to convert std::string to std::AnyMap

* Store driver version for different devices

* Use driver version as gpu compiled model runtime properties
2023-12-19 16:50:05 +01:00
Vladimir Paramuzov
294cc6668c [GPU] KVCache custom op and fusion pass (#21689) 2023-12-19 17:56:25 +04:00
Yaroslav Torziuk
6719133af2 [GPU] Fix accuracy problems brought by permute_f_y kernel (#20241)
* Add test cases from Levit model

* Enable PermuteKernel_f_y_axes kernel

* Fix permute of y and b for planar layout where x is not 1

* Enable fusing in permute_kernel_f_y_axes

* Add tests for fusing in permute_kernel_f_y_axes

---------

Co-authored-by: Wilson Seok <wilson.seok@intel.com>
2023-12-19 13:50:32 +01:00
Nikolay Shchegolev
6246fb0c23 [CPU] StringMemory for the output tensor in the InferRequest (#21746) 2023-12-19 15:20:15 +04:00
Roman Lyamin
6060683f4c [GPU] Fix for is_user_cpu() function (#21744) 2023-12-19 10:48:13 +01:00
Sofya Balandina
8f13219728 [op conformance] Made fixes to allign with accuracy validation (#21347) 2023-12-19 13:43:40 +04:00
Xiping Yan
a3a3f0b55c [CPU] Fix chatglmv2 accuracy issue and crash with input 1024 token in SPR (#21360) 2023-12-19 13:13:35 +04:00
Pawel Raasz
63cb89fe1b Use move instead copy (#21732) 2023-12-19 12:43:28 +04:00
Maciej Smyk
e94feea5c7 [DOCS] Update of graphs in repo (#21737)
* csv files

* Update GNA.rst

* csv files

* csv files

* csv

* csv

* removal of duplicates

* duplicates removal

* Delete small_IR_graph_demonstration.png

* moving graph files to static folder

* Update LowPrecisionModelRepresentation.rst

* update
2023-12-19 09:20:24 +01:00
Anastasiia Pnevskaia
3c157b2a1b [TF FE] [Telemetry] Added error_info event to TF FE. (#21736)
* Added error_info event to TF FE.
2023-12-19 12:14:36 +04:00
Pawel Raasz
499a9694e5 Fix Coverity CID 1529136, 1529124 (#21733)
* CID 1529136, fix auto makes a copy of PartialShape
- minor updates to reduce redundant Rank object creation

* CID 1529124 fix copy caused by auto
2023-12-19 07:06:33 +00:00
rghvsh
a4e8f9d98c [ONNX] Extend ONNX Frontend with BitwiseAnd-18 operator (#21585) 2023-12-19 09:54:55 +04:00
Vladimir Paramuzov
63b23a1ebb [GPU] Enable state precision conversion to infer precision (#21729) 2023-12-19 09:36:03 +04:00
Min, Byungil
0e92469330 [GPU] bugfix slice mem calculation for batch iteration (#21716)
Signed-off-by: Min, Byungil <byungil.min@intel.com>
2023-12-19 14:20:38 +09:00
Taylor Yeonbok Lee
b770780b89 [GPU] Skip redundant gather in stateful model (#21681)
* Skip redundant gather in stateful model

* Fix memory reuse issue for node skipped at runtime.
If the node is not marked as can_be_optimized at build time, memory dep is not properly applied
=> So it can cause the wrong memory reuse
2023-12-19 00:00:39 +00:00
Jan Iwaszkiewicz
c59498be9c [PyOV] Allow single inputs in form of lists of simple types (#21734) 2023-12-18 22:14:31 +01:00
Sergey Shlyapnikov
0e496fa0b5 [GPU] Include Select node to shape_of sub-graph and add CPU impl to avoid runtime synchronizations (#21738) 2023-12-18 19:52:11 +00:00
Wanglei Shen
d5b80062d9 support E3950 in MT 2.0 on both Linux and Windows (#21735) 2023-12-18 20:42:22 +01:00
Gorokhov Dmitriy
058d488083 [CPU] FullyConnected acceleration with 4bit weights decompression on SPR (#21723) 2023-12-18 23:16:45 +04:00
Maksim Kutakov
18a9c772b4 [CPU] Plugin optimizations for LLM beam search via states (#21642) 2023-12-18 23:16:32 +04:00
Eddy Kim
01967bdbff [GPU] Adding a switch to change the caching policy (#21443)
* added a new property to enable full model caching

* added a new public property cache_mode

* updated according to clang-format guide

* changed default caching policy to optimize_speed
2023-12-18 17:50:33 +00:00
Eddy Kim
63e08f9965 [GPU] adding support for weights compression to gather (#21711)
* 1st version

* fixed style check error

* added unit tests

* removed unnecessary comments

* updated hash to include decompression type

* applied code reviews

* applied code reviews

* fixed unit tests
2023-12-18 17:46:11 +00:00
Mateusz Tabaka
d03dc4fa2a Add LSTMCellFusion transformation (#21594)
* Add LSTMCellFusion transformation

Partially fixes: CVS-125605

* code style

* fix accuracy issue

* add headers
2023-12-18 18:35:25 +04:00
Mateusz Mikolajczyk
ec8765f950 Fix L to not broadcast it (#21724) 2023-12-18 17:31:56 +04:00
Sergey Shlyapnikov
032ac898e2 [GPU] Apply is_non_decompression_multiply() callback only for compressed models (#21719) 2023-12-18 12:20:20 +00:00
Mateusz Mikolajczyk
98e8caad79 [PyAPI][Opset13] ScaledDotProductAttention-13 PyAPI improvements + tests (#21422)
* Improve ScaledDotProductAttention PyAPI

* Add tmp tests

* Improve PyAPI & add tests

* Fix test issue

* Fix pytest type

* Update src/bindings/python/tests/test_graph/test_scaled_dot_product_attention.py

Co-authored-by: Anastasia Kuporosova <anastasia.kuporosova@intel.com>

* Improve const creation

---------

Co-authored-by: Anastasia Kuporosova <anastasia.kuporosova@intel.com>
2023-12-18 12:35:14 +01:00
Piotr Krzemiński
8ff25cf4a2 [SPEC] Update ScaledDotProductAttention specification for broadcasting of batch dimensions (#21714)
* [SPEC] Update spec for broadcasting of batch dimensions, improve clarity

* [SPEC] Update code block style, comments in xml, attention_mask details
2023-12-18 12:19:06 +01:00
Pawel Raasz
a6f3c7d1b2 Us dynamic dim in pattern if only one valid bound (#21694) 2023-12-18 15:02:01 +04:00
Mateusz Mikolajczyk
a1e296eaa8 [TF FE][Opset13] Enable Multinomial operator in TF frontend (#20646)
* Enable Multinomial operator in TF frontend

* Implement requested changes

* Update tests/layer_tests/tensorflow_tests/test_tf_Multinomial.py

Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>

* Align with CPU implementation

---------

Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>
2023-12-18 12:00:04 +01:00
Tomasz Jankowski
a29843013d [Ref] Fix ScatterElementsUpdate lowest value bounding (#21669)
* Use lowest for value bounding

* Add test for lowest on float
2023-12-18 11:57:01 +01:00