River Li
dc64268564
Remove ov::hint::PerformanceMode::UNDEFINED ( #21592 )
...
* Remove ov::hint::PerformanceMode::UNDEFINED
* Update for reviewer comments and build issue
* Fix build error - may be used uninitialized
* Update
---------
Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com >
2023-12-20 21:15:26 +04:00
Alina Kladieva
3d3bb51de9
[GHA] Fix smart ci for docs changes in linux arm64 ( #21798 )
...
Update linux_arm64.yml
2023-12-20 21:13:17 +04:00
Alina Kladieva
2b950a65b3
[GHA] Add links to Smart CI doc to GHA overview. Align folder structure ( #21794 )
...
Update custom_actions.md
2023-12-20 19:53:48 +04:00
yanlan song
15e43c6f5b
support user dynamism output buffer ( #21647 )
...
* test gpu user output buffer dynamism
Signed-off-by: fishbell <bell.song@intel.com >
* enable test
Signed-off-by: fishbell <bell.song@intel.com >
* check valid pointer
Signed-off-by: fishbell <bell.song@intel.com >
* update case
Signed-off-by: fishbell <bell.song@intel.com >
* cpplint
Signed-off-by: fishbell <bell.song@intel.com >
---------
Signed-off-by: fishbell <bell.song@intel.com >
2023-12-20 19:40:16 +04:00
Vitaliy Urusovskij
8384279b2e
Preprocessing tests to new utils ( #21752 )
2023-12-20 15:28:06 +04:00
Sebastian Golebiewski
c95a34ea15
Updating thumbnails for notebooks ( #21781 )
2023-12-20 11:16:42 +01:00
Ilya Lavrenov
139242d47d
Delete old Azure pipelines ( #21771 )
2023-12-20 12:28:08 +04:00
Ilya Lavrenov
119dc95de6
Revert "[op conformance] Made fixes to allign with accuracy validation ( #21347 )" ( #21778 )
...
This reverts commit 8f13219728 .
2023-12-20 11:41:27 +04:00
Ilya Lavrenov
f6cd2150c2
Disable JS for on all Linux platforms with gcc < 7 ( #21777 )
2023-12-20 11:39:46 +04:00
Ilya Lavrenov
3a6a62558a
Disable system TBB on Linux arm64 ( #21769 )
...
* Disable system TBB on Linux arm64
* Update cmake/features.cmake
2023-12-20 10:53:49 +04:00
Sergey Shlyapnikov
51eab3c931
[GPU] Revert transformation changes ( #21749 )
...
* Revert "[GPU] Apply is_non_decompression_multiply() callback only for compressed models (#21719 )"
This reverts commit 032ac898e2 .
* [GPU] Apply MarkDequantizationSubgraph callback function only to compressed models
2023-12-20 03:46:09 +00:00
Vishniakov Nikolai
8d704f6400
[JS OV] Node.js API ( #19848 )
...
Co-authored-by: almilosz <alicja.miloszewska@intel.com >
Co-authored-by: Andrei Kashchikhin <andrey.kashchikhin@intel.com >
Co-authored-by: yatarkan <yaroslav.tarkan@intel.com >
Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com >
2023-12-20 00:19:34 +00:00
Georgy Krivoruchko
2779df5800
[ONNX] Refactoring tests on API 2.0 ( #21748 )
...
* Test in onnx_import_dyn_shapes.in.cpp moved to API 2.0
* Test in onnx_import_exception.cpp moved to API 2.0
* Tests in onnx_import_org_openvino.in.cpp moved to API 2.0
* Tests in onnx_import_org_pytorch.in.cpp moved to API 2.0
* Fixed code style
2023-12-19 22:49:44 +01:00
Vitaliy Urusovskij
e6ab01c681
Shared onnx tests to API2.0 ( #21726 )
2023-12-20 01:03:33 +04:00
Sofya Balandina
5b776e9f1f
[apiConformance] Fix checks for finding logs in analyze_confomance_fails ( #21767 )
2023-12-20 00:52:08 +04:00
Ilya Lavrenov
662aa57a92
Downgrade GHA for upload / download back to @v3 ( #21770 )
2023-12-20 00:48:31 +04:00
Ilya Lavrenov
dc79c41fb8
Updated xbyak_aarch64 in onednn ( #21762 )
2023-12-20 00:13:17 +04:00
Jan Iwaszkiewicz
712e374519
[PyOV] Allow replacing of invalid bytes in string-based data ( #21761 )
...
* [PyOV] Allow replacing of invalid bytes in string-based data
* Fix linter
2023-12-19 23:25:52 +04:00
Georgy Krivoruchko
27c9ca749d
[ONNX] Switched to ONNX 1.15.0 ( #20929 )
...
* [ONNX] Switched to ONNX 1.15.0
* Updated usage of ONNX serialization API
* Updated xfailed lists
* Updated xfailed lists
* Added xfailed ML-operators
* Added xfailed float8 cases
* Fixing mo tests
* Reverted legacy tests
* Sets default opset version in layer tests
* Skipped test which breaks moving on 1.15
* Updated xfailed list
* Fixed typo and added more tests to skipped
* Added xfailed tests
* Reduced opset due to need to work on onnx 1.13
* Removed unnecessary xfails
* Skipped misaligned tests
* WA for old compiler build
2023-12-19 18:55:32 +00:00
Alina Kladieva
224eb79ab6
[GHA] Prepare new required stages for precommit ( #21766 )
...
* Add Smart CI support to webassembly workflow
* Add ONNX_Models to required checks list
2023-12-19 22:52:12 +04:00
River Li
7b1074b275
Generalize model cache reusing ( #21492 )
...
* Generalize model cache reusing
* Use .as<std::AnyMap>() to convert std::string to std::AnyMap
* Store driver version for different devices
* Use driver version as gpu compiled model runtime properties
2023-12-19 16:50:05 +01:00
Vladimir Paramuzov
294cc6668c
[GPU] KVCache custom op and fusion pass ( #21689 )
2023-12-19 17:56:25 +04:00
Yaroslav Torziuk
6719133af2
[GPU] Fix accuracy problems brought by permute_f_y kernel ( #20241 )
...
* Add test cases from Levit model
* Enable PermuteKernel_f_y_axes kernel
* Fix permute of y and b for planar layout where x is not 1
* Enable fusing in permute_kernel_f_y_axes
* Add tests for fusing in permute_kernel_f_y_axes
---------
Co-authored-by: Wilson Seok <wilson.seok@intel.com >
2023-12-19 13:50:32 +01:00
Nikolay Shchegolev
6246fb0c23
[CPU] StringMemory for the output tensor in the InferRequest ( #21746 )
2023-12-19 15:20:15 +04:00
Roman Lyamin
6060683f4c
[GPU] Fix for is_user_cpu() function ( #21744 )
2023-12-19 10:48:13 +01:00
Sofya Balandina
8f13219728
[op conformance] Made fixes to allign with accuracy validation ( #21347 )
2023-12-19 13:43:40 +04:00
Xiping Yan
a3a3f0b55c
[CPU] Fix chatglmv2 accuracy issue and crash with input 1024 token in SPR ( #21360 )
2023-12-19 13:13:35 +04:00
Pawel Raasz
63cb89fe1b
Use move instead copy ( #21732 )
2023-12-19 12:43:28 +04:00
Maciej Smyk
e94feea5c7
[DOCS] Update of graphs in repo ( #21737 )
...
* csv files
* Update GNA.rst
* csv files
* csv files
* csv
* csv
* removal of duplicates
* duplicates removal
* Delete small_IR_graph_demonstration.png
* moving graph files to static folder
* Update LowPrecisionModelRepresentation.rst
* update
2023-12-19 09:20:24 +01:00
Anastasiia Pnevskaia
3c157b2a1b
[TF FE] [Telemetry] Added error_info event to TF FE. ( #21736 )
...
* Added error_info event to TF FE.
2023-12-19 12:14:36 +04:00
Pawel Raasz
499a9694e5
Fix Coverity CID 1529136, 1529124 ( #21733 )
...
* CID 1529136, fix auto makes a copy of PartialShape
- minor updates to reduce redundant Rank object creation
* CID 1529124 fix copy caused by auto
2023-12-19 07:06:33 +00:00
rghvsh
a4e8f9d98c
[ONNX] Extend ONNX Frontend with BitwiseAnd-18 operator ( #21585 )
2023-12-19 09:54:55 +04:00
Vladimir Paramuzov
63b23a1ebb
[GPU] Enable state precision conversion to infer precision ( #21729 )
2023-12-19 09:36:03 +04:00
Min, Byungil
0e92469330
[GPU] bugfix slice mem calculation for batch iteration ( #21716 )
...
Signed-off-by: Min, Byungil <byungil.min@intel.com >
2023-12-19 14:20:38 +09:00
Taylor Yeonbok Lee
b770780b89
[GPU] Skip redundant gather in stateful model ( #21681 )
...
* Skip redundant gather in stateful model
* Fix memory reuse issue for node skipped at runtime.
If the node is not marked as can_be_optimized at build time, memory dep is not properly applied
=> So it can cause the wrong memory reuse
2023-12-19 00:00:39 +00:00
Jan Iwaszkiewicz
c59498be9c
[PyOV] Allow single inputs in form of lists of simple types ( #21734 )
2023-12-18 22:14:31 +01:00
Sergey Shlyapnikov
0e496fa0b5
[GPU] Include Select node to shape_of sub-graph and add CPU impl to avoid runtime synchronizations ( #21738 )
2023-12-18 19:52:11 +00:00
Wanglei Shen
d5b80062d9
support E3950 in MT 2.0 on both Linux and Windows ( #21735 )
2023-12-18 20:42:22 +01:00
Gorokhov Dmitriy
058d488083
[CPU] FullyConnected acceleration with 4bit weights decompression on SPR ( #21723 )
2023-12-18 23:16:45 +04:00
Maksim Kutakov
18a9c772b4
[CPU] Plugin optimizations for LLM beam search via states ( #21642 )
2023-12-18 23:16:32 +04:00
Eddy Kim
01967bdbff
[GPU] Adding a switch to change the caching policy ( #21443 )
...
* added a new property to enable full model caching
* added a new public property cache_mode
* updated according to clang-format guide
* changed default caching policy to optimize_speed
2023-12-18 17:50:33 +00:00
Eddy Kim
63e08f9965
[GPU] adding support for weights compression to gather ( #21711 )
...
* 1st version
* fixed style check error
* added unit tests
* removed unnecessary comments
* updated hash to include decompression type
* applied code reviews
* applied code reviews
* fixed unit tests
2023-12-18 17:46:11 +00:00
Mateusz Tabaka
d03dc4fa2a
Add LSTMCellFusion transformation ( #21594 )
...
* Add LSTMCellFusion transformation
Partially fixes: CVS-125605
* code style
* fix accuracy issue
* add headers
2023-12-18 18:35:25 +04:00
Mateusz Mikolajczyk
ec8765f950
Fix L to not broadcast it ( #21724 )
2023-12-18 17:31:56 +04:00
Sergey Shlyapnikov
032ac898e2
[GPU] Apply is_non_decompression_multiply() callback only for compressed models ( #21719 )
2023-12-18 12:20:20 +00:00
Mateusz Mikolajczyk
98e8caad79
[PyAPI][Opset13] ScaledDotProductAttention-13 PyAPI improvements + tests ( #21422 )
...
* Improve ScaledDotProductAttention PyAPI
* Add tmp tests
* Improve PyAPI & add tests
* Fix test issue
* Fix pytest type
* Update src/bindings/python/tests/test_graph/test_scaled_dot_product_attention.py
Co-authored-by: Anastasia Kuporosova <anastasia.kuporosova@intel.com >
* Improve const creation
---------
Co-authored-by: Anastasia Kuporosova <anastasia.kuporosova@intel.com >
2023-12-18 12:35:14 +01:00
Piotr Krzemiński
8ff25cf4a2
[SPEC] Update ScaledDotProductAttention specification for broadcasting of batch dimensions ( #21714 )
...
* [SPEC] Update spec for broadcasting of batch dimensions, improve clarity
* [SPEC] Update code block style, comments in xml, attention_mask details
2023-12-18 12:19:06 +01:00
Pawel Raasz
a6f3c7d1b2
Us dynamic dim in pattern if only one valid bound ( #21694 )
2023-12-18 15:02:01 +04:00
Mateusz Mikolajczyk
a1e296eaa8
[TF FE][Opset13] Enable Multinomial operator in TF frontend ( #20646 )
...
* Enable Multinomial operator in TF frontend
* Implement requested changes
* Update tests/layer_tests/tensorflow_tests/test_tf_Multinomial.py
Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com >
* Align with CPU implementation
---------
Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com >
2023-12-18 12:00:04 +01:00
Tomasz Jankowski
a29843013d
[Ref] Fix ScatterElementsUpdate lowest value bounding ( #21669 )
...
* Use lowest for value bounding
* Add test for lowest on float
2023-12-18 11:57:01 +01:00