Commit Graph

906 Commits

Author SHA1 Message Date
Ilya Churaev
2a344b66cb Update to the latest cmake for Windows (#13604)
* Update to the latest cmake for Windows

* Skip new warnings

* Added compiler flags for old cmake

* Disabled some warnings

* Suppress new warnings

* Fixed some errors

* Disable some warning for lpt

* Fix some warnings

* Disable new warnings

* Fix some warnings

* Disabled Was error for arm

* Fixed some errors

* Fixed some new warnings

* Fixed auto plugin

* Fixed Windows warning

* Fixed Windows

* Disable some warnings

* Some changes

* Small test fixes

* Fixed python build

* Added skip warnings for legacy python API
2022-10-25 18:11:46 +04:00
Oleksii Khovan
1960746c44 [GPU] ScatterElementsUpdate blocked layout support (#12465)
* add parameterized test
* add blocked layouts support
* support for mixed input formats
* fix RHEL 8.2 build
* add scatter_elements_update to whitelist for blocked formats
* Added bs_fs_yx_bsv16_fsv32 format
2022-10-25 08:22:27 +03:00
Mingyu Kim
0e242b3244 [GPU] update onednn to latest 2.7 (#13518) 2022-10-25 12:21:13 +09:00
Sun Xiaoxia
ec14dd3523 Hybrid-aware P/E Core optimization (#13435)
* add HybridAware stream setting and core binding

* fix clang format issue

* unified code style, add parameter check

* correct input affinity skip NUMA, modify function name

* remove unnecessary floor

* fix ci compile issue on Mac/windows platform

* modify smoke_SetConfigAffinity test

* modify ov_core_set_and_get_property_enum test, affinity HYBRID_AWARE is changed to NUMA

* remove affinity correcting on this pr

* revert ov_core_test.cpp

* merge function by comments

* fix code style issue

Co-authored-by: Shen, Wanglei <wanglei.shen@intel.com>
2022-10-24 22:26:22 +03:00
Gorokhov Dmitriy
086bc00d4c [CPU] Fixed EmbeddingSegmentsSum initialization with dynamic num_segments (#13372) 2022-10-24 18:34:25 +03:00
Gorokhov Dmitriy
9f40eb7196 [CPU] Fixed integer compute for arithmetic operations (#13556) 2022-10-24 15:54:52 +04:00
Min, Byungil
4188f1f181 [GPU] Update for layout query (#13346)
+ support multiple input and output target formats
+ implement generic logic for selection formats and find_data_format
+ Add TCs for select_preferred_formats

Signed-off-by: Min, Byungil <byungil.min@intel.com>
2022-10-24 15:27:09 +09:00
Felix Dohyun Kim
385d87edaf [GPU][DG2] Fix output scale post-op condition (#13567)
* fix bug in oscale post-op condition
2022-10-24 10:17:00 +09:00
Roman Lyamin
478939ea9e [GPU] GEMM dynamic (#13248) 2022-10-21 16:19:35 +04:00
Ilya Churaev
85d78a3ef0 Build code snippets together with doc (#13541)
* Build code snippets together with doc

* Added separate step for code snippets

* Try to enable CI step

* Disabe myriad

* Try to fix Windows

* Detect number of cores

* Reduced build time of code snippets
2022-10-21 06:29:19 +03:00
Oleksii Khovan
2f982b9490 [GPU] reorg_yolo blocked layouts support (#12463)
* add unit test for reorg_yolo
* add validation to reorg_yolo kernel
* add blocked formats support
* remove non-working yxfb optimization
* add reorg_yolo to whitelist for blocked formats
2022-10-20 19:35:43 +03:00
Pavel Durandin
dfd5b461f1 [GPU] Fix gpu unit and functional tests on windows (#13560) 2022-10-20 09:56:29 +04:00
OlehKravchyshyn
3500d75187 [GPU] Add supporting blocked format for reverse operation (#12032) 2022-10-20 14:18:00 +09:00
hyunback kim
71f3c8c786 [GPU] Fix brain-tumor-segmentation-002/INT8/onnx running failed (#13274)
reorder_input has the issue when using generated reorder case.
And handle_reshape has only handle 4 dims (bfyx), so support 5, 6 dimes

Signed-off-by: hyunback <hyunback.kim@intel.com>
2022-10-20 11:29:53 +09:00
Nadezhda Ageeva
a4655bb6b3 [GNA] Insert identity using ngraph transformation (#13317)
* [GNA] Insert identity using ngraph transformation

* Update src/plugins/intel_gna/src/ops/identity.hpp

Co-authored-by: Szymon Irzabek <szymon.jakub.irzabek@intel.com>

* Update src/plugins/intel_gna/src/transformations/rt_info/gna_precision_change_flag.hpp

Co-authored-by: Szymon Irzabek <szymon.jakub.irzabek@intel.com>

* Update src/plugins/intel_gna/src/transformations/rt_info/gna_precision_change_flag.cpp

Co-authored-by: Szymon Irzabek <szymon.jakub.irzabek@intel.com>

* Update src/plugins/intel_gna/src/transformations/insert_identity_layer.hpp

Co-authored-by: Szymon Irzabek <szymon.jakub.irzabek@intel.com>

* Rewrites pass with Identity insertion using recusrive function. Adds test for Split. Adds comments

* Change namespace for element type

Co-authored-by: Szymon Irzabek <szymon.jakub.irzabek@intel.com>
2022-10-19 20:27:55 +03:00
Chen Xu
98dbb91af6 [CPU] Support topk sort for int32 directly (#13448) 2022-10-19 13:38:33 +03:00
Vladislav Golubev
a25c2ba665 [LPT] Precision restrictions extending (#13172)
* [LPT] Precision restrictions extending

* Compilation error fixed

* Markup for SubgraphOps fixed

* cleanup

* Review fixes: removed vector<size_t> wrapper in PrecisionRestriction class

* Review fixes: added comments

* docs compilation fix
2022-10-19 12:29:32 +02:00
Ryszard Jezierski
17e702776b Update GNA library version (#13188)
* Changed GNA library version from 1455.2 to 1815

* fixed new URL to GNA in cmake

* fixed gna lib version in linux CI

* use Gna2ExportTlv from new library

* Updated GNA to 1815.1
2022-10-19 13:40:21 +04:00
Katarzyna Mitrus
3c0b5c7f9b BinaryElementwiseArithmetic - common shape_infer (#13421)
* Use eltwise_shape_infer form shape_inference in validate_and_infer_elementwise_args

* Align Unary ops, remove redundant validate_and_infer_elementwise_args usage

* Add test with default constructor for BinaryElementwiseArithmetic ops

* Style apply

* Fix expected error message

* Add common shape_infer tests for BinaryElementiwiseArithmetic ops

* Remove old Add test

* Update NGRAPH_CHECK to OV ASSERT

* Removal of redundant autob param to the validate function

* Tests update
2022-10-19 11:57:16 +03:00
Vladimir Paramuzov
ef12d3976f [GPU] Fixes for infer request impl in dynamic cases (#13050)
* [GPU] Fixes for infer request impl in dynamic cases

* [GPU] Fixed incosistent output shapes for LSTMSequence op

* [GPU] Update network::get_output_layout method

* [GPU] WA for USM memory allocations with 0 bytes
2022-10-19 09:35:03 +04:00
yanlan song
752ff23582 Bell/remote tensor fix in multi (#13483)
* unify device representations

Signed-off-by: fishbell <bell.song@intel.com>

* fix case failure

Signed-off-by: fishbell <bell.song@intel.com>

* add test cases

Signed-off-by: fishbell <bell.song@intel.com>

* indenting

Signed-off-by: fishbell <bell.song@intel.com>

* fix build error on macos

Signed-off-by: fishbell <bell.song@intel.com>

* keep the original logic, if no device available,multi will throw exeception

Signed-off-by: fishbell <bell.song@intel.com>

Signed-off-by: fishbell <bell.song@intel.com>
2022-10-19 10:15:36 +08:00
yanlan song
5e2869cd14 support cache dir in multi/auto/batch (#13507)
* support cache dir in multi/auto

Signed-off-by: fishbell <bell.song@intel.com>

* support cache_dir in bat h

Signed-off-by: fishbell <bell.song@intel.com>

* fix case failure

Signed-off-by: fishbell <bell.song@intel.com>

* clang format

Signed-off-by: fishbell <bell.song@intel.com>

Signed-off-by: fishbell <bell.song@intel.com>
2022-10-19 10:15:03 +08:00
Vladislav Golubev
2d33f6f603 [CPU] LSTM Sequence test enabled (#13323) 2022-10-18 18:43:03 +04:00
Vladimir Paramuzov
5453965566 [GPU] Refactor kernel selector to make file path a bit shorter (#13419) 2022-10-18 17:41:09 +04:00
Pawel Raasz
4bc36f29f4 Review opset1 concat for shape inference aspects (#13455)
* Add interval dim test to Concat op
- use common test fixture for PartialShape OK scenario

* Use get_shape_labels in Concat tests

* Add label_evaluate test for Concat op

* Add validation util to check no labels in tensor

* Add template shape inference for Concat op

* Move Conact evaluate_label test to evaluate bounds

* Fix cpplint issue
2022-10-18 13:17:21 +04:00
Felix Dohyun Kim
f2bdffb04f [GPU][DG2] Fix some bugs (#13517)
* Bugfix: deconv 2 conv optimization
2022-10-18 15:34:50 +09:00
Sungeun Kim
b21510f9f6 Optimize b16 (#13205)
* optimize models on batch16
* add is_os_yx_isv16_osv8 to format_traits_map
* choose onednn deconv
  - remove checking deconv cross-precision
  - implicit concat could not support byxf.

Co-authored-by: Jade Cho <jade.cho@intel.com>
2022-10-18 13:35:45 +09:00
Ilya Lavrenov
5519fff5c2 Fixed RDFT under Rosetta (#13495) 2022-10-17 15:02:28 +02:00
Andrew Kwangwoong Park
550e59070e [GPU] Baseline primitive, primitive_inst, program_node change for multiple outputs support (#13295)
* Update primitive, primitive_inst, and program_node for multiple outputs support

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Update arg_max_min kernel for multiple outputs support

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Fix failed unittests TCs

- remove duplicate output memory

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Clean up code

Signed-off-by: Andrew Park <andrew.park@intel.com>

* Apply code review

Signed-off-by: Andrew Park <andrew.park@intel.com>

Signed-off-by: Andrew Park <andrew.park@intel.com>
2022-10-17 15:13:19 +09:00
Min, Byungil
f1d816f6aa [GPU] Add planar format to onednn reduction (#13445)
Signed-off-by: Min, Byungil <byungil.min@intel.com>
2022-10-17 15:04:41 +09:00
Ilya Lavrenov
2e336e4bb8 Fixes for brew support on OSX (#13476)
* Fixes for brew support on OSX

* Added new policy

* added quotes

* Used OpenVINODeveloperPackage

* Extra fixes

* Use standard verbose variable

* Fixed pkg-config generation
2022-10-15 00:15:03 +03:00
Pavel Durandin
0a174e6788 [GPU] Fix for windows memory fail (#13489) 2022-10-14 17:31:48 +03:00
Sergey Shlyapnikov
5fea4c3fc3 [GPU] Convert activation's slope buffer data type for OneDNN fusions (#13444) 2022-10-14 13:58:35 +04:00
Sergey Shlyapnikov
0b8f1f8c00 [GPU] Optimize reorder between permute and first convolution (#13459) 2022-10-14 13:57:58 +04:00
Tomasz Jankowski
b424ee2568 [Core] Move op_eval tests into Template plugin (#13001)
* Remove redundant tests

* Remove MatMul op_eval tests

* Remove RoiAlign, RoiPooling op_eval tests

* Move VariadicSplit op_eval tests to TemplatePlugin

* Move Transpose op_eval tests to TemplatePlugin

* Move Interpolate op_eval tests to Template Plugin

* Redo Interpolate_v4 tests

* Adopt to new exception messages
2022-10-14 07:45:28 +03:00
Irina Efode
dafa67cf27 [IE TESTTS] Move Set_Device_Name to correct place (#13473) 2022-10-14 06:17:03 +03:00
Mykhailo Hnap
30774036ab [GPU] Implement CTCLoss-4 (#13122) 2022-10-14 11:13:32 +09:00
Irina Efode
78d6f68de4 Infra for device suffix (#13339)
* [IE TESTS] Using Gflags instead of parsing command-line options

* Apply infra for suffix

* remove extra

* Squashed commit of the following:

commit c249f5d581
Author: iefode <irina.efode@intel.com>
Date:   Thu Oct 6 11:56:11 2022 +0400

    Fix default values for command-line argument

commit b2dbb4418c
Author: iefode <irina.efode@intel.com>
Date:   Wed Oct 5 18:09:22 2022 +0400

    rename gflags config

* Fix errors

* Remove gflag using

* Remove gflags

* Update main.cpp

* Update main.cpp
2022-10-13 15:45:03 +04:00
Szymon Irzabek
296b621577 [GNA] Replace fixed quantization code with a more flexible approach (#13237) 2022-10-13 10:21:59 +01:00
Zhang Yi
7d5e0abcaa [CPU]add interaction op with FP32 & BF16 support (#12351) 2022-10-13 11:59:48 +04:00
Luwei Zhou
61acc4a08b [CPU] Add conv per-tensor input zero point. (#12270)
- Enable per-tensor zp on AVX512_VNNI and AVX512_AMX ISA to use BRGCONV kernel.
- Fix brgconv should not be disabled correctly in some INT8 model which has binary postops
- Merge optimization of convolution attribute PR to lower down the load network time latency
- Only enable conv/group conv S8 input on AMX. Non-AMX same with 2022.1 LPT input precision config.
2022-10-13 10:07:11 +04:00
hyunback kim
383ec09d85 [GPU] Fix endless loop issue in onednn optimization post-opt (#13413)
* [GPU] Fix endless loop issue in onednn optimization post-opt
* Reproduced in yolo_v5s.int8
* Add OV_GPU_DisableOnednnOptPostOps in debug_configuration
2022-10-13 14:47:03 +09:00
Ilya Lavrenov
c2911adf94 Conda-forge fixes (#13392) 2022-10-12 12:16:13 +04:00
Mikhail Ryzhov
ea3ac0d335 [GNA] Rename and move to new place gnaFuncTests, gnaUnitTests (#13280)
* [GNA] Move tests to plugin dir

* reverted temporary fix

* Added linux specific compiler flags

Co-authored-by: Adam Tumialis <adam.tumialis@intel.com>
2022-10-12 09:33:42 +02:00
Katarzyna Mitrus
844b0a2682 [shape_infer] Shape inference function for GRU/AUGRUSequence operations (#12982)
* shape_infer function for gru_sequence

* Reuse gru sequence shape infer for augru shape infer

* link ov_shape_inference in transformations

* Move attention input validation to shape_infer

* Update gru_sequence type prop tests

* type prop tests update

* gru_sequence shape_infer tests

* Update merged dimensions init

* Enable GRUSequence shape_infer in cpu

* Comments update

* Align error messages

* Change constexpr int to size_t in tests

* Apply types refactor

* Remobe set_input_is_relevant_to_shape from gru/augru_sequence

* Style apply

* revert constexpr for result_et

* Uninitialize valid_num_directions variable
2022-10-12 10:35:54 +04:00
Wang, Yang
6bd099917a [AUTO plugin] Enable benchmark setting performance hint with none value to hardware device through AUTO/MULTI plugin (#13063)
* Set 'UNDEFINED' as the performancehint default value. Update benchmark app to pass performance hint with none value to MULTI/AUTO plugin.

Signed-off-by: Wang, Yang <yang4.wang@intel.com>

* Using 'UNDEFINED' as the default value of hint in AUTO/MULTI plugin.

Signed-off-by: Wang, Yang <yang4.wang@intel.com>

* Adding a flag instead of adding a default value of hint to check if user configures the performance hint to AUTO/MULTI plugin.

Signed-off-by: Wang, Yang <yang4.wang@intel.com>

* Update benchmark python version.

Signed-off-by: Wang, Yang <yang4.wang@intel.com>

Signed-off-by: Wang, Yang <yang4.wang@intel.com>
2022-10-12 10:03:17 +08:00
Roman Lyamin
9c6ad77852 [GPU] FullyConnected dynamic (#13015)
* [GPU] FullyConnected dynamic

* [GPU] Fix FC OneDNN usage
2022-10-11 16:51:18 +04:00
Oleksandr Zhydkov
2d4d80a444 [GPU] Add support class agnostic box regression for experimental detectron … (#13003)
* add support class agnostic box regression for experimental detectron detection output

* add test
2022-10-11 21:31:47 +09:00
Oleksii Khovan
66b5f9acf2 [GPU] GenerateProposals-9 (#11994)
* add static_output attribute to enable GPU implementation of GenerateProposals.
This change may be reverted when dynamic shapes support will be implemented
in GPU plugin.

* - add GPU single layer test;
- add normalized attribute to SLT.

* add GPU primitive for GenerateProposals

* add kernel for GenerateProposals

* add unit test for GenerateProposals

* add blocked layouts support

* tidy up

* support blocked layouts also for 2nd and 3d outputs

* Transformation to deal with dynamic output in GPU
  - revert changes in GenerateProposals interface and shape inferenece;
  - add transformation;
  - custom compare method in single-layer test.

* address review comments

* fix after rebase

* fix after rebase

* review comment: added nms_eta to testcase name generation

* - added input types combination to functional tests;
- fix float16 comparison.

* fix after rebase

* use vector for input ids

* fix after rebase
2022-10-11 21:10:32 +09:00
Pawel Raasz
b4ad7033c9 Opset 1 transpose shape inference review (#12937)
* Test to interval shape propagated by transpose

* Test to propagate labels by transpose

* Add template transpose shape inference

* Fixes to transpose shape inference

* Update names for shapes:
input -> input_shape
order -> order_shape

* Not fill output shape for dynamic range

* Add constexpr to SeqGen and Between comparator

* Correct StaticShape creation in test

* Tests check partial value propagate in arg input

* Add evaluate upper, lower, label to transpose
- add test

* Add common methods for inference and evaluate

* Move helpers to shape_inference

* Move transpose attribute to transpose op

* Fix include in transpose operator

* Correct label generation and type

* Fix null conversion

* Use uint64_t for labels tensor

* Fix compare labels

* Use order length as output rank

* Update transpose transformation test

* Move helpers to validation_util

* Correct test assertion for expected shape

* Transpose evaluate use common function
for output calculation

* Remove redundant helpers from transpose test
2022-10-10 16:18:52 +04:00