* [FIX][CPU] Convert CF: added convert childs number check
* code style fix
* WIP: [CPU][TESTS] Convert CF: Added subgraph test with two outputs for Convert node
* MatMulDecompressConvertTest refactoring + instances fixes
* removed KeepConstAndDecompressionForMatMul pass and added cpu callback for KeepConstAndDecompression
* MatMulDecompressConvertTest2: added graphs and small problem description
* small review fix
* try to fix memory issue
Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>
* save code
Signed-off-by: Hu Yuan2 <yuan2.hu@intel.com>
* fix life cycle issue
Signed-off-by: Hu Yuan2 <yuan2.hu@intel.com>
* update comment and save stream of master thread in tbb ThreadLocal
Signed-off-by: Hu Yuan2 <yuan2.hu@intel.com>
* update
Signed-off-by: Hu Yuan2 <yuan2.hu@intel.com>
* not save the main stream in tbb
Signed-off-by: Hu Yuan2 <yuan2.hu@intel.com>
* remote test code and update comment
Signed-off-by: Hu Yuan2 <yuan2.hu@intel.com>
* fix mistaken modify
Signed-off-by: Hu Yuan2 <yuan2.hu@intel.com>
* fix format issue
Signed-off-by: Hu Yuan2 <yuan2.hu@intel.com>
* add test
Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>
* improve test
Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>
* fix the test
Signed-off-by: Hu Yuan2 <yuan2.hu@intel.com>
* remote unused code
Signed-off-by: Hu Yuan2 <yuan2.hu@intel.com>
* update the comment of the code
Signed-off-by: Hu Yuan2 <yuan2.hu@intel.com>
* fix format issue
Signed-off-by: Hu Yuan2 <yuan2.hu@intel.com>
* revert test case
Signed-off-by: Hu Yuan2 <yuan2.hu@intel.com>
---------
Signed-off-by: HU Yuan2 <yuan2.hu@intel.com>
Signed-off-by: Hu Yuan2 <yuan2.hu@intel.com>
* Fixed output_model logic.
* Removed InputCutInfo, disabled input cut in ovc.
* Disabled output cut, added tests for setting shapes or types for not all inputs.
* Returned support of numpy type.
* Separated MO and OVC python API tests.
* Small corrections.
* Added output dir test, exceptions test.
* Tests fixed.
* Corrected extension param description.
* Corrected input description, minor code corrections.
* [PT FE] Use weight share switch in frontend
* Return static for function
* Update src/bindings/python/src/openvino/frontend/pytorch/ts_decoder.py
* Fix issue with quantized constants
* Add tests for shared
* Change fused_names algo -> cut subgraphs
* Added extractor name to serialization dir + meta_info
* Uncomment log
* Add this_op_cnt to model_info, fix in_info for model
* Replace clone node second time to relace input node
* fix small problem
* small fixes
* Switch off repeat extractor
* remove model serialization
* fused_names
* change default device in fused_names extractor
* fused_names
* Small speed up
* Move replace of const by param to cache
* Move alignament of in_info to extractorManager
* Sort model by size (check mem fragmentation)
* Fix problem with opset12
* Update manager.cpp
* Serialize cache in case of long
* Add test
* Update graph_cache.cpp
* Update graph_cache.cpp
* Graph cache size
* test other approach
* remove extra
* Fix issue with replae
* try with 1gb limitatiom
* to merge
* revert
* Change `VPUX`/`VPU` occurrences to `NPU`
* Switch `HARDWARE_AWARE_IGNORED_PATTERNS` VPU to NPU
* Rename `MYRIAD plugin`
* Rename vpu_patterns to npu_patterns in tools/pot
* Rename vpu.json to npu.json in tools/pot
* Rename restrict_for_vpu to restrict_for_npu in tools/pot
* Change keembayOptimalBatchNum to npuOptimalBatchNum
---------
Co-authored-by: Dan <mircea-aurelian.dan@intel.com>
* Fix issue with kwargs in signature
* Update src/bindings/python/src/openvino/frontend/pytorch/ts_decoder.py
* Fix problem with some ops in detectron2
* Use debug name for extra input signature
* [TF FE] Support MaxPoolWithArgmax operation
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Add ticket number for TS crash
* Correct error message
* Skip crashing tests
* Set additional tensor name for MaxPool
---------
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* [HETERO] Add ConstantFolding in compile modelto avoid unexpected dynamism after model split. Add new property, which shows number of subgraphs
* Remove check for dynamic subgraph
* Removed legacy API from core_impl
* Revert old extension in old API
* Fixed unit tests
* Wrap new extensions in old API
* Wrap extensions in all legacy API
* Fixed legacy exceptions
* Fixed ONNX tests
* Try to fix LTO
Decompression attribute (that is present in models with FP16 precision)
prevents the weights to be constantfolded. Weights constantfolding is
required by CompressQuantizeWeights to compress the weights to low
precision format.
Ticket: CVS-117310