- Implement static TileScheduler to handle compile params processing. Now compile params are accessed only here
- TileScheduler should emit code only for necessary scalar/vector Tiles
- Perform abstract-to-physical register mapping in one place (currently KernelEmitter constructor)
- Implement more precise register mapping, so larger subgraphs could be created (now up to 12 i/o regs instead of 7)
Co-authored-by: Ivan Novoselov <ivan.novoselov@intel.com>
* tf_fe lib renaming to switch to new fe
* Apply suggestions from code review
* 1. implements renaming for win os
2. --api_2 key is equal to True by default
* --ir_version key is equal to '11' by default
* resolves merge conflicts
* check if file to rename exists
Co-authored-by: Evgenya Stepyreva <eva.my.link@gmail.com>
* fix unicode issue that ie.ReadNetwork fails if the exe's folder contains Chinese characters
* fix clang format issue
* fix clang format issue
* modify '/' to file_separator in get_directory
* fix ci issue
* fix test failure on windows
* fix test issue
Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
* [SAMPLES] Add GNA library version log to speech_sample
* [GNA] Add OV version to TLV export file
Co-authored-by: Szymon Irzabek <szymon.jakub.irzabek@intel.com>
* Revert "Fix experimental detectron do ref impl (#10621)"
This reverts commit d87233863d.
* Disabled Experimental Detectron per agreement with GPU team. Ticket to fix it: 90209
when cpuFuncTest and libopenvino.so in sampe directory
on redhat8, cpuFuncTest can't find front library path
Signed-off-by: Hu, Yuan2 <yuan2.hu@intel.com>
Signed-off-by: Hu, Yuan2 <yuan2.hu@intel.com>
* Add eye decomposition transformation
* Fix EyeLike generation when diagonal shift
outside dimensions
* Add batch shape to eye decomposition
* Eye decomposition clean-up
* Remove reference part if no eye decompose in
decomposition tests
* Eye-Like use eye operator
* Disable eye decomposition for CPU plugin
* Use opset9 instead of ops in eye decomposition
* Apply transformations header style
to eye_decomposition.hpp
* Add model reference in eye decomposition tests
- use opset9 instead of ov::op:vX namespace
* Refactor eye decomposition:
- match style of other transformations
- add NodeRegister class to make and collect created nodes
- use `NodeRegister` in transformation for copy runtime info
- use `NodeRegister` in `MatcherPass` to replace new `register_new_node`
* Fix CI LTO optimization issue for Ubuntu 20
* update logic of setting TPUT by default for MULTI and AUTO.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Set config to MULTI instaed of target device when using -d MULTI:target_device in benchmark app.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Update.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Update logit to check if perfomance hint is set.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Update test cases.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Set the correct _so from passthrough executable network when batch plugin is disabel to fix lifecycle coredump.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Same update for AUTO plugin
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Update
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Update
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
* Update.
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
Signed-off-by: Wang, Yang <yang4.wang@intel.com>
Co-authored-by: Chen Peter <peter.chen@intel.com>
* CVS-89672 Make model reshape and track batch
* Minor refactoring
* Changed mechanism of constant replacement to more mature
* Update src/common/transformations/include/transformations/smart_reshape/lstm_states_broadcast.hpp
* Update src/common/transformations/src/transformations/smart_reshape/lstm_states_broadcast.cpp
* [GPU] Integrate oneDNNv2.7
- Support bsv16_fsv2, bsv16_fsv4
- Update qunatize LWS opt and common GetOptimalLocalWorkGroupSizes.
- Enable bs_fs_zyx_bsv16_fsv2 in eltwise.
- Update to use binary_mul in case NOT int8 conv with oscale.
- Update oneDNN v2.7pc2 branch in onednn submmodule
- Apply shallow conv in/out condition, using acdb,acdeb
mixed precision b32, f32 case, newly add weight reorder format
- Update to fix remains ref_conv selelction issue and running failure issues.
- Update to fix shallow out and normal in case creates unnecessary reorder.
* Fix different in/out format issue in concat
- Concat supports in/out different format, but it has issue calculating concat axis.
After moving cpuFuncTest to cpu_plugin folder, there is more stricter grammar check.
ngPrc is declared twice, but for the first declaration, the ngPrc is not used.
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
Signed-off-by: Yan, Xiping <xiping.yan@intel.com>
* Disable python for configuration with libraries suffix
* Fixed python detection on arm
* More fixes
* Used find host package
* Don't use PYTHON_VERSION which is used by pybind
* [TF FE] Correct Deconvolution for NCHW layout
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Revert Deconvolution implementation and work around -1 for SS
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>