* Use prebuilt cmake in install-dependencies steps
* Removed building of custom python to save Azure CI time
* Adjusted conditions in cmake
* Download VPU custom kernels compiler only for x86
* Removed -pyver argument
* Improvements
* Detect glibc version
* Usage of new glic variable
* Fixed typo
* Increased glibc version for MYRIAD case
* Made OpenVINO cmake Linux flavors agnostic
* Detect glibc version on RedHat
* Use clcompiler only since ubuntu 18.04
* Universal way to detect glibc version
* Revert some changes
* Apply suggestions from code review
fixes issue with GNA
* Apply suggestions from code review
fixed typo
* Improvements in rpm / debian build
* Fixed several debian warnings
* Supported old gflags from CentOS 7
* Reverted back OpenCV version
* Fixed clang-format
* Fixes for brew support on OSX
* Added new policy
* added quotes
* Used OpenVINODeveloperPackage
* Extra fixes
* Use standard verbose variable
* Fixed pkg-config generation
* Building python frontend tests
* Enable mock tests in new python API:
- Fix mock_py frontend library name
- Set same destination for mock_py frontend as others frontends
- Fix importing mock_py frontend
- Minor changes in mock_py tests
* Fix flake8 issue
* Fix linking issue on windows
* Remove DISABLE_PYBIND11 option
* Build mock_py always as shared library
- put mock_py into ov::frontend namespace
- minor re-factor of mock_py
* Add mock c API fix warning on windows
* Fix installation of mock_py frontend
* Install mock_py frontend library in tests
* More detailed error message in python test
when cannot import frontend
* Restore IMPLEMENT_OPENVINO_API for windows build
of pybind_mock_frontend
* Set env paths for python API test on Azure CI
- skip tensorflow conversion ext test when not installed
* Correct skip tensorflow test decorator
* Add C API macro for mock1 frontend
* Ignore E999 error reported by flake8
* Fix conversion extension name for tf imports
* Fix flake8 issues
* Remove python3.8 path from LD_LIBRARY_PATH
on debian CI
Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
* Added support of pugixml search via pkg-config
* Updated old InferencEngineDEveloperPackage.cmake.in
* Removed debug print
* Fixed cmake stage for U18
* Added WA for pugixml on U18 for DevPackage as well
* Disable pkg-config generation for OpenVINO for U18
* Added option for pkg-config
* Now triplet is aut-detected
* Fixed U18 case
* OpenCV build switched off by default
* Removal of OCV copy from the "install tests" stage
* Switch off OCV dependent tests
* cmake code repositioning
* More OCV-dependent tests disabled
* OCV removed entirely from the C-API tests
* Final cleanup
* Fixes for cases when TBB_DIR env var is set
* Don't use make in build_samples.sh script
* First version of Windows installer
* WIndows NSIS installer
* Improved version of debian packages
* Improvements
* Improvements
* Debian packages now look good
* Library versioning
* Fixed tests to run against debian packages
* Fixed frontend tests
* Fixed code style
* FIxed Windows
* Fixed python tests
* Fixed paths in tests
* fdvfdv
* Fixes
* USe versioning only for debian packages
* Relocatable tests
* Fixed
* Fixed all tests
* Fixed clang-format
* Fixed more tests
* Fixed some tests
* Absolute paths in .ci
* Fixes
* Added support of OpenCV 3.4
* Trying to fix gnaUnitTests
* Build with system TBB
* Fixes
* Check whether system TBB is available
* Try to fix ONNX Runtime build with system TBB
* Test
* Fixed compilation of threading.cpp
* Fixed unset of cache dirs
* Limit dearch paths of TBB
* Try to enable pip packages with custom TBB
* Fix for TBB 2021.2
* Install only needed TBB libraries
* Install TBB from system to pip package
* Reverted usage of TBBROOT
* Fixed oneTBB case
* Try to fix Android
* Escape some paths
* Added samples path
* Fixed TBBBind usage for case of system TBB
* auto-batching POC squashed (all commits from auto-batch-2021.3 branch)
(cherry picked from commit d7742f2c747bc514a126cc9a4d5b99f0ff5cbbc7)
* applying/accomodating the API changes after rebase to the master
* replaying modified version of actual batch selection
* eearly experiments with model mem footprint
* changes from rebasing to the latest master
* experimenting with DG1 on the batch size selection, also collecting the mem footprint
* WIP:moving the auto-batching to the icore to let the MULT/AUTO support that, ALLOW_AUTO_BATCHING as a conventional config key. still fials hot device swap
* quick-n-dirty batch footpint vs device total mem
* code style
* testing which models perform badly due to kernels and NOT (batched) footprint
* stub pipeline task to comunicate the readiness rather than promise/future
* quick-n-dirty timeout impl
* explicit _completionTasks,reverting BA to use the timeout
* inputs outputs copies, works with AUTO and demo now
* accomodate the config per device-id, after rebase to the latest master
* allowing the auto-batching only with tput hint to let more conventional tests pass
* fix the pre-mature timeout restaring via waiting for batch1 requests completion
* moved the bacthed request statring ( along with input copies) to the dedicated thread
* [IE CLDNN] Disable bs_fs_yx_bsv16_fsv16 format for int8 convolution
* code style
* increasing the timeout to test the ssd_* models perf (timeout?) issues
* reducing number of output stuff in BA to avoid bloating the logs in experiments
* more aggressive batching for experiments, not limited to 32 and also 4 as a min
* more accurate timeout debugging info
* getting the reqs limitation from the plugin SetConfig as well
* refactor the reshape logic a bit to accomodate CPU for bathcing, also added remeote context
* let the benchamrk_app to consume specific batch values for the auto-batching such as BATCH:GPU(4)
* auto-batching functional test (with results check vs ref) and GPU instance for that
* fixed arithemtic on blobs ptrs
* clang
* handling possible batched network failure
* BATCH as the constants device name in test
* ENABLE_BATCH
* func tests for CPU, also DetectionOutput hetero tests (CPU and GPU)
* DetectionOutput hetero test for the CPU
* reenabling the Auto-Batching in the AUTO
* auto-batching device enabled in the test
* fixed the DO test
* improve the loading loop logic
* brushed the config keys
* allow hetero code-path for explicit device name like BATCH:GPU(4), used in the hetero code-path tests
* fix the test after refactoring
* clang
* moving ThreadSafeQueue to the ie_parallel, as it is re-used in the AUTO/MULTI and BATCH now
* auto-batching hetero test (subgraph with DetectionOutput)
* fixed minor changes that were result of experiments with impl
* code-style
* brushing, disabling CPU's HETERO tests until planned activity for 22.2
* removing home-baked MAX_BATCH_SZIE and swicthing to the official impl by GPU team
* remote blobs tests for the auto-batching (old API)
* brushed names a bit
* CreateContext and LoadNEtwork with context for the Auto-Batching plus remote-blobs tests
* fixed the ieUnitTests with adding CreateContext stub to the MockICore
* clang
* improved remote-blobs tests
* revert the back BA from exeprimenents with AB + device_use_mem
* conformance tests for BATCH, alos batch size 1 is default for BATCH:DEVICE
* remote blobs 2.0 tests, issue with context having the orig device name
* debugging DG1 perf drop (presumably due to non-fitting the device-mem)
* disbaling WA with batch/=2 for excesive mem footptint, leaving only streams 2
* remote blobs 2.0 tests for different tensor sharing types
* converting assert to throw to accomodate legacy API where the lock() was possible to be called
* revert the timeout back to avoid mixing the studies, fixed the footprint calc
* reverting to estimating the max batch by extrapolating from bacth1 size
* more conservative footptint etimation (with bacth1), graceful bacth 1 handling without duplication
* even graceful batch 1 handling without duplication
* WA for MAX_BATCH_SIZE failure, removing batch4 as a min for the auto-batching
* AutoBatchPlugin -> ov_auto_batch_plugin
* WA for gcc 4.8
* clang
* fix misprint
* fixed errors resulted from recent OV's Variant to Any transition
* skip auto-batching for already-batched networks
* AUTO_BATCH_TIMEOUT and tests
* GPU-specific L3
* switched to pure config, also improved ALLOW_AUTO_BATCHING config key handling logic
* debugging device info
* enabling the config tests for the GPU and fixing the Auto-batching tests to pass
* making the default (when not recognized the driver) cache size more aggressive, to accomodate recent HW with old drivers
* skip auto-batching for RNNs and alikes (e.g. single CHW input)
* fixed fallback to the bacth1 and moved HETERO path under condition to avoid bloating
* brushing
* Auto plugin GetMetric support gpu auto-batch
Signed-off-by: Hu, Yuan2 <yuan2.hu@intel.com>
* add test case
Signed-off-by: Hu, Yuan2 <yuan2.hu@intel.com>
* add comments on test
Signed-off-by: Hu, Yuan2 <yuan2.hu@intel.com>
* brushing the vars names, alos adding the excpetion handling
* disabling the auto-batching for the networks with non-batched outputs and faster-rcnn and alikes (CVS-74085) to minimize the of #failures
* add try catch
Signed-off-by: Hu, Yuan2 <yuan2.hu@intel.com>
* brushing the code changed in the GPU plugin
* Auto-Batch requests tests
* brushed varibles a bit (ref)
* cleaned debug output from the ie_core
* cleaned cmake for the Auto-Batch
* removed batchN estimation from batch1
* cleaned from debug printf
* comments, cleanup
* WA the mock test errors introduced with merging the https://github.com/myshevts/openvino/pull/13
* Adding back removed batchN estimation from batch1 to debug degradations on DG1 (resulted from too optimistic MAX_BATCH_SIZE?). This partially reverts commit e8f1738ac1.
* brushing ie_core.cpp
* fix 32bit compilation
* Code review: ENABLE_AUTO_BATCH
* consolidate the auot-batching logic in ie_core.cpp into single ApplyAutoBAtching
* renamed brushed the OPTIMAL_BATCH (now with_SIZE) and mimicks the MAX_BATCH_SZIE wrt MODEL_PTR
* default value for the OPTIMAL_BATCH_SIZE
* clang
* accomodate new func tests location
* fix shuffle of headers after clang + copyrights
* fixed misprint made during code refactoring
* moving the common therad-safe containers (like ThreadSafeQueue) to the dedicated dev_api header
* switch from the device name to the OPTIMAL_BATCH_SIZE metric presence as a conditin to consider Auto-Batching
* switching from the unsafe size() and minimizing time under lock
* code style
* brushed the ApplyAutoBatching
* brushed the netric/config names and descriptions
* completed the core intergration tests for the auto-batching
* ExecGraphInfo and check for incorrect cfg
* removed explicit dependencies from cmake file of the plugin
* disabling Auto-Batching thru the tput hint (to preserve current product default), only excplicit like BATCH:GPU used in the tests
Co-authored-by: Roman Lyamin <roman.lyamin@intel.com>
Co-authored-by: Hu, Yuan2 <yuan2.hu@intel.com>
* Renamed plugin
* Changed plugin source dir
* Fixed tests
* Renamed ENABLE_GNA to ENABLE_INTEL_GNA
* Fixed centos compile error
* One more renaming place
* HF for centos 7
* renamed the name in version api
* Fixed CI configurations
* Disabled GNA plugin for old gcc (<=5.4)
* Fixed Azure Win job
* Enable LTO for oneDNN GPU
* Use importer target for onednn_gpu
* Fixed install of OpenCL, ittapi
* Try to enable samples build
* Fixed ie_wheel build
* Disabled samples
* Fixed incorrect install of pugixml in dynamic build
* Fixed clDNN_unit_tests64 link
* Minimized a number of hardcoded paths
* Switched off ITT
* Disable TBBBind for static Windows
* Fixed link on Windows
* Fixes
* Static build for Window
* Disable unit-tests
* Disabled TF
* Fixed add dependency for PDPD unit tests
* Disabled GNA
* Enabled ngraph test utils
* Fixes
* Fixes
* Disable clDNN for static build on Windows
* Dynamic MCVS runtime for OpenCL
* Disable ITT traces for static libs
* Ability to switch TBBBind_2_5 OFF
* Disabled GPU on Windows build
* Fixed libGNA link
* Disabled some GNA tests
* Fixed multiple definitions issue
* Hide FrontEnd::load in DLL
* Install libGNAStubs
* Trying to fix Windows
* Trying to fix frontends
* Passed compilation on Windows
* Updated install rules for ngraph backends
* Try CPU, TEMPLATE tests
* Enabled unit-tests
* Enabled ngraph tests
* Try to fix unit-test
* Fixed unit-test on Linux
* Fixed compilation with ENABLE_IR_V7_READER=OFF
* Fixed tests on Linux
* Enable cpuFuncTests only in static build
* Try to fix testMockPluginFrontEnd test
* Try to enable TF
* Fixed python tests
* Partial revert
* Removed OV_FRONTEND_PATH