* zero-copy (assuming determenistic app-level scheduling) for the multi-device, via "borrowing" the corresponding device-specific blobs and letting the app to implicitly use these
* Optimized Infer Request Scheduling
* remoteblob checks in the conventional SetBlob
* correctly (with status) reporting NOT_IMPLEMENTED
* SetBlob to accomodate for the RemoteBobs
* Tests for remote blobs support via MULTI: creating the shared_test in case the other (closed source) plugins would want to use that (in the private shared_tests instantiations).
Also instantiating the remote blobs tests for the some basic combinations to test the MULTI supports them
* macos compilation (and general plugin platform support) fix
* shuffled files, so that the MULTI tests are now part of the ieFuncTests (and need no separate target). Also brushed the macro that handales the NOT_IMPLEMENTED as bit
* further shuffled files, so that the initial MULTI tests are now part of the IE tests, yet specific instances do need separate targets
* Fixed misprint
* Brushing the code and comments a bit
* further brushing of the ScheduleToWorkerRequest: moving the task execution directly into the loop over devices (avoids pointers and 'else' clause)
* 1) zero-copy (assuming determenistic app-level scheduling) for the multi-device, via "borrowing" the corresponding device-specific blobs and letting the app to implicitly use these
2) Initial MULTI section in the opt guide (primarily to document a tip on helping the MULTI to keep the zero-copy path)
* [MULTI] remote context support and associated scheduling (respecting the remote data affinity)
* fix CentOS (old) gcc issue: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81880
since the intriduced therad_local string is template the bug manifests itself (and the string is not allocated/initialized).
the QA is to wrap the std::string into the function
* further fix for the old gcc versions issue, now with non-trivial thread_local destruction sefault: switching from the std::string to the plain const char*
* additional tests for the MULTI and remote blobs (no remote context and multi GPUs cases)
* fix for the tests (that now can check for more specific NotImplemented exeption).
Alos couple of line endings
* added check so that sample only supports networks with one input
* moved ngraph-realted operations to related segment of the sample
* fix for output image not being saved correcly due
* Range: Align operator with spec and add unit tests
* Range: Remove output shape from range ref impl signature
* Range: Exclude backend unit tests for CPU and GPU due to unsupported dynamic ops
* Range: Add single layer test class for Range-4
* Range: Add unit test for shape inference
* Range: Add unit tests for i32 and f32
* Range: Refactor Range v0 backend test and added test for f32 type
* Range: Add floating point tolerance in unit tests to avoid failures due to precision
* Range: Add subgraph tests for Range add element-wise
* Range: Refactor Range class for single layer tests and add range add element-wise test with truncated inputs
* [VPU] Fix dynamic networks import
* [IE][GNA][TESTS] Move ImportExport tests from GNA to shared part
* [VPU][Tests] Add ExportImport test for dynamic network
* [VPU] Review fixes
* [VPU][Tests] Review and test fixes
* [VPU][Tests] Move TEST_P to shared part
* remove convert op from layer creator
* remove depthtospace op from layer creator
* remove mvn op from layer creator
* remove normalizel2 op from layer creator
* remove notequal op from layer creator
* remove subtract op from layer creator
* correct mvn op behavior when copied with new input
* fix trying to get precision from empty output of normalize layer
* fix normalize layer not setting output type
* remove trailing whitespace
* add fp64 to possible convert op precision types
* use a function to translate bool string representation
* merge emergency opset changes for mvn and roipooling ops
* Add reference implementation for PSROIPooling operator
* fix test_roi_pooling
* use std::roundf
* remove unnecessary copies in single layer tets
* Fixes after review
* fixes after review
* use element::Type_t instead of element::
* apply code format
* add PSROIPooling to evaluates_map
* apply code format
it is easy to capture when there are 2 app-level inference requests, but only single worker (MULTI) request
main thread | callback thread
___________________________________________________________________________
| <in the callback, the worker request>
| <the request returns itself to the "idle" queue>
| 1) idleGuard.Release()->try_push(workerRequestPtr)
2)<notified on vacant worker arrived via callback> |
3) starts another request with StartAsync | ...
4) <in the ThisRequestExecutor::run()> |
workerInferRequest->_task = std::move(task); | if (_inferPipelineTasks.try_pop(workerRequestPtr->task))
the last line introduces DATA RACE (sporadically manifested in the bad_function_call exception), the fix is in this commit
* remove power op from layer creator
* remove prelu op from layer creator
* remove tile op from layer creator
* remove relu op from layer creator
* remove selu op from layer creator
* remove softmax op from layer creator
* remove tanh op from layer creator
* remove split op from layer creator
* remove reshape op from layer creator
* remove reverse sequence op from layer creator
* remove proposal op from layer creator
* remove priorbox op from layer creator
* remove roipooling op from layer creator
* remove priorboxclustered op from layer creator
* style fix
* utility function to parse bool-containing strings
* align priorbox scale_all_sizes parameter to specification
* change location of getBoolStrParamAsIntStr function
* align prelu creator to new constant op changes
* adjust priorbox tests to align with scale_all_sizes default value
* adjust priorbox python tests to align with scale_all_sizes default value
* align priorboxclustered attributes initlialization to specification
* fix checking wrong container's end iterator for opset name search
* improve comment on roipooling parameters
* Apply review suggestion 1
Co-authored-by: Ilya Churaev <ilyachur@gmail.com>
* Apply review suggestion 2
Co-authored-by: Ilya Churaev <ilyachur@gmail.com>
* align priorbox step initial value to specification
* align roipooling method attribute to specification
* remove roipooling specific creator
* align with review comments
Co-authored-by: Ilya Churaev <ilyachur@gmail.com>
* create MaxPoolLayer test
* Create single layer tests for MaxPool for cpu plugin
* create max_pool_2d_ceil unit test
* Update MaxPool spec
* add comments describing AUTO and NOTSET types
* create unit test for MaxPool
* add type_prop test for default values
* add MaxPool unit tests to CMakeList
* Remove second constructor and change the first one so it has default values for rounding_type and auto_pad
* style-apply
* Update the spec
* add max pool single layer test instances for different pad type
* add 5D input max pool single layer test instance for cpu plugin
* Remove max pool single layer tests files
* add more test instances for max pool single layer tests for cpu plugin
* add newline characters
* [MO] Add CMake install for Model Optimizer
* [MO] Update test for version.py
* [MO] Add CMake install for Model Optimizer
* [MO] Update test for version.py
* [MO] Add CMake install for Model Optimizer
* [MO] Update test for version.py
* - fix install paths for onnx_reader and ir_reader
- remove static lib installation for plugins on plugins
- 97-myriad-usbboot.rules is installed only for Linux
* added new line
* [MO] Add CMake install for Model Optimizer
* [MO] Update test for version.py
* [MO] Add CMake install for Model Optimizer
* [MO] Update test for version.py
* [MO] Add CMake install for Model Optimizer
* [MO] Update test for version.py
* export TBB_DIR in setupvars