* Commits from PR11373:
Fixed handling of transpose after convolution
[GNA] Fixed calculation of dimensions for ConvolutionFilter and PWL primitives
[GNA] Fixed coverity error and failed tests
* Apply comments
* Update src/plugins/intel_gna/gna_graph_compiler.cpp
Co-authored-by: Marcin Kusmierski <marcin.kusmierski@intel.com>
* Update src/plugins/intel_gna/gna_graph_compiler.cpp
Co-authored-by: Marcin Kusmierski <marcin.kusmierski@intel.com>
* Rollback names
* Separate test data
* Move coverity issue to separate request
Co-authored-by: Elizaveta Lobanova <elizaveta.lobanova@intel.com>
Co-authored-by: Marcin Kusmierski <marcin.kusmierski@intel.com>
* Add some tip description about enable OpenCL for benchmark_app.
Signed-off-by: xipingya <xiping.yan@intel.com>
* Export doesn't work, we need to add -Dopencl_root_hints=[PATH]/OpenCL-CLHPP/include to cmake command.
Signed-off-by: xipingya <xiping.yan@intel.com>
Co-authored-by: Chen Peter <peter.chen@intel.com>
* change memory access pattern of fsv layout for permute
* Fix permute_ref to process F first only when (bf...) => (b...f)
* Refactor
Co-authored-by: si-eun-kim <sieun.kim@intel.com>
* fix references
* update links
* update the wording to be more clear
* add the error message about Visual studio back
* update links to static html links of 2022.2
* Fix GPU NonMaxSuppression implementation
* Introduce Nms9 single layer tests
* Adapt internal NMS and GPU implementation for NMS9 implementation
* Adapt CPU implementation in GPU for NMS9
* Add blocked layouts support to NMS
* Add unit tests for blocked formats for NMS
* Fix boxes groups size for the small shapes
* Use ocl implementation for blocked layout input
* Fix templates typedefs to pass win build
* Fix second output to set data in correct format
* Add methods for access to varables information in Program class
* add ReadValue and Assign primitives
* ReadValue and Assign implementations
* Implementation of memory states allocation
* Add output existance check in primitive_inst to avoid crashes if output is set during execution
* Add memory states management functionality in network component
* Integration of memory states feature in inference request component
* Exclude constant path for read_value and assign nodes in cldnn transformations
* Improve memory states test to run on a single inference request
* unit tests for ReadValue and Assign
* single-layer test for ReadValue and Assign
* Add QueryState API implementation
* Add memory state test which covers dynamic batch case
Co-authored-by: Oleksii Khovan <okhovan@lohika.com>