* [GPU] Reorder weights refactoring (#17787)
* [GPU] Fix DG2 with weights optimization
* [GPU] Fix DG2 with weights optimization
* [GPU] Fix DG2 with weights optimization
* [GPU] Fix DG2 with weights optimization
* [GPU] Fix inner order description for some of formats
* [GPU] Fix expected number of primitives in test
---------
Co-authored-by: Roman Lyamin <Roman.Lyamin@intel.com>
Co-authored-by: Sergey Shlyapnikov <sergey.shlyapnikov@intel.com>
* [GPU] Permute f and y axes
It is supported cases where y and f (and X if it is not equal 1) axes size divisible by 4,8 or 16.
added kernel to switch f and y axes in 4d model blocked and planar formats
added test fot tests
* Added subgroup read/write to THREE_DIM_TRANSPOSE kernel case.
* Better checking of is SIMD size supported.
* Added support for long type to subgroup read/write.
* Added subgroup read/write support to 2d permute.
* Fixed win build issue.
* Changed f and y indexes in iteration.
* Added vector read/write.
* Fixed j_times calculation.
* Better naming.
* Rollback test logic.
* Fixed fusion logic.
* Accept only supported blocked layouts and SIMD sizes.
---------
Co-authored-by: Mykhailo Hnap <mykhailo.hnap@capgemini.com>
Co-authored-by: Wilson Seok <wilson.seok@intel.com>
* [GPU] Add oneDNN primitives profiling support
* [GPU] Add stream.wait() method to prevent caches flushing and other possible impacts of finish() call
* Add comment for wait() usage
* Provided visualization of partial values and labels. Adopted DimensionTracker for better equivalence tracking
* Addressed comments and fixed one test
* Removed copy of consts in translator, added test.
* Fixed memory loss for tf.Const.
* Added test, minor corrections.
* Update src/bindings/python/src/openvino/frontend/tensorflow/node_decoder.py
Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>
* Test corrections.
* Added comment.
---------
Co-authored-by: Roman Kazantsev <roman.kazantsev@intel.com>
* Enable `LoadedTensor.*HETERO` test
* Fix use of `ICompiledModel::outputs()`
* Remove extra `loaded_from_cache` argument
* Misprint
* Small refactoring
* Remove extra `model` from `CompiledModelDesc`
Use `get_runtime_model()` instead
* ClangFormat
* [PyOV] Extend Tensor API
* one more ctor
* apply comments
* support constoutput
* add checks for shape
* checks for type and shape
* apply comments
* is_continuos
* codestyle
* Remove unsupported plugin from proxy in order to avoid exception
catching
* Remove from alias for
* Globally remove unavailable devices
* Do not load proxy if no available devices
* Try to fix CI
* Add debug messages
* Avoid two times the same plugin
* Revert "Add debug messages"
This reverts commit 562e36c633.
* Cache hidden devices
* Update if fallback order was changed
* Try to fix CI
* Fixed CoreThreading tests
* Fixed typo
* Try to fix plugin initialization
* Fixed initialization
* Fixed lock
* Do not return default context from the compiled model
* Try to fix segfault
* Added test
* Remove unnecessary code
* Remove code duplication
* Migrate to new API
* Fixed comment
* Revert renamed variable
* Remove cast
* Hetero throws OV Exception
* PostgreSQL interface for reporting from test apps
* Fixes for Linux and MacOS builds
* Fixed segmentation fault and linux build issue
* Removed unnecessary debug messages and std::endl
* Improved logging
* Added loading libpq.so.5 on Linux and fixed typos
* Removed explicit definition for conformance runner
* Added reporting levels with less details, but faster
* Added escaping strings due to unexpected tests naming
* Added env-var defined Run ID (external grouping)
* Added transaction-based queries
* Stored amount of tests to run
* Added handling of serialization errors for transactions
* Cleanup unnecessary code due to latest changes in env
* Added customizable test results
* Enabled conformanceTests.exe for PostgreSQL Reporting
* Changed behavior of Run ID. Changed behavior of Suite Results.
Fixed void in declarations.
* Refactored code regarding control of custom fields in runtime
* Updating targetDevice and case type in Conformance Test
* Introduced wasting test results in case of skipping
Fixed behavior of missing data in case of Fast reporting
Refactored Initialize function by moving loading part separately
* Updated queries naming
* Added runtime resolution of an opset version for operation
* Added capturing a PostgreSQL logs for diagnostic purposes
* Remove unused header
* Fix for fresh master
* Try to detect run on ARM, fix for latest IR hash format
* Updated status values for easier queries
* Added a version info to session information
* Added simple DGPU detection, removed unused prints
* Reduced serialization errors
* Replaced usleep by nanosleep
* Improving robustness
* Fixed behavior of OpImplCheck
* Added IR Weight
* Added a customizable "device architecture" field
* Fix for CI
* Added missing device detection in OpImplCheck
* Introduced Manual Start feature for delayed posting start
information to a tables
* Added app_id to test_results
* Changed architecture detection as requested by owner
* Moved CMake option to src/tests
* Added TODO
* Added full device name as a part of target device
* Added MacOS libpq loading
* Trying alternate way to load on MacOS
* Adding correct executable name detection for MacOS