* Gather needs to keep the original input/output rank
- because the parameters as indices, batch_dims and axis depend on the rank.
- add input_rank to gather primitive.
* don't query on set_preferred_formats pass
-when the force_implementations is set.
-when forcing_impl is not onednn.
* Add Multinomial-13 to MO
* Add Multinomial tests for MO IR reader
* Move convert_type check
* Imports clean up
* Update pacgage BOM file
* Avoid files collision in tests
* Check for ReduceProd + SoftMax fix
* Check for ReduceProd + SoftMax fix
* Fix after moving on get_constant_max_of_type
* Extended tests and added coverage for other types
* Code optimization
* [GPU] Fixed data generation for f16 fusion tests
* [GPU] Temporary tolerance increase for failed tests on iGPU
* [GPU] Temporary skip or tolerance increase for failed tests on dGPU
* Enable device properties setting via AUTO plugin.
* Update.
* Update.
* Update.
* Update.
* Update.
* Update.
* Update docs/OV_Runtime_UG/auto_device_selection.md
Co-authored-by: yanlan song <bell.song@intel.com>
* Add for stateful model
* Fix the link
---------
Co-authored-by: yanlan song <bell.song@intel.com>
Co-authored-by: Chen Peter <peter.chen@intel.com>
* Add group_normalization_kernel_selector
* Define group_normalization GPU primitive and its instantiation
* Add GroupNormalization operation builder
* Add test class for GroupNormalization operator
* Add instantiation of GroupNormalization test for GPU Plugin
* Disable GroupNormalizationDecomposition transformation in GPU Plugin
* Add GroupNormalizationKernelRef implementation
* Add GroupNormalization unit tests which cover blocked layout support
* [GPU] enable dynamic loop
- support multiple outputs
- support dynamic loop memory allocation
- support negative num_iterations
- implement calc_output_layouts
- add dynamic loop functional / unit tests
* Fix fail to check memory to set when original 1d data
- follow up code reviews
* Fix unit test failures
* Follow up code review
* Modify concat memory map creation process
* Check whether or not first input of loop is num_iteration_id
* Follow up code review
- refactoring preprocess_backedge_memory
* * Fix ci failures
* Clear custom_outputs_vec for condition
* Add num_outputs for condition and loop
* *Fix constant and param of body network have mismatched layouts
* Set consts.needsBatchInterpretation for const
* * refactoring is_dynamic in loop_inst::execute
* * remove wait_for_events in body_network execution loop
* * Remove redundant events
* * follow-up code review - modify OPENVNO_ASSERT
* * Remove redundant codes in loop_inst::execute
* * add current iteration update nodes into the ov::Model
* * rollback some codes for the performance degradation