[Docs][PyOV] update python snippets (#19367)

* [Docs][PyOV] update python snippets * first snippet * Fix samples debug * Fix linter * part1 * Fix speech sample * update model state snippet * add serialize * add temp dir * CPU snippets update (#134) * snippets CPU 1/6 * snippets CPU 2/6 * snippets CPU 3/6 * snippets CPU 4/6 * snippets CPU 5/6 * snippets CPU 6/6 * make module TODO: REMEMBER ABOUT EXPORTING PYTONPATH ON CIs ETC * Add static model creation in snippets for CPU * export_comp_model done * leftovers * apply comments * apply comments -- properties * small fixes * rempve debug info * return IENetwork instead of Function * apply comments * revert precision change in common snippets * update opset * [PyOV] Edit docs for the rest of plugins (#136) * modify main.py * GNA snippets * GPU snippets * AUTO snippets * MULTI snippets * HETERO snippets * Added properties * update gna * more samples * Update docs/OV_Runtime_UG/model_state_intro.md * Update docs/OV_Runtime_UG/model_state_intro.md * attempt1 fix ci * new approach to test * temporary remove some files from run * revert cmake changes * fix ci * fix snippet * fix py_exclusive snippet * fix preprocessing snippet * clean-up main * remove numpy installation in gha * check for GPU * add logger * iexclude main * main update * temp * Temp2 * Temp2 * temp * Revert temp * add property execution devices * hide output from samples --------- Co-authored-by: p-wysocki <przemyslaw.wysocki@intel.com> Co-authored-by: Jan Iwaszkiewicz <jan.iwaszkiewicz@intel.com> Co-authored-by: Karol Blaszczak <karol.blaszczak@intel.com>
2023-09-13 21:05:24 +02:00 · 2023-09-13 21:05:24 +02:00 · 2bf8d910f6
commit 2bf8d910f6
parent 4f92676c85
68 changed files with 1223 additions and 892 deletions
--- a/.github/workflows/linux.yml
+++ b/.github/workflows/linux.yml
@ -600,6 +600,14 @@ jobs:
            --ignore=${{ env.INSTALL_TEST_DIR }}/pyopenvino/tests/test_onnx/test_zoo_models.py \
            --ignore=${{ env.INSTALL_TEST_DIR }}/pyopenvino/tests/test_onnx/test_backend.py
      - name: Python API snippets
        run: |
          source ${{ env.INSTALL_DIR }}/setupvars.sh
          export PYTHONPATH=${{ env.INSTALL_TEST_DIR }}:${{ github.workspace }}/openvino/docs/:$PYTHONPATH
          export LD_LIBRARY_PATH=${{ env.INSTALL_TEST_DIR }}:$LD_LIBRARY_PATH
          python3 ${{ github.workspace }}/openvino/docs/snippets/main.py
      - name: Model Optimizer UT
        run: |
--- a/cmake/developer_package/shellcheck/shellcheck.cmake
+++ b/cmake/developer_package/shellcheck/shellcheck.cmake
@ -39,7 +39,6 @@ function(ie_shellcheck_process)
            continue()
        endif()
        get_filename_component(dir_name "${script}" DIRECTORY)
        string(REPLACE "${IE_SHELLCHECK_DIRECTORY}" "${CMAKE_BINARY_DIR}/shellcheck" output_file ${script})
        set(output_file "${output_file}.txt")
        get_filename_component(script_name "${script}" NAME)
--- a/docs/OV_Runtime_UG/migration_ov_2_0/graph_construction.md
+++ b/docs/OV_Runtime_UG/migration_ov_2_0/graph_construction.md
@ -20,7 +20,7 @@ nGraph API
   .. tab-item:: Python
      :sync: py
-      .. doxygensnippet:: docs/snippets/ngraph.py
+      .. doxygensnippet:: docs/snippets/ngraph_snippet.py
         :language: Python
         :fragment: ngraph:graph
--- a/docs/OV_Runtime_UG/model_state_intro.md
+++ b/docs/OV_Runtime_UG/model_state_intro.md
@ -163,10 +163,22 @@ Example of Creating Model OpenVINO API
 In the following example, the ``SinkVector`` is used to create the ``ov::Model``. For a model with states, except inputs and outputs, the ``Assign`` nodes should also point to the ``Model`` to avoid deleting it during graph transformations. Use the constructor to do it, as shown in the example, or with the special ``add_sinks(const SinkVector& sinks)`` method. After deleting the node from the graph with the ``delete_sink()`` method, a sink can be deleted from ``ov::Model``.
-.. doxygensnippet:: docs/snippets/ov_model_with_state_infer.cpp
+.. tab-set::
   .. tab-item:: C++
      :sync: cpp
      .. doxygensnippet:: docs/snippets/ov_model_with_state_infer.cpp
         :language: cpp
         :fragment: [model_create]
   .. tab-item:: Python
      :sync: py
      .. doxygensnippet:: docs/snippets/ov_model_with_state_infer.py
         :language: python
         :fragment: ov:model_create
 .. _openvino-state-api:
 OpenVINO State API
@ -189,11 +201,23 @@ Based on the IR from the previous section, the example below demonstrates infere
 One infer request and one thread will be used in this example. Using several threads is possible if there are several independent sequences. Then, each sequence can be processed in its own infer request. Inference of one sequence in several infer requests is not recommended. In one infer request, a state will be saved automatically between inferences, but if the first step is done in one infer request and the second in another, a state should be set in a new infer request manually (using the ``ov::IVariableState::set_state`` method).
 .. tab-set::
-.. doxygensnippet:: docs/snippets/ov_model_with_state_infer.cpp
+   .. tab-item:: C++
      :sync: cpp
      .. doxygensnippet:: docs/snippets/ov_model_with_state_infer.cpp
         :language: cpp
         :fragment: [part1]
   .. tab-item:: Python
      :sync: py
      .. doxygensnippet:: docs/snippets/ov_model_with_state_infer.py
         :language: python
         :fragment: ov:part1
 For more elaborate examples demonstrating how to work with models with states, 
 refer to the speech sample and a demo in the :doc:`Samples Overview <openvino_docs_OV_UG_Samples_Overview>`.
--- a/docs/notebooks/109-latency-tricks-with-output.rst
+++ b/docs/notebooks/109-latency-tricks-with-output.rst
@ -206,7 +206,7 @@ benchmarking process.
 .. code:: ipython3
-    import openvino.runtime as ov
+    import openvino as ov
    # initialize OpenVINO
    core = ov.Core()
--- a/docs/notebooks/109-throughput-tricks-with-output.rst
+++ b/docs/notebooks/109-throughput-tricks-with-output.rst
@ -193,7 +193,7 @@ benchmarking process.
 .. code:: ipython3
-    import openvino.runtime as ov
+    import openvino as ov
    # initialize OpenVINO
    core = ov.Core()
--- a/docs/notebooks/115-async-api-with-output.rst
+++ b/docs/notebooks/115-async-api-with-output.rst
@ -54,7 +54,7 @@ Imports `⇑ <#top>`__
    import time
    import numpy as np
    from openvino.runtime import Core, AsyncInferQueue
-    import openvino.runtime as ov
+    import openvino as ov
    from IPython import display
    import matplotlib.pyplot as plt
--- a/docs/notebooks/203-meter-reader-with-output.rst
+++ b/docs/notebooks/203-meter-reader-with-output.rst
@ -54,7 +54,7 @@ Import `⇑ <#top>`__
    import cv2
    import tarfile
    import matplotlib.pyplot as plt
-    import openvino.runtime as ov
+    import openvino as ov
    sys.path.append("../utils")
    from notebook_utils import download_file, segmentation_map_to_image
--- a/docs/optimization_guide/nncf/ptq/code/ptq_aa_openvino.py
+++ b/docs/optimization_guide/nncf/ptq/code/ptq_aa_openvino.py
@ -21,7 +21,7 @@ import torch
 import openvino
 from sklearn.metrics import accuracy_score
-def validate(model: openvino.runtime.CompiledModel, 
+def validate(model: openvino.CompiledModel, 
             validation_loader: torch.utils.data.DataLoader) -> float:
    predictions = []
    references = []
@ -39,7 +39,7 @@ def validate(model: openvino.runtime.CompiledModel,
 #! [validation]
 #! [quantization]
-model = ... # openvino.runtime.Model object
+model = ... # openvino.Model object
 quantized_model = nncf.quantize_with_accuracy_control(model,
                        calibration_dataset=calibration_dataset,
@ -50,7 +50,7 @@ quantized_model = nncf.quantize_with_accuracy_control(model,
 #! [quantization]
 #! [inference]
-import openvino.runtime as ov
+import openvino as ov
 # compile the model to transform quantized operations to int8
 model_int8 = ov.compile_model(quantized_model)
--- a/docs/optimization_guide/nncf/ptq/code/ptq_onnx.py
+++ b/docs/optimization_guide/nncf/ptq/code/ptq_onnx.py
@ -23,7 +23,7 @@ quantized_model = nncf.quantize(model, calibration_dataset)
 #! [quantization]
 #! [inference]
-import openvino.runtime as ov
+import openvino as ov
 from openvino.tools.mo import convert_model
 # convert ONNX model to OpenVINO model
--- a/docs/optimization_guide/nncf/ptq/code/ptq_openvino.py
+++ b/docs/optimization_guide/nncf/ptq/code/ptq_openvino.py
@ -15,7 +15,7 @@ calibration_dataset = nncf.Dataset(calibration_loader, transform_fn)
 #! [dataset]
 #! [quantization]
-import openvino.runtime as ov
+import openvino as ov
 model = ov.Core().read_model("model_path")
 quantized_model = nncf.quantize(model, calibration_dataset)
--- a/docs/optimization_guide/nncf/ptq/code/ptq_tensorflow.py
+++ b/docs/optimization_guide/nncf/ptq/code/ptq_tensorflow.py
@ -22,7 +22,7 @@ quantized_model = nncf.quantize(model, calibration_dataset)
 #! [quantization]
 #! [inference]
-import openvino.runtime as ov
+import openvino as ov
 from openvino.tools.mo import convert_model
 # convert TensorFlow model to OpenVINO model
--- a/docs/optimization_guide/nncf/ptq/code/ptq_torch.py
+++ b/docs/optimization_guide/nncf/ptq/code/ptq_torch.py
@ -22,7 +22,7 @@ quantized_model = nncf.quantize(model, calibration_dataset)
 #! [quantization]
 #! [inference]
-import openvino.runtime as ov
+import openvino as ov
 from openvino.tools.mo import convert_model
 input_fp32 = ... # FP32 model input
--- a/docs/snippets/ShapeInference.py
+++ b/docs/snippets/ShapeInference.py
@ -1,23 +1,23 @@
 # Copyright (C) 2018-2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
-from openvino.runtime import Core, Layout, set_batch
+import openvino as ov
-ov = Core()
+from utils import get_model, get_image
-model = ov.read_model("path/to/model")
+
 model = get_model()
 #! [picture_snippet]
 model.reshape([8, 3, 448, 448])
 #! [picture_snippet]
 #! [set_batch]
-model.get_parameters()[0].set_layout(Layout("N..."))
+model.get_parameters()[0].set_layout(ov.Layout("N..."))
-set_batch(model, 5)
+ov.set_batch(model, 5)
 #! [set_batch]
 #! [simple_spatials_change]
-from cv2 import imread
+image = get_image()
-image = imread("path/to/image")
+model.reshape([1, 3, image.shape[0], image.shape[1]])
 model.reshape({1, 3, image.shape[0], image.shape[1]})
 #! [simple_spatials_change]
 #! [obj_to_shape]
--- a/docs/snippets/init.py
+++ b/docs/snippets/init.py
@ -0,0 +1,7 @@
 # Copyright (C) 2018-2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
 from .utils import get_image
 from .utils import get_model
 from .utils import get_ngraph_model
 from .utils import get_path_to_model
--- a/docs/snippets/cpu/Bfloat16Inference.py
+++ b/docs/snippets/cpu/Bfloat16Inference.py
@ -2,22 +2,24 @@
 # SPDX-License-Identifier: Apache-2.0
-from openvino.runtime import Core
+import openvino as ov
 from snippets import get_model
 model = get_model()
 #! [part0]
-core = Core()
+core = ov.Core()
-cpu_optimization_capabilities = core.get_property("CPU", "OPTIMIZATION_CAPABILITIES")
+cpu_optimization_capabilities = core.get_property("CPU", ov.properties.device.capabilities())
 #! [part0]
 # TODO: enable part1 when property api will be supported in python
 #! [part1]
-core = Core()
+core = ov.Core()
 model = core.read_model("model.xml")
 compiled_model = core.compile_model(model, "CPU")
-inference_precision = core.get_property("CPU", "INFERENCE_PRECISION_HINT")
+inference_precision = core.get_property("CPU", ov.properties.hint.inference_precision())
 #! [part1]
 #! [part2]
-core = Core()
+core = ov.Core()
-core.set_property("CPU", {"INFERENCE_PRECISION_HINT": "f32"})
+core.set_property("CPU", {ov.properties.hint.inference_precision(): ov.Type.f32})
 #! [part2]
--- a/docs/snippets/cpu/init.py
+++ b/docs/snippets/cpu/init.py
@ -0,0 +1,2 @@
 # Copyright (C) 2018-2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
--- a/docs/snippets/cpu/compile_model.py
+++ b/docs/snippets/cpu/compile_model.py
@ -1,17 +1,23 @@
-# Copyright (C) 2022 Intel Corporation
+# Copyright (C) 2018-2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
 from snippets import get_model
 #! [compile_model_default]
 from openvino.runtime import Core
-core = Core()
+def main():
-model = core.read_model("model.xml")
+    model = get_model()
 compiled_model = core.compile_model(model, "CPU")
 #! [compile_model_default]
-#! [compile_model_multi]
+    #! [compile_model_default]
-core = Core()
+    import openvino as ov
-model = core.read_model("model.xml")
+
-compiled_model = core.compile_model(model, "MULTI:CPU,GPU.0")
+    core = ov.Core()
-#! [compile_model_multi]
+    compiled_model = core.compile_model(model, "CPU")
    #! [compile_model_default]
    if "GPU" not in core.available_devices:
        return 0
    #! [compile_model_multi]
    core = ov.Core()
    compiled_model = core.compile_model(model, "MULTI:CPU,GPU.0")
    #! [compile_model_multi]
--- a/docs/snippets/cpu/dynamic_shape.py
+++ b/docs/snippets/cpu/dynamic_shape.py
@ -1,11 +1,13 @@
-# Copyright (C) 2022 Intel Corporation
+# Copyright (C) 2018-2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
 from snippets import get_model
-from openvino.runtime import Core
+model = get_model()
 #! [static_shape]
-core = Core()
+import openvino as ov
-model = core.read_model("model.xml")
+
 core = ov.Core()
 model.reshape([10, 20, 30, 40])
 #! [static_shape]
--- a/docs/snippets/cpu/multi_threading.py
+++ b/docs/snippets/cpu/multi_threading.py
@ -2,28 +2,47 @@
 # SPDX-License-Identifier: Apache-2.0
 #
-import openvino.runtime as ov
+from openvino import Core, properties
-from openvino.runtime import Core, Type, OVAny, properties
+from snippets import get_model
 model = get_model()
 device_name = "CPU"
 core = Core()
 core.set_property("CPU", properties.intel_cpu.sparse_weights_decompression_rate(0.8))
 device_name = 'CPU'
 xml_path = 'model.xml'
 core = ov.Core()
 core.set_property("CPU", ov.properties.intel_cpu.sparse_weights_decompression_rate(0.8))
 model = core.read_model(model=xml_path)
 # ! [ov:intel_cpu:multi_threading:part0]
 # Use one logical processor for inference
-compiled_model_1 = core.compile_model(model=model, device_name=device_name, config={properties.inference_num_threads(1)})
+compiled_model_1 = core.compile_model(
    model=model,
    device_name=device_name,
    config={properties.inference_num_threads(): 1},
 )
 # Use logical processors of Efficient-cores for inference on hybrid platform
-compiled_model_2 = core.compile_model(model=model, device_name=device_name, config={properties.hint.scheduling_core_type(properties.hint.SchedulingCoreType.ECORE_ONLY)})
+compiled_model_2 = core.compile_model(
    model=model,
    device_name=device_name,
    config={
        properties.hint.scheduling_core_type(): properties.hint.SchedulingCoreType.ECORE_ONLY,
    },
 )
 # Use one logical processor per CPU core for inference when hyper threading is on
-compiled_model_3 = core.compile_model(model=model, device_name=device_name, config={properties.hint.enable_hyper_threading(False)})
+compiled_model_3 = core.compile_model(
    model=model,
    device_name=device_name,
    config={properties.hint.enable_hyper_threading(): False},
 )
 # ! [ov:intel_cpu:multi_threading:part0]
 # ! [ov:intel_cpu:multi_threading:part1]
 # Disable CPU threads pinning for inference when system supoprt it
-compiled_model_4 = core.compile_model(model=model, device_name=device_name, config={properties.hint.enable_cpu_pinning(False)})
+compiled_model_4 = core.compile_model(
    model=model,
    device_name=device_name,
    config={properties.hint.enable_cpu_pinning(): False},
 )
 # ! [ov:intel_cpu:multi_threading:part1]
 assert compiled_model_1
 assert compiled_model_2
--- a/docs/snippets/cpu/ov_execution_mode.py
+++ b/docs/snippets/cpu/ov_execution_mode.py
@ -1,12 +1,18 @@
 # Copyright (C) 2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
-from openvino.runtime import Core
+#! [ov:execution_mode:part0]
 import openvino as ov
-#! [ov:execution_mode:part0]
+core = ov.Core()
 core = Core()
 # in case of Accuracy
-core.set_property("CPU", {"EXECUTION_MODE_HINT": "ACCURACY"})
+core.set_property(
    "CPU",
    {ov.properties.hint.execution_mode(): ov.properties.hint.ExecutionMode.ACCURACY},
 )
 # in case of Performance
-core.set_property("CPU", {"EXECUTION_MODE_HINT": "PERFORMANCE"})
+core.set_property(
    "CPU",
    {ov.properties.hint.execution_mode(): ov.properties.hint.ExecutionMode.PERFORMANCE},
 )
 #! [ov:execution_mode:part0]
--- a/docs/snippets/cpu/ov_sparse_weights_decompression.py
+++ b/docs/snippets/cpu/ov_sparse_weights_decompression.py
@ -2,14 +2,16 @@
 # SPDX-License-Identifier: Apache-2.0
 #
-import openvino.runtime as ov
+import openvino as ov
 from snippets import get_model
-device_name = 'CPU'
+model = get_model()
-xml_path = 'model.xml'
+
 device_name = "CPU"
 xml_path = "model.xml"
 # ! [ov:intel_cpu:sparse_weights_decompression:part0]
 core = ov.Core()
 core.set_property("CPU", ov.properties.intel_cpu.sparse_weights_decompression_rate(0.8))
 model = core.read_model(model=xml_path)
 compiled_model = core.compile_model(model=model, device_name=device_name)
 # ! [ov:intel_cpu:sparse_weights_decompression:part0]
 assert compiled_model
--- a/docs/snippets/export_compiled_model.py
+++ b/docs/snippets/export_compiled_model.py
@ -1,7 +1,19 @@
-#! [export_compiled_model]
+# Copyright (C) 2018-2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
 #
-from openvino.runtime import Core
+from utils import get_path_to_model
-ov.Core().compile_model(device, modelPath, properties).export_model(compiled_blob)
+device = "CPU"
 model_path = get_path_to_model()
 properties = {}
 #! [export_compiled_model]
 import openvino as ov
 core = ov.Core()
 compiled_model = core.compile_model(model_path, device, properties)
 output_stream = compiled_model.export_model()
 #! [export_compiled_model]
--- a/docs/snippets/gna/init.py
+++ b/docs/snippets/gna/init.py
@ -0,0 +1,2 @@
 # Copyright (C) 2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
--- a/docs/snippets/gna/configure.py
+++ b/docs/snippets/gna/configure.py
@ -2,14 +2,17 @@
 # SPDX-License-Identifier: Apache-2.0
 #! [import]
-from openvino.runtime import Core
+import openvino as ov
 #! [import]
-model_path = "model.xml"
+from snippets import get_model
 model = get_model()
 # TODO: no GNA properties to replace strings
 #! [ov_gna_exec_mode_hw_with_sw_fback]
-core = Core()
+core = ov.Core()
-model = core.read_model(model=model_path)
+compiled_model = core.compile_model(
-compiled_model = core.compile_model(model, device_name="GNA",
+    model, device_name="GNA", config={"GNA_DEVICE_MODE": "GNA_HW_WITH_SW_FBACK"}
-    config={ 'GNA_DEVICE_MODE' : 'GNA_HW_WITH_SW_FBACK'})
+)
 #! [ov_gna_exec_mode_hw_with_sw_fback]
--- a/docs/snippets/gna/import_export.py
+++ b/docs/snippets/gna/import_export.py
@ -2,25 +2,26 @@
 # SPDX-License-Identifier: Apache-2.0
 #! [import]
-from openvino.runtime import Core
+import openvino as ov
 from io import BytesIO
 #! [import]
-model_path = "model.xml"
+from snippets import get_model
 model = get_model()
 blob_path = "compiled_model.blob"
-core = Core()
+core = ov.Core()
 model = core.read_model(model=model_path)
 compiled_model = core.compile_model(model, device_name="GNA")
 #! [ov_gna_export]
 user_stream = compiled_model.export_model()
-with open(blob_path, 'wb') as f:
+with open(blob_path, "wb") as f:
    f.write(user_stream)
 #! [ov_gna_export]
 # [ov_gna_import]
-with open(blob_path, 'rb') as f:
+with open(blob_path, "rb") as f:
    buf = BytesIO(f.read())
    compiled_model = core.import_model(buf, device_name="GNA")
 # [ov_gna_import]
--- a/docs/snippets/gna/set_batch.py
+++ b/docs/snippets/gna/set_batch.py
@ -2,20 +2,21 @@
 # SPDX-License-Identifier: Apache-2.0
 #! [import]
-from openvino.runtime import Core, set_batch
+import openvino as ov
 from openvino.preprocess import PrePostProcessor
 #! [import]
-model_path = "model.xml"
+from snippets import get_path_to_model
 batch_size = 8
 model_path = get_path_to_model([1, 32])
 #! [ov_gna_read_model]
-core = Core()
+core = ov.Core()
 model = core.read_model(model=model_path)
 #! [ov_gna_read_model]
 #! [ov_gna_set_nc_layout]
-ppp = PrePostProcessor(model)
+ppp = ov.preprocess.PrePostProcessor(model)
 for i in range(len(model.inputs)):
    input_name = model.input(i).get_any_name()
    ppp.input(i).model().set_layout("N?")
@ -23,5 +24,5 @@ model = ppp.build()
 #! [ov_gna_set_nc_layout]
 #! [ov_gna_set_batch_size]
-set_batch(model, batch_size)
+ov.set_batch(model, batch_size)
 #! [ov_gna_set_batch_size]
--- a/docs/snippets/gpu/init.py
+++ b/docs/snippets/gpu/init.py
@ -0,0 +1,2 @@
 # Copyright (C) 2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
--- a/docs/snippets/gpu/compile_model.py
+++ b/docs/snippets/gpu/compile_model.py
@ -2,40 +2,49 @@
 # SPDX-License-Identifier: Apache-2.0
-from openvino.runtime import Core
+import openvino as ov
 from snippets import get_model
 #! [compile_model_default_gpu]
 core = Core()
 model = core.read_model("model.xml")
 compiled_model = core.compile_model(model, "GPU")
 #! [compile_model_default_gpu]
-#! [compile_model_gpu_with_id]
+def main():
-core = Core()
+    model = get_model()
 model = core.read_model("model.xml")
 compiled_model = core.compile_model(model, "GPU.1")
 #! [compile_model_gpu_with_id]
-#! [compile_model_gpu_with_id_and_tile]
+    core = ov.Core()
-core = Core()
+    if "GPU" not in core.available_devices:
-model = core.read_model("model.xml")
+        return 0
 compiled_model = core.compile_model(model, "GPU.1.0")
 #! [compile_model_gpu_with_id_and_tile]
-#! [compile_model_multi]
+    #! [compile_model_default_gpu]
-core = Core()
+    core = ov.Core()
-model = core.read_model("model.xml")
+    compiled_model = core.compile_model(model, "GPU")
-compiled_model = core.compile_model(model, "MULTI:GPU.1,GPU.0")
+    #! [compile_model_default_gpu]
 #! [compile_model_multi]
-#! [compile_model_batch_plugin]
+    #! [compile_model_gpu_with_id]
-core = Core()
+    core = ov.Core()
-model = core.read_model("model.xml")
+    compiled_model = core.compile_model(model, "GPU.1")
-compiled_model = core.compile_model(model, "BATCH:GPU")
+    #! [compile_model_gpu_with_id]
 #! [compile_model_batch_plugin]
-#! [compile_model_auto_batch]
+    #! [compile_model_gpu_with_id_and_tile]
-core = Core()
+    core = ov.Core()
-model = core.read_model("model.xml")
+    compiled_model = core.compile_model(model, "GPU.1.0")
-compiled_model = core.compile_model(model, "GPU", {"PERFORMANCE_HINT": "THROUGHPUT"})
+    #! [compile_model_gpu_with_id_and_tile]
-#! [compile_model_auto_batch]
+
    #! [compile_model_multi]
    core = ov.Core()
    compiled_model = core.compile_model(model, "MULTI:GPU.1,GPU.0")
    #! [compile_model_multi]
    #! [compile_model_batch_plugin]
    core = ov.Core()
    compiled_model = core.compile_model(model, "BATCH:GPU")
    #! [compile_model_batch_plugin]
    #! [compile_model_auto_batch]
    core = ov.Core()
    compiled_model = core.compile_model(
        model,
        "GPU",
        {
            ov.properties.hint.performance_mode(): ov.properties.hint.PerformanceMode.THROUGHPUT,
        },
    )
    #! [compile_model_auto_batch]
--- a/docs/snippets/gpu/custom_kernels_api.py
+++ b/docs/snippets/gpu/custom_kernels_api.py
@ -1,10 +1,17 @@
 # Copyright (C) 2022 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
 import openvino as ov
 from openvino.runtime import Core
-#! [part0]
+def main():
-core = Core()
+    core = ov.Core()
-core.set_property("GPU", {"CONFIG_FILE": "<path_to_the_xml_file>"})
+
-#! [part0]
+    if "GPU" not in core.available_devices:
        return 0
    # TODO: missing property to replace string
    #! [part0]
    core = ov.Core()
    core.set_property("GPU", {"CONFIG_FILE": "<path_to_the_xml_file>"})
    #! [part0]
--- a/docs/snippets/gpu/dynamic_batch.py
+++ b/docs/snippets/gpu/dynamic_batch.py
@ -2,27 +2,38 @@
 # SPDX-License-Identifier: Apache-2.0
-import openvino.runtime as ov
+import openvino as ov
 from snippets import get_model
 #! [dynamic_batch]
 core = ov.Core()
-C = 3
+def main():
-H = 224
+    model = get_model()
 W = 224
-model = core.read_model("model.xml")
+    core = ov.Core()
-model.reshape([(1, 10), C, H, W])
+    if "GPU" not in core.available_devices:
        return 0
-# compile model and create infer request
+    #! [dynamic_batch]
-compiled_model = core.compile_model(model, "GPU")
+    core = ov.Core()
 infer_request = compiled_model.create_infer_request()
-# create input tensor with specific batch size
+    C = 3
-input_tensor = ov.Tensor(model.input().element_type, [2, C, H, W])
+    H = 224
    W = 224
-# ...
+    model.reshape([(1, 10), C, H, W])
-infer_request.infer([input_tensor])
+    # compile model and create infer request
    compiled_model = core.compile_model(model, "GPU")
    infer_request = compiled_model.create_infer_request()
-#! [dynamic_batch]
+    # create input tensor with specific batch size
    input_tensor = ov.Tensor(model.input().element_type, [2, C, H, W])
    # ...
    results = infer_request.infer([input_tensor])
    #! [dynamic_batch]
    assert list(results.keys())[0].partial_shape == ov.PartialShape([(1, 10), 3, 224, 224])
    assert list(results.values())[0].shape == tuple(ov.Shape([2, 3, 224, 224]))
--- a/docs/snippets/gpu/preprocessing_nv12_two_planes.py
+++ b/docs/snippets/gpu/preprocessing_nv12_two_planes.py
@ -1,18 +1,31 @@
 # Copyright (C) 2022 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
-#! [init_preproc]
+from snippets import get_model
-from openvino.runtime import Core, Type, Layout
+import openvino as ov
 from openvino.preprocess import PrePostProcessor, ColorFormat
 core = Core()
 model = core.read_model("model.xml")
-p = PrePostProcessor(model)
+def init_preproc():
-p.input().tensor().set_element_type(Type.u8) \
+    model = get_model()
-                  .set_color_format(ColorFormat.NV12_TWO_PLANES, ["y", "uv"]) \
+    #! [init_preproc]
-                  .set_memory_type("GPU_SURFACE")
+    import openvino as ov
-p.input().preprocess().convert_color(ColorFormat.BGR)
+    from openvino.preprocess import PrePostProcessor, ColorFormat
-p.input().model().set_layout(Layout("NCHW"))
+
-model_with_preproc = p.build()
+    core = ov.Core()
-#! [init_preproc]
+
    p = PrePostProcessor(model)
    p.input().tensor().set_element_type(ov.Type.u8).set_color_format(
        ColorFormat.NV12_TWO_PLANES, ["y", "uv"]
    ).set_memory_type("GPU_SURFACE")
    p.input().preprocess().convert_color(ColorFormat.BGR)
    p.input().model().set_layout(ov.Layout("NCHW"))
    model_with_preproc = p.build()
    #! [init_preproc]
 def main():
    core = ov.Core()
    if "GPU" not in core.available_devices:
        return 0
    init_preproc()
--- a/docs/snippets/ie_common.py
+++ b/docs/snippets/ie_common.py
@ -2,6 +2,7 @@
 # SPDX-License-Identifier: Apache-2.0
 #
 from utils import get_path_to_model, get_image, get_path_to_extension_library
 #! [ie:create_core]
 import numpy as np
@ -9,8 +10,10 @@ import openvino.inference_engine as ie
 core = ie.IECore()
 #! [ie:create_core]
 model_path = get_path_to_model(True)
 #! [ie:read_model]
-network = core.read_network("model.xml")
+network = core.read_network(model_path)
 #! [ie:read_model]
 #! [ie:compile_model]
@ -26,7 +29,7 @@ exec_network = core.load_network(network, "CPU", num_requests=4)
 infer_request = exec_network.requests[0]
 # Get input blobs mapped to input layers names
 input_blobs = infer_request.input_blobs
-data = input_blobs["data1"].buffer
+data = input_blobs["data"].buffer
 # Original I64 precision was converted to I32
 assert data.dtype == np.int32
 # Fill the first blob ...
@ -36,7 +39,7 @@ assert data.dtype == np.int32
 results = infer_request.infer()
 #! [ie:inference]
-input_data = iter(list())
+input_data = get_image()
 def process_results(results, frame_id):
    pass
@ -67,11 +70,11 @@ for infer_request in exec_network.requests:
 total_frames = 100
 for _ in range(total_frames):
    # Wait for at least one free request
-    exec_network.wait(num_request=1)
+    exec_network.wait(num_requests=1)
    # Get idle id
    idle_id = exec_network.get_idle_request_id()
    # Start asynchronous inference on idle request
-    exec_network.start_async(request_id=idle_id, inputs=next(input_data))
+    exec_network.start_async(request_id=idle_id, inputs={"data": input_data})
 # Wait for all requests to complete
 exec_network.wait()
 #! [ie:start_async_and_wait]
@ -79,12 +82,13 @@ exec_network.wait()
 #! [ie:get_output_tensor]
 # Get output blobs mapped to output layers names
 output_blobs = infer_request.output_blobs
-data = output_blobs["out1"].buffer
+data = output_blobs["relu"].buffer
 # Original I64 precision was converted to I32
 assert data.dtype == np.int32
 # Process output data
 #! [ie:get_output_tensor]
 path_to_extension_library = get_path_to_extension_library()
 #! [ie:load_old_extension]
-core.add_extension("path_to_extension_library.so", "CPU")
+core.add_extension(path_to_extension_library, "CPU")
 #! [ie:load_old_extension]
--- a/docs/snippets/main.py
+++ b/docs/snippets/main.py
@ -0,0 +1,44 @@
 # Copyright (C) 2018-2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
 #
 import sys
 import os
 import io
 import importlib
 from contextlib import redirect_stdout, redirect_stderr
 skip_snippets = ["main.py", "__init__.py", "utils.py", "ie_common.py", "ov_common.py"]
 def import_python_modules(directory, subdirectory=""):
    for item in os.listdir(directory):
        if item.endswith('.py') and item not in skip_snippets:
            imported_item = item[:-3]
            if subdirectory != "":
                imported_item=subdirectory + "." + imported_item
            print(f"Snippet {item} is executing...")
            with redirect_stdout(io.StringIO()), redirect_stderr(io.StringIO()):
                mod = importlib.import_module(imported_item)
            try:
                with redirect_stdout(io.StringIO()), redirect_stderr(io.StringIO()):
                    mod.main()
            except AttributeError as e:
                pass
            print(f"Snippet {item} succesfully executed.")
        if os.path.isdir(os.path.join(directory, item)):
            dir_path = os.path.join(directory, item)
            import_python_modules(dir_path, item)
 def main():
    dir_path = os.path.dirname(os.path.realpath(__file__))
    import_python_modules(dir_path)
 if __name__ == "__main__":
    sys.exit(main() or 0)
--- a/docs/snippets/ngraph_snippet.py
+++ b/docs/snippets/ngraph_snippet.py
--- a/docs/snippets/ov_auto.py
+++ b/docs/snippets/ov_auto.py
@ -1,15 +1,15 @@
 import sys
-from openvino.runtime import Core
+import openvino as ov
 from openvino.inference_engine import IECore
-model_path = "/openvino_CI_CD/result/install_pkg/tests/test_model_zoo/core/models/ir/add_abc.xml"
+from utils import get_model, get_ngraph_model
-path_to_model = "/openvino_CI_CD/result/install_pkg/tests/test_model_zoo/core/models/ir/add_abc.xml"
+
 model = get_model()
 net = get_ngraph_model()
 def part0():
-#! [part0]
+    #! [part0]
-    core = Core()
+    core = ov.Core()
    # Read a network in IR, PaddlePaddle, or ONNX format:
    model = core.read_model(model_path)
    #  compile a model on AUTO using the default list of device candidates.
    #  The following lines are equivalent:
@ -20,21 +20,25 @@ def part0():
    # You can also specify the devices to be used by AUTO.
    # The following lines are equivalent:
    compiled_model = core.compile_model(model=model, device_name="AUTO:GPU,CPU")
-    compiled_model = core.compile_model(model=model, device_name="AUTO", config={"MULTI_DEVICE_PRIORITIES": "GPU,CPU"})
+    compiled_model = core.compile_model(
        model=model,
        device_name="AUTO",
        config={ov.properties.device.priorities(): "GPU,CPU"},
    )
    # Optional
    # the AUTO plugin is pre-configured (globally) with the explicit option:
-    core.set_property(device_name="AUTO", properties={"MULTI_DEVICE_PRIORITIES":"GPU,CPU"})
+    core.set_property(
-#! [part0]
+        device_name="AUTO", properties={ov.properties.device.priorities(): "GPU,CPU"}
    )
    #! [part0]
 def part1():
-#! [part1]
+    #! [part1]
    ### IE API ###
    ie = IECore()
    # Read a network in IR, PaddlePaddle, or ONNX format:
    net = ie.read_network(model=path_to_model)
    # Load a network to AUTO using the default list of device candidates.
    # The following lines are equivalent:
    exec_net = ie.load_network(network=net)
@ -45,87 +49,148 @@ def part1():
    # You can also specify the devices to be used by AUTO in its selection process.
    # The following lines are equivalent:
    exec_net = ie.load_network(network=net, device_name="AUTO:GPU,CPU")
-    exec_net = ie.load_network(network=net, device_name="AUTO", config={"MULTI_DEVICE_PRIORITIES": "GPU,CPU"})
+    exec_net = ie.load_network(
        network=net,
        device_name="AUTO",
        config={ov.properties.device.priorities(): "GPU,CPU"},
    )
    # Optional
    # the AUTO plugin is pre-configured (globally) with the explicit option:
-    ie.set_config(config={"MULTI_DEVICE_PRIORITIES":"GPU,CPU"}, device_name="AUTO")
+    ie.set_config(
-#! [part1]
+        config={ov.properties.device.priorities(): "GPU,CPU"}, device_name="AUTO"
    )
    #! [part1]
 def part3():
-#! [part3]
+    #! [part3]
-    core = Core()
+    core = ov.Core()
-    # Read a network in IR, PaddlePaddle, or ONNX format:
+
    model = core.read_model(model_path)
    # Compile a model on AUTO with Performance Hints enabled:
    # To use the “THROUGHPUT” mode:
-    compiled_model = core.compile_model(model=model, device_name="AUTO", config={"PERFORMANCE_HINT":"THROUGHPUT"})
+    compiled_model = core.compile_model(
        model=model,
        device_name="AUTO",
        config={
            ov.properties.hint.performance_mode(): ov.properties.hint.PerformanceMode.THROUGHPUT
        },
    )
    # To use the “LATENCY” mode:
-    compiled_model = core.compile_model(model=model, device_name="AUTO", config={"PERFORMANCE_HINT":"LATENCY"})
+    compiled_model = core.compile_model(
        model=model,
        device_name="AUTO",
        config={
            ov.properties.hint.performance_mode(): ov.properties.hint.PerformanceMode.LATENCY
        },
    )
    # To use the “CUMULATIVE_THROUGHPUT” mode:
-    compiled_model = core.compile_model(model=model, device_name="AUTO", config={"PERFORMANCE_HINT":"CUMULATIVE_THROUGHPUT"})
+    compiled_model = core.compile_model(
-#! [part3]
+        model=model,
        device_name="AUTO",
        config={
            ov.properties.hint.performance_mode(): ov.properties.hint.PerformanceMode.CUMULATIVE_THROUGHPUT
        },
    )
    #! [part3]
 def part4():
-#! [part4]
+    #! [part4]
-    core = Core()
+    core = ov.Core()
    model = core.read_model(model_path)
    # Example 1
-    compiled_model0 = core.compile_model(model=model, device_name="AUTO", config={"MODEL_PRIORITY":"HIGH"})
+    compiled_model0 = core.compile_model(
-    compiled_model1 = core.compile_model(model=model, device_name="AUTO", config={"MODEL_PRIORITY":"MEDIUM"})
+        model=model,
-    compiled_model2 = core.compile_model(model=model, device_name="AUTO", config={"MODEL_PRIORITY":"LOW"})
+        device_name="AUTO",
        config={ov.properties.hint.model_priority(): ov.properties.hint.Priority.HIGH},
    )
    compiled_model1 = core.compile_model(
        model=model,
        device_name="AUTO",
        config={
            ov.properties.hint.model_priority(): ov.properties.hint.Priority.MEDIUM
        },
    )
    compiled_model2 = core.compile_model(
        model=model,
        device_name="AUTO",
        config={ov.properties.hint.model_priority(): ov.properties.hint.Priority.LOW},
    )
    # Assume that all the devices (CPU and GPUs) can support all the networks.
    # Result: compiled_model0 will use GPU.1, compiled_model1 will use GPU.0, compiled_model2 will use CPU.
    # Example 2
-    compiled_model3 = core.compile_model(model=model, device_name="AUTO", config={"MODEL_PRIORITY":"HIGH"})
+    compiled_model3 = core.compile_model(
-    compiled_model4 = core.compile_model(model=model, device_name="AUTO", config={"MODEL_PRIORITY":"MEDIUM"})
+        model=model,
-    compiled_model5 = core.compile_model(model=model, device_name="AUTO", config={"MODEL_PRIORITY":"LOW"})
+        device_name="AUTO",
        config={ov.properties.hint.model_priority(): ov.properties.hint.Priority.HIGH},
    )
    compiled_model4 = core.compile_model(
        model=model,
        device_name="AUTO",
        config={
            ov.properties.hint.model_priority(): ov.properties.hint.Priority.MEDIUM
        },
    )
    compiled_model5 = core.compile_model(
        model=model,
        device_name="AUTO",
        config={ov.properties.hint.model_priority(): ov.properties.hint.Priority.LOW},
    )
    # Assume that all the devices (CPU ang GPUs) can support all the networks.
    # Result: compiled_model3 will use GPU.1, compiled_model4 will use GPU.1, compiled_model5 will use GPU.0.
-#! [part4]
+    #! [part4]
 def part5():
-#! [part5]
+    #! [part5]
-    core = Core()
+    core = ov.Core()
-    model = core.read_model(model_path)
+
    # gpu_config and cpu_config will load during compile_model()
    compiled_model = core.compile_model(model=model)
    compiled_model = core.compile_model(model=model, device_name="AUTO")
-#! [part5]
+    #! [part5]
 def part6():
-#! [part6]
+    #! [part6]
-    core = Core()
+    core = ov.Core()
-    # read a network in IR, PaddlePaddle, or ONNX format
+
    model = core.read_model(model_path)
    # compile a model on AUTO and set log level to debug
-    compiled_model = core.compile_model(model=model, device_name="AUTO", config={"LOG_LEVEL":"LOG_DEBUG"})
+    compiled_model = core.compile_model(
        model=model,
        device_name="AUTO",
        config={ov.properties.log.level(): ov.properties.log.Level.DEBUG},
    )
    # set log level with set_property and compile model
-    core.set_property(device_name="AUTO", properties={"LOG_LEVEL":"LOG_DEBUG"})
+    core.set_property(
        device_name="AUTO",
        properties={ov.properties.log.level(): ov.properties.log.Level.DEBUG},
    )
    compiled_model = core.compile_model(model=model, device_name="AUTO")
-#! [part6]
+    #! [part6]
 def part7():
-#! [part7]
+    #! [part7]
-    core = Core()
+    core = ov.Core()
-    # read a network in IR, PaddlePaddle, or ONNX format
+
    model = core.read_model(model_path)
    # compile a model on AUTO and set log level to debug
    compiled_model = core.compile_model(model=model, device_name="AUTO")
    # query the runtime target devices on which the inferences are being executed
-    execution_devices = compiled_model.get_property("EXECUTION_DEVICES")
+    execution_devices = compiled_model.get_property(ov.properties.execution_devices())
-#! [part7]
+    #! [part7]
 def main():
    part0()
    part1()
    part3()
    part4()
    part5()
    part6()
    part7()
-
+    core = ov.Core()
-if __name__ == '__main__':
+    if "GPU" not in core.available_devices:
-    sys.exit(main())
+        return 0
    part0()
    part1()
--- a/docs/snippets/ov_auto_batching.py
+++ b/docs/snippets/ov_auto_batching.py
@ -1,41 +1,55 @@
-from openvino.runtime import Core
+# Copyright (C) 2018-2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
 #
-core = Core()
+import openvino as ov
-model = core.read_model(model="sample.xml")
+from utils import get_model
 # [compile_model]
 config = {"PERFORMANCE_HINT": "THROUGHPUT"}
 compiled_model = core.compile_model(model, "GPU", config)
 # [compile_model]
-# [compile_model_no_auto_batching]
+def main():
 # disabling the automatic batching
 # leaving intact other configurations options that the device selects for the 'throughput' hint 
 config = {"PERFORMANCE_HINT": "THROUGHPUT",
          "ALLOW_AUTO_BATCHING": False}
 compiled_model = core.compile_model(model, "GPU", config)
 # [compile_model_no_auto_batching]
-# [query_optimal_num_requests]
+    core = ov.Core()
-# when the batch size is automatically selected by the implementation
+    model = get_model()
 # it is important to query/create and run the sufficient requests
 config = {"PERFORMANCE_HINT": "THROUGHPUT"}
 compiled_model = core.compile_model(model, "GPU", config)
 num_requests = compiled_model.get_property("OPTIMAL_NUMBER_OF_INFER_REQUESTS")
 # [query_optimal_num_requests]
-# [hint_num_requests]
+    if "GPU" not in core.available_devices:
-config = {"PERFORMANCE_HINT": "THROUGHPUT",
+        return 0
          "PERFORMANCE_HINT_NUM_REQUESTS": "4"}
 # limiting the available parallel slack for the 'throughput'
 # so that certain parameters (like selected batch size) are automatically accommodated accordingly 
 compiled_model = core.compile_model(model, "GPU", config)
 # [hint_num_requests]
-# [hint_plus_low_level]
+    # [compile_model]
-config = {"PERFORMANCE_HINT": "THROUGHPUT",
+    import openvino.runtime.properties as props
-          "INFERENCE_NUM_THREADS": "4"}
+    import openvino.runtime.properties.hint as hints
-# limiting the available parallel slack for the 'throughput'
+
-# so that certain parameters (like selected batch size) are automatically accommodated accordingly
+    config = {hints.performance_mode(): hints.PerformanceMode.THROUGHPUT}
-compiled_model = core.compile_model(model, "CPU", config)
+    compiled_model = core.compile_model(model, "GPU", config)
-# [hint_plus_low_level]]
+    # [compile_model]
    # [compile_model_no_auto_batching]
    # disabling the automatic batching
    # leaving intact other configurations options that the device selects for the 'throughput' hint 
    config = {hints.performance_mode(): hints.PerformanceMode.THROUGHPUT,
            hints.allow_auto_batching(): False}
    compiled_model = core.compile_model(model, "GPU", config)
    # [compile_model_no_auto_batching]
    # [query_optimal_num_requests]
    # when the batch size is automatically selected by the implementation
    # it is important to query/create and run the sufficient requests
    config = {hints.performance_mode(): hints.PerformanceMode.THROUGHPUT}
    compiled_model = core.compile_model(model, "GPU", config)
    num_requests = compiled_model.get_property(props.optimal_number_of_infer_requests())
    # [query_optimal_num_requests]
    # [hint_num_requests]
    config = {hints.performance_mode(): hints.PerformanceMode.THROUGHPUT,
            hints.num_requests(): "4"}
    # limiting the available parallel slack for the 'throughput'
    # so that certain parameters (like selected batch size) are automatically accommodated accordingly 
    compiled_model = core.compile_model(model, "GPU", config)
    # [hint_num_requests]
    # [hint_plus_low_level]
    config = {hints.performance_mode(): hints.PerformanceMode.THROUGHPUT,
            props.inference_num_threads(): "4"}
    # limiting the available parallel slack for the 'throughput'
    # so that certain parameters (like selected batch size) are automatically accommodated accordingly
    compiled_model = core.compile_model(model, "CPU", config)
    # [hint_plus_low_level]]
--- a/docs/snippets/ov_caching.py
+++ b/docs/snippets/ov_caching.py
@ -2,35 +2,39 @@
 # SPDX-License-Identifier: Apache-2.0
 #
-from openvino.runtime import Core
+from utils import get_path_to_model, get_temp_dir
 import openvino as ov
 import openvino.runtime.properties as props
 device_name = 'GNA'
-xml_path = '/tmp/myModel.xml'
+model_path = get_path_to_model()
 path_to_cache_dir = get_temp_dir()
 # ! [ov:caching:part0]
-core = Core()
+core = ov.Core()
-core.set_property({'CACHE_DIR': '/path/to/cache/dir'})
+core.set_property({props.cache_dir(): path_to_cache_dir})
-model = core.read_model(model=xml_path)
+model = core.read_model(model=model_path)
 compiled_model = core.compile_model(model=model, device_name=device_name)
 # ! [ov:caching:part0]
 assert compiled_model
 # ! [ov:caching:part1]
-core = Core()
+core = ov.Core()
-compiled_model = core.compile_model(model_path=xml_path, device_name=device_name)
+compiled_model = core.compile_model(model=model_path, device_name=device_name)
 # ! [ov:caching:part1]
 assert compiled_model
 # ! [ov:caching:part2]
-core = Core()
+core = ov.Core()
-core.set_property({'CACHE_DIR': '/path/to/cache/dir'})
+core.set_property({props.cache_dir(): path_to_cache_dir})
-compiled_model = core.compile_model(model_path=xml_path, device_name=device_name)
+compiled_model = core.compile_model(model=model_path, device_name=device_name)
 # ! [ov:caching:part2]
 assert compiled_model
 # ! [ov:caching:part3]
 # Find 'EXPORT_IMPORT' capability in supported capabilities
-caching_supported = 'EXPORT_IMPORT' in core.get_property(device_name, 'OPTIMIZATION_CAPABILITIES')
+caching_supported = 'EXPORT_IMPORT' in core.get_property(device_name, props.device.capabilities())
 # ! [ov:caching:part3]
--- a/docs/snippets/ov_common.py
+++ b/docs/snippets/ov_common.py
@ -4,14 +4,17 @@
 import numpy as np
 from utils import get_image, get_path_to_extension_library, get_path_to_model
 #! [ov_api_2_0:create_core]
-import openvino.runtime as ov
+import openvino as ov
 core = ov.Core()
 #! [ov_api_2_0:create_core]
 model_path = get_path_to_model()
 #! [ov_api_2_0:read_model]
-model = core.read_model("model.xml")
+model = core.read_model(model_path)
 #! [ov_api_2_0:read_model]
 #! [ov_api_2_0:compile_model]
@ -30,7 +33,7 @@ assert input_tensor1.data.dtype == np.int64
 # Fill the first data ...
 # Get input tensor by tensor name
-input_tensor2 = infer_request.get_tensor("data2_t")
+input_tensor2 = infer_request.get_tensor("input")
 assert input_tensor2.data.dtype == np.int64
 # Fill the second data ...
 #! [ov_api_2_0:get_input_tensor_aligned]
@ -43,7 +46,7 @@ assert input_tensor1.data.dtype == np.int32
 # Fill the first data ...
 # Get input tensor by tensor name
-input_tensor2 = infer_request.get_tensor("data2_t")
+input_tensor2 = infer_request.get_tensor("input")
 # IR v10 works with converted precisions (i64 -> i32)
 assert input_tensor2.data.dtype == np.int32
 # Fill the second data ..
@ -53,7 +56,7 @@ assert input_tensor2.data.dtype == np.int32
 results = infer_request.infer()
 #! [ov_api_2_0:inference]
-input_data = iter(list())
+input_data = get_image()
 def process_results(results, frame_id):
    pass
@ -83,7 +86,7 @@ infer_queue.set_callback(callback)
 total_frames = 100
 for i in range(total_frames):
    # Wait for at least one available infer request and start asynchronous inference
-    infer_queue.start_async(next(input_data), userdata=i)
+    infer_queue.start_async(input_data, userdata=i)
 # Wait for all requests to complete
 infer_queue.wait_all()
 #! [ov_api_2_0:start_async_and_wait]
@ -104,6 +107,8 @@ assert output_tensor.data.dtype == np.int32
 # process output data ...
 #! [ov_api_2_0:get_output_tensor_v10]
 path_to_extension_library = get_path_to_extension_library()
 #! [ov_api_2_0:load_old_extension]
-core.add_extension("path_to_extension_library.so")
+core.add_extension(path_to_extension_library)
 #! [ov_api_2_0:load_old_extension]
--- a/docs/snippets/ov_denormals.py
+++ b/docs/snippets/ov_denormals.py
@ -2,14 +2,15 @@
 # SPDX-License-Identifier: Apache-2.0
 #
-import openvino.runtime as ov
+import openvino as ov
 from utils import get_model
 device_name = 'CPU'
-xml_path = 'modelWithDenormals.xml'
+model = get_model()
 # ! [ov:intel_cpu:denormals_optimization:part0]
 core = ov.Core()
 core.set_property("CPU", ov.properties.intel_cpu.denormals_optimization(True))
 model = core.read_model(model=xml_path)
 compiled_model = core.compile_model(model=model, device_name=device_name)
 # ! [ov:intel_cpu:denormals_optimization:part0]
 assert compiled_model
--- a/docs/snippets/ov_dynamic_shapes.py
+++ b/docs/snippets/ov_dynamic_shapes.py
@ -2,13 +2,16 @@
 # SPDX-License-Identifier: Apache-2.0
 import numpy as np
 from utils import get_dynamic_model
 #! [import]
-import openvino.runtime as ov
+import openvino as ov
 #! [import]
 model = get_dynamic_model()
 #! [reshape_undefined]
-Core = ov.Core()
+core = ov.Core()
 model = core.read_model(“model.xml”)
 # Set first dimension to be dynamic while keeping others static
 model.reshape([-1, 3, 224, 224])
@ -25,7 +28,7 @@ model.reshape([-1, 3, (112, 448), (112, 448)])
 model.reshape([(1, 8), 3, (112, 448), (112, 448)])
 #! [reshape_bounds]
-model = core.read_model("model.xml")
+model = get_dynamic_model()
 #! [print_dynamic]
 # Print output partial shape
@ -35,41 +38,43 @@ print(model.output().partial_shape)
 print(model.input().partial_shape)
 #! [print_dynamic]
-#! [detect_dynamic]
+model = get_dynamic_model()
 model = core.read_model("model.xml")
-if model.input(0).partial_shape.is_dynamic():
+#! [detect_dynamic]
 if model.input(0).partial_shape.is_dynamic:
    # input is dynamic
    pass
-if model.output(0).partial_shape.is_dynamic():
+if model.output(0).partial_shape.is_dynamic:
    # output is dynamic
    pass
-if model.output(0).partial_shape[1].is_dynamic():
+if model.output(0).partial_shape[1].is_dynamic:
    # 1-st dimension of output is dynamic
    pass
 #! [detect_dynamic]
 executable = core.compile_model(model)
 infer_request = executable.create_infer_request()
 input_tensor_name = "input"
 #! [set_input_tensor]
 # For first inference call, prepare an input tensor with 1x128 shape and run inference request
-Input_data1 = np.ones(shape=[1,128])
+input_data1 = np.ones(shape=[1,128])
-infer_request.infer([input_data1])
+infer_request.infer({input_tensor_name: input_data1})
 # Get resulting outputs
-Output_tensor1 = infer_request.get_output_tensor()
+output_tensor1 = infer_request.get_output_tensor()
-Output_data1 = output_tensor.data[:]
+output_data1 = output_tensor1.data[:]
 # For second inference call, prepare a 1x200 input tensor and run inference request
-Input_data2 = np.ones(shape=[1,200])
+input_data2 = np.ones(shape=[1,200])
-infer_request.infer([input_data2])
+infer_request.infer({input_tensor_name: input_data2})
 # Get resulting outputs
-Output_tensor2 = infer_request.get_output_tensor()
+output_tensor2 = infer_request.get_output_tensor()
-Output_data2 = output_tensor.data[:]
+output_data2 = output_tensor2.data[:]
 #! [set_input_tensor]
 infer_request = executable.create_infer_request()
@ -96,10 +101,9 @@ infer_request.infer()
 data2 = output_tensor.data[:]
 #! [get_input_tensor]
-#! [check_inputs]
+model = get_dynamic_model()
 core = ov.Core()
 model = core.read_model("model.xml")
 #! [check_inputs]
 # Print model input layer info
 for input_layer in model.inputs:
    print(input_layer.names, input_layer.partial_shape)
--- a/docs/snippets/ov_extensions.py
+++ b/docs/snippets/ov_extensions.py
@ -2,10 +2,10 @@
 # SPDX-License-Identifier: Apache-2.0
 #
-import openvino.runtime as ov
+import openvino as ov
 #! [py_frontend_extension_ThresholdedReLU_header]
-import openvino.runtime.opset8 as ops
+import openvino.runtime.opset12 as ops
 from openvino.frontend import ConversionExtension
 #! [py_frontend_extension_ThresholdedReLU_header]
@ -17,10 +17,14 @@ from openvino.frontend import ConversionExtension
 # Not implemented
 #! [add_frontend_extension]
 from utils import get_path_to_extension_library
 path_to_extension_lib = get_path_to_extension_library()
 #! [add_extension_lib]
 core = ov.Core()
 # Load extensions library to ov.Core
-core.add_extension("libopenvino_template_extension.so")
+core.add_extension(path_to_extension_lib)
 #! [add_extension_lib]
 #! [py_frontend_extension_MyRelu]
--- a/docs/snippets/ov_graph.py
+++ b/docs/snippets/ov_graph.py
@ -25,21 +25,22 @@
 #        |    result   |
 #        |_____________|
-import openvino.runtime as ov
+import openvino as ov
 import openvino.runtime.opset12 as ops
-data1 = ov.opset8.parameter([1, 3, 2, 2], ov.Type.i64)
+data1 = ops.parameter([1, 3, 2, 2], ov.Type.i64)
 data1.friendly_name = "data1"      # operation name
 data1.output(0).name = "data1_t" # tensor name
-data2 = ov.opset8.parameter([1, 2, 2, 2], ov.Type.i64)
+data2 = ops.parameter([1, 2, 2, 2], ov.Type.i64)
 data2.friendly_name = "data2"      # operation name
 data2.output(0).name = "data2_t"   # tensor name
-concat = ov.opset8.concat([data1, data2], 1)
+concat = ops.concat([data1, data2], 1)
 concat.friendly_name = "concat"    # operation name
 concat.output(0).name = "concat_t" # tensor name
-result = ov.opset8.result(concat)
+result = ops.result(concat)
 result.friendly_name = "result"    # operation name
 model = ov.Model(result, [data1, data2], "model_name")
--- a/docs/snippets/ov_hetero.py
+++ b/docs/snippets/ov_hetero.py
@ -1,43 +1,50 @@
-import openvino.runtime as ov
+import openvino as ov
 from utils import get_model
-core = ov.Core()
+def main():
-model = core.read_model("sample.xml")
+    model = get_model()
    core = ov.Core()
-#! [set_manual_affinities]
+    #! [set_manual_affinities]
-for op in model.get_ops():
+    for op in model.get_ops():
        rt_info = op.get_rt_info()
        rt_info["affinity"] = "CPU"
-#! [set_manual_affinities]
+    #! [set_manual_affinities]
-#! [fix_automatic_affinities]
+    if "GPU" not in core.available_devices:
-# This example demonstrates how to perform default affinity initialization and then
+        return 0
 # correct affinity manually for some layers
 device = "HETERO:GPU,CPU"
-# query_model result contains mapping of supported operations to devices
+    #! [fix_automatic_affinities]
-supported_ops = core.query_model(model, device)
+    # This example demonstrates how to perform default affinity initialization and then
    # correct affinity manually for some layers
    device = "HETERO:GPU,CPU"
-# update default affinities manually for specific operations
+    # query_model result contains mapping of supported operations to devices
-supported_ops["operation_name"] = "CPU"
+    supported_ops = core.query_model(model, device)
-# set affinities to a model
+    # update default affinities manually for specific operations
-for node in model.get_ops():
+    supported_ops["operation_name"] = "CPU"
    # set affinities to a model
    for node in model.get_ops():
        affinity = supported_ops[node.get_friendly_name()]
        node.get_rt_info()["affinity"] = "CPU"
-# load model with manually set affinities
+    # load model with manually set affinities
-compiled_model = core.compile_model(model, device)
+    compiled_model = core.compile_model(model, device)
-#! [fix_automatic_affinities]
+    #! [fix_automatic_affinities]
-#! [compile_model]
+    #! [compile_model]
-compiled_model = core.compile_model(model, device_name="HETERO:GPU,CPU")
+    compiled_model = core.compile_model(model, device_name="HETERO:GPU,CPU")
-# device priorities via configuration property
+    # device priorities via configuration property
-compiled_model = core.compile_model(model, device_name="HETERO", config={"MULTI_DEVICE_PRIORITIES": "GPU,CPU"})
+    compiled_model = core.compile_model(
-#! [compile_model]
+        model, device_name="HETERO", config={ov.properties.device.priorities(): "GPU,CPU"}
    )
    #! [compile_model]
-#! [configure_fallback_devices]
+    #! [configure_fallback_devices]
-core.set_property("HETERO", {"MULTI_DEVICE_PRIORITIES": "GPU,CPU"})
+    core.set_property("HETERO", {ov.properties.device.priorities(): "GPU,CPU"})
-core.set_property("GPU", {"PERF_COUNT": "YES"})
+    core.set_property("GPU", {ov.properties.enable_profiling(): True})
-core.set_property("CPU", {"INFERENCE_PRECISION_HINT": "f32"})
+    core.set_property("CPU", {ov.properties.hint.inference_precision(): ov.Type.f32})
-compiled_model = core.compile_model(model=model, device_name="HETERO")
+    compiled_model = core.compile_model(model=model, device_name="HETERO")
-#! [configure_fallback_devices]
+    #! [configure_fallback_devices]
--- a/docs/snippets/ov_infer_request.py
+++ b/docs/snippets/ov_infer_request.py
@ -1,13 +1,14 @@
 # Copyright (C) 2018-2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
-import numpy as np
+from utils import get_model
 #! [import]
-import openvino.runtime as ov
+import openvino as ov
 #! [import]
 core = ov.Core()
-model = core.read_model("model.xml")
+model = get_model()
 compiled_model = core.compile_model(model, "AUTO")
 #! [create_infer_request]
@ -31,10 +32,12 @@ infer_request.wait_for(10)
 #! [wait_for]
 #! [set_callback]
-def callback(request, userdata):
+def callback(request, _):
    request.start_async()
-infer_request.set_callback(callback)
+callbacks_info = {}
 callbacks_info["finished"] = 0
 infer_request.set_callback(callback, callbacks_info)
 #! [set_callback]
 #! [cancel]
@ -48,24 +51,21 @@ output_tensor = infer_request.get_output_tensor()
 #! [get_set_index_tensor]
 input_tensor = infer_request.get_input_tensor(0)
-output_tensor = infer_request.get_output_tensor(1)
+output_tensor = infer_request.get_output_tensor(0)
 #! [get_set_index_tensor]
-#! [get_set_name_tensor]
+input_tensor_name = "input"
 input_tensor = infer_request.get_tensor("input_name")
 output_tensor = infer_request.get_tensor("output_name")
 #! [get_set_name_tensor]
 #! [get_set_tensor]
-tensor1 = infer_request.get_tensor("tensor_name1")
+tensor1 = infer_request.get_tensor("result")
-tensor2 = ov.Tensor()
+tensor2 = ov.Tensor(ov.Type.f32, [1, 3, 32, 32])
-infer_request.set_tensor("tensor_name2", tensor2)
+infer_request.set_tensor(input_tensor_name, tensor2)
 #! [get_set_tensor]
 #! [get_set_tensor_by_port]
 input_port = model.input(0)
-output_port = model.input("tensor_name")
+output_port = model.input(input_tensor_name)
-input_tensor = ov.Tensor()
+input_tensor = ov.Tensor(ov.Type.f32, [1, 3, 32, 32])
 infer_request.set_tensor(input_port, input_tensor)
 output_tensor = infer_request.get_tensor(output_port)
 #! [get_set_tensor_by_port]
@ -81,15 +81,15 @@ infer_request2.set_input_tensor(0, output)
 #! [roi_tensor]
 # input_tensor points to input of a previous network and
 # cropROI contains coordinates of output bounding box **/
-input_tensor = ov.Tensor(type=ov.Type.f32, shape=ov.Shape([1, 3, 20, 20]))
+input_tensor = ov.Tensor(type=ov.Type.f32, shape=ov.Shape([1, 3, 100, 100]))
 begin = [0, 0, 0, 0]
-end = [1, 2, 3, 3]
+end = [1, 3, 32, 32]
 # ...
 # roi_tensor uses shared memory of input_tensor and describes cropROI
 # according to its coordinates **/
 roi_tensor = ov.Tensor(input_tensor, begin, end)
-infer_request2.set_tensor("input_name", roi_tensor)
+infer_request2.set_tensor(input_tensor_name, roi_tensor)
 #! [roi_tensor]
 #! [remote_tensor]
--- a/docs/snippets/ov_layout.py
+++ b/docs/snippets/ov_layout.py
@ -1,10 +1,11 @@
 # Copyright (C) 2018-2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
 #
-import openvino.runtime as ov
+import openvino as ov
 import openvino.runtime.opset12 as ops
 # ! [ov:layout:simple]
-from openvino.runtime import Layout
+from openvino import Layout
 layout = Layout('NCHW')
 # ! [ov:layout:simple]
 # ! [ov:layout:complex]
@ -61,12 +62,12 @@ def create_simple_model():
    # Parameter--->Multiply--->Add--->Result
    #    Constant---'          /
    #              Constant---'
-    data = ov.opset8.parameter([3, 1, 2], ov.Type.f32)
+    data = ops.parameter([3, 1, 2], ov.Type.f32, name="input_tensor_name")
-    mul_constant = ov.opset8.constant([1.5], ov.Type.f32)
+    mul_constant = ops.constant([1.5], ov.Type.f32)
-    mul = ov.opset8.multiply(data, mul_constant)
+    mul = ops.multiply(data, mul_constant)
-    add_constant = ov.opset8.constant([0.5], ov.Type.f32)
+    add_constant = ops.constant([0.5], ov.Type.f32)
-    add = ov.opset8.add(mul, add_constant)
+    add = ops.add(mul, add_constant)
-    res = ov.opset8.result(add)
+    res = ops.result(add)
    return ov.Model([res], [data], "model")
 model = create_simple_model()
--- a/docs/snippets/ov_model_snippets.py
+++ b/docs/snippets/ov_model_snippets.py
@ -3,10 +3,13 @@
 import numpy as np
 #! [import]
-import openvino.runtime as ov
+import openvino as ov
 #! [import]
 import openvino.runtime.opset12 as ops
 #! [import]
 import openvino.runtime.passes as passes
 # ! [ov:create_simple_model]
 def create_simple_model():
    # This example shows how to create ov::Function
@ -14,12 +17,12 @@ def create_simple_model():
    # Parameter--->Multiply--->Add--->Result
    #    Constant---'          /
    #              Constant---'
-    data = ov.opset8.parameter([3, 1, 2], ov.Type.f32)
+    data = ops.parameter([3, 1, 2], ov.Type.f32)
-    mul_constant = ov.opset8.constant([1.5], ov.Type.f32)
+    mul_constant = ops.constant([1.5], ov.Type.f32)
-    mul = ov.opset8.multiply(data, mul_constant)
+    mul = ops.multiply(data, mul_constant)
-    add_constant = ov.opset8.constant([0.5], ov.Type.f32)
+    add_constant = ops.constant([0.5], ov.Type.f32)
-    add = ov.opset8.add(mul, add_constant)
+    add = ops.add(mul, add_constant)
-    res = ov.opset8.result(add)
+    res = ops.result(add)
    return ov.Model([res], [data], "model")
 # ! [ov:create_simple_model]
@ -30,24 +33,24 @@ def create_advanced_model():
    # Parameter->Split---0-->Result
    #               | `--1-->Relu-->Result
    #               `----2-->Result
-    data = ov.opset8.parameter(ov.Shape([1, 3, 64, 64]), ov.Type.f32)
+    data = ops.parameter(ov.Shape([1, 3, 64, 64]), ov.Type.f32)
    # Create Constant for axis value
-    axis_const = ov.opset8.constant(ov.Type.i64, ov.Shape({}), [1])
+    axis_const = ops.constant(1, dtype=ov.Type.i64)
-    # Create opset8::Split operation that splits input to three slices across 1st dimension
+    # Create opset12::Split operation that splits input to three slices across 1st dimension
-    split = ov.opset8.split(data, axis_const, 3)
+    split = ops.split(data, axis_const, 3)
-    # Create opset8::Relu operation that takes 1st Split output as input
+    # Create opset12::Relu operation that takes 1st Split output as input
-    relu = ov.opset8.relu(split.output(1))
+    relu = ops.relu(split.output(1))
    # Results operations will be created automatically based on provided OutputVector
-    return ov.Model([split.output(0), relu, split.output[2]], [data], "model")
+    return ov.Model([split.output(0), relu.output(0), split.output(2)], [data], "model")
 # ! [ov:create_advanced_model]
 def ov_api_examples():
    # Doesn't work
    # node = ov.opset8.parameter(ov.PartialShape([ov.Dimension.dynamic(), 3, 64, 64]), np.float32)
-    node = ov.opset8.parameter(ov.PartialShape([ov.Dimension.dynamic(), ov.Dimension(3), ov.Dimension(64), ov.Dimension(64)]), np.float32)
+    node = ops.parameter(ov.PartialShape([ov.Dimension.dynamic(), ov.Dimension(3), ov.Dimension(64), ov.Dimension(64)]), np.float32)
    # it doesn't work:
    # static_shape = ov.Shape()
@ -59,8 +62,7 @@ def ov_api_examples():
 # ! [ov:serialize]
 def serialize_example(m : ov.Model):
-    from openvino.runtime import serialize
+    ov.serialize(m, xml_path='model.xml', bin_path='model.bin')
    serialize(m, xml_path='model.xml', bin_path='model.bin')
 # ! [ov:serialize]
 # ! [ov:visualize]
@ -68,7 +70,7 @@ def visualize_example(m : ov.Model):
    # Need import:
    # * import openvino.runtime.passes as passes
    pass_manager = passes.Manager()
-    pass_manager.register_pass(pass_name="VisualTree", file_name='image.svg')
+    pass_manager.register_pass(passes.VisualizeTree(file_name='image.svg'))
    pass_manager.run_passes(m)
 # ! [ov:visualize]
@ -79,7 +81,10 @@ def model_inputs_outputs(model : ov.Model):
    #! [all_inputs_ouputs]
-if __name__ == '__main__':
+def main():
    ov_api_examples()
    create_simple_model()
-    create_advanced_model()
+    model = create_advanced_model()
    serialize_example(model)
    visualize_example(model)
    model_inputs_outputs(model)
--- a/docs/snippets/ov_model_with_state_infer.py
+++ b/docs/snippets/ov_model_with_state_infer.py
@ -0,0 +1,84 @@
 # Copyright (C) 2018-2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
 import logging as log
 import numpy as np
 import sys
 import openvino  as ov
 from openvino.runtime import opset12 as ops
 def main():
    # 1. Load inference engine
    log.info("Loading OpenVINO")
    core = ov.Core()
    #! [ov:model_create]
    # 2. Creating ov.Model
    input = ops.parameter([1, 1], dtype=np.float32, name="data")
    init_const = ops.constant([0], dtype=np.float32)
    read = ops.read_value(init_const, "variable0")
    add = ops.add(input, read)
    assign = ops.assign(add, "variable0")
    add2 = ops.add(add, read)
    result = ops.result(add2)
    model = ov.Model(results=[result], sinks=[assign], parameters=[input], name="model")
     #! [ov:model_create]
    log.info("Loading network files")
    # 3. Load network to CPU
    compiled_model = core.compile_model(model, "CPU")
    # 4. Create Infer Request
    infer_request = compiled_model.create_infer_request()
    # 5. Prepare inputs
    input_tensors = []
    for input in compiled_model.inputs:
        input_tensors.append(infer_request.get_tensor(input))
    # 6. Prepare outputs
    output_tensors = []
    for output in compiled_model.outputs:
        output_tensors.append(infer_request.get_tensor(output))
    # 7. Initialize memory state before starting
    for state in infer_request.query_state():
        state.reset()
    #! [ov:part1]
    # input data
    input_data = np.arange(start=1, stop=7, dtype=np.float32)
    log.info("Infer the first utterance")
    for next_input in range(int(len(input_data)/2)):
        infer_request.infer({"data" : np.asarray([input_data[next_input]]).reshape([1,1])})
        # check states
        states = infer_request.query_state()
        if len(states) == 0:
            log.error("Queried states are empty")
            return -1
        mstate = states[0].state
        if not mstate:
            log.error("Can't cast state to MemoryBlob")
            return -1
        state_buf = mstate.data
        log.info(state_buf[0])
    log.info("\nReset state between utterances...\n")
    for state in infer_request.query_state():
        state.reset()
    log.info("Infer the second utterance")
    for next_input in range(int(len(input_data)/2), len(input_data)):
        infer_request.infer({0 : np.asarray([input_data[next_input]]).reshape([1,1])})
        # check states
        states = infer_request.query_state()
        mstate = states[0].state
        state_buf = mstate.data
        log.info(state_buf[0])
    #! [ov:part1]
    log.info("Execution successful")
    return 0
--- a/docs/snippets/ov_multi.py
+++ b/docs/snippets/ov_multi.py
@ -1,108 +1,121 @@
 import sys
-from openvino.runtime import Core
+import openvino as ov
-model_path = "/openvino_CI_CD/result/install_pkg/tests/test_model_zoo/core/models/ir/add_abc.xml"
+from utils import get_model
 path_to_model = "/openvino_CI_CD/result/install_pkg/tests/test_model_zoo/core/models/ir/add_abc.xml"
 def MULTI_0():
 #! [MULTI_0]
    core = Core()
-    # Read a network in IR, PaddlePaddle, or ONNX format:
+model = get_model()
-    model = core.read_model(model_path)
+
 def MULTI_0():
    #! [MULTI_0]
    core = ov.Core()
    # Option 1
    # Pre-configure MULTI globally with explicitly defined devices,
    # and compile the model on MULTI using the newly specified default device list.
-    core.set_property(device_name="MULTI", properties={"MULTI_DEVICE_PRIORITIES":"GPU,CPU"})
+    core.set_property(
        device_name="MULTI", properties={ov.properties.device.priorities(): "GPU,CPU"}
    )
    compiled_model = core.compile_model(model=model, device_name="MULTI")
    # Option 2
    # Specify the devices to be used by MULTI explicitly at compilation.
    # The following lines are equivalent:
    compiled_model = core.compile_model(model=model, device_name="MULTI:GPU,CPU")
-    compiled_model = core.compile_model(model=model, device_name="MULTI", config={"MULTI_DEVICE_PRIORITIES": "GPU,CPU"}) 
+    compiled_model = core.compile_model(
        model=model,
        device_name="MULTI",
        config={ov.properties.device.priorities(): "GPU,CPU"},
    )
    #! [MULTI_0]
 #! [MULTI_0]
 def MULTI_1():
-#! [MULTI_1]
+    #! [MULTI_1]
-    core = Core()
+    core = ov.Core()
-    model = core.read_model(model_path)
+
-    core.set_property(device_name="MULTI", properties={"MULTI_DEVICE_PRIORITIES":"CPU,GPU"})
+    core.set_property(
        device_name="MULTI", properties={ov.properties.device.priorities(): "CPU,GPU"}
    )
    # Once the priority list is set, you can alter it on the fly:
    # reverse the order of priorities
-    core.set_property(device_name="MULTI", properties={"MULTI_DEVICE_PRIORITIES":"GPU,CPU"})
+    core.set_property(
        device_name="MULTI", properties={ov.properties.device.priorities(): "GPU,CPU"}
    )
    # exclude some devices (in this case, CPU)
-    core.set_property(device_name="MULTI", properties={"MULTI_DEVICE_PRIORITIES":"GPU"})
+    core.set_property(
        device_name="MULTI", properties={ov.properties.device.priorities(): "GPU"}
    )
    # bring back the excluded devices
-    core.set_property(device_name="MULTI", properties={"MULTI_DEVICE_PRIORITIES":"GPU,CPU"})
+    core.set_property(
        device_name="MULTI", properties={ov.properties.device.priorities(): "GPU,CPU"}
    )
    # You cannot add new devices on the fly!
    # Attempting to do so will trigger the following exception:
    # [ ERROR ] [NOT_FOUND] You can only change device
    # priorities but not add new devices with the model's
    # ov::device::priorities. CPU device was not in the original device list!
-
+    #! [MULTI_1]
 #! [MULTI_1]
 # the following two pieces of code appear not to be used anywhere
 # they should be considered for removal
 def available_devices_1():
-#! [available_devices_1]
+    #! [available_devices_1]
    all_devices = "MULTI:"
-    core = Core()
+    core = ov.Core()
-    model = core.read_model(model_path)
+
    all_devices += ",".join(core.available_devices)
    compiled_model = core.compile_model(model=model, device_name=all_devices)
-#! [available_devices_1]
+    #! [available_devices_1]
 def available_devices_2():
-#! [available_devices_2]
+    #! [available_devices_2]
    match_list = []
    all_devices = "MULTI:"
    dev_match_str = "GPU"
-    core = Core()
+    core = ov.Core()
-    model = core.read_model(model_path)
+
    for d in core.available_devices:
        if dev_match_str in d:
            match_list.append(d)
    all_devices += ",".join(match_list)
    compiled_model = core.compile_model(model=model, device_name=all_devices)
-#! [available_devices_2]
+    #! [available_devices_2]
 def MULTI_4():
-#! [MULTI_4]
+    #! [MULTI_4]
-    core = Core()
+    core = ov.Core()
    cpu_config = {}
    gpu_config = {}
    # Read a network in IR, PaddlePaddle, or ONNX format:
    model = core.read_model(model_path)
    # When compiling the model on MULTI, configure CPU and GPU
    # (devices, priorities, and device configurations; gpu_config and cpu_config will load during compile_model() ):
-    compiled_model = core.compile_model(model=model, device_name="MULTI:GPU,CPU", config={"CPU":"NUM_STREAMS 4", "GPU":"NUM_STREAMS 8"})
+    compiled_model = core.compile_model(
        model=model,
        device_name="MULTI:GPU,CPU",
        config={"CPU": "NUM_STREAMS 4", "GPU": "NUM_STREAMS 8"},
    )
    # Optionally, query the optimal number of requests:
-    nireq = compiled_model.get_property("OPTIMAL_NUMBER_OF_INFER_REQUESTS")
+    nireq = compiled_model.get_property(
-#! [MULTI_4]
+        ov.properties.optimal_number_of_infer_requests()
    )
    #! [MULTI_4]
 def main():
    core = ov.Core()
    if "GPU" not in core.available_devices:
        return 0
    MULTI_0()
    MULTI_1()
    available_devices_1()
    available_devices_2()
    MULTI_4()
 if __name__ == '__main__':
    sys.exit(main())
--- a/docs/snippets/ov_network_state_intro.py
+++ b/docs/snippets/ov_network_state_intro.py
@ -1,172 +0,0 @@
 # Copyright (C) 2018-2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
 import logging as log
 import numpy as np
 import sys
 from openvino.runtime import opset10 as ops
 from openvino.runtime import Core, Model, PartialShape, Tensor, Type
 from openvino.runtime.passes import LowLatency2, MakeStateful, Manager
 def state_network_example():
    #! [ov:state_network]
    input = ops.parameter([1, 1], dtype=np.float32)
    read = ops.read_value(input, "variable0")
    add = ops.add(read, input)
    save = ops.assign(add, "variable0")
    result = ops.result(add)
    model = Model(results=[result], sinks=[save], parameters=[input])
    #! [ov:state_network]
 def low_latency_2_example():
    #! [ov:low_latency_2]
    # Precondition for Model.
    # TensorIterator and Parameter are created in body of TensorIterator with names
    tensor_iterator_name = "TI_name"
    body_parameter_name = "body_parameter_name"
    idx = "0" # this is a first variable in the network
    # The State will be named "TI_name/param_name/variable_0"
    state_name = tensor_iterator_name + "//" + body_parameter_name + "//" + "variable_" + idx # todo
    #! [ov:get_ov_model]
    core = Core()
    ov_model = core.read_model("path_to_the_model")
    #! [ov:get_ov_model]
    # reshape input if needed
    #! [ov:reshape_ov_model]
    ov_model.reshape({"X": PartialShape([1, 1, 16])})
    #! [ov:reshape_ov_model]
    #! [ov:apply_low_latency_2]
    manager = Manager()
    manager.register_pass(LowLatency2())
    manager.run_passes(ov_model)
    #! [ov:apply_low_latency_2]
    hd_specific_model = core.compile_model(ov_model)
    # Try to find the Variable by name
    infer_request = hd_specific_model.create_infer_request()
    states = infer_request.query_state()
    for state in states:
        name = state.get_name()
        if (name == state_name):
            # some actions
    #! [ov:low_latency_2]
    #! [ov:low_latency_2_use_parameters]
    manager.register_pass(LowLatency2(False))
    #! [ov:low_latency_2_use_parameters]
 def apply_make_stateful_tensor_names():
    #! [ov:make_stateful_tensor_names]
    core = Core()
    ov_model = core.read_model("path_to_the_model")
    tensor_names = {"tensor_name_1": "tensor_name_4",
                    "tensor_name_3": "tensor_name_6"}
    manager = Manager()
    manager.register_pass(MakeStateful(tensor_names))
    manager.run_passes(ov_model)
    #! [ov:make_stateful_tensor_names]
 def apply_make_stateful_ov_nodes():
    #! [ov:make_stateful_ov_nodes]
    core = Core()
    ov_model = core.read_model("path_to_the_model")
    # Parameter_1, Result_1, Parameter_3, Result_3 are 
    # ops.parameter/ops.result in the ov_model
    pairs = ["""(Parameter_1, Result_1), (Parameter_3, Result_3)"""]
    manager = Manager()
    manager.register_pass(MakeStateful(pairs))
    manager.run_passes(ov_model)
    #! [ov:make_stateful_ov_nodes]
 def main():
    #! [ov:state_api_usage]
    # 1. Load inference engine
    log.info("Loading Inference Engine")
    core = Core()
    # 2. Read a model
    log.info("Loading network files")
    model = core.read_model("path_to_the_model")
    # 3. Load network to CPU
    hw_specific_model = core.compile_model(model, "CPU")
    # 4. Create Infer Request
    infer_request = hw_specific_model.create_infer_request()
    # 5. Reset memory states before starting
    states = infer_request.query_state()
    if (states.size() != 1):
        log.error(f"Invalid queried state number. Expected 1, but got {str(states.size())}")
        return -1
    for state in states:
        state.reset()
    # 6. Inference
    input_data = np.arange(start=1, stop=12, dtype=np.float32)
    # This example demonstrates how to work with OpenVINO State API.
    # Input_data: some array with 12 float numbers
    # Part1: read the first four elements of the input_data array sequentially.
    # Expected output for the first utterance:
    # sum of the previously processed elements [ 1, 3, 6, 10]
    # Part2: reset state value (set to 0) and read the next four elements.
    # Expected output for the second utterance:
    # sum of the previously processed elements [ 5, 11, 18, 26]
    # Part3: set state value to 5 and read the next four elements.
    # Expected output for the third utterance:
    # sum of the previously processed elements + 5 [ 14, 24, 35, 47]
    target_state = states[0]
    # Part 1
    log.info("Infer the first utterance")
    for next_input in range(len(input_data)/3):
        infer_request.infer({0 : input_data[next_input]})
        state_buf = target_state.state.data
        log.info(state_buf[0])
    # Part 2
    log.info("\nReset state between utterances...\n")
    target_state.reset()
    log.info("Infer the second utterance")
    for next_input in range(len(input_data)/3, (len(input_data)/3 * 2)):
        infer_request.infer({0 : input_data[next_input]})
        state_buf = target_state.state.data
        log.info(state_buf[0])
    # Part 3
    log.info("\nSet state value between utterances to 5...\n")
    v = np.asarray([5], dtype=np.float32)
    tensor = Tensor(v, shared_memory=True)
    target_state.state = tensor
    log.info("Infer the third utterance")
    for next_input in range((input_data.size()/3 * 2), input_data.size()):
        infer_request.infer({0 : input_data[next_input]})
        state_buf = target_state.state.data
        log.info(state_buf[0])
    log.info("Execution successful")
    #! [ov:state_api_usage]
    return 0
 if __name__ == '__main__':
    sys.exit(main())
--- a/docs/snippets/ov_preprocessing.py
+++ b/docs/snippets/ov_preprocessing.py
@ -2,24 +2,26 @@
 # SPDX-License-Identifier: Apache-2.0
 #
 import openvino.runtime.properties as props
 from openvino.preprocess import ResizeAlgorithm, ColorFormat
-from openvino.runtime import Layout, Type, serialize
+from openvino import Layout, Type, serialize
 import openvino as ov
 import openvino.runtime.opset12 as ops
 from utils import get_model, get_temp_dir, get_path_to_model, get_advanced_model
-xml_path = ''
+input_name = "input"
-input_name = ''
+model = get_advanced_model()
 # ! [ov:preprocess:create]
 from openvino.preprocess import PrePostProcessor
 from openvino.runtime import Core
 core = Core()
 model = core.read_model(model=xml_path)
 ppp = PrePostProcessor(model)
 # ! [ov:preprocess:create]
 # ! [ov:preprocess:tensor]
 from openvino.preprocess import ColorFormat
-from openvino.runtime import Layout, Type
+from openvino import Layout, Type
 ppp.input(input_name).tensor() \
        .set_element_type(Type.u8) \
        .set_shape([1, 480, 640, 3]) \
@ -52,10 +54,13 @@ ppp.output(2) # Gets output with index=2 (3rd one) in a model
 # ! [ov:preprocess:input_name]
-ppp.input('image')
+ppp.input(input_name)
 ppp.output('result')
 # ! [ov:preprocess:input_name]
 model = get_model()
 ppp = PrePostProcessor(model)
 # ! [ov:preprocess:input_1]
 # no index/name is needed if model has one input
 ppp.input().preprocess().scale(50.)
@ -65,75 +70,75 @@ ppp.output() \
    .postprocess().convert_element_type(Type.u8)
 # ! [ov:preprocess:input_1]
 # ! [ov:preprocess:mean_scale]
-ppp.input('input').preprocess().mean(128).scale(127)
+ppp.input(input_name).preprocess().mean(128).scale(127)
 # ! [ov:preprocess:mean_scale]
 # ! [ov:preprocess:mean_scale_array]
 # Suppose model's shape is {1, 3, 224, 224}
 # N=1, C=3, H=224, W=224
-ppp.input('input').model().set_layout(Layout('NCHW'))
+ppp.input(input_name).model().set_layout(Layout('NCHW'))
 # Mean/Scale has 3 values which matches with C=3
-ppp.input('input').preprocess() \
+ppp.input(input_name).preprocess() \
    .mean([103.94, 116.78, 123.68]).scale([57.21, 57.45, 57.73])
 # ! [ov:preprocess:mean_scale_array]
 # ! [ov:preprocess:convert_element_type]
 # First define data type for your tensor
-ppp.input('input').tensor().set_element_type(Type.u8)
+ppp.input(input_name).tensor().set_element_type(Type.u8)
 # Then define preprocessing step
-ppp.input('input').preprocess().convert_element_type(Type.f32)
+ppp.input(input_name).preprocess().convert_element_type(Type.f32)
 # If conversion is needed to `model's` element type, 'f32' can be omitted
-ppp.input('input').preprocess().convert_element_type()
+ppp.input(input_name).preprocess().convert_element_type()
 # ! [ov:preprocess:convert_element_type]
 # ! [ov:preprocess:convert_layout]
 # First define layout for your tensor
-ppp.input('input').tensor().set_layout(Layout('NHWC'))
+ppp.input(input_name).tensor().set_layout(Layout('NHWC'))
 # Then define layout of model
-ppp.input('input').model().set_layout(Layout('NCHW'))
+ppp.input(input_name).model().set_layout(Layout('NCHW'))
 print(ppp)  # Will print 'implicit layout conversion step'
 # ! [ov:preprocess:convert_layout]
 # ! [ov:preprocess:convert_layout_2]
-ppp.input('input').tensor().set_shape([1, 480, 640, 3])
+ppp.input(input_name).tensor().set_shape([1, 480, 640, 3])
 # Model expects shape {1, 3, 480, 640}
-ppp.input('input').preprocess()\
+ppp.input(input_name).preprocess()\
    .convert_layout([0, 3, 1, 2])
 # 0 -> 0; 3 -> 1; 1 -> 2; 2 -> 3
 # ! [ov:preprocess:convert_layout_2]
 # ! [ov:preprocess:resize_1]
-ppp.input('input').tensor().set_shape([1, 3, 960, 1280])
+ppp.input(input_name).tensor().set_shape([1, 3, 960, 1280])
-ppp.input('input').model().set_layout(Layout('??HW'))
+ppp.input(input_name).model().set_layout(Layout('??HW'))
-ppp.input('input').preprocess()\
+ppp.input(input_name).preprocess()\
    .resize(ResizeAlgorithm.RESIZE_LINEAR, 480, 640)
 # ! [ov:preprocess:resize_1]
 # ! [ov:preprocess:resize_2]
-ppp.input('input').tensor().set_shape([1, 3, 960, 1280])
+ppp.input(input_name).tensor().set_shape([1, 3, 960, 1280])
 # Model accepts {1, 3, 480, 640} shape, thus last dimensions are 'H' and 'W'
-ppp.input('input').model().set_layout(Layout('??HW'))
+ppp.input(input_name).model().set_layout(Layout('??HW'))
 # Resize to model's dimension
-ppp.input('input').preprocess().resize(ResizeAlgorithm.RESIZE_LINEAR)
+ppp.input(input_name).preprocess().resize(ResizeAlgorithm.RESIZE_LINEAR)
 # ! [ov:preprocess:resize_2]
 # ! [ov:preprocess:convert_color_1]
-ppp.input('input').tensor().set_color_format(ColorFormat.BGR)
+ppp.input(input_name).tensor().set_color_format(ColorFormat.BGR)
-ppp.input('input').preprocess().convert_color(ColorFormat.RGB)
+ppp.input(input_name).preprocess().convert_color(ColorFormat.RGB)
 # ! [ov:preprocess:convert_color_1]
 # ! [ov:preprocess:convert_color_2]
 # This will split original `input` to 2 separate inputs: `input/y' and 'input/uv'
-ppp.input('input').tensor()\
+ppp.input(input_name).tensor()\
    .set_color_format(ColorFormat.NV12_TWO_PLANES)
-ppp.input('input').preprocess()\
+ppp.input(input_name).preprocess()\
    .convert_color(ColorFormat.RGB)
 print(ppp)  # Dump preprocessing steps to see what will happen
 # ! [ov:preprocess:convert_color_2]
 # ! [ov:preprocess:custom]
 # It is possible to insert some custom operations
-import openvino.runtime.opset8 as ops
+import openvino.runtime.opset12 as ops
 from openvino.runtime import Output
 from openvino.runtime.utils.decorators import custom_preprocess_function
@ -142,22 +147,22 @@ def custom_abs(output: Output):
    # Custom nodes can be inserted as Preprocessing steps
    return ops.abs(output)
-ppp.input("input_image").preprocess() \
+ppp.input("input").preprocess() \
    .custom(custom_abs)
 # ! [ov:preprocess:custom]
 # ! [ov:preprocess:postprocess]
 # Model's output has 'NCHW' layout
-ppp.output('result_image').model().set_layout(Layout('NCHW'))
+ppp.output('result').model().set_layout(Layout('NCHW'))
 # Set target user's tensor to U8 type + 'NHWC' layout
 # Precision & layout conversions will be done implicitly
-ppp.output('result_image').tensor()\
+ppp.output('result').tensor()\
    .set_layout(Layout("NHWC"))\
    .set_element_type(Type.u8)
 # Also it is possible to insert some custom operations
-import openvino.runtime.opset8 as ops
+import openvino.runtime.opset12 as ops
 from openvino.runtime import Output
 from openvino.runtime.utils.decorators import custom_preprocess_function
@ -166,23 +171,23 @@ def custom_abs(output: Output):
    # Custom nodes can be inserted as Post-processing steps
    return ops.abs(output)
-ppp.output("result_image").postprocess()\
+ppp.output("result").postprocess()\
    .custom(custom_abs)
 # ! [ov:preprocess:postprocess]
 # ! [ov:preprocess:save_headers]
 from openvino.preprocess import PrePostProcessor, ColorFormat, ResizeAlgorithm
-from openvino.runtime import Core, Layout, Type, set_batch
+from openvino import Core, Layout, Type, set_batch
-# First method - imports
+from openvino import serialize
 from openvino.runtime import serialize
 # Second method - imports
 from openvino.runtime.passes import Manager, Serialize
 # ! [ov:preprocess:save_headers]
 model_path = get_path_to_model()
 serialized_model_path = get_path_to_model()
 # ! [ov:preprocess:save]
 # ========  Step 0: read original model =========
 core = Core()
-model = core.read_model(model='/path/to/some_model.onnx')
+model = core.read_model(model=model_path)
 # ======== Step 1: Preprocessing ================
 ppp = PrePostProcessor(model)
@ -213,20 +218,16 @@ model = ppp.build()
 set_batch(model, 2)
 # ======== Step 3: Save the model ================
-# First method - using serialize runtime wrapper
+serialize(model, serialized_model_path)
 serialize(model, '/path/to/some_model_saved.xml', '/path/to/some_model_saved.bin')
 # Second method - using Manager and Serialize pass
 manager = Manager()
 manager.register_pass(Serialize('/path/to/some_model_saved.xml', '/path/to/some_model_saved.bin'))
 manager.run_passes(model)
 # ! [ov:preprocess:save]
 path_to_cache_dir = get_temp_dir()
 # ! [ov:preprocess:save_load]
 core = Core()
-core.set_property({'CACHE_DIR': '/path/to/cache/dir'})
+core.set_property({props.cache_dir(): path_to_cache_dir})
 # In case that no preprocessing is needed anymore, we can load model on target device directly
 # With cached model available, it will also save some time on reading original model
-compiled_model = core.compile_model('/path/to/some_model_saved.xml', 'CPU')
+compiled_model = core.compile_model(serialized_model_path, 'CPU')
 # ! [ov:preprocess:save_load]
--- a/docs/snippets/ov_preprocessing_migration.py
+++ b/docs/snippets/ov_preprocessing_migration.py
@ -2,8 +2,10 @@
 # SPDX-License-Identifier: Apache-2.0
 #
 from utils import get_model, get_ngraph_model
 #! [ov_imports]
-from openvino.runtime import Core, Layout, Type
+from openvino import Core, Layout, Type
 from openvino.preprocess import ColorFormat, PrePostProcessor, ResizeAlgorithm
 #! [ov_imports]
@ -13,11 +15,9 @@ import openvino.inference_engine as ie
 #include "inference_engine.hpp"
-model_path = ''
+tensor_name="input"
 tensor_name = ''
 core = Core()
-model = core.read_model(model=model_path)
+model = get_model([1,32,32,3])
 #! [ov_mean_scale]
 ppp = PrePostProcessor(model)
@ -30,6 +30,8 @@ input.preprocess().mean([116.78]).scale([57.21, 57.45, 57.73])
 model = ppp.build()
 #! [ov_mean_scale]
 model = get_model()
 #! [ov_conversions]
 ppp = PrePostProcessor(model)
 input = ppp.input(tensor_name)
@ -50,9 +52,11 @@ input.preprocess().convert_color(ColorFormat.BGR)
 model = ppp.build()
 #! [ov_color_space]
 model = get_model([1, 3, 448, 448])
 #! [ov_image_scale]
 ppp = PrePostProcessor(model)
-input = ppp.input(tensor_name)
+input = ppp.input("input")
 # need to specify H and W dimensions in model, others are not important
 input.model().set_layout(Layout('??HW'))
 # scale to model shape
@ -61,39 +65,38 @@ input.preprocess().resize(ResizeAlgorithm.RESIZE_LINEAR, 448, 448)
 model = ppp.build()
 #! [ov_image_scale]
 import openvino.inference_engine as ie
 import ngraph as ng
 operation_name = "data"
-model_path = ''
+core = ie.IECore()
-operation_name = ''
+network = get_ngraph_model()
 core = Core()
 network = core.ReadNetwork(model_path)
 #! [mean_scale]
-preProcess = network.getInputsInfo()[operation_name].getPreProcess()
+preprocess_info = network.input_info[operation_name].preprocess_info
-preProcess.init(3)
+preprocess_info.init(3)
-preProcess[0].meanValue = 116.78
+preprocess_info[0].mean_value = 116.78
-preProcess[1].meanValue = 116.78
+preprocess_info[1].mean_value = 116.78
-preProcess[2].meanValue = 116.78
+preprocess_info[2].mean_value = 116.78
-preProcess[0].stdScale = 57.21
+preprocess_info[0].std_scale = 57.21
-preProcess[1].stdScale = 57.45
+preprocess_info[1].std_scale = 57.45
-preProcess[2].stdScale = 57.73
+preprocess_info[2].std_scale = 57.73
-preProcess.setVariant(ie.MEAN_VALUE)
+preprocess_info.mean_variant = ie.MeanVariant.MEAN_VALUE
 #! [mean_scale]
 #! [conversions]
-inputInfo = network.getInputsInfo()[operation_name]
+input_info = network.input_info[operation_name]
-inputInfo.setPrecision(ie.Precision.U8)
+input_info.precision = "U8"
-inputInfo.setLayout(ie.Layout.NHWC)
+input_info.layout = "NHWC"
 # model input layout is always NCHW in Inference Engine
 # for shapes with 4 dimensions
 #! [conversions]
 #! [image_scale]
-preProcess = network.getInputsInfo()[operation_name].getPreProcess()
+preprocess_info = network.input_info[operation_name].preprocess_info
 # Inference Engine supposes input for resize is always in NCHW layout
 # while for OpenVINO Runtime API 2.0 `H` and `W` dimensions must be specified
 # Also, current code snippet supposed resize from dynamic shapes
-preProcess.setResizeAlgorithm(ie.ResizeAlgorithm.RESIZE_BILINEAR)
+preprocess_info.resize_algorithm = ie.ResizeAlgorithm.RESIZE_BILINEAR
 #! [image_scale]
--- a/docs/snippets/ov_properties_api.py
+++ b/docs/snippets/ov_properties_api.py
@ -2,53 +2,62 @@
 # SPDX-License-Identifier: Apache-2.0
 #
-from openvino.runtime import Core
+import openvino as ov
 import openvino.runtime.properties as props
 import openvino.runtime.properties.hint as hints
-# [get_available_devices]
+from utils import get_model
 core = Core()
 available_devices = core.available_devices
 # [get_available_devices]
 # [hetero_priorities]
 device_priorites = core.get_property("HETERO", "MULTI_DEVICE_PRIORITIES")
 # [hetero_priorities]
 # [cpu_device_name]
 cpu_device_name = core.get_property("CPU", "FULL_DEVICE_NAME")
 # [cpu_device_name]
 model = core.read_model(model="sample.xml")
 # [compile_model_with_property]
 config = {"PERFORMANCE_HINT": "THROUGHPUT",
        "INFERENCE_PRECISION_HINT": "f32"}
 compiled_model = core.compile_model(model, "CPU", config)
 # [compile_model_with_property]
 # [optimal_number_of_infer_requests]
 compiled_model = core.compile_model(model, "CPU")
 nireq = compiled_model.get_property("OPTIMAL_NUMBER_OF_INFER_REQUESTS")
 # [optimal_number_of_infer_requests]
-# [core_set_property_then_compile]
+def main():
-# latency hint is a default for CPU
+    # [get_available_devices]
-core.set_property("CPU", {"PERFORMANCE_HINT": "LATENCY"})
+    core = ov.Core()
-# compiled with latency configuration hint
+    available_devices = core.available_devices
-compiled_model_latency = core.compile_model(model, "CPU")
+    # [get_available_devices]
-# compiled with overriden performance hint value
+
-config = {"PERFORMANCE_HINT": "THROUGHPUT"}
+    # [hetero_priorities]
-compiled_model_thrp = core.compile_model(model, "CPU", config)
+    device_priorites = core.get_property("HETERO", props.device.priorities())
-# [core_set_property_then_compile]
+    # [hetero_priorities]
    # [cpu_device_name]
    cpu_device_name = core.get_property("CPU", props.device.full_name())
    # [cpu_device_name]
    model = get_model()
    # [compile_model_with_property]
    config = {hints.performance_mode(): hints.PerformanceMode.THROUGHPUT,
            hints.inference_precision(): ov.Type.f32}
    compiled_model = core.compile_model(model, "CPU", config)
    # [compile_model_with_property]
    # [optimal_number_of_infer_requests]
    compiled_model = core.compile_model(model, "CPU")
    nireq = compiled_model.get_property(props.optimal_number_of_infer_requests())
    # [optimal_number_of_infer_requests]
-# [inference_num_threads]
+    # [core_set_property_then_compile]
-compiled_model = core.compile_model(model, "CPU")
+    # latency hint is a default for CPU
-nthreads = compiled_model.get_property("INFERENCE_NUM_THREADS")
+    core.set_property("CPU", {hints.performance_mode(): hints.PerformanceMode.LATENCY})
-# [inference_num_threads]
+    # compiled with latency configuration hint
    compiled_model_latency = core.compile_model(model, "CPU")
    # compiled with overriden performance hint value
    config = {hints.performance_mode(): hints.PerformanceMode.THROUGHPUT}
    compiled_model_thrp = core.compile_model(model, "CPU", config)
    # [core_set_property_then_compile]
-# [multi_device]
+
-config = {"MULTI_DEVICE_PRIORITIES": "CPU,GPU"}
+    # [inference_num_threads]
-compiled_model = core.compile_model(model, "MULTI", config)
+    compiled_model = core.compile_model(model, "CPU")
-# change the order of priorities
+    nthreads = compiled_model.get_property(props.inference_num_threads())
-compiled_model.set_property({"MULTI_DEVICE_PRIORITIES": "GPU,CPU"})
+    # [inference_num_threads]
-# [multi_device]
+
    if "GPU" not in available_devices:
        return 0
    # [multi_device]
    config = {props.device.priorities(): "CPU,GPU"}
    compiled_model = core.compile_model(model, "MULTI", config)
    # change the order of priorities
    compiled_model.set_property({props.device.priorities(): "GPU,CPU"})
    # [multi_device]
--- a/docs/snippets/ov_properties_migration.py
+++ b/docs/snippets/ov_properties_migration.py
@ -2,81 +2,88 @@
 # SPDX-License-Identifier: Apache-2.0
 #
-from openvino.runtime import Core
+import openvino as ov
 import openvino.runtime.properties as props
 import openvino.runtime.properties.hint as hints
 from utils import get_model
-core = Core()
+def main():
    core = ov.Core()
-# ! [core_set_property]
+    # ! [core_set_property]
-core.set_property(device_name="CPU", properties={"PERF_COUNT": "YES"})
+    core.set_property(device_name="CPU", properties={props.enable_profiling(): True})
-# ! [core_set_property]
+    # ! [core_set_property]
-model = core.read_model("sample.xml")
+    model = get_model()
-# ! [core_compile_model]
+    if "GPU" not in core.available_devices:
-compiled_model = core.compile_model(model=model, device_name="MULTI", config=
+        return 0
    # ! [core_compile_model]
    compiled_model = core.compile_model(model=model, device_name="MULTI", config=
        {
-        "MULTI_DEVICE_PRIORITIES": "GPU,CPU",
+            props.device.priorities(): "GPU,CPU",
-        "PERFORMANCE_HINT": "THROUGHPUT",
+            hints.performance_mode(): hints.PerformanceMode.THROUGHPUT,
-        "INFERENCE_PRECISION_HINT": "f32"
+            hints.inference_precision(): ov.Type.f32
        })
-# ! [core_compile_model]
+    # ! [core_compile_model]
-# ! [compiled_model_set_property]
+    # ! [compiled_model_set_property]
-# turn CPU off for multi-device execution
+    # turn CPU off for multi-device execution
-compiled_model.set_property(properties={"MULTI_DEVICE_PRIORITIES": "GPU"})
+    compiled_model.set_property(properties={props.device.priorities(): "GPU"})
-# ! [compiled_model_set_property]
+    # ! [compiled_model_set_property]
-# ! [core_get_rw_property]
+    # ! [core_get_rw_property]
-num_streams = core.get_property("CPU", "NUM_STREAMS")
+    num_streams = core.get_property("CPU", props.streams.num())
-# ! [core_get_rw_property]
+    # ! [core_get_rw_property]
-# ! [core_get_ro_property]
+    # ! [core_get_ro_property]
-full_device_name = core.get_property("CPU", "FULL_DEVICE_NAME")
+    full_device_name = core.get_property("CPU", props.device.full_name())
-# ! [core_get_ro_property]
+    # ! [core_get_ro_property]
-# ! [compiled_model_get_rw_property]
+    # ! [compiled_model_get_rw_property]
-perf_mode = compiled_model.get_property("PERFORMANCE_HINT")
+    perf_mode = compiled_model.get_property(hints.performance_mode())
-# ! [compiled_model_get_rw_property]
+    # ! [compiled_model_get_rw_property]
-# ! [compiled_model_get_ro_property]
+    # ! [compiled_model_get_ro_property]
-nireq = compiled_model.get_property("OPTIMAL_NUMBER_OF_INFER_REQUESTS")
+    nireq = compiled_model.get_property(props.optimal_number_of_infer_requests())
-# ! [compiled_model_get_ro_property]
+    # ! [compiled_model_get_ro_property]
    import ngraph as ng
    import openvino.inference_engine as ie
    from utils import get_ngraph_model
-from openvino.inference_engine import IECore
+    core = ie.IECore()
    #! [core_get_metric]
    full_device_name = core.get_metric("CPU", "FULL_DEVICE_NAME")
    #! [core_get_metric]
    #! [core_get_config]
    num_streams = core.get_config("CPU", "CPU_THROUGHPUT_STREAMS")
    #! [core_get_config]
-core = IECore()
+    #! [core_set_config]
-#! [core_get_metric]
+    core.set_config({"PERF_COUNT": "YES"}, "CPU")
-full_device_name = core.get_metric("CPU", "FULL_DEVICE_NAME")
+    #! [core_set_config]
 #! [core_get_metric]
-#! [core_get_config]
+    net = get_ngraph_model()
 num_streams = core.get_config("CPU", "CPU_THROUGHPUT_STREAMS")
 #! [core_get_config]
-#! [core_set_config]
+    #! [core_load_network]
-core.set_config({"PERF_COUNT": "YES"}, "CPU")
+    exec_network = core.load_network(net, "MULTI", {"DEVICE_PRIORITIES": "CPU, GPU",
 #! [core_set_config]
 net = core.read_network("sample.xml")
 #! [core_load_network]
 exec_network = core.load_network(net, "MULTI", {"DEVICE_PRIORITIES": "CPU, GPU",
                                                    "PERFORMANCE_HINT": "THROUGHPUT",
                                                    "ENFORCE_BF16": "NO"})
-#! [core_load_network]
+    #! [core_load_network]
-#! [executable_network_set_config]
+    #! [executable_network_set_config]
-# turn CPU off for multi-device execution
+    # turn CPU off for multi-device execution
-exec_network.set_config({"DEVICE_PRIORITIES": "GPU"})
+    exec_network.set_config({"DEVICE_PRIORITIES": "GPU"})
-#! [executable_network_set_config]
+    #! [executable_network_set_config]
-#! [executable_network_get_metric]
+    #! [executable_network_get_metric]
-nireq = exec_network.get_metric("OPTIMAL_NUMBER_OF_INFER_REQUESTS")
+    nireq = exec_network.get_metric("OPTIMAL_NUMBER_OF_INFER_REQUESTS")
-#! [executable_network_get_metric]
+    #! [executable_network_get_metric]
-#! [executable_network_get_config]
+    #! [executable_network_get_config]
-perf_hint = exec_network.get_config("PERFORMANCE_HINT")
+    perf_hint = exec_network.get_config("PERFORMANCE_HINT")
-#! [executable_network_get_config]
+    #! [executable_network_get_config]
--- a/docs/snippets/ov_python_exclusives.py
+++ b/docs/snippets/ov_python_exclusives.py
@ -2,18 +2,17 @@
 # SPDX-License-Identifier: Apache-2.0
 import numpy as np
 import openvino as ov
-#! [auto_compilation]
+from utils import get_path_to_model, get_image
 import openvino.runtime as ov
 compiled_model = ov.compile_model("model.xml")
 #! [auto_compilation]
 #! [properties_example]
 import openvino.runtime.opset12 as ops
 core = ov.Core()
-input_a = ov.opset11.parameter([8], name="input_a")
+input_a = ops.parameter([8], name="input_a")
-res = ov.opset11.absolute(input_a)
+res = ops.absolute(input_a)
 model = ov.Model(res, [input_a])
 compiled = core.compile_model(model, "CPU")
 model.outputs[0].tensor.set_names({"result_0"})  # Add name for Output
@ -25,6 +24,10 @@ print(compiled.inputs)
 print(compiled.outputs)
 #! [properties_example]
 #! [auto_compilation]
 compiled_model = ov.compile_model(model)
 #! [auto_compilation]
 #! [tensor_basics]
 data_float64 = np.ones(shape=(2,8))
@ -37,6 +40,7 @@ tensor = ov.Tensor(data_int32)
 assert tensor.element_type == ov.Type.i32
 #! [tensor_basics]
 #! [tensor_shared_mode]
 data_to_share = np.ones(shape=(2,8))
@ -98,9 +102,9 @@ _ = next(iter(results.values()))
 core = ov.Core()
 # Simple model that adds two inputs together
-input_a = ov.opset8.parameter([8])
+input_a = ops.parameter([8])
-input_b = ov.opset8.parameter([8])
+input_b = ops.parameter([8])
-res = ov.opset8.add(input_a, input_b)
+res = ops.add(input_a, input_b)
 model = ov.Model(res, [input_a, input_b])
 compiled = core.compile_model(model, "CPU")
@ -137,14 +141,14 @@ infer_queue.wait_all()
 assert all(data_done)
 #! [asyncinferqueue_set_callback]
-unt8_data = np.ones([100])
+unt8_data = np.ones([100], dtype=np.uint8)
 #! [packing_data]
 from openvino.helpers import pack_data
 packed_buffer = pack_data(unt8_data, ov.Type.u4)
 # Create tensor with shape in element types
-t = ov.Tensor(packed_buffer, [1, 128], ov.Type.u4)
+t = ov.Tensor(packed_buffer, [100], ov.Type.u4)
 #! [packing_data]
 #! [unpacking]
@ -154,9 +158,12 @@ unpacked_data = unpack_data(t.data, t.element_type, t.shape)
 assert np.array_equal(unpacked_data , unt8_data)
 #! [unpacking]
 model_path = get_path_to_model()
 image = get_image()
 #! [releasing_gil]
-import openvino.runtime as ov
+import openvino as ov
 import cv2 as cv
 from threading import Thread
 input_data = []
@ -164,23 +171,20 @@ input_data = []
 # Processing input data will be done in a separate thread
 # while compilation of the model and creation of the infer request
 # is going to be executed in the main thread.
-def prepare_data(input, image_path):
+def prepare_data(input, image):
-    image = cv.imread(image_path)
+    shape = list(input.shape)
-    h, w = list(input.shape)[-2:]
+    resized_img = np.resize(image, shape)
-    image = cv.resize(image, (h, w))
+    input_data.append(resized_img)
    image = image.transpose((2, 0, 1))
    image = np.expand_dims(image, 0)
    input_data.append(image)
 core = ov.Core()
-model = core.read_model("model.xml")
+model = core.read_model(model_path)
 # Create thread with prepare_data function as target and start it
-thread = Thread(target=prepare_data, args=[model.input(), "path/to/image"])
+thread = Thread(target=prepare_data, args=[model.input(), image])
 thread.start()
 # The GIL will be released in compile_model.
 # It allows a thread above to start the job,
 # while main thread is running in the background.
-compiled = core.compile_model(model, "GPU")
+compiled = core.compile_model(model, "CPU")
 # After returning from compile_model, the main thread acquires the GIL
 # and starts create_infer_request which releases it once again.
 request = compiled.create_infer_request()
--- a/docs/snippets/ov_python_inference.py
+++ b/docs/snippets/ov_python_inference.py
@ -2,14 +2,15 @@
 # SPDX-License-Identifier: Apache-2.0
 import numpy as np
-import openvino.runtime as ov
+import openvino as ov
 import openvino.runtime.opset12 as ops
 INPUT_SIZE = 1_000_000  # Use bigger values if necessary, i.e.: 300_000_000
-input_0 = ov.opset11.parameter([INPUT_SIZE], name="input_0")
+input_0 = ops.parameter([INPUT_SIZE], name="input_0")
-input_1 = ov.opset11.parameter([INPUT_SIZE], name="input_1")
+input_1 = ops.parameter([INPUT_SIZE], name="input_1")
-add_inputs = ov.opset11.add(input_0, input_1)
+add_inputs = ops.add(input_0, input_1)
-res = ov.opset11.reduce_sum(add_inputs, reduction_axes=0, name="reduced")
+res = ops.reduce_sum(add_inputs, reduction_axes=0, name="reduced")
 model = ov.Model(res, [input_0, input_1], name="my_model")
 model.outputs[0].tensor.set_names({"reduced_result"})  # Add name for Output
@ -56,7 +57,7 @@ run(time_in_sec)
 # Hiding latency
 request.start_async({"input_0": data_0, "input_1": data_1})
-run()
+run(time_in_sec)
 request.wait()
 results = request.get_output_tensor(0).data  # Gather data from InferRequest
 #! [hiding_latency]
--- a/docs/snippets/src/main.py
+++ b/docs/snippets/src/main.py
@ -3,7 +3,7 @@
 import numpy as np
 #! [import]
-import openvino.runtime as ov
+import openvino as ov
 #! [import]
 #! [part1]
--- a/docs/snippets/utils.py
+++ b/docs/snippets/utils.py
@ -0,0 +1,86 @@
 # Copyright (C) 2018-2023 Intel Corporation
 # SPDX-License-Identifier: Apache-2.0
 #
 import tempfile
 import numpy as np
 from sys import platform
 import openvino as ov
 import openvino.runtime.opset12 as ops
 import ngraph as ng
 from ngraph.impl import Function
 import openvino.inference_engine as ie
 def get_dynamic_model():
    param = ops.parameter(ov.PartialShape([-1, -1]), name="input")
    return ov.Model(ops.relu(param), [param])
 def get_model(input_shape = None, input_dtype=np.float32) -> ov.Model:
    if input_shape is None:
        input_shape = [1, 3, 32, 32]
    param = ops.parameter(input_shape, input_dtype, name="input")
    relu = ops.relu(param, name="relu")
    relu.output(0).get_tensor().set_names({"result"})
    model = ov.Model([relu], [param], "test_model")
    assert model is not None
    return model
 def get_advanced_model():
    data = ops.parameter(ov.Shape([1, 3, 64, 64]), ov.Type.f32, name="input")
    data1 = ops.parameter(ov.Shape([1, 3, 64, 64]), ov.Type.f32, name="input2")
    axis_const = ops.constant(1, dtype=ov.Type.i64)
    split = ops.split(data, axis_const, 3)
    relu = ops.relu(split.output(1))
    relu.output(0).get_tensor().set_names({"result"})
    return ov.Model([split.output(0), relu.output(0), split.output(2)], [data, data1], "model")
 def get_ngraph_model(input_shape = None, input_dtype=np.float32):
    if input_shape is None:
        input_shape = [1, 3, 32, 32]
    param = ng.opset11.parameter(input_shape, input_dtype, name="data")
    relu = ng.opset11.relu(param, name="relu")
    func = Function([relu], [param], "test_model")
    caps = ng.Function.to_capsule(func)
    net = ie.IENetwork(caps)
    assert net is not None
    return net
 def get_image(shape = (1, 3, 32, 32), dtype = "float32"):
    np.random.seed(42)
    return np.random.rand(*shape).astype(dtype)
 def get_path_to_extension_library():
    library_path=""
    if platform == "win32":
        library_path="openvino_template_extension.dll"
    else:
        library_path="libopenvino_template_extension.so"
    return library_path
 def get_path_to_model(input_shape = None, is_old_api=False):
    if input_shape is None:
        input_shape = [1, 3, 32, 32]
    path_to_xml = tempfile.NamedTemporaryFile(suffix="_model.xml").name
    if is_old_api:
        net = get_ngraph_model(input_shape, input_dtype=np.int64)
        net.serialize(path_to_xml)
    else:
        model = get_model(input_shape)
        ov.serialize(model, path_to_xml)
    return path_to_xml
 def get_temp_dir():
    temp_dir = tempfile.TemporaryDirectory()
    return temp_dir
--- a/samples/python/benchmark/bert_benchmark/bert_benchmark.py
+++ b/samples/python/benchmark/bert_benchmark/bert_benchmark.py
@ -10,7 +10,8 @@ import tempfile
 from time import perf_counter
 import datasets
-from openvino.runtime import Core, get_version, AsyncInferQueue, PartialShape
+import openvino as ov
 from openvino.runtime import get_version
 from transformers import AutoTokenizer
 from transformers.onnx import export
 from transformers.onnx.features import FeaturesManager
@ -28,7 +29,7 @@ def main():
    # Download the tokenizer
    tokenizer = AutoTokenizer.from_pretrained(model_name)
-    core = Core()
+    core = ov.Core()
    with tempfile.TemporaryDirectory() as tmp:
        onnx_path = Path(tmp) / f'{model_name}.onnx'
@ -39,7 +40,7 @@ def main():
    # Enforce dynamic input shape
    try:
-        model.reshape({model_input.any_name: PartialShape([1, '?']) for model_input in model.inputs})
+        model.reshape({model_input.any_name: ov.PartialShape([1, '?']) for model_input in model.inputs})
    except RuntimeError:
        log.error("Can't set dynamic shape")
        raise
@ -50,7 +51,7 @@ def main():
    # It is possible to set CUMULATIVE_THROUGHPUT as PERFORMANCE_HINT for AUTO device
    compiled_model = core.compile_model(model, 'CPU', tput)
    # AsyncInferQueue creates optimal number of InferRequest instances
-    ireqs = AsyncInferQueue(compiled_model)
+    ireqs = ov.AsyncInferQueue(compiled_model)
    sst2 = datasets.load_dataset('glue', 'sst2')
    sst2_sentences = sst2['validation']['sentence']
--- a/samples/python/benchmark/sync_benchmark/sync_benchmark.py
+++ b/samples/python/benchmark/sync_benchmark/sync_benchmark.py
@ -9,7 +9,8 @@ import sys
 from time import perf_counter
 import numpy as np
-from openvino.runtime import Core, get_version
+import openvino as ov
 from openvino.runtime import get_version
 from openvino.runtime.utils.types import get_dtype
@ -40,7 +41,7 @@ def main():
    # Pick a device by replacing CPU, for example AUTO:GPU,CPU.
    # Using MULTI device is pointless in sync scenario
    # because only one instance of openvino.runtime.InferRequest is used
-    core = Core()
+    core = ov.Core()
    compiled_model = core.compile_model(sys.argv[1], 'CPU', latency)
    ireq = compiled_model.create_infer_request()
    # Fill input data for the ireq
--- a/samples/python/benchmark/throughput_benchmark/throughput_benchmark.py
+++ b/samples/python/benchmark/throughput_benchmark/throughput_benchmark.py
@ -9,7 +9,8 @@ import statistics
 from time import perf_counter
 import numpy as np
-from openvino.runtime import Core, get_version, AsyncInferQueue
+import openvino as ov
 from openvino.runtime import get_version
 from openvino.runtime.utils.types import get_dtype
@ -39,10 +40,10 @@ def main():
    # Create Core and use it to compile a model.
    # Pick a device by replacing CPU, for example MULTI:CPU(4),GPU(8).
    # It is possible to set CUMULATIVE_THROUGHPUT as PERFORMANCE_HINT for AUTO device
-    core = Core()
+    core = ov.Core()
    compiled_model = core.compile_model(sys.argv[1], 'CPU', tput)
    # AsyncInferQueue creates optimal number of InferRequest instances
-    ireqs = AsyncInferQueue(compiled_model)
+    ireqs = ov.AsyncInferQueue(compiled_model)
    # Fill input data for ireqs
    for ireq in ireqs:
        for model_input in compiled_model.inputs:
--- a/samples/python/classification_sample_async/classification_sample_async.py
+++ b/samples/python/classification_sample_async/classification_sample_async.py
@ -9,8 +9,7 @@ import sys
 import cv2
 import numpy as np
-from openvino.preprocess import PrePostProcessor
+import openvino as ov
 from openvino.runtime import AsyncInferQueue, Core, InferRequest, Layout, Type
 def parse_args() -> argparse.Namespace:
@ -32,7 +31,7 @@ def parse_args() -> argparse.Namespace:
    return parser.parse_args()
-def completion_callback(infer_request: InferRequest, image_path: str) -> None:
+def completion_callback(infer_request: ov.InferRequest, image_path: str) -> None:
    predictions = next(iter(infer_request.results.values()))
    # Change a shape of a numpy.ndarray with results to get another one with one dimension
@ -61,7 +60,7 @@ def main() -> int:
 # --------------------------- Step 1. Initialize OpenVINO Runtime Core ------------------------------------------------
    log.info('Creating OpenVINO Runtime Core')
-    core = Core()
+    core = ov.Core()
 # --------------------------- Step 2. Read a model --------------------------------------------------------------------
    log.info(f'Reading the model: {args.model}')
@ -88,22 +87,22 @@ def main() -> int:
    input_tensors = [np.expand_dims(image, 0) for image in resized_images]
 # --------------------------- Step 4. Apply preprocessing -------------------------------------------------------------
-    ppp = PrePostProcessor(model)
+    ppp = ov.preprocess.PrePostProcessor(model)
    # 1) Set input tensor information:
    # - input() provides information about a single model input
    # - precision of tensor is supposed to be 'u8'
    # - layout of data is 'NHWC'
    ppp.input().tensor() \
-        .set_element_type(Type.u8) \
+        .set_element_type(ov.Type.u8) \
-        .set_layout(Layout('NHWC'))  # noqa: N400
+        .set_layout(ov.Layout('NHWC'))  # noqa: N400
    # 2) Here we suppose model has 'NCHW' layout for input
-    ppp.input().model().set_layout(Layout('NCHW'))
+    ppp.input().model().set_layout(ov.Layout('NCHW'))
    # 3) Set output tensor information:
    # - precision of tensor is supposed to be 'f32'
-    ppp.output().tensor().set_element_type(Type.f32)
+    ppp.output().tensor().set_element_type(ov.Type.f32)
    # 4) Apply preprocessing modifing the original 'model'
    model = ppp.build()
@ -115,7 +114,7 @@ def main() -> int:
 # --------------------------- Step 6. Create infer request queue ------------------------------------------------------
    log.info('Starting inference in asynchronous mode')
    # create async queue with optimal number of infer requests
-    infer_queue = AsyncInferQueue(compiled_model)
+    infer_queue = ov.AsyncInferQueue(compiled_model)
    infer_queue.set_callback(completion_callback)
 # --------------------------- Step 7. Do inference --------------------------------------------------------------------
--- a/samples/python/hello_classification/hello_classification.py
+++ b/samples/python/hello_classification/hello_classification.py
@ -8,8 +8,7 @@ import sys
 import cv2
 import numpy as np
-from openvino.preprocess import PrePostProcessor, ResizeAlgorithm
+import openvino as ov
 from openvino.runtime import Core, Layout, Type
 def main():
@ -26,7 +25,7 @@ def main():
 # --------------------------- Step 1. Initialize OpenVINO Runtime Core ------------------------------------------------
    log.info('Creating OpenVINO Runtime Core')
-    core = Core()
+    core = ov.Core()
 # --------------------------- Step 2. Read a model --------------------------------------------------------------------
    log.info(f'Reading the model: {model_path}')
@ -48,7 +47,7 @@ def main():
    input_tensor = np.expand_dims(image, 0)
 # --------------------------- Step 4. Apply preprocessing -------------------------------------------------------------
-    ppp = PrePostProcessor(model)
+    ppp = ov.preprocess.PrePostProcessor(model)
    _, h, w, _ = input_tensor.shape
@ -58,19 +57,19 @@ def main():
    # - layout of data is 'NHWC'
    ppp.input().tensor() \
        .set_shape(input_tensor.shape) \
-        .set_element_type(Type.u8) \
+        .set_element_type(ov.Type.u8) \
-        .set_layout(Layout('NHWC'))  # noqa: ECE001, N400
+        .set_layout(ov.Layout('NHWC'))  # noqa: ECE001, N400
    # 2) Adding explicit preprocessing steps:
    # - apply linear resize from tensor spatial dims to model spatial dims
-    ppp.input().preprocess().resize(ResizeAlgorithm.RESIZE_LINEAR)
+    ppp.input().preprocess().resize(ov.preprocess.ResizeAlgorithm.RESIZE_LINEAR)
    # 3) Here we suppose model has 'NCHW' layout for input
-    ppp.input().model().set_layout(Layout('NCHW'))
+    ppp.input().model().set_layout(ov.Layout('NCHW'))
    # 4) Set output tensor information:
    # - precision of tensor is supposed to be 'f32'
-    ppp.output().tensor().set_element_type(Type.f32)
+    ppp.output().tensor().set_element_type(ov.Type.f32)
    # 5) Apply preprocessing modifying the original 'model'
    model = ppp.build()
--- a/samples/python/hello_query_device/hello_query_device.py
+++ b/samples/python/hello_query_device/hello_query_device.py
@ -5,7 +5,7 @@
 import logging as log
 import sys
-from openvino.runtime import Core
+import openvino as ov
 def param_to_string(parameters) -> str:
@ -20,7 +20,7 @@ def main():
    log.basicConfig(format='[ %(levelname)s ] %(message)s', level=log.INFO, stream=sys.stdout)
    # --------------------------- Step 1. Initialize OpenVINO Runtime Core --------------------------------------------
-    core = Core()
+    core = ov.Core()
    # --------------------------- Step 2. Get metrics of available devices --------------------------------------------
    log.info('Available devices:')
--- a/samples/python/hello_reshape_ssd/hello_reshape_ssd.py
+++ b/samples/python/hello_reshape_ssd/hello_reshape_ssd.py
@ -9,8 +9,7 @@ import sys
 import cv2
 import numpy as np
-from openvino.preprocess import PrePostProcessor
+import openvino as ov
 from openvino.runtime import Core, Layout, PartialShape, Type
 def main():
@ -27,7 +26,7 @@ def main():
 # --------------------------- Step 1. Initialize OpenVINO Runtime Core ------------------------------------------------
    log.info('Creating OpenVINO Runtime Core')
-    core = Core()
+    core = ov.Core()
 # --------------------------- Step 2. Read a model --------------------------------------------------------------------
    log.info(f'Reading the model: {model_path}')
@ -50,25 +49,25 @@ def main():
    log.info('Reshaping the model to the height and width of the input image')
    n, h, w, c = input_tensor.shape
-    model.reshape({model.input().get_any_name(): PartialShape((n, c, h, w))})
+    model.reshape({model.input().get_any_name(): ov.PartialShape((n, c, h, w))})
 # --------------------------- Step 4. Apply preprocessing -------------------------------------------------------------
-    ppp = PrePostProcessor(model)
+    ppp = ov.preprocess.PrePostProcessor(model)
    # 1) Set input tensor information:
    # - input() provides information about a single model input
    # - precision of tensor is supposed to be 'u8'
    # - layout of data is 'NHWC'
    ppp.input().tensor() \
-        .set_element_type(Type.u8) \
+        .set_element_type(ov.Type.u8) \
-        .set_layout(Layout('NHWC'))  # noqa: N400
+        .set_layout(ov.Layout('NHWC'))  # noqa: N400
    # 2) Here we suppose model has 'NCHW' layout for input
-    ppp.input().model().set_layout(Layout('NCHW'))
+    ppp.input().model().set_layout(ov.Layout('NCHW'))
    # 3) Set output tensor information:
    # - precision of tensor is supposed to be 'f32'
-    ppp.output().tensor().set_element_type(Type.f32)
+    ppp.output().tensor().set_element_type(ov.Type.f32)
    # 4) Apply preprocessing modifing the original 'model'
    model = ppp.build()
--- a/samples/python/model_creation_sample/model_creation_sample.py
+++ b/samples/python/model_creation_sample/model_creation_sample.py
@ -8,14 +8,13 @@ import typing
 from functools import reduce
 import numpy as np
-from openvino.preprocess import PrePostProcessor
+import openvino as ov
-from openvino.runtime import (Core, Layout, Model, Shape, Type, op, opset1,
+from openvino.runtime import op, opset1, opset8
                              opset8, set_batch)
 from data import digits
-def create_ngraph_function(model_path: str) -> Model:
+def create_ngraph_function(model_path: str) -> ov.Model:
    """Create a model on the fly from the source code using ngraph."""
    def shape_and_length(shape: list) -> typing.Tuple[list, int]:
@ -28,17 +27,17 @@ def create_ngraph_function(model_path: str) -> Model:
    # input
    input_shape = [64, 1, 28, 28]
-    param_node = op.Parameter(Type.f32, Shape(input_shape))
+    param_node = op.Parameter(ov.Type.f32, ov.Shape(input_shape))
    # convolution 1
    conv_1_kernel_shape, conv_1_kernel_length = shape_and_length([20, 1, 5, 5])
-    conv_1_kernel = op.Constant(Type.f32, Shape(conv_1_kernel_shape), weights[0:conv_1_kernel_length].tolist())
+    conv_1_kernel = op.Constant(ov.Type.f32, ov.Shape(conv_1_kernel_shape), weights[0:conv_1_kernel_length].tolist())
    weights_offset += conv_1_kernel_length
    conv_1_node = opset8.convolution(param_node, conv_1_kernel, [1, 1], padding_begin, padding_end, [1, 1])
    # add 1
    add_1_kernel_shape, add_1_kernel_length = shape_and_length([1, 20, 1, 1])
-    add_1_kernel = op.Constant(Type.f32, Shape(add_1_kernel_shape),
+    add_1_kernel = op.Constant(ov.Type.f32, ov.Shape(add_1_kernel_shape),
                               weights[weights_offset : weights_offset + add_1_kernel_length])
    weights_offset += add_1_kernel_length
    add_1_node = opset8.add(conv_1_node, add_1_kernel)
@ -48,7 +47,7 @@ def create_ngraph_function(model_path: str) -> Model:
    # convolution 2
    conv_2_kernel_shape, conv_2_kernel_length = shape_and_length([50, 20, 5, 5])
-    conv_2_kernel = op.Constant(Type.f32, Shape(conv_2_kernel_shape),
+    conv_2_kernel = op.Constant(ov.Type.f32, ov.Shape(conv_2_kernel_shape),
                                weights[weights_offset : weights_offset + conv_2_kernel_length],
                                )
    weights_offset += conv_2_kernel_length
@ -56,7 +55,7 @@ def create_ngraph_function(model_path: str) -> Model:
    # add 2
    add_2_kernel_shape, add_2_kernel_length = shape_and_length([1, 50, 1, 1])
-    add_2_kernel = op.Constant(Type.f32, Shape(add_2_kernel_shape),
+    add_2_kernel = op.Constant(ov.Type.f32, ov.Shape(add_2_kernel_shape),
                               weights[weights_offset : weights_offset + add_2_kernel_length],
                               )
    weights_offset += add_2_kernel_length
@ -72,13 +71,13 @@ def create_ngraph_function(model_path: str) -> Model:
        weights[weights_offset : weights_offset + 2 * reshape_1_length],
        dtype=np.int64,
    )
-    reshape_1_kernel = op.Constant(Type.i64, Shape(list(dtype_weights.shape)), dtype_weights)
+    reshape_1_kernel = op.Constant(ov.Type.i64, ov.Shape(list(dtype_weights.shape)), dtype_weights)
    weights_offset += 2 * reshape_1_length
    reshape_1_node = opset8.reshape(maxpool_2_node, reshape_1_kernel, True)
    # matmul 1
    matmul_1_kernel_shape, matmul_1_kernel_length = shape_and_length([500, 800])
-    matmul_1_kernel = op.Constant(Type.f32, Shape(matmul_1_kernel_shape),
+    matmul_1_kernel = op.Constant(ov.Type.f32, ov.Shape(matmul_1_kernel_shape),
                                  weights[weights_offset : weights_offset + matmul_1_kernel_length],
                                  )
    weights_offset += matmul_1_kernel_length
@ -86,7 +85,7 @@ def create_ngraph_function(model_path: str) -> Model:
    # add 3
    add_3_kernel_shape, add_3_kernel_length = shape_and_length([1, 500])
-    add_3_kernel = op.Constant(Type.f32, Shape(add_3_kernel_shape),
+    add_3_kernel = op.Constant(ov.Type.f32, ov.Shape(add_3_kernel_shape),
                               weights[weights_offset : weights_offset + add_3_kernel_length],
                               )
    weights_offset += add_3_kernel_length
@ -96,12 +95,12 @@ def create_ngraph_function(model_path: str) -> Model:
    relu_node = opset8.relu(add_3_node)
    # reshape 2
-    reshape_2_kernel = op.Constant(Type.i64, Shape(list(dtype_weights.shape)), dtype_weights)
+    reshape_2_kernel = op.Constant(ov.Type.i64, ov.Shape(list(dtype_weights.shape)), dtype_weights)
    reshape_2_node = opset8.reshape(relu_node, reshape_2_kernel, True)
    # matmul 2
    matmul_2_kernel_shape, matmul_2_kernel_length = shape_and_length([10, 500])
-    matmul_2_kernel = op.Constant(Type.f32, Shape(matmul_2_kernel_shape),
+    matmul_2_kernel = op.Constant(ov.Type.f32, ov.Shape(matmul_2_kernel_shape),
                                  weights[weights_offset : weights_offset + matmul_2_kernel_length],
                                  )
    weights_offset += matmul_2_kernel_length
@ -109,7 +108,7 @@ def create_ngraph_function(model_path: str) -> Model:
    # add 4
    add_4_kernel_shape, add_4_kernel_length = shape_and_length([1, 10])
-    add_4_kernel = op.Constant(Type.f32, Shape(add_4_kernel_shape),
+    add_4_kernel = op.Constant(ov.Type.f32, ov.Shape(add_4_kernel_shape),
                               weights[weights_offset : weights_offset + add_4_kernel_length],
                               )
    weights_offset += add_4_kernel_length
@ -119,7 +118,7 @@ def create_ngraph_function(model_path: str) -> Model:
    softmax_axis = 1
    softmax_node = opset8.softmax(add_4_node, softmax_axis)
-    return Model(softmax_node, [param_node], 'lenet')
+    return ov.Model(softmax_node, [param_node], 'lenet')
 def main():
@ -135,35 +134,35 @@ def main():
    number_top = 1
    # ---------------------------Step 1. Initialize OpenVINO Runtime Core--------------------------------------------------
    log.info('Creating OpenVINO Runtime Core')
    core = Core()
    # ---------------------------Step 2. Read a model in OpenVINO Intermediate Representation------------------------------
    log.info(f'Loading the model using ngraph function with weights from {model_path}')
    model = create_ngraph_function(model_path)
    # ---------------------------Step 3. Apply preprocessing----------------------------------------------------------
    # Get names of input and output blobs
-    ppp = PrePostProcessor(model)
+    ppp = ov.preprocess.PrePostProcessor(model)
    # 1) Set input tensor information:
    # - input() provides information about a single model input
    # - precision of tensor is supposed to be 'u8'
    # - layout of data is 'NHWC'
    ppp.input().tensor() \
-        .set_element_type(Type.u8) \
+        .set_element_type(ov.Type.u8) \
-        .set_layout(Layout('NHWC'))  # noqa: N400
+        .set_layout(ov.Layout('NHWC'))  # noqa: N400
    # 2) Here we suppose model has 'NCHW' layout for input
-    ppp.input().model().set_layout(Layout('NCHW'))
+    ppp.input().model().set_layout(ov.Layout('NCHW'))
    # 3) Set output tensor information:
    # - precision of tensor is supposed to be 'f32'
-    ppp.output().tensor().set_element_type(Type.f32)
+    ppp.output().tensor().set_element_type(ov.Type.f32)
    # 4) Apply preprocessing modifing the original 'model'
    model = ppp.build()
    # Set a batch size equal to number of input images
-    set_batch(model, digits.shape[0])
+    ov.set_batch(model, digits.shape[0])
    # ---------------------------Step 4. Loading model to the device-------------------------------------------------------
    log.info('Loading the model to the plugin')
    core = ov.Core()
    compiled_model = core.compile_model(model, device_name)
    # ---------------------------Step 5. Prepare input---------------------------------------------------------------------
--- a/samples/python/speech_sample/speech_sample.py
+++ b/samples/python/speech_sample/speech_sample.py
@ -9,8 +9,7 @@ from timeit import default_timer
 from typing import Dict
 import numpy as np
-from openvino.preprocess import PrePostProcessor
+import openvino as ov
 from openvino.runtime import Core, InferRequest, Layout, Type, set_batch
 from arg_parser import parse_args
 from file_options import read_utterance_file, write_utterance_file
@ -20,7 +19,7 @@ from utils import (GNA_ATOM_FREQUENCY, GNA_CORE_FREQUENCY,
                   set_scale_factors)
-def do_inference(data: Dict[str, np.ndarray], infer_request: InferRequest, cw_l: int = 0, cw_r: int = 0) -> np.ndarray:
+def do_inference(data: Dict[str, np.ndarray], infer_request: ov.InferRequest, cw_l: int = 0, cw_r: int = 0) -> np.ndarray:
    """Do a synchronous matrix inference."""
    frames_to_infer = {}
    result = {}
@ -69,7 +68,7 @@ def main():
 # --------------------------- Step 1. Initialize OpenVINO Runtime Core ------------------------------------------------
    log.info('Creating OpenVINO Runtime Core')
-    core = Core()
+    core = ov.Core()
 # --------------------------- Step 2. Read a model --------------------------------------------------------------------
    if args.model:
@ -83,19 +82,19 @@ def main():
        if args.layout:
            layouts = get_input_layouts(args.layout, model.inputs)
-        ppp = PrePostProcessor(model)
+        ppp = ov.preprocess.PrePostProcessor(model)
        for i in range(len(model.inputs)):
-            ppp.input(i).tensor().set_element_type(Type.f32)
+            ppp.input(i).tensor().set_element_type(ov.Type.f32)
            input_name = model.input(i).get_any_name()
            if args.layout and input_name in layouts.keys():
-                ppp.input(i).tensor().set_layout(Layout(layouts[input_name]))
+                ppp.input(i).tensor().set_layout(ov.Layout(layouts[input_name]))
-                ppp.input(i).model().set_layout(Layout(layouts[input_name]))
+                ppp.input(i).model().set_layout(ov.Layout(layouts[input_name]))
        for i in range(len(model.outputs)):
-            ppp.output(i).tensor().set_element_type(Type.f32)
+            ppp.output(i).tensor().set_element_type(ov.Type.f32)
        model = ppp.build()
@ -103,7 +102,7 @@ def main():
            batch_size = args.batch_size if args.context_window_left == args.context_window_right == 0 else 1
            if any((not _input.node.layout.empty for _input in model.inputs)):
-                set_batch(model, batch_size)
+                ov.set_batch(model, batch_size)
            else:
                log.warning('Layout is not set for any input, so custom batch size is not set')
--- a/src/bindings/python/docs/test_examples.md
+++ b/src/bindings/python/docs/test_examples.md
@ -107,7 +107,7 @@ Let's add a test case for new class. Start with imports and simple test of the c
 ```python
 import pytest
 import numpy as np 
-import openvino.runtime as ov
+import openvino as ov
 def test_mytensor_creation():
    tensor = ov.MyTensor([1, 2, 3])
		`@ -0,0 +1,2 @@`
							`# Copyright (C) 2018-2023 Intel Corporation`
							`# SPDX-License-Identifier: Apache-2.0`
		`@ -0,0 +1,2 @@`
							`# Copyright (C) 2023 Intel Corporation`
							`# SPDX-License-Identifier: Apache-2.0`