Files
openvino/docs/notebooks/224-3D-segmentation-point-clouds-with-output.rst
Maciej Smyk 3677dda457 [DOCS] 23.0 to 23.1 link update for master (#19584)
* 2023.1 link fix

* 2023.1 link fix

* 2023.1 link fix

* 2023.1 link fix

* 2023.1 link fix
2023-09-04 15:40:02 +02:00

273 lines
8.9 KiB
ReStructuredText
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

Part Segmentation of 3D Point Clouds with OpenVINO™
===================================================
This notebook demonstrates how to process `point
cloud <https://en.wikipedia.org/wiki/Point_cloud>`__ data and run 3D
Part Segmentation with OpenVINO. We use the
`PointNet <https://arxiv.org/abs/1612.00593>`__ pre-trained model to
detect each part of a chair and return its category.
PointNet
--------
PointNet was proposed by Charles Ruizhongtai Qi, a researcher at
Stanford University in 2016: `PointNet: Deep Learning on Point Sets for
3D Classification and
Segmentation <https://arxiv.org/abs/1612.00593>`__. The motivation
behind the research is to classify and segment 3D representations of
images. They use a data structure called point cloud, which is a set of
points that represents a 3D shape or object. PointNet provides a unified
architecture for applications ranging from object classification, part
segmentation, to scene semantic parsing. It is highly efficient and
effective, showing strong performance on par or even better than state
of the art.
.. _top:
**Table of contents**:
- `Imports <#imports>`__
- `Prepare the Model <#prepare-the-model>`__
- `Data Processing Module <#data-processing-module>`__
- `Visualize the original 3D data <#visualize-the-original-3d-data>`__
- `Run inference <#run-inference>`__
- `Select inference device <#select-inference-device>`__
Imports `⇑ <#top>`__
###############################################################################################################################
.. code:: ipython3
import sys
from pathlib import Path
from typing import Union
import numpy as np
import matplotlib.pyplot as plt
from openvino.runtime import Core, serialize
from openvino.tools import mo
sys.path.append("../utils")
from notebook_utils import download_file
Prepare the Model `⇑ <#top>`__
###############################################################################################################################
Download the pre-trained PointNet ONNX model. This pre-trained model is provided by
`axinc-ai <https://github.com/axinc-ai>`__, and you can find more
point clouds examples
`here <https://github.com/axinc-ai/ailia-models/tree/master/point_segmentation>`__.
.. code:: ipython3
# Set the data and model directories, model source URL and model filename
MODEL_DIR = Path("model")
MODEL_DIR.mkdir(exist_ok=True)
download_file("https://storage.googleapis.com/ailia-models/pointnet_pytorch/chair_100.onnx", directory=Path(MODEL_DIR), show_progress=False)
onnx_model_path = MODEL_DIR / "chair_100.onnx"
Convert the ONNX model to OpenVINO IR. An OpenVINO IR (Intermediate
Representation) model consists of an ``.xml`` file, containing
information about network topology, and a ``.bin`` file, containing the
weights and biases binary data. Model conversion Python API is used for
conversion of ONNX model to OpenVINO IR. The ``mo.convert_model`` Python
function returns an OpenVINO model ready to load on a device and start
making predictions. We can save it on a disk for next usage with
``openvino.runtime.serialize``. For more information about model
conversion Python API, see this
`page <https://docs.openvino.ai/2023.1/openvino_docs_model_processing_introduction.html>`__.
.. code:: ipython3
ir_model_xml = onnx_model_path.with_suffix(".xml")
core = Core()
if not ir_model_xml.exists():
# Convert model to OpenVINO Model
model = mo.convert_model(onnx_model_path, compress_to_fp16=True)
# Serialize model in OpenVINO IR format xml + bin
serialize(model, str(ir_model_xml))
else:
# Read model
model = core.read_model(model=ir_model_xml)
Data Processing Module `⇑ <#top>`__
###############################################################################################################################
.. code:: ipython3
def load_data(point_file: Union[str, Path]):
"""
Load the point cloud data and convert it to ndarray
Parameters:
point_file: string, path of .pts data
Returns:
point_set: point clound represented in np.array format
"""
point_set = np.loadtxt(point_file).astype(np.float32)
# normailization
point_set = point_set - np.expand_dims(np.mean(point_set, axis=0), 0) # center
dist = np.max(np.sqrt(np.sum(point_set ** 2, axis=1)), 0)
point_set = point_set / dist # scale
return point_set
def visualize(point_set:np.ndarray):
"""
Create a 3D view for data visualization
Parameters:
point_set: np.ndarray, the coordinate data in X Y Z format
"""
fig = plt.figure(dpi=192, figsize=(4, 4))
ax = fig.add_subplot(111, projection='3d')
X = point_set[:, 0]
Y = point_set[:, 2]
Z = point_set[:, 1]
# Scale the view of each axis to adapt to the coordinate data distribution
max_range = np.array([X.max() - X.min(), Y.max() - Y.min(), Z.max() - Z.min()]).max() * 0.5
mid_x = (X.max() + X.min()) * 0.5
mid_y = (Y.max() + Y.min()) * 0.5
mid_z = (Z.max() + Z.min()) * 0.5
ax.set_xlim(mid_x - max_range, mid_x + max_range)
ax.set_ylim(mid_y - max_range, mid_y + max_range)
ax.set_zlim(mid_z - max_range, mid_z + max_range)
plt.tick_params(labelsize=5)
ax.set_xlabel('X', fontsize=10)
ax.set_ylabel('Y', fontsize=10)
ax.set_zlabel('Z', fontsize=10)
return ax
Visualize the original 3D data `⇑ <#top>`__
###############################################################################################################################
The point cloud data can be downloaded from
`ShapeNet <https://shapenet.cs.stanford.edu/ericyi/shapenetcore_partanno_segmentation_benchmark_v0.zip>`__,
a large-scale dataset of 3D shapes. Here, we select the 3D data of a
chair for example.
.. code:: ipython3
point_data = "../data/pts/chair.pts"
points = load_data(point_data)
X = points[:, 0]
Y = points[:, 2]
Z = points[:, 1]
ax = visualize(points)
ax.scatter3D(X, Y, Z, s=5, cmap="jet", marker="o", label='chair')
ax.set_title('3D Visualization')
plt.legend(loc='upper right', fontsize=8)
plt.show()
.. image:: 224-3D-segmentation-point-clouds-with-output_files/224-3D-segmentation-point-clouds-with-output_10_0.png
Run inference `⇑ <#top>`__
###############################################################################################################################
Run inference and visualize the results of 3D segmentation. - The input data is a point cloud with
``1 batch size``\ \ ``3 axis value`` (x, y, z) and
``arbitrary number of points`` (dynamic shape). - The output data is a
mask with ``1 batch size`` and ``4 classification confidence`` for each
input point.
.. code:: ipython3
# Parts of a chair
classes = ['back', 'seat', 'leg', 'arm']
# Preprocess the input data
point = points.transpose(1, 0)
point = np.expand_dims(point, axis=0)
# Print info about model input and output shape
print(f"input shape: {model.input(0).partial_shape}")
print(f"output shape: {model.output(0).partial_shape}")
.. parsed-literal::
input shape: [1,3,?]
output shape: [1,?,4]
Select inference device `⇑ <#top>`__
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Select device from dropdown list for running inference using OpenVINO:
.. code:: ipython3
import ipywidgets as widgets
device = widgets.Dropdown(
options=core.available_devices + ["AUTO"],
value='AUTO',
description='Device:',
disabled=False,
)
device
.. parsed-literal::
Dropdown(description='Device:', index=1, options=('CPU', 'AUTO'), value='AUTO')
.. code:: ipython3
# Inference
compiled_model = core.compile_model(model=model, device_name=device.value)
output_layer = compiled_model.output(0)
result = compiled_model([point])[output_layer]
# Find the label map for all points of chair with highest confidence
pred = np.argmax(result[0], axis=1)
ax = visualize(point)
for i, name in enumerate([0, 1, 2, 3]):
XCur = []
YCur = []
ZCur = []
for j, nameCur in enumerate(pred):
if name == nameCur:
XCur.append(X[j])
YCur.append(Y[j])
ZCur.append(Z[j])
XCur = np.array(XCur)
YCur = np.array(YCur)
ZCur = np.array(ZCur)
# add current point of the part
ax.scatter(XCur, YCur, ZCur, s=5, cmap="jet", marker="o", label=classes[i])
ax.set_title('3D Segmentation Visualization')
plt.legend(loc='upper right', fontsize=8)
plt.show()
.. image:: 224-3D-segmentation-point-clouds-with-output_files/224-3D-segmentation-point-clouds-with-output_15_0.png