Files
openvino/docs/ops/detection/PSROIPooling_1.md
Tatiana Savina 2ae7d4428b [DOCS] shift to rst - opsets N (#17267)
* opset to rst

* change list indentations

* fix formula

* add n operations

* add negative and nonzero

* fix link

* specs to rst

* fix matrixnms path

* change path to if

* fix list

* fix format
2023-04-28 13:04:07 +02:00

3.4 KiB

PSROIPooling

@sphinxdirective

Versioned name: PSROIPooling-1

Category: Object detection

Short description: PSROIPooling computes position-sensitive pooling on regions of interest specified by input.

Detailed description: Reference <https://arxiv.org/pdf/1703.06211.pdf>__.

PSROIPooling operation takes two input blobs: with feature maps and with regions of interests (box coordinates). The latter is specified as five element tuples: [batch_id, x_1, y_1, x_2, y_2]. ROIs coordinates are specified in absolute values for the average mode and in normalized values (to [0,1] interval) for bilinear interpolation.

Attributes

  • output_dim

    • Description: output_dim is a pooled output channel number.
    • Range of values: a positive integer
    • Type: int
    • Required: yes
  • group_size

    • Description: group_size is the number of groups to encode position-sensitive score maps.
    • Range of values: a positive integer
    • Type: int
    • Default value: 1
    • Required: no
  • spatial_scale

    • Description: spatial_scale is a multiplicative spatial scale factor to translate ROI coordinates from their input scale to the scale used when pooling.
    • Range of values: a positive floating-point number
    • Type: float
    • Required: yes
  • mode

    • Description: mode specifies mode for pooling.

    • Range of values:

      • average - perform average pooling
      • bilinear - perform pooling with bilinear interpolation
    • Type: string

    • Default value: average

    • Required: no

  • spatial_bins_x

    • Description: spatial_bins_x specifies numbers of bins to divide the input feature maps over width. Used for "bilinear" mode only.
    • Range of values: a positive integer
    • Type: int
    • Default value: 1
    • Required: no
  • spatial_bins_y

    • Description: spatial_bins_y specifies numbers of bins to divide the input feature maps over height. Used for "bilinear" mode only.
    • Range of values: a positive integer
    • Type: int
    • Default value: 1
    • Required: no

Inputs:

  • 1: 4D input tensor with shape [N, C, H, W] and type T with feature maps. Required.

  • 2: 2D input tensor with shape [num_boxes, 5]. It contains a list of five element tuples that describe a region of interest: [batch_id, x_1, y_1, x_2, y_2]. Required. Batch indices must be in the range of [0, N-1].

Outputs:

  • 1: 4D output tensor with areas copied and interpolated from the 1st input tensor by coordinates of boxes from the 2nd input.

Types

  • T: any supported floating-point type.

Example

.. code-block:: cpp

<layer ... type="PSROIPooling" ... > 1 3240 38 38 100 5 100 360 6 6

@endsphinxdirective