openvino/PSROIPooling_1.md at 96bae7bd2c7e4135fcfa67c813d028c811f06463

Files

Nikolay Tyukaev ef45b5da8d Doc Migration (master) (#1377 )

* Doc Migration from Gitlab (#1289)

* doc migration

* fix

* Update FakeQuantize_1.md

* Update performance_benchmarks.md

* Updates graphs for FPGA

* Update performance_benchmarks.md

* Change DL Workbench structure (#1)

* Changed DL Workbench structure

* Fixed tags

* fixes

* Update ie_docs.xml

* Update performance_benchmarks_faq.md

* Fixes in DL Workbench layout

* Fixes for CVS-31290

* [DL Workbench] Minor correction

* Fix for CVS-30955

* Added nGraph deprecation notice as requested by Zoe

* fix broken links in api doxy layouts

* CVS-31131 fixes

* Additional fixes

* Fixed POT TOC

* Update PAC_Configure.md

PAC DCP 1.2.1 install guide.

* Update inference_engine_intro.md

* fix broken link

* Update opset.md

* fix

* added opset4 to layout

* added new opsets to layout, set labels for them

* Update VisionAcceleratorFPGA_Configure.md

Updated from 2020.3 to 2020.4

Co-authored-by: domi2000 <domi2000@users.noreply.github.com>

2020-07-20 17:36:08 +03:00

3.2 KiB

Raw Blame History

PSROIPooling

Versioned name: PSROIPooling-1

Category: Object detection

Short description: PSROIPooling computes position-sensitive pooling on regions of interest specified by input.

Detailed description: Reference.

PSROIPooling operation takes two input blobs: with feature maps and with regions of interests (box coordinates). The latter is specified as five element tuples: [batch_id, x_1, y_1, x_2, y_2]. ROIs coordinates are specified in absolute values for the average mode and in normalized values (to [0,1] interval) for bilinear interpolation.

Attributes

output_dim
- Description: output_dim is a pooled output channel number.
- Range of values: a positive integer
- Type: int
- Default value: None
- Required: yes
group_size
- Description: group_size is the number of groups to encode position-sensitive score maps. Use for average mode only.
- Range of values: a positive integer
- Type: int
- Default value: 1
- Required: no
spatial_scale
- Description: spatial_scale is a multiplicative spatial scale factor to translate ROI coordinates from their input scale to the scale used when pooling.
- Range of values: a positive floating-point number
- Type: float
- Default value: None
- Required: yes
mode
- Description: mode specifies mode for pooling.
- Range of values:
  - average - perform average pooling
  - bilinear - perform pooling with bilinear interpolation
- Type: string
- Default value: average
- Required: no
spatial_bins_x
- Description: spatial_bins_x specifies numbers of bins to divide the input feature maps over width. Used for "bilinear" mode only.
- Range of values: a positive integer
- Type: int
- Default value: 1
- Required: no
spatial_bins_y
- Description: spatial_bins_y specifies numbers of bins to divide the input feature maps over height. Used for "bilinear" mode only.
- Range of values: a positive integer
- Type: int
- Default value: 1
- Required: no

Inputs:

1: 4D input blob with feature maps. Required.
2: 2D input blob describing box consisting of five element tuples: [batch_id, x_1, y_1, x_2, y_2]. Required.

Outputs:

1: 4D output tensor with areas copied and interpolated from the 1st input tensor by coordinates of boxes from the 2nd input.

Example

<layer ... type="PSROIPooling" ... >
    <data group_size="6" mode="bilinear" output_dim="360" spatial_bins_x="3" spatial_bins_y="3" spatial_scale="1"/>
    <input>
        <port id="0">
            <dim>1</dim>
            <dim>3240</dim>
            <dim>38</dim>
            <dim>38</dim>
        </port>
        <port id="1">
            <dim>100</dim>
            <dim>5</dim>
        </port>
    </input>
    <output>
        <port id="2">
            <dim>100</dim>
            <dim>360</dim>
            <dim>6</dim>
            <dim>6</dim>
        </port>
    </output>
</layer>

3.2 KiB Raw Blame History

PSROIPooling

3.2 KiB

Raw Blame History