* benchmark_app: factor out advanced options
* cpp/benchmark_app: add usage word
* move api to advanced, group b/shape/data_shape/layout into Input shapes, factor out Statistics dumping options
* Factor out Device-specific performance options
* Factor out Preprocessing options
* Minor regroup