Benchmarks¶

ResNet-101 Speed Benchmark¶

The code is reproducible on Tesla P40 GPUs, and the experiment details can be found in examples/resnet101_speed_benchmark.

The code is reproducible on Tesla P40 GPUs, and the experiment details can be found in examples/resnet101_accuracy_benchmark.

The code is reproducible on Tesla P40 GPUs, and the experiment details can be found in examples/amoebanetd_speed_benchmark.

Experiment	AmoebaNet-D (L, F)	# of Model Parameters	Total Model Parameter Memory	Total Peak Activation Memory
naive-1	(6, 208)	90M	1.00GB	–
pipeline-1	(6, 416)	358M	4.01GB	6.64GB
pipeline-2	(6, 544)	613M	6.45GB	11.31GB
pipeline-4	(12, 544)	1.16B	13.00GB	18.72GB
pipeline-8	(24, 512)	2.01B	22.42GB	35.78GB