rahul003 commented on issue #9774: mx.io.ImageRecordIter does not respect dtype argument / FP16 performance on Volta URL: https://github.com/apache/incubator-mxnet/issues/9774#issuecomment-367141388 Okay cool, I'll try to document that. I was using the maximum batch size which is a multiple of 8 that I could fit with p3.16x, i.e. 960 for the imagenet script, and set data-nthreads to 24. ```python train_imagenet.py --data-train ... --data-val ... --batch-size 960 --gpus 0,1,2,3,4,5,6,7 --dtype float16```
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services