renganxu commented on issue #14047: mxnet.base.MXNetError: Cannot find argument
'cudnn_algo_verbose'
URL:
https://github.com/apache/incubator-mxnet/issues/14047#issuecomment-479182021
Hi @ptrendx , Horovod sees all 8 ranks there. Because with 1 node, the speed
is ~5000 images/sec, and wit
renganxu commented on issue #14047: mxnet.base.MXNetError: Cannot find argument
'cudnn_algo_verbose'
URL:
https://github.com/apache/incubator-mxnet/issues/14047#issuecomment-477673243
Hi @ptrendx, since my system could not install Docker, I converted the mxnet
Docker container to Singular
renganxu commented on issue #14047: mxnet.base.MXNetError: Cannot find argument
'cudnn_algo_verbose'
URL:
https://github.com/apache/incubator-mxnet/issues/14047#issuecomment-475767300
Thanks very much for helping check this issue! I really want to know why the
result is not reproducible.
renganxu commented on issue #14047: mxnet.base.MXNetError: Cannot find argument
'cudnn_algo_verbose'
URL:
https://github.com/apache/incubator-mxnet/issues/14047#issuecomment-475270354
Hi @ptrendx, I used the same commands as in nvidia github to create the
training dataset, but still coul
renganxu commented on issue #14047: mxnet.base.MXNetError: Cannot find argument
'cudnn_algo_verbose'
URL:
https://github.com/apache/incubator-mxnet/issues/14047#issuecomment-470243418
Hi @ptrendx, could you give some guidance on the question in my previous
post? Also what is"--dali-nvjpeg