[GitHub] VGalata commented on issue #8792: Different training performance between mxnet v. 0.11.0 and v. 0.12.1

2017-11-23 Thread GitBox
VGalata commented on issue #8792: Different training performance between mxnet v. 0.11.0 and v. 0.12.1 URL: https://github.com/apache/incubator-mxnet/issues/8792#issuecomment-346757566 Dear @jeremiedb, Thank you for your fast reply! I downloaded the `master` branch (`v0.9.3a-

[GitHub] szha closed pull request #8788: fix build

2017-11-23 Thread GitBox
szha closed pull request #8788: fix build URL: https://github.com/apache/incubator-mxnet/pull/8788 This is a PR merged from a forked repository. As GitHub hides the original diff on merge, it is displayed below for the sake of provenance: As this is a foreign pull request (from a fork), t

[incubator-mxnet] branch master updated: fix build (#8788)

2017-11-23 Thread zhasheng
This is an automated email from the ASF dual-hosted git repository. zhasheng pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-mxnet.git The following commit(s) were added to refs/heads/master by this push: new 3c32f76 fix build (#8788) 3c32f76 is

[GitHub] tanhm07 commented on issue #8777: Error: package or namespace load failed for ?mxnet?:

2017-11-23 Thread GitBox
tanhm07 commented on issue #8777: Error: package or namespace load failed for ?mxnet?: URL: https://github.com/apache/incubator-mxnet/issues/8777#issuecomment-346750754 I downgraded to CUDA 8 with CUDNN v5.1 and it works now! Thanks very much for your time and help. -

[GitHub] cjolivier01 closed pull request #8774: remove meaningless type qualifier

2017-11-23 Thread GitBox
cjolivier01 closed pull request #8774: remove meaningless type qualifier URL: https://github.com/apache/incubator-mxnet/pull/8774 This is a PR merged from a forked repository. As GitHub hides the original diff on merge, it is displayed below for the sake of provenance: As this is a foreig

[incubator-mxnet] branch master updated: remove meaningless type qualifier (#8774)

2017-11-23 Thread cjolivier01
This is an automated email from the ASF dual-hosted git repository. cjolivier01 pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-mxnet.git The following commit(s) were added to refs/heads/master by this push: new 5088477 remove meaningless type q

[GitHub] KellenSunderland opened a new issue #8805: Release Branch 1.0 crashing for several tests in CUDA 7.5 docker containers.

2017-11-23 Thread GitBox
KellenSunderland opened a new issue #8805: Release Branch 1.0 crashing for several tests in CUDA 7.5 docker containers. URL: https://github.com/apache/incubator-mxnet/issues/8805 ## Description The release v1.0 branch is crashing for several tests with the same stacktrace. Branch: h

[GitHub] wkcn closed issue #8569: The conflict between MXNet and OpenCV

2017-11-23 Thread GitBox
wkcn closed issue #8569: The conflict between MXNet and OpenCV URL: https://github.com/apache/incubator-mxnet/issues/8569 This is an automated message from the Apache Git Service. To respond to the message, please log on GitH

[GitHub] eric-haibin-lin commented on issue #8500: program crash when run sparse model predict

2017-11-23 Thread GitBox
eric-haibin-lin commented on issue #8500: program crash when run sparse model predict URL: https://github.com/apache/incubator-mxnet/issues/8500#issuecomment-346721500 There's a 2-second gap between batches. What metric are you using? Did you verify how much time computing the metric take

[GitHub] ZiyueHuang commented on issue #8774: remove meaningless type qualifier

2017-11-23 Thread GitBox
ZiyueHuang commented on issue #8774: remove meaningless type qualifier URL: https://github.com/apache/incubator-mxnet/pull/8774#issuecomment-346719663 CI has passed. @szha @cjolivier01 This is an automated message from the Ap

[GitHub] jeremiedb commented on issue #8792: Different training performance between mxnet v. 0.11.0 and v. 0.12.1

2017-11-23 Thread GitBox
jeremiedb commented on issue #8792: Different training performance between mxnet v. 0.11.0 and v. 0.12.1 URL: https://github.com/apache/incubator-mxnet/issues/8792#issuecomment-346719088 There's been a bug introduced for a short time in master after `mx.runif` and `mx.rnorm` used by the i

[GitHub] lx75249 closed pull request #8796: Avoid odr-using static member that will cause undefined references in?

2017-11-23 Thread GitBox
lx75249 closed pull request #8796: Avoid odr-using static member that will cause undefined references in? URL: https://github.com/apache/incubator-mxnet/pull/8796 This is a PR merged from a forked repository. As GitHub hides the original diff on merge, it is displayed below for the sake o

[GitHub] szha commented on issue #8784: Fix warning on meaningless return type qualifier

2017-11-23 Thread GitBox
szha commented on issue #8784: Fix warning on meaningless return type qualifier URL: https://github.com/apache/incubator-mxnet/pull/8784#issuecomment-346713299 @larroy let's use Ziyue's #8774 for patching master. This is an au

[GitHub] szha commented on issue #8796: Avoid odr-using static member that will cause undefined references in?

2017-11-23 Thread GitBox
szha commented on issue #8796: Avoid odr-using static member that will cause undefined references in? URL: https://github.com/apache/incubator-mxnet/pull/8796#issuecomment-346712598 I have a fix for this problem at #8788 Thi

[GitHub] szha closed issue #6803: There are some problems running MNIST on the CPU cluster

2017-11-23 Thread GitBox
szha closed issue #6803: There are some problems running MNIST on the CPU cluster URL: https://github.com/apache/incubator-mxnet/issues/6803 This is an automated message from the Apache Git Service. To respond to the message

[GitHub] szha closed pull request #7423: Do not use LG in case of DMLC_GLOG_DEFINED enabled

2017-11-23 Thread GitBox
szha closed pull request #7423: Do not use LG in case of DMLC_GLOG_DEFINED enabled URL: https://github.com/apache/incubator-mxnet/pull/7423 This is a PR merged from a forked repository. As GitHub hides the original diff on merge, it is displayed below for the sake of provenance: As this

[GitHub] szha commented on issue #6803: There are some problems running MNIST on the CPU cluster

2017-11-23 Thread GitBox
szha commented on issue #6803: There are some problems running MNIST on the CPU cluster URL: https://github.com/apache/incubator-mxnet/issues/6803#issuecomment-346712430 This issue is closed due to lack of activity in the last 90 days. Feel free to ping me to reopen if this is still an ac

[GitHub] szha commented on issue #7423: Do not use LG in case of DMLC_GLOG_DEFINED enabled

2017-11-23 Thread GitBox
szha commented on issue #7423: Do not use LG in case of DMLC_GLOG_DEFINED enabled URL: https://github.com/apache/incubator-mxnet/pull/7423#issuecomment-346712431 This issue is closed due to lack of activity in the last 90 days. Feel free to ping me to reopen if this is still an active issu

[GitHub] hapownli closed issue #8756: As there are many JAVA programer, and JDK9 has HAD JAVA REPL JSHELL like PYTHON OR SCALA shell, it will be easy to use JAVA training model. , When will MXNET sup

2017-11-23 Thread GitBox
hapownli closed issue #8756: As there are many JAVA programer, and JDK9 has HAD JAVA REPL JSHELL like PYTHON OR SCALA shell, it will be easy to use JAVA training model. , When will MXNET support JAVA programing language? URL: https://github.com/apache/incubator-mxnet/issues/8756 --

[GitHub] lucidfrontier45 commented on issue #8800: Is PyPI wheel linked to cudnn?

2017-11-23 Thread GitBox
lucidfrontier45 commented on issue #8800: Is PyPI wheel linked to cudnn? URL: https://github.com/apache/incubator-mxnet/issues/8800#issuecomment-346710579 I got it. Thank you. This is an automated message from the Apache Git

[GitHub] lucidfrontier45 closed issue #8800: Is PyPI wheel linked to cudnn?

2017-11-23 Thread GitBox
lucidfrontier45 closed issue #8800: Is PyPI wheel linked to cudnn? URL: https://github.com/apache/incubator-mxnet/issues/8800 This is an automated message from the Apache Git Service. To respond to the message, please log on

[incubator-mxnet] branch v1.0.0 updated: Fix weird hang bug due to cuInit sometimes calls fork (#8790)

2017-11-23 Thread zhasheng
This is an automated email from the ASF dual-hosted git repository. zhasheng pushed a commit to branch v1.0.0 in repository https://gitbox.apache.org/repos/asf/incubator-mxnet.git The following commit(s) were added to refs/heads/v1.0.0 by this push: new 7b40c03 Fix weird hang bug due to cu

[incubator-mxnet] branch master updated: Fix weird hang bug due to cuInit sometimes calls fork (#8804)

2017-11-23 Thread zhasheng
This is an automated email from the ASF dual-hosted git repository. zhasheng pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-mxnet.git The following commit(s) were added to refs/heads/master by this push: new 0bf76a8 Fix weird hang bug due to cu

[GitHub] szha opened a new pull request #8804: Fix weird hang bug due to cuInit sometimes calls fork

2017-11-23 Thread GitBox
szha opened a new pull request #8804: Fix weird hang bug due to cuInit sometimes calls fork URL: https://github.com/apache/incubator-mxnet/pull/8804 ## Description ## (Brief description on what this PR is about) ## Checklist ## ### Essentials ### - [ ] Passed code style chec

[GitHub] szha closed pull request #8804: Fix weird hang bug due to cuInit sometimes calls fork

2017-11-23 Thread GitBox
szha closed pull request #8804: Fix weird hang bug due to cuInit sometimes calls fork URL: https://github.com/apache/incubator-mxnet/pull/8804 This is a PR merged from a forked repository. As GitHub hides the original diff on merge, it is displayed below for the sake of provenance: As th

[incubator-mxnet] branch master updated: Fix weird hang bug due to cuInit sometimes calls fork (#8790)

2017-11-23 Thread jxie
This is an automated email from the ASF dual-hosted git repository. jxie pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-mxnet.git The following commit(s) were added to refs/heads/master by this push: new 3953e79 Fix weird hang bug due to cuInit

[GitHub] piiswrong closed pull request #8790: Fix weird hang bug due to cuInit sometimes calls fork

2017-11-23 Thread GitBox
piiswrong closed pull request #8790: Fix weird hang bug due to cuInit sometimes calls fork URL: https://github.com/apache/incubator-mxnet/pull/8790 This is a PR merged from a forked repository. As GitHub hides the original diff on merge, it is displayed below for the sake of provenance:

[GitHub] szha opened a new pull request #8788: [WIP] fix build

2017-11-23 Thread GitBox
szha opened a new pull request #8788: [WIP] fix build URL: https://github.com/apache/incubator-mxnet/pull/8788 ## Description ## Addresses #8786 with the same fix as #8692 This is an automated message from the Apache Git Se

[GitHub] larroy commented on issue #8793: Fix link error on DEBUG=1. Need mshadow::gpu::kDevMask def in TU

2017-11-23 Thread GitBox
larroy commented on issue #8793: Fix link error on DEBUG=1. Need mshadow::gpu::kDevMask def in TU URL: https://github.com/apache/incubator-mxnet/pull/8793#issuecomment-346689786 @szha yep, seems your fix is better, and they don't want any more changes for v1.0.0, so closing this.

[GitHub] larroy closed pull request #8793: Fix link error on DEBUG=1. Need mshadow::gpu::kDevMask def in TU

2017-11-23 Thread GitBox
larroy closed pull request #8793: Fix link error on DEBUG=1. Need mshadow::gpu::kDevMask def in TU URL: https://github.com/apache/incubator-mxnet/pull/8793 This is a PR merged from a forked repository. As GitHub hides the original diff on merge, it is displayed below for the sake of prove

[GitHub] szha commented on issue #8793: Fix link error on DEBUG=1. Need mshadow::gpu::kDevMask def in TU

2017-11-23 Thread GitBox
szha commented on issue #8793: Fix link error on DEBUG=1. Need mshadow::gpu::kDevMask def in TU URL: https://github.com/apache/incubator-mxnet/pull/8793#issuecomment-346686152 You can use the fix I had in #8788 instead, which is the same fix as in #8692 --

[GitHub] larroy commented on issue #8793: Fix link error on DEBUG=1. Need mshadow::gpu::kDevMask def in TU

2017-11-23 Thread GitBox
larroy commented on issue #8793: Fix link error on DEBUG=1. Need mshadow::gpu::kDevMask def in TU URL: https://github.com/apache/incubator-mxnet/pull/8793#issuecomment-346685420 I don't think it's the right place to do it, but right now it fixes the build for the release, we should do it i

[GitHub] larroy commented on issue #8790: Fix weird hang bug due to cuInit sometimes calls fork

2017-11-23 Thread GitBox
larroy commented on issue #8790: Fix weird hang bug due to cuInit sometimes calls fork URL: https://github.com/apache/incubator-mxnet/pull/8790#issuecomment-346683066 ? as a hotfix I think it's ok but I think we should refactor this code, I don't like what we are doing at pthread_atfork,

[GitHub] szha commented on issue #8800: Is PyPI wheel linked to cudnn?

2017-11-23 Thread GitBox
szha commented on issue #8800: Is PyPI wheel linked to cudnn? URL: https://github.com/apache/incubator-mxnet/issues/8800#issuecomment-346680810 yes. libcudnn is statically linked. This is an automated message from the Apache

[GitHub] cjolivier01 commented on a change in pull request #8803: Small first iteration for profiler: use nonblocking queue

2017-11-23 Thread GitBox
cjolivier01 commented on a change in pull request #8803: Small first iteration for profiler: use nonblocking queue URL: https://github.com/apache/incubator-mxnet/pull/8803#discussion_r152858233 ## File path: src/engine/profiler.h ## @@ -65,11 +66,24 @@ struct OprExecStat {

[GitHub] bhavinthaker commented on issue #8790: Fix weird hang bug due to cuInit sometimes calls fork

2017-11-23 Thread GitBox
bhavinthaker commented on issue #8790: Fix weird hang bug due to cuInit sometimes calls fork URL: https://github.com/apache/incubator-mxnet/pull/8790#issuecomment-346679938 Thanks ptrendx@ for the quick review. This is an aut

[GitHub] cjolivier01 commented on a change in pull request #8803: Small first iteration for profiler: use nonblocking queue

2017-11-23 Thread GitBox
cjolivier01 commented on a change in pull request #8803: Small first iteration for profiler: use nonblocking queue URL: https://github.com/apache/incubator-mxnet/pull/8803#discussion_r152858169 ## File path: CMakeLists.txt ## @@ -38,7 +38,7 @@ mxnet_option(USE_MKL_EXPERIME

[GitHub] cjolivier01 opened a new pull request #8803: Small first iteration for profiler: use nonblocking queue

2017-11-23 Thread GitBox
cjolivier01 opened a new pull request #8803: Small first iteration for profiler: use nonblocking queue URL: https://github.com/apache/incubator-mxnet/pull/8803 ## Description ## Use nonblocking queue for profiler Next up: buffer server so allocation not done in-line ## Chec

[GitHub] ptrendx commented on issue #8790: Fix weird hang bug due to cuInit sometimes calls fork

2017-11-23 Thread GitBox
ptrendx commented on issue #8790: Fix weird hang bug due to cuInit sometimes calls fork URL: https://github.com/apache/incubator-mxnet/pull/8790#issuecomment-346676190 I did not see such behavior before (cuinit calling fork), but the PR itself seems fine and should not affect anything nega

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346672550 I am not one to pass judgement on anything python -- I believe that everything should b

[GitHub] SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346672168 Linux (Ubuntu 16.04 LTS - Gnome 3 flavor) I meant more along the lines as to if thi

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346671978 What's your OS? DGX I assume is a Linux image? -

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346671978 What's your OS? This i

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346671883 I don't see anything strange about the script -- like you said, just about everything i

[GitHub] SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346671750 @cjolivier01 what do you think of script though? Likewise :) (abroad in Germany) -

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346671698 ok :) Happy Thanksgiving! (If you're in the US) ---

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346671639 maybe. CPU usage profiles look about the same for both... -

[GitHub] SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346671615 I will contact them and have them look into it --

[GitHub] SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346671587 Ok, then it appears that this might be nvidia docker image specific -

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346671211 2: /usr/bin/python3.5 /home/coolivie/src/DeepLearning/python/file_2.py --gpus 2 --in

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346671211 2: (3): MaxPool2D(size=(2, 2), stride=(2, 2), padding=(0, 0), ceil_mode=False)

[GitHub] SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346671199 personal machine is built from master ---

[GitHub] SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346671164 DGX has a nvidia docker image. Local machine is pip install of the cuda verison --

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346671077 1: /usr/bin/python3.5 /home/coolivie/src/DeepLearning/python/file_1.py --gpus 2 De

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346670844 I am using 2.7 here, let me try 3.5...

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346670934 are you using latest master, by the way? or a pip install?

[GitHub] SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346670791 haha :) yeah, I didnt implement it there though. That is odd, I ran mine on a DGX..

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346670648 terrible accuracy on the second one, though :)

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346670273 I have two GTX 1080's on my home machine here, and I get roughly the same speed on two

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346670273 I have twp GTX 1080's on my home machine here, and I get roughly the same speed on two

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346670273 I have twp GTX 1080's on my home machine here, and I get roughly the same speed on two

[GitHub] SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346669382 cool :) This is an autom

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346669264 nvm, file naming, i get it.. --

[GitHub] SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346669200 is file 3 also in the same directory? ---

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346668952 when running file 2, i get: Connected to pydev debugger (build 172.4343.16) [ERRO

[GitHub] cjolivier01 closed issue #8617: Failed to detect tcmalloc or jemalloc for Fedora 26

2017-11-23 Thread GitBox
cjolivier01 closed issue #8617: Failed to detect tcmalloc or jemalloc for Fedora 26 URL: https://github.com/apache/incubator-mxnet/issues/8617 This is an automated message from the Apache Git Service. To respond to the messa

[GitHub] cjolivier01 commented on issue #8617: Failed to detect tcmalloc or jemalloc for Fedora 26

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8617: Failed to detect tcmalloc or jemalloc for Fedora 26 URL: https://github.com/apache/incubator-mxnet/issues/8617#issuecomment-346664647 Please reopen if necessary, but I think we're just going to standardize on cmake. Let me know if cmake doesn't find i

[GitHub] expnn opened a new issue #8617: Failed to detect tcmalloc or jemalloc for Fedora 26

2017-11-23 Thread GitBox
expnn opened a new issue #8617: Failed to detect tcmalloc or jemalloc for Fedora 26 URL: https://github.com/apache/incubator-mxnet/issues/8617 The wildcard given by the current Makefile failed to detect the gperftools/jemalloc malloc libraries. Currently, it uses the following snippet

[GitHub] cjolivier01 closed issue #8617: Failed to detect tcmalloc or jemalloc for Fedora 26

2017-11-23 Thread GitBox
cjolivier01 closed issue #8617: Failed to detect tcmalloc or jemalloc for Fedora 26 URL: https://github.com/apache/incubator-mxnet/issues/8617 This is an automated message from the Apache Git Service. To respond to the messa

[GitHub] SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346660854 @cjolivier01 No worries. I appreciate your assistance. However, this behavior could signa

[GitHub] aijanai commented on issue #8802: Predicting with batch size lower than the one used for training gives error

2017-11-23 Thread GitBox
aijanai commented on issue #8802: Predicting with batch size lower than the one used for training gives error URL: https://github.com/apache/incubator-mxnet/issues/8802#issuecomment-346658576 #6752 This is an automated messa

[GitHub] aijanai commented on issue #8802: Predicting with batch size lower than the one used for training gives error

2017-11-23 Thread GitBox
aijanai commented on issue #8802: Predicting with batch size lower than the one used for training gives error URL: https://github.com/apache/incubator-mxnet/issues/8802#issuecomment-346658576 #6752 This is an automated messa

[GitHub] aijanai closed issue #8802: Predicting with batch size lower than the one used for training gives error

2017-11-23 Thread GitBox
aijanai closed issue #8802: Predicting with batch size lower than the one used for training gives error URL: https://github.com/apache/incubator-mxnet/issues/8802 This is an automated message from the Apache Git Service. To

[GitHub] aijanai commented on issue #8802: Predicting with batch size lower than the one used for training gives error

2017-11-23 Thread GitBox
aijanai commented on issue #8802: Predicting with batch size lower than the one used for training gives error URL: https://github.com/apache/incubator-mxnet/issues/8802#issuecomment-346657432 Ok, looks like it was dealt it https://github.com/apache/incubator-mxnet/issues/6752: adding pad=

[GitHub] cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346655435 I apologize, but I am not very good yet at reading large chunks of python without a deb

[GitHub] aijanai opened a new issue #8802: Predicting with batch size lower than the one used for training gives error

2017-11-23 Thread GitBox
aijanai opened a new issue #8802: Predicting with batch size lower than the one used for training gives error URL: https://github.com/apache/incubator-mxnet/issues/8802 ## Description I train with dataset size = 904 and batch size = 50. If I try then to predict with the model with a t

[GitHub] LeonJWH opened a new issue #8801: Cannot find custom operator when visualizing network.

2017-11-23 Thread GitBox
LeonJWH opened a new issue #8801: Cannot find custom operator when visualizing network. URL: https://github.com/apache/incubator-mxnet/issues/8801 When I tried to visualize the mask rcnn network, I used the script below: ``` import mxnet as mx sym = mx.sym.load('rcnn1-symbol.json')

[GitHub] lucidfrontier45 opened a new issue #8800: Is PyPI wheel linked to cudnn?

2017-11-23 Thread GitBox
lucidfrontier45 opened a new issue #8800: Is PyPI wheel linked to cudnn? URL: https://github.com/apache/incubator-mxnet/issues/8800 ## Description mxnet-cu90 package in PyPI seems to not be compiled against CuDNN. ## Environment info (Required) OS: Ubuntu 16.04 Python: 3.6.3

[GitHub] SumNeuron commented on issue #8794: GPU throws out of index error?

2017-11-23 Thread GitBox
SumNeuron commented on issue #8794: GPU throws out of index error? URL: https://github.com/apache/incubator-mxnet/issues/8794#issuecomment-346627089 Is there way to set the global context? or for mxnet to auto switch to the greater of two contexts automatically (e.g. if I accidentally i

[GitHub] SumNeuron commented on issue #8794: GPU throws out of index error?

2017-11-23 Thread GitBox
SumNeuron commented on issue #8794: GPU throws out of index error? URL: https://github.com/apache/incubator-mxnet/issues/8794#issuecomment-346626718 to clarify, *both* fixes are needed This is an automated message from the Ap

[GitHub] SumNeuron commented on issue #8794: GPU throws out of index error?

2017-11-23 Thread GitBox
SumNeuron commented on issue #8794: GPU throws out of index error? URL: https://github.com/apache/incubator-mxnet/issues/8794#issuecomment-346626446 doing just that results in: ``` MXNetErrorTraceback (most recent call last) in () 5

[GitHub] ZiyueHuang commented on issue #8794: GPU throws out of index error?

2017-11-23 Thread GitBox
ZiyueHuang commented on issue #8794: GPU throws out of index error? URL: https://github.com/apache/incubator-mxnet/issues/8794#issuecomment-346624746 Please try `data.as_in_context(ctx)`. This is an automated message from the

[GitHub] SumNeuron commented on issue #8716: ImageDetIter uses type list as inside making as_in_context break

2017-11-23 Thread GitBox
SumNeuron commented on issue #8716: ImageDetIter uses type list as inside making as_in_context break URL: https://github.com/apache/incubator-mxnet/issues/8716#issuecomment-346624735 @zhreshold is this intended behavior? Th

[GitHub] SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!)

2017-11-23 Thread GitBox
SumNeuron commented on issue #8751: Distributed Training has inverse results when imported (8 GPUS is slower than 1!) URL: https://github.com/apache/incubator-mxnet/issues/8751#issuecomment-346624513 @cjolivier01 any ideas?

[GitHub] SumNeuron commented on issue #8756: As there are many JAVA programer, and JDK9 has HAD JAVA REPL JSHELL like PYTHON OR SCALA shell, it will be easy to use JAVA training model. , When will MX

2017-11-23 Thread GitBox
SumNeuron commented on issue #8756: As there are many JAVA programer, and JDK9 has HAD JAVA REPL JSHELL like PYTHON OR SCALA shell, it will be easy to use JAVA training model. , When will MXNET support JAVA programing language? URL: https://github.com/apache/incubator-mxnet/issues/8756#issueco

[GitHub] mseeger opened a new issue #8799: Dangling outputs and dtype != float32: Gradient computation fails

2017-11-23 Thread GitBox
mseeger opened a new issue #8799: Dangling outputs and dtype != float32: Gradient computation fails URL: https://github.com/apache/incubator-mxnet/issues/8799 Hello, please see the complete example below for full details. The problem is that if in the course of a computation, cert

[GitHub] KangGrandesty opened a new pull request #8798: new operator

2017-11-23 Thread GitBox
KangGrandesty opened a new pull request #8798: new operator URL: https://github.com/apache/incubator-mxnet/pull/8798 ## Description ## (Brief description on what this PR is about) ## Checklist ## ### Essentials ### - [ ] Passed code style checking (`make lint`) - [ ] Chang

[GitHub] KangGrandesty closed pull request #8798: new operator

2017-11-23 Thread GitBox
KangGrandesty closed pull request #8798: new operator URL: https://github.com/apache/incubator-mxnet/pull/8798 This is a PR merged from a forked repository. As GitHub hides the original diff on merge, it is displayed below for the sake of provenance: As this is a foreign pull request (fro

[GitHub] xinghedyc opened a new pull request #8797: fix errors for reinterpret_cast usage

2017-11-23 Thread GitBox
xinghedyc opened a new pull request #8797: fix errors for reinterpret_cast usage URL: https://github.com/apache/incubator-mxnet/pull/8797 ## Description ## (Brief description on what this PR is about) fix error for reinterpret_cast ## Checklist ## ### Essentials ### - [ ] Pass

[GitHub] szha commented on issue #7557: Parsing Training accuracy, Training loss log files to plot Training accuracy, Training loss curves

2017-11-23 Thread GitBox
szha commented on issue #7557: Parsing Training accuracy, Training loss log files to plot Training accuracy, Training loss curves URL: https://github.com/apache/incubator-mxnet/issues/7557#issuecomment-346605404 This issue is closed due to lack of activity in the last 90 days. Feel free t

[GitHub] szha commented on issue #7591: Error:include/mxnet/./ndarray.h:109: Unknown type enum 1794200432

2017-11-23 Thread GitBox
szha commented on issue #7591: Error:include/mxnet/./ndarray.h:109: Unknown type enum 1794200432 URL: https://github.com/apache/incubator-mxnet/issues/7591#issuecomment-346605408 This issue is closed due to lack of activity in the last 90 days. Feel free to ping me to reopen if this is st

[GitHub] szha closed issue #7557: Parsing Training accuracy, Training loss log files to plot Training accuracy, Training loss curves

2017-11-23 Thread GitBox
szha closed issue #7557: Parsing Training accuracy, Training loss log files to plot Training accuracy, Training loss curves URL: https://github.com/apache/incubator-mxnet/issues/7557 This is an automated message from the Apa

[GitHub] szha closed issue #7591: Error:include/mxnet/./ndarray.h:109: Unknown type enum 1794200432

2017-11-23 Thread GitBox
szha closed issue #7591: Error:include/mxnet/./ndarray.h:109: Unknown type enum 1794200432 URL: https://github.com/apache/incubator-mxnet/issues/7591 This is an automated message from the Apache Git Service. To respond to th

[GitHub] beeva-enriqueotero commented on issue #6179: SoftmaxOutput - how to get probabilities?

2017-11-23 Thread GitBox
beeva-enriqueotero commented on issue #6179: SoftmaxOutput - how to get probabilities? URL: https://github.com/apache/incubator-mxnet/issues/6179#issuecomment-346604827 In order to apply typical crossentropy function as eval_metric you need to access to these probabilities. Otherwise trai

[GitHub] lx75249 opened a new pull request #8796: Avoid odr-using static member that will cause undefined references in?

2017-11-23 Thread GitBox
lx75249 opened a new pull request #8796: Avoid odr-using static member that will cause undefined references in? URL: https://github.com/apache/incubator-mxnet/pull/8796 Directly passing `mshadow::gpu::kDevMask` to `CHECK_EQ` violates odr-use rule of inplace initialized static data member,

[GitHub] xinghedyc opened a new issue #8795: can cpp-package use operators in contrib directory?

2017-11-23 Thread GitBox
xinghedyc opened a new issue #8795: can cpp-package use operators in contrib directory? URL: https://github.com/apache/incubator-mxnet/issues/8795 1. I found the generated op.h by OpWrapperGenerator.py dosen't include contrib operators? 2. Is it possible to use c++ to write a predict co

[GitHub] cjolivier01 commented on issue #8737: Use RAII and fix Coverity resource leaks #10371 and others

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8737: Use RAII and fix Coverity resource leaks #10371 and others URL: https://github.com/apache/incubator-mxnet/pull/8737#issuecomment-346571223 Note that SimpleBind tends to be called once, so a shared pointer and its atomic counter isnt going to affect p

[GitHub] SumNeuron opened a new issue #8794: GPU throws out of index error?

2017-11-23 Thread GitBox
SumNeuron opened a new issue #8794: GPU throws out of index error? URL: https://github.com/apache/incubator-mxnet/issues/8794 ## Description (Brief description of the problem in no more than 2 sentences.) I have two files: - VAE.py which defines a variational auto encoder - vae

[GitHub] cjolivier01 commented on issue #8777: Error: package or namespace load failed for ?mxnet?:

2017-11-23 Thread GitBox
cjolivier01 commented on issue #8777: Error: package or namespace load failed for ?mxnet?: URL: https://github.com/apache/incubator-mxnet/issues/8777#issuecomment-346569879 Yes. I use twin 1080?s at home. As far as I know, 1080 and 1080ti have same instruction set. On Wed, Nov 22

[GitHub] larroy opened a new pull request #8793: Fix link error on DEBUG=1. Need mshadow::gpu::kDevMask def in TU

2017-11-23 Thread GitBox
larroy opened a new pull request #8793: Fix link error on DEBUG=1. Need mshadow::gpu::kDevMask def in TU URL: https://github.com/apache/incubator-mxnet/pull/8793 Quick fix for mshadow::gpu::kDevMask used in a translation unit (not a header), which causes undefined symbol in debug mode.

  1   2   >