Re: Regressions in NDArrayIter

2018-09-11 Thread Jake Lee
I am looking into it.

Could you provide the shape of NDArrayIter and minimal reproducible code?
Thank you so much!

Alfredo Luque  於 2018年9月11日 週二 下午6:05寫道:

> Looks like https://github.com/apache/incubator-mxnet/pull/12285 broke a
> ton
> of our test cases iterating over 3D NDArray instances (eg; MNIST) by
> creating an index out of range.
>
> Stacktrace:
>
> .com/airbnb/bighead/python/bighead/ml_frameworks/mxnet/gluon.py", line
> 434, in transform
> for batch in data_iter:
>   File "/anaconda3/envs/py36/lib/python3.6/site-packages/mxnet/io/io.py",
> line 228, in __next__
> return self.next()
>   File "/anaconda3/envs/py36/lib/python3.6/site-packages/mxnet/io/io.py",
> line 680, in next
> label = self.getlabel()
>   File "/anaconda3/envs/py36/lib/python3.6/site-packages/mxnet/io/io.py",
> line 750, in getlabel
> return self._batchify(self.label)
>   File "/anaconda3/envs/py36/lib/python3.6/site-packages/mxnet/io/io.py",
> line 732, in _batchify
> first_data = self._getdata(data_source, start=self.cursor)
>   File "/anaconda3/envs/py36/lib/python3.6/site-packages/mxnet/io/io.py",
> line 694, in _getdata
> end = end if end is not None else data_source[0][1].shape[0]
> IndexError: list index out of range
>
> I’ve created an issue at
> https://github.com/apache/incubator-mxnet/issues/12526
>
>
> We’ll be pinning to the previous build until it’s reverted/patched, but let
> us know if we can help provide more regression tests here.
>
> —
> Alfredo Luque
> Software Engineer
> Machine Learning Infrastructure
> Airbnb
> San Francisco, CA
>


Re: Off-Heap Memory Management in MXNet Scala

2018-09-11 Thread Chris Olivier
do you log on finalize() if the object wasn’t properly freed (ie
NDArray.finalize())? is that available in Scala?

On Tue, Sep 11, 2018 at 6:12 PM Qing Lan  wrote:

> Nice document! Way better than current .dispose() in Scala!
>
> Thanks,
> Qing
>
> On 9/11/18, 6:04 PM, "Chris Olivier"  wrote:
>
> wow, incredible document!
>
> On Tue, Sep 11, 2018 at 2:37 PM Naveen Swamy 
> wrote:
>
> > Hi All,
> >
> > I am working on managing Off-Heap Memory Management and have written
> a
> > proposal here based on my prototype and research I did.
> >
> > Please review the doc and provide your feedback ?
> >
> >
> https://cwiki.apache.org/confluence/display/MXNET/JVM+Memory+Management
> >
> > I had offline discussion with a few people I work with and added
> their
> > feedback to the doc as well.
> >
> > Thanks, Naveen
> >
>
>
>


Re: Off-Heap Memory Management in MXNet Scala

2018-09-11 Thread Qing Lan
Nice document! Way better than current .dispose() in Scala!

Thanks,
Qing

On 9/11/18, 6:04 PM, "Chris Olivier"  wrote:

wow, incredible document!

On Tue, Sep 11, 2018 at 2:37 PM Naveen Swamy  wrote:

> Hi All,
>
> I am working on managing Off-Heap Memory Management and have written a
> proposal here based on my prototype and research I did.
>
> Please review the doc and provide your feedback ?
>
> https://cwiki.apache.org/confluence/display/MXNET/JVM+Memory+Management
>
> I had offline discussion with a few people I work with and added their
> feedback to the doc as well.
>
> Thanks, Naveen
>




Re: Off-Heap Memory Management in MXNet Scala

2018-09-11 Thread Chris Olivier
wow, incredible document!

On Tue, Sep 11, 2018 at 2:37 PM Naveen Swamy  wrote:

> Hi All,
>
> I am working on managing Off-Heap Memory Management and have written a
> proposal here based on my prototype and research I did.
>
> Please review the doc and provide your feedback ?
>
> https://cwiki.apache.org/confluence/display/MXNET/JVM+Memory+Management
>
> I had offline discussion with a few people I work with and added their
> feedback to the doc as well.
>
> Thanks, Naveen
>


Regressions in NDArrayIter

2018-09-11 Thread Alfredo Luque
Looks like https://github.com/apache/incubator-mxnet/pull/12285 broke a ton
of our test cases iterating over 3D NDArray instances (eg; MNIST) by
creating an index out of range.

Stacktrace:

.com/airbnb/bighead/python/bighead/ml_frameworks/mxnet/gluon.py", line
434, in transform
for batch in data_iter:
  File "/anaconda3/envs/py36/lib/python3.6/site-packages/mxnet/io/io.py",
line 228, in __next__
return self.next()
  File "/anaconda3/envs/py36/lib/python3.6/site-packages/mxnet/io/io.py",
line 680, in next
label = self.getlabel()
  File "/anaconda3/envs/py36/lib/python3.6/site-packages/mxnet/io/io.py",
line 750, in getlabel
return self._batchify(self.label)
  File "/anaconda3/envs/py36/lib/python3.6/site-packages/mxnet/io/io.py",
line 732, in _batchify
first_data = self._getdata(data_source, start=self.cursor)
  File "/anaconda3/envs/py36/lib/python3.6/site-packages/mxnet/io/io.py",
line 694, in _getdata
end = end if end is not None else data_source[0][1].shape[0]
IndexError: list index out of range

I’ve created an issue at
https://github.com/apache/incubator-mxnet/issues/12526


We’ll be pinning to the previous build until it’s reverted/patched, but let
us know if we can help provide more regression tests here.

—
Alfredo Luque
Software Engineer
Machine Learning Infrastructure
Airbnb
San Francisco, CA


Re: Error Publishing OSX package to Maven

2018-09-11 Thread Naveen Swamy
Qing helped test by excluding the bundle/source. It still creates 2 repos
in Staging. I am not sure if Maven Version or some other maven-plugin
difference is causing this issue, there was no issue publishing to
Linux-CPU and Linux-GPU from the same code.
For now, I have manually uploaded the artifacts(generated through maven
release prepare/perform) to the Staging repo so we can make it available
for OSX users.

This issue needs to be investigated independently.

-Naveen

On Tue, Sep 11, 2018 at 2:03 PM, Naveen Swamy  wrote:

> that seems reasonable. I also asked this question on d...@maven.apache.org
> to see if there is anything obvious that we missed.
>
> https://lists.apache.org/thread.html/def6e5c6c47ab2f39592a1fe060b6c
> fd0008d303a2b5c814545d231b@%3Cdev.maven.apache.org%3E
>
> On Tue, Sep 11, 2018 at 11:25 AM, Carin Meier 
> wrote:
>
>> I would suggest trying
>>
>> * Remove bundle/source jar from 1.3.0 and attempt publishing OSX package.
>> and seeing if that solves the problem and allows publishing to staging.
>>
>>  If it does work then it helps to identity the cause.
>> If the effort does not take to long. I would recommend timeboxing the
>> effort and if it is going to take more effort to investigate, we can go
>> with announcing later for OSX.
>>
>>
>> - Carin
>>
>> On Tue, Sep 11, 2018 at 2:10 PM Naveen Swamy  wrote:
>>
>> > hey all,
>> >
>> > I am working on publishing the 1.3.0 Scala package to Maven and
>> > encountering a error when I am about to Close the Repo on Nexus. When I
>> > publish the OSX package to Staging, the artifacts gets split into 2
>> > repositories and when I close[1] the repo(to make it available for use)
>> it
>> > fails since the signature files are in a different repo. This is how the
>> > artifacts are getting split
>> >
>> > Repo1: orgapachemxnet-1018
>> > ===
>> > mxnet-full_2.11-osx-x86_64-cpu-bundle.jar.md5
>> > mxnet-full_2.11-osx-x86_64-cpu-bundle.jar.sha1
>> > mxnet-full_2.11-osx-x86_64-cpu-sources.jar.asc
>> > mxnet-full_2.11-osx-x86_64-cpu-sources.jar.md5
>> > mxnet-full_2.11-osx-x86_64-cpu-src.jar.md5
>> > mxnet-full_2.11-osx-x86_64-cpu-src.jar.sha1
>> > mxnet-full_2.11-osx-x86_64-cpu.jar
>> > mxnet-full_2.11-osx-x86_64-cpu.jar.asc
>> > mxnet-full_2.11-osx-x86_64-cpu.pom.md5
>> > mxnet-full_2.11-osx-x86_64-cpu.pom.sha1
>> >
>> >
>> > Repo2: orgapachemxnet-1019
>> > ===
>> > mxnet-full_2.11-osx-x86_64-cpu-bundle.jar
>> > mxnet-full_2.11-osx-x86_64-cpu-bundle.jar.asc
>> > mxnet-full_2.11-osx-x86_64-cpu-sources.jar
>> > mxnet-full_2.11-osx-x86_64-cpu-sources.jar.sha1
>> > mxnet-full_2.11-osx-x86_64-cpu-src.jar
>> > mxnet-full_2.11-osx-x86_64-cpu-src.jar.asc
>> > mxnet-full_2.11-osx-x86_64-cpu-jar.md5
>> > mxnet-full_2.11-osx-x86_64-cpu.jar.sha1
>> > mxnet-full_2.11-osx-x86_64-cpu.pom
>> > mxnet-full_2.11-osx-x86_64-cpu.pom.asc
>> >
>> > This was not an issue with Linux-CPU and Linux-GPU versions for 1.3.0
>> and
>> > suspicion is the new Source and documentation jar files being added as a
>> > part of 1.3.0 for OSX.
>> >
>> > There are 2 options
>> > * Continue to investigate the issue and announce later for OSX (after
>> the
>> > general announcement)
>> > * Remove bundle/source jar from 1.3.0 and attempt publishing OSX
>> package.
>> >
>> > What do you guys suggest?
>> >
>> >
>> > 1) Closing Staged Repo -
>> >
>> > https://central.sonatype.org/pages/releasing-the-deployment.
>> html#locate-and-examine-your-staging-repository
>> > 2) Scala Release Process:
>> >
>> > https://cwiki.apache.org/confluence/display/MXNET/MXNet-
>> Scala+Release+Process
>> >
>>
>
>


Off-Heap Memory Management in MXNet Scala

2018-09-11 Thread Naveen Swamy
Hi All,

I am working on managing Off-Heap Memory Management and have written a
proposal here based on my prototype and research I did.

Please review the doc and provide your feedback ?

https://cwiki.apache.org/confluence/display/MXNET/JVM+Memory+Management

I had offline discussion with a few people I work with and added their
feedback to the doc as well.

Thanks, Naveen


Re: Error Publishing OSX package to Maven

2018-09-11 Thread Naveen Swamy
that seems reasonable. I also asked this question on d...@maven.apache.org
to see if there is anything obvious that we missed.

https://lists.apache.org/thread.html/def6e5c6c47ab2f39592a1fe060b6cfd0008d303a2b5c814545d231b@%3Cdev.maven.apache.org%3E


On Tue, Sep 11, 2018 at 11:25 AM, Carin Meier  wrote:

> I would suggest trying
>
> * Remove bundle/source jar from 1.3.0 and attempt publishing OSX package.
> and seeing if that solves the problem and allows publishing to staging.
>
>  If it does work then it helps to identity the cause.
> If the effort does not take to long. I would recommend timeboxing the
> effort and if it is going to take more effort to investigate, we can go
> with announcing later for OSX.
>
>
> - Carin
>
> On Tue, Sep 11, 2018 at 2:10 PM Naveen Swamy  wrote:
>
> > hey all,
> >
> > I am working on publishing the 1.3.0 Scala package to Maven and
> > encountering a error when I am about to Close the Repo on Nexus. When I
> > publish the OSX package to Staging, the artifacts gets split into 2
> > repositories and when I close[1] the repo(to make it available for use)
> it
> > fails since the signature files are in a different repo. This is how the
> > artifacts are getting split
> >
> > Repo1: orgapachemxnet-1018
> > ===
> > mxnet-full_2.11-osx-x86_64-cpu-bundle.jar.md5
> > mxnet-full_2.11-osx-x86_64-cpu-bundle.jar.sha1
> > mxnet-full_2.11-osx-x86_64-cpu-sources.jar.asc
> > mxnet-full_2.11-osx-x86_64-cpu-sources.jar.md5
> > mxnet-full_2.11-osx-x86_64-cpu-src.jar.md5
> > mxnet-full_2.11-osx-x86_64-cpu-src.jar.sha1
> > mxnet-full_2.11-osx-x86_64-cpu.jar
> > mxnet-full_2.11-osx-x86_64-cpu.jar.asc
> > mxnet-full_2.11-osx-x86_64-cpu.pom.md5
> > mxnet-full_2.11-osx-x86_64-cpu.pom.sha1
> >
> >
> > Repo2: orgapachemxnet-1019
> > ===
> > mxnet-full_2.11-osx-x86_64-cpu-bundle.jar
> > mxnet-full_2.11-osx-x86_64-cpu-bundle.jar.asc
> > mxnet-full_2.11-osx-x86_64-cpu-sources.jar
> > mxnet-full_2.11-osx-x86_64-cpu-sources.jar.sha1
> > mxnet-full_2.11-osx-x86_64-cpu-src.jar
> > mxnet-full_2.11-osx-x86_64-cpu-src.jar.asc
> > mxnet-full_2.11-osx-x86_64-cpu-jar.md5
> > mxnet-full_2.11-osx-x86_64-cpu.jar.sha1
> > mxnet-full_2.11-osx-x86_64-cpu.pom
> > mxnet-full_2.11-osx-x86_64-cpu.pom.asc
> >
> > This was not an issue with Linux-CPU and Linux-GPU versions for 1.3.0 and
> > suspicion is the new Source and documentation jar files being added as a
> > part of 1.3.0 for OSX.
> >
> > There are 2 options
> > * Continue to investigate the issue and announce later for OSX (after the
> > general announcement)
> > * Remove bundle/source jar from 1.3.0 and attempt publishing OSX package.
> >
> > What do you guys suggest?
> >
> >
> > 1) Closing Staged Repo -
> >
> > https://central.sonatype.org/pages/releasing-the-
> deployment.html#locate-and-examine-your-staging-repository
> > 2) Scala Release Process:
> >
> > https://cwiki.apache.org/confluence/display/MXNET/
> MXNet-Scala+Release+Process
> >
>


Re: Error Publishing OSX package to Maven

2018-09-11 Thread Carin Meier
I would suggest trying

* Remove bundle/source jar from 1.3.0 and attempt publishing OSX package.
and seeing if that solves the problem and allows publishing to staging.

 If it does work then it helps to identity the cause.
If the effort does not take to long. I would recommend timeboxing the
effort and if it is going to take more effort to investigate, we can go
with announcing later for OSX.


- Carin

On Tue, Sep 11, 2018 at 2:10 PM Naveen Swamy  wrote:

> hey all,
>
> I am working on publishing the 1.3.0 Scala package to Maven and
> encountering a error when I am about to Close the Repo on Nexus. When I
> publish the OSX package to Staging, the artifacts gets split into 2
> repositories and when I close[1] the repo(to make it available for use) it
> fails since the signature files are in a different repo. This is how the
> artifacts are getting split
>
> Repo1: orgapachemxnet-1018
> ===
> mxnet-full_2.11-osx-x86_64-cpu-bundle.jar.md5
> mxnet-full_2.11-osx-x86_64-cpu-bundle.jar.sha1
> mxnet-full_2.11-osx-x86_64-cpu-sources.jar.asc
> mxnet-full_2.11-osx-x86_64-cpu-sources.jar.md5
> mxnet-full_2.11-osx-x86_64-cpu-src.jar.md5
> mxnet-full_2.11-osx-x86_64-cpu-src.jar.sha1
> mxnet-full_2.11-osx-x86_64-cpu.jar
> mxnet-full_2.11-osx-x86_64-cpu.jar.asc
> mxnet-full_2.11-osx-x86_64-cpu.pom.md5
> mxnet-full_2.11-osx-x86_64-cpu.pom.sha1
>
>
> Repo2: orgapachemxnet-1019
> ===
> mxnet-full_2.11-osx-x86_64-cpu-bundle.jar
> mxnet-full_2.11-osx-x86_64-cpu-bundle.jar.asc
> mxnet-full_2.11-osx-x86_64-cpu-sources.jar
> mxnet-full_2.11-osx-x86_64-cpu-sources.jar.sha1
> mxnet-full_2.11-osx-x86_64-cpu-src.jar
> mxnet-full_2.11-osx-x86_64-cpu-src.jar.asc
> mxnet-full_2.11-osx-x86_64-cpu-jar.md5
> mxnet-full_2.11-osx-x86_64-cpu.jar.sha1
> mxnet-full_2.11-osx-x86_64-cpu.pom
> mxnet-full_2.11-osx-x86_64-cpu.pom.asc
>
> This was not an issue with Linux-CPU and Linux-GPU versions for 1.3.0 and
> suspicion is the new Source and documentation jar files being added as a
> part of 1.3.0 for OSX.
>
> There are 2 options
> * Continue to investigate the issue and announce later for OSX (after the
> general announcement)
> * Remove bundle/source jar from 1.3.0 and attempt publishing OSX package.
>
> What do you guys suggest?
>
>
> 1) Closing Staged Repo -
>
> https://central.sonatype.org/pages/releasing-the-deployment.html#locate-and-examine-your-staging-repository
> 2) Scala Release Process:
>
> https://cwiki.apache.org/confluence/display/MXNET/MXNet-Scala+Release+Process
>


Error Publishing OSX package to Maven

2018-09-11 Thread Naveen Swamy
hey all,

I am working on publishing the 1.3.0 Scala package to Maven and
encountering a error when I am about to Close the Repo on Nexus. When I
publish the OSX package to Staging, the artifacts gets split into 2
repositories and when I close[1] the repo(to make it available for use) it
fails since the signature files are in a different repo. This is how the
artifacts are getting split

Repo1: orgapachemxnet-1018
===
mxnet-full_2.11-osx-x86_64-cpu-bundle.jar.md5
mxnet-full_2.11-osx-x86_64-cpu-bundle.jar.sha1
mxnet-full_2.11-osx-x86_64-cpu-sources.jar.asc
mxnet-full_2.11-osx-x86_64-cpu-sources.jar.md5
mxnet-full_2.11-osx-x86_64-cpu-src.jar.md5
mxnet-full_2.11-osx-x86_64-cpu-src.jar.sha1
mxnet-full_2.11-osx-x86_64-cpu.jar
mxnet-full_2.11-osx-x86_64-cpu.jar.asc
mxnet-full_2.11-osx-x86_64-cpu.pom.md5
mxnet-full_2.11-osx-x86_64-cpu.pom.sha1


Repo2: orgapachemxnet-1019
===
mxnet-full_2.11-osx-x86_64-cpu-bundle.jar
mxnet-full_2.11-osx-x86_64-cpu-bundle.jar.asc
mxnet-full_2.11-osx-x86_64-cpu-sources.jar
mxnet-full_2.11-osx-x86_64-cpu-sources.jar.sha1
mxnet-full_2.11-osx-x86_64-cpu-src.jar
mxnet-full_2.11-osx-x86_64-cpu-src.jar.asc
mxnet-full_2.11-osx-x86_64-cpu-jar.md5
mxnet-full_2.11-osx-x86_64-cpu.jar.sha1
mxnet-full_2.11-osx-x86_64-cpu.pom
mxnet-full_2.11-osx-x86_64-cpu.pom.asc

This was not an issue with Linux-CPU and Linux-GPU versions for 1.3.0 and
suspicion is the new Source and documentation jar files being added as a
part of 1.3.0 for OSX.

There are 2 options
* Continue to investigate the issue and announce later for OSX (after the
general announcement)
* Remove bundle/source jar from 1.3.0 and attempt publishing OSX package.

What do you guys suggest?


1) Closing Staged Repo -
https://central.sonatype.org/pages/releasing-the-deployment.html#locate-and-examine-your-staging-repository
2) Scala Release Process:
https://cwiki.apache.org/confluence/display/MXNET/MXNet-Scala+Release+Process


Re: Enable Travis CI for Apache MXNet

2018-09-11 Thread Sebastian

Done, here is the ticket:

https://issues.apache.org/jira/browse/INFRA-17004

On 10.09.2018 14:49, Marco de Abreu wrote:

Hello Sebastian,

could you please file a ticket with Apache Infra to enable Travis CI for
our main repository? The discussion thread is available at
https://lists.apache.org/thread.html/3871eb5a7bc31eb467f68935f4a56469292b03a33f1693cdb9a98f31@%3Cdev.mxnet.apache.org%3E
.

The settings should be branch- as well as PR-validation. The reported
status should be set to "not required", thus not blocking any PRs if that
validation pipeline fails.

Best regards,
Marco



GitHub having issues today

2018-09-11 Thread kellen sunderland
FYI GitHub is having issues today. It's severe enough that it's blocking my
ability to validate PRs.  I'm sure it will be resolved soon, but it might
be a good day to read a few papers ;-).

https://status.github.com/messages

-Kellen