[ 
https://issues.apache.org/jira/browse/SYSTEMML-1474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15961231#comment-15961231
 ] 

Glenn Weidner commented on SYSTEMML-1474:
-----------------------------------------

Note it appears to be a regression since the same test can be run successfully 
on same local system using latest release systemml-0.13.0-incubating-python.tgz.

> Index out of bounds error in test_naive_bayes1 of test_mllearn_numpy.py
> -----------------------------------------------------------------------
>
>                 Key: SYSTEMML-1474
>                 URL: https://issues.apache.org/jira/browse/SYSTEMML-1474
>             Project: SystemML
>          Issue Type: Bug
>            Reporter: Glenn Weidner
>            Priority: Minor
>
> The following error was observed running the python tests from command line 
> with spark-submit:
> {code}
> ======================================================================
> ERROR: test_naive_bayes1 (__main__.TestMLLearn)
> ----------------------------------------------------------------------
> Traceback (most recent call last):
>   File "/home/spark/test_mllearn_numpy.py", line 184, in test_naive_bayes1
>     mllearn_predicted = nb.fit(vectors, 
> newsgroups_train.target).predict(vectors_test)
>   File "/usr/lib/python2.7/site-packages/systemml/mllearn/estimators.py", 
> line 142, in fit
>     self.fit_numpy(X, y)
>   File "/usr/lib/python2.7/site-packages/systemml/mllearn/estimators.py", 
> line 95, in fit_numpy
>     self._fit_numpy()
>   File "/usr/lib/python2.7/site-packages/systemml/mllearn/estimators.py", 
> line 88, in _fit_numpy
>     self.model = self.estimator.fit(convertToMatrixBlock(self.sc, self.X), 
> y_mb)
>   File "/usr/lib/python2.7/site-packages/systemml/converters.py", line 106, 
> in convertToMatrixBlock
>     [ _copyRowBlock(i, sc, ret, src, numRowsPerBlock,  rlen, clen) for i in 
> range(0, src.shape[0], numRowsPerBlock) ]
>   File "/usr/lib/python2.7/site-packages/systemml/converters.py", line 83, in 
> _copyRowBlock
>     mb = _convertSPMatrixToMB(sc, src[i:i+numRowsPerBlock,]) if 
> isinstance(src, spmatrix) else _convertDenseMatrixToMB(sc, 
> src[i:i+numRowsPerBlock,])
>   File "/usr/lib64/python2.7/site-packages/scipy/sparse/csr.py", line 304, in 
> __getitem__
>     return self._get_submatrix(row, col)
>   File "/usr/lib64/python2.7/site-packages/scipy/sparse/csr.py", line 447, in 
> _get_submatrix
>     check_bounds(i0, i1, M)
>   File "/usr/lib64/python2.7/site-packages/scipy/sparse/csr.py", line 443, in 
> check_bounds
>     " %d <= %d" % (i0, num, i1, num, i0, i1))
> IndexError: index out of bounds: 0 <= 2030 <= 2034, 0 <= 2059 <= 2034, 2030 
> <= 2059
> {code}
> The IndexError was first observed when running the test under a Notebook 
> cloud environment with Spark 2.0.2, then reproduced at command line on local 
> system with Spark 2.1.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to