[ https://issues.apache.org/jira/browse/SYSTEMML-1474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15961231#comment-15961231 ]
Glenn Weidner commented on SYSTEMML-1474: ----------------------------------------- Note it appears to be a regression since the same test can be run successfully on same local system using latest release systemml-0.13.0-incubating-python.tgz. > Index out of bounds error in test_naive_bayes1 of test_mllearn_numpy.py > ----------------------------------------------------------------------- > > Key: SYSTEMML-1474 > URL: https://issues.apache.org/jira/browse/SYSTEMML-1474 > Project: SystemML > Issue Type: Bug > Reporter: Glenn Weidner > Priority: Minor > > The following error was observed running the python tests from command line > with spark-submit: > {code} > ====================================================================== > ERROR: test_naive_bayes1 (__main__.TestMLLearn) > ---------------------------------------------------------------------- > Traceback (most recent call last): > File "/home/spark/test_mllearn_numpy.py", line 184, in test_naive_bayes1 > mllearn_predicted = nb.fit(vectors, > newsgroups_train.target).predict(vectors_test) > File "/usr/lib/python2.7/site-packages/systemml/mllearn/estimators.py", > line 142, in fit > self.fit_numpy(X, y) > File "/usr/lib/python2.7/site-packages/systemml/mllearn/estimators.py", > line 95, in fit_numpy > self._fit_numpy() > File "/usr/lib/python2.7/site-packages/systemml/mllearn/estimators.py", > line 88, in _fit_numpy > self.model = self.estimator.fit(convertToMatrixBlock(self.sc, self.X), > y_mb) > File "/usr/lib/python2.7/site-packages/systemml/converters.py", line 106, > in convertToMatrixBlock > [ _copyRowBlock(i, sc, ret, src, numRowsPerBlock, rlen, clen) for i in > range(0, src.shape[0], numRowsPerBlock) ] > File "/usr/lib/python2.7/site-packages/systemml/converters.py", line 83, in > _copyRowBlock > mb = _convertSPMatrixToMB(sc, src[i:i+numRowsPerBlock,]) if > isinstance(src, spmatrix) else _convertDenseMatrixToMB(sc, > src[i:i+numRowsPerBlock,]) > File "/usr/lib64/python2.7/site-packages/scipy/sparse/csr.py", line 304, in > __getitem__ > return self._get_submatrix(row, col) > File "/usr/lib64/python2.7/site-packages/scipy/sparse/csr.py", line 447, in > _get_submatrix > check_bounds(i0, i1, M) > File "/usr/lib64/python2.7/site-packages/scipy/sparse/csr.py", line 443, in > check_bounds > " %d <= %d" % (i0, num, i1, num, i0, i1)) > IndexError: index out of bounds: 0 <= 2030 <= 2034, 0 <= 2059 <= 2034, 2030 > <= 2059 > {code} > The IndexError was first observed when running the test under a Notebook > cloud environment with Spark 2.0.2, then reproduced at command line on local > system with Spark 2.1. -- This message was sent by Atlassian JIRA (v6.3.15#6346)