Liang-Chi Hsieh created SPARK-26559: ---------------------------------------
Summary: ML image can't work with numpy versions prior to 1.9 Key: SPARK-26559 URL: https://issues.apache.org/jira/browse/SPARK-26559 Project: Spark Issue Type: Bug Components: ML, PySpark Affects Versions: 3.0.0 Reporter: Liang-Chi Hsieh ML image can't work with numpy version prior to 1.9 now. Current pyspark test can show it: {code:java} test_read_images (pyspark.ml.tests.test_image.ImageReaderTest) ... ERROR test_read_images_multiple_times (pyspark.ml.tests.test_image.ImageReaderTest2) ... ok ====================================================================== ERROR: test_read_images (pyspark.ml.tests.test_image.ImageReaderTest) ---------------------------------------------------------------------- Traceback (most recent call last): File "/Users/viirya/docker_tmp/repos/spark-1/python/pyspark/ml/tests/test_image.py", line 36, in test_read_images self.assertEqual(ImageSchema.toImage(array, origin=first_row[0]), first_row) File "/Users/viirya/docker_tmp/repos/spark-1/python/pyspark/ml/image.py", line 193, in toImage data = bytearray(array.astype(dtype=np.uint8).ravel().tobytes()) AttributeError: 'numpy.ndarray' object has no attribute 'tobytes' ---------------------------------------------------------------------- Ran 2 tests in 29.040s FAILED (errors=1) {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)