Amit Baghel created SPARK-25157: ----------------------------------- Summary: Streaming of image files from directory Key: SPARK-25157 URL: https://issues.apache.org/jira/browse/SPARK-25157 Project: Spark Issue Type: New Feature Components: ML, Structured Streaming Affects Versions: 2.3.1 Reporter: Amit Baghel
We are doing video analytics for video streams using Spark. At present there is no direct way to stream video frames or image files to Spark and process using Structured Streaming and Dataset. We are using Kafka to stream images and then doing processing at spark. We need a method in Spark to stream images from directory. Currently *{{DataStreamReader}}* doesn't support Images. With the introduction of *org.apache.spark.ml.image.ImageSchema* class, we think streaming capabilities can be added for images. It is fine if it won't support some of the structured streaming features as it is a binary file. Schema used in ImageSchema class for image can be used in Dataset. This feature could be similar to *mmlspark* *streamImages* method. ([https://github.com/Azure/mmlspark/blob/4413771a8830e4760f550084da60ea0616bf80b9/src/io/image/src/main/python/ImageReader.py] ) -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org