Amit Baghel created SPARK-25157:
-----------------------------------

             Summary: Streaming of image files from directory
                 Key: SPARK-25157
                 URL: https://issues.apache.org/jira/browse/SPARK-25157
             Project: Spark
          Issue Type: New Feature
          Components: ML, Structured Streaming
    Affects Versions: 2.3.1
            Reporter: Amit Baghel


We are doing video analytics for video streams using Spark. At present there is 
no direct way to stream video frames or image files to Spark and process using 
Structured Streaming and Dataset. We are using Kafka to stream images and then 
doing processing at spark. We need a method in Spark to stream images from 
directory. Currently *{{DataStreamReader}}* doesn't support Images. With the 
introduction of *org.apache.spark.ml.image.ImageSchema* class, we think 
streaming capabilities can be added for images. It is fine if it won't support 
some of the structured streaming features as it is a binary file. Schema used 
in ImageSchema class for image can be used in Dataset. This feature could be 
similar to *mmlspark* *streamImages* method. 
([https://github.com/Azure/mmlspark/blob/4413771a8830e4760f550084da60ea0616bf80b9/src/io/image/src/main/python/ImageReader.py]
 )



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to