Hi,
  How do one process for data sources other than text?
Lets say I have millions of mp3 (or jpeg) files and I want to use spark to
process them?
How does one go about it.


I have never been able to figure this out..
Lets say I have this library in python which works like following:

import audio

song = audio.read_mp3(filename)

Then most of the methods are attached to song or maybe there is another
function which takes "song" type as an input.

Maybe the above is just rambling.. but how do I use spark to process (say)
audiio files.
Thanks

Reply via email to