Re: PySpark on PyPi

2015-08-10 Thread Davies Liu
I think so, any contributions on this are welcome. On Mon, Aug 10, 2015 at 11:03 AM, Brian Granger elliso...@gmail.com wrote: Sorry, trying to follow the context here. Does it look like there is support for the idea of creating a setup.py file and pypi package for pyspark? Cheers, Brian

Pushing Spark to 10Gb/s

2015-08-10 Thread Starch, Michael D (398M)
All, I am trying to get data moving in and out of spark at 10Gb/s. I currently have a very powerful cluster to work on, offering 40Gb/s inifiniband links so I believe the network pipe should be fast enough. Has anyone gotten spark operating at high data rates before? Any advice would be

Re: PySpark on PyPi

2015-08-10 Thread Matt Goodman
I would tentatively suggest also conda packaging. http://conda.pydata.org/docs/ --Matthew Goodman = Check Out My Website: http://craneium.net Find me on LinkedIn: http://tinyurl.com/d6wlch On Mon, Aug 10, 2015 at 11:23 AM, Davies Liu dav...@databricks.com wrote: I think

Re: Package Release Annoucement: Spark SQL on HBase Astro

2015-08-10 Thread Ted Yu
Yan / Bing: Mind taking a look at HBASE-14181 https://issues.apache.org/jira/browse/HBASE-14181 'Add Spark DataFrame DataSource to HBase-Spark Module' ? Thanks On Wed, Jul 22, 2015 at 4:53 PM, Bing Xiao (Bing) bing.x...@huawei.com wrote: We are happy to announce the availability of the Spark

possible issues with listing objects in the HadoopFSrelation

2015-08-10 Thread Gil Vernik
Just some thoughts, hope i didn't missed something obvious. HadoopFSRelation calls directly FileSystem class to list files in the path. It looks like it implements basically the same logic as in the FileInputFormat.listStatus method ( located in hadoop-map-reduce-client-core) The point is