I think so, any contributions on this are welcome.
On Mon, Aug 10, 2015 at 11:03 AM, Brian Granger elliso...@gmail.com wrote:
Sorry, trying to follow the context here. Does it look like there is
support for the idea of creating a setup.py file and pypi package for
pyspark?
Cheers,
Brian
All,
I am trying to get data moving in and out of spark at 10Gb/s. I currently have
a very powerful cluster to work on, offering 40Gb/s inifiniband links so I
believe the network pipe should be fast enough.
Has anyone gotten spark operating at high data rates before? Any advice would
be
I would tentatively suggest also conda packaging.
http://conda.pydata.org/docs/
--Matthew Goodman
=
Check Out My Website: http://craneium.net
Find me on LinkedIn: http://tinyurl.com/d6wlch
On Mon, Aug 10, 2015 at 11:23 AM, Davies Liu dav...@databricks.com wrote:
I think
Yan / Bing:
Mind taking a look at HBASE-14181
https://issues.apache.org/jira/browse/HBASE-14181 'Add Spark DataFrame
DataSource to HBase-Spark Module' ?
Thanks
On Wed, Jul 22, 2015 at 4:53 PM, Bing Xiao (Bing) bing.x...@huawei.com
wrote:
We are happy to announce the availability of the Spark
Just some thoughts, hope i didn't missed something obvious.
HadoopFSRelation calls directly FileSystem class to list files in the
path.
It looks like it implements basically the same logic as in the
FileInputFormat.listStatus method ( located in
hadoop-map-reduce-client-core)
The point is