[jira] [Commented] (SPARK-15487) Spark Master UI to reverse proxy Application and Workers UI

2016-10-24 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15601794#comment-15601794 ] Matthew Farrellee commented on SPARK-15487: --- well, unless you're putting another proxy in front

[jira] [Commented] (SPARK-15487) Spark Master UI to reverse proxy Application and Workers UI

2016-10-23 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15600710#comment-15600710 ] Matthew Farrellee commented on SPARK-15487: --- try just setting the proxy url to "/"

Re: [openstack-dev] [sahara] Proposing Vitaly Gridnev to core reviewer team

2015-10-13 Thread Matthew Farrellee
+1! On 10/12/2015 07:19 AM, Sergey Lukjanov wrote: Hi folks, I'd like to propose Vitaly Gridnev as a member of the Sahara core reviewer team. Vitaly contributing to Sahara for a long time and doing a great job on reviewing and improving Sahara. Here are the statistics for reviews [0][1][2]

[jira] [Created] (FLINK-2709) line editing in scala shell

2015-09-18 Thread Matthew Farrellee (JIRA)
Matthew Farrellee created FLINK-2709: Summary: line editing in scala shell Key: FLINK-2709 URL: https://issues.apache.org/jira/browse/FLINK-2709 Project: Flink Issue Type: New Feature

[jira] [Created] (FLINK-2709) line editing in scala shell

2015-09-18 Thread Matthew Farrellee (JIRA)
Matthew Farrellee created FLINK-2709: Summary: line editing in scala shell Key: FLINK-2709 URL: https://issues.apache.org/jira/browse/FLINK-2709 Project: Flink Issue Type: New Feature

Re: [openstack-dev] [sahara] Proposing Ethan Gafford for the core reviewer team

2015-08-13 Thread Matthew Farrellee
On 08/13/2015 10:56 AM, Sergey Lukjanov wrote: Hi folks, I'd like to propose Ethan Gafford as a member of the Sahara core reviewer team. Ethan contributing to Sahara for a long time and doing a great job on reviewing and improving Sahara. Here are the statistics for reviews [0][1][2] and

[jira] [Commented] (SPARK-5368) Spark should support NAT (via akka improvements)

2015-03-23 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14377096#comment-14377096 ] Matthew Farrellee commented on SPARK-5368: -- [~jayunit100] the relevant config

[jira] [Commented] (SPARK-5368) Spark should support NAT (via akka improvements)

2015-03-22 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14375157#comment-14375157 ] Matthew Farrellee commented on SPARK-5368: -- [~srowen] i was able to workaround my

[jira] [Commented] (SPARK-5113) Audit and document use of hostnames and IP addresses in Spark

2015-03-22 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14375158#comment-14375158 ] Matthew Farrellee commented on SPARK-5113: -- [~pwendell] would

[jira] [Commented] (SPARK-6245) jsonRDD() of empty RDD results in exception

2015-03-16 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14363290#comment-14363290 ] Matthew Farrellee commented on SPARK-6245: -- [~srowen] thanks for fixing

[jira] [Commented] (SPARK-6245) jsonRDD() of empty RDD results in exception

2015-03-10 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14355843#comment-14355843 ] Matthew Farrellee commented on SPARK-6245: -- this is an issue for the scala

[jira] [Commented] (SPARK-5368) Spark should support NAT (via akka improvements)

2015-03-09 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14353534#comment-14353534 ] Matthew Farrellee commented on SPARK-5368: -- [~srowen] will you take a look

[jira] [Commented] (SPARK-2313) PySpark should accept port via a command line argument rather than STDIN

2015-02-12 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14318221#comment-14318221 ] Matthew Farrellee commented on SPARK-2313: -- that'd work, also requires a py4j

[jira] [Commented] (SPARK-927) PySpark sample() doesn't work if numpy is installed on master but not on workers

2015-01-05 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14265290#comment-14265290 ] Matthew Farrellee commented on SPARK-927: - PR #2313 was subsumed by PR #3351, which

[jira] [Resolved] (SPARK-927) PySpark sample() doesn't work if numpy is installed on master but not on workers

2015-01-05 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee resolved SPARK-927. - Resolution: Fixed Fix Version/s: 1.2.0 PySpark sample() doesn't work if numpy

Re: [openstack-dev] [sahara] team meeting Nov 27 1800 UTC

2014-11-26 Thread Matthew Farrellee
On 11/26/2014 01:10 PM, Sergey Lukjanov wrote: Hi folks, We'll be having the Sahara team meeting as usual in #openstack-meeting-alt channel. Agenda: https://wiki.openstack.org/wiki/Meetings/SaharaAgenda#Next_meetings

Re: [Openstack] Sahara: No images available after registering UbuntuVanilla Image when launching cluster.‏

2014-11-24 Thread Matthew Farrellee
On 11/24/2014 02:28 PM, Edward HUANG wrote: Hi all, I'm setting up a local cloud environment on servers in my lab. I installed OpenStack with devstack, and i install it with sahara. Data processing appears in the dashboard, and i did add a ubuntu-vanilla qcow2 images according to

Re: [openstack-dev] [sahara] Nominate Sergey Reshetniak to sahara-core

2014-11-11 Thread Matthew Farrellee
On 11/11/2014 12:35 PM, Sergey Lukjanov wrote: Hi folks, I'd like to propose Sergey to sahara-core. He's made a lot of work on different parts of Sahara and he has a very good knowledge of codebase, especially in plugins area. Sergey has been consistently giving us very well thought out and

Re: [openstack-dev] [sahara] Nominate Michael McCune to sahara-core

2014-11-11 Thread Matthew Farrellee
On 11/11/2014 12:37 PM, Sergey Lukjanov wrote: Hi folks, I'd like to propose Michael McCune to sahara-core. He has a good knowledge of codebase and implemented important features such as Swift auth using trusts. Mike has been consistently giving us very well thought out and constructive reviews

[jira] [Closed] (SPARK-2256) pyspark: RDD.take doesn't work ... sometimes ...

2014-10-03 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-2256. Resolution: Fixed Fix Version/s: 1.1.0 pyspark: RDD.take doesn't work ... sometimes

[jira] [Commented] (SPARK-3733) Support for programmatically submitting Spark jobs

2014-09-30 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14153142#comment-14153142 ] Matthew Farrellee commented on SPARK-3733: -- will you describe what you mean

[jira] [Commented] (SPARK-3685) Spark's local dir should accept only local paths

2014-09-29 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14151691#comment-14151691 ] Matthew Farrellee commented on SPARK-3685: -- [~andrewor] thanks for the info

[jira] [Commented] (SPARK-3685) Spark's local dir should accept only local paths

2014-09-29 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14152152#comment-14152152 ] Matthew Farrellee commented on SPARK-3685: -- the root of the resource problem

[jira] [Commented] (SPARK-3685) Spark's local dir should accept only local paths

2014-09-29 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14152526#comment-14152526 ] Matthew Farrellee commented on SPARK-3685: -- if you're going to go down this path

[jira] [Commented] (SPARK-3685) Spark's local dir scheme is not configurable

2014-09-28 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14151148#comment-14151148 ] Matthew Farrellee commented on SPARK-3685: -- i'm skeptical. what would

Re: [openstack-dev] [Sahara] Verbosity of Sahara overview image

2014-09-27 Thread Matthew Farrellee
On 09/26/2014 02:27 PM, Sharan Kumar M wrote: Hi all, I am trying to modify the diagram in http://docs.openstack.org/developer/sahara/overview.html so that it syncs with the contents. In the diagram, is it nice to mark the connections between the openstack components like, Nova with Cinder,

[jira] [Commented] (SPARK-3639) Kinesis examples set master as local

2014-09-24 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14146245#comment-14146245 ] Matthew Farrellee commented on SPARK-3639: -- seems reasonable to me Kinesis

[jira] [Resolved] (SPARK-1443) Unable to Access MongoDB GridFS data with Spark using mongo-hadoop API

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee resolved SPARK-1443. -- Resolution: Done Fix Version/s: (was: 0.9.0) Unable to Access MongoDB

[jira] [Commented] (SPARK-1443) Unable to Access MongoDB GridFS data with Spark using mongo-hadoop API

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142446#comment-14142446 ] Matthew Farrellee commented on SPARK-1443: -- [~PavanKumarVarma] i hope you've been

[jira] [Commented] (SPARK-1177) Allow SPARK_JAR to be set in system properties

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142447#comment-14142447 ] Matthew Farrellee commented on SPARK-1177: -- [~epakhomov] it looks like this has

[jira] [Closed] (SPARK-1177) Allow SPARK_JAR to be set in system properties

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-1177. Resolution: Fixed Fix Version/s: (was: 0.9.0) Allow SPARK_JAR to be set

[jira] [Commented] (SPARK-1176) Adding port configuration for HttpBroadcast

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142454#comment-14142454 ] Matthew Farrellee commented on SPARK-1176: -- [~epakhomov] it looks like

[jira] [Resolved] (SPARK-1176) Adding port configuration for HttpBroadcast

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee resolved SPARK-1176. -- Resolution: Fixed Fix Version/s: (was: 0.9.0) 1.1.0

[jira] [Closed] (SPARK-1748) I installed the spark_standalone,but I did not know how to use sbt to compile the programme of spark?

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-1748. Resolution: Done Fix Version/s: (was: 0.8.1) I installed the spark_standalone

[jira] [Commented] (SPARK-1748) I installed the spark_standalone,but I did not know how to use sbt to compile the programme of spark?

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142455#comment-14142455 ] Matthew Farrellee commented on SPARK-1748: -- thanks for the question. you'll get

[jira] [Closed] (SPARK-614) Make last 4 digits of framework id in standalone mode logging monotonically increasing

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-614. --- Resolution: Unresolved Fix Version/s: (was: 0.7.1) Make last 4 digits of framework

[jira] [Commented] (SPARK-614) Make last 4 digits of framework id in standalone mode logging monotonically increasing

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142456#comment-14142456 ] Matthew Farrellee commented on SPARK-614: - it looks like nothing has happened

[jira] [Closed] (SPARK-719) Add FAQ page to documentation or webpage

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-719. --- Resolution: Done Fix Version/s: (was: 0.7.1) Add FAQ page to documentation

[jira] [Commented] (SPARK-719) Add FAQ page to documentation or webpage

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142459#comment-14142459 ] Matthew Farrellee commented on SPARK-719: - it looks like this has some good content

[jira] [Closed] (SPARK-637) Create troubleshooting checklist

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-637. --- Resolution: Later Create troubleshooting checklist

[jira] [Commented] (SPARK-637) Create troubleshooting checklist

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142463#comment-14142463 ] Matthew Farrellee commented on SPARK-637: - this is a good idea, and it will take

[jira] [Commented] (SPARK-3593) Support Sorting of Binary Type Data

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142468#comment-14142468 ] Matthew Farrellee commented on SPARK-3593: -- [~pmagid] will you provide some

[jira] [Commented] (SPARK-537) driver.run() returned with code DRIVER_ABORTED

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142474#comment-14142474 ] Matthew Farrellee commented on SPARK-537: - this should be resolved by a number

[jira] [Resolved] (SPARK-537) driver.run() returned with code DRIVER_ABORTED

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee resolved SPARK-537. - Resolution: Fixed Fix Version/s: 1.0.0 driver.run() returned with code

[jira] [Commented] (SPARK-538) INFO spark.MesosScheduler: Ignoring update from TID 9 because its job is gone

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142475#comment-14142475 ] Matthew Farrellee commented on SPARK-538: - this is a reasonable question

[jira] [Closed] (SPARK-538) INFO spark.MesosScheduler: Ignoring update from TID 9 because its job is gone

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-538. --- Resolution: Done INFO spark.MesosScheduler: Ignoring update from TID 9 because its job

[jira] [Updated] (SPARK-542) Cache Miss when machine have multiple hostname

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee updated SPARK-542: Component/s: Mesos Priority: Blocker Cache Miss when machine have multiple hostname

[jira] [Closed] (SPARK-550) Hiding the default spark context in the spark shell creates serialization issues

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-550. --- Resolution: Done Hiding the default spark context in the spark shell creates serialization

[jira] [Commented] (SPARK-550) Hiding the default spark context in the spark shell creates serialization issues

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142477#comment-14142477 ] Matthew Farrellee commented on SPARK-550: - a lot of code has changed in this space

[jira] [Closed] (SPARK-559) Automatically register all classes used in fields of a class with Kryo

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-559. --- Resolution: Done Automatically register all classes used in fields of a class with Kryo

[jira] [Commented] (SPARK-559) Automatically register all classes used in fields of a class with Kryo

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142479#comment-14142479 ] Matthew Farrellee commented on SPARK-559: - the last comment on this, from 2 years

[jira] [Closed] (SPARK-567) Unified directory structure for temporary data

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-567. --- Resolution: Incomplete please re-open with additional details for how this could

[jira] [Closed] (SPARK-718) NPE when performing action during transformation

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-718. --- Resolution: Done NPE when performing action during transformation

[jira] [Commented] (SPARK-718) NPE when performing action during transformation

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142506#comment-14142506 ] Matthew Farrellee commented on SPARK-718: - Spark simply does not support nesting

[jira] [Commented] (SPARK-690) Stack overflow when running pagerank more than 10000 iterators

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142511#comment-14142511 ] Matthew Farrellee commented on SPARK-690: - [~andrew xia] this is reported against

[jira] [Closed] (SPARK-690) Stack overflow when running pagerank more than 10000 iterators

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-690. --- Resolution: Unresolved Stack overflow when running pagerank more than 1 iterators

[jira] [Commented] (SPARK-610) Support master failover in standalone mode

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142528#comment-14142528 ] Matthew Farrellee commented on SPARK-610: - [~matei] given YARN and Mesos

[jira] [Updated] (SPARK-604) reconnect if mesos slaves dies

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee updated SPARK-604: Component/s: Mesos reconnect if mesos slaves dies

[jira] [Commented] (SPARK-584) Pass slave ip address when starting a cluster

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142545#comment-14142545 ] Matthew Farrellee commented on SPARK-584: - what's the use case for this? Pass

[jira] [Commented] (SPARK-575) Maintain a cache of JARs on each node to avoid unnecessary copying

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-575?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142553#comment-14142553 ] Matthew Farrellee commented on SPARK-575: - [~joshrosen] is quite correct

[jira] [Closed] (SPARK-575) Maintain a cache of JARs on each node to avoid unnecessary copying

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-575. --- Resolution: Incomplete Maintain a cache of JARs on each node to avoid unnecessary copying

[jira] [Commented] (SPARK-578) Fix interpreter code generation to only capture needed dependencies

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14142558#comment-14142558 ] Matthew Farrellee commented on SPARK-578: - [~matei] is this related to slimming

[jira] [Updated] (SPARK-542) Cache Miss when machine have multiple hostname

2014-09-21 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee updated SPARK-542: Priority: Minor (was: Blocker) Cache Miss when machine have multiple hostname

Re: Spark + Mahout

2014-09-19 Thread Matthew Farrellee
On 09/19/2014 05:06 AM, Sean Owen wrote: No, it is actually a quite different 'alpha' project under the same name: linear algebra DSL on top of H2O and also Spark. It is not really about algorithm implementations now. On Sep 19, 2014 1:25 AM, Matthew Farrellee m...@redhat.com mailto:m

[jira] [Commented] (SPARK-3321) Defining a class within python main script

2014-09-18 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14138867#comment-14138867 ] Matthew Farrellee commented on SPARK-3321: -- [~guoxu1231] i think so too. ok if i

[jira] [Commented] (SPARK-3580) Add Consistent Method To Get Number of RDD Partitions Across Different Languages

2014-09-18 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139043#comment-14139043 ] Matthew Farrellee commented on SPARK-3580: -- what do you think about going

[jira] [Commented] (SPARK-3562) Periodic cleanup event logs

2014-09-18 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14139786#comment-14139786 ] Matthew Farrellee commented on SPARK-3562: -- is logrotate an option for you

[jira] [Closed] (SPARK-3581) RDD API(distinct/subtract) does not work for RDD of Dictionaries

2014-09-18 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-3581. Resolution: Not a Problem RDD API(distinct/subtract) does not work for RDD of Dictionaries

[jira] [Closed] (SPARK-3321) Defining a class within python main script

2014-09-18 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-3321. Resolution: Not a Problem Defining a class within python main script

[jira] [Closed] (SPARK-2022) Spark 1.0.0 is failing if mesos.coarse set to true

2014-09-17 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee closed SPARK-2022. Resolution: Fixed Spark 1.0.0 is failing if mesos.coarse set to true

[jira] [Commented] (SPARK-3508) annotate the Spark configs to indicate which ones are meant for the end user

2014-09-16 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14135631#comment-14135631 ] Matthew Farrellee commented on SPARK-3508: -- documented == public is a good metric

[jira] [Commented] (SPARK-2377) Create a Python API for Spark Streaming

2014-09-15 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14134225#comment-14134225 ] Matthew Farrellee commented on SPARK-2377: -- it's a little tricky. you need

[jira] [Created] (SPARK-3538) Provide way for workers to log messages to driver's out/err

2014-09-15 Thread Matthew Farrellee (JIRA)
Matthew Farrellee created SPARK-3538: Summary: Provide way for workers to log messages to driver's out/err Key: SPARK-3538 URL: https://issues.apache.org/jira/browse/SPARK-3538 Project: Spark

Re: yet another jenkins restart early thursday morning -- 730am PDT (and a brief update on our new jenkins infra)

2014-09-11 Thread Matthew Farrellee
shane, is there anything we should do for pull requests that failed, but for unrelated issues? best, matt On 09/11/2014 11:29 AM, shane knapp wrote: ...and the restart is done. On Thu, Sep 11, 2014 at 7:38 AM, shane knapp skn...@berkeley.edu wrote: jenkins is now in quiet mode, and a

Re: yet another jenkins restart early thursday morning -- 730am PDT (and a brief update on our new jenkins infra)

2014-09-11 Thread Matthew Farrellee
/Spark-Master-Maven-with-YARN/557/, which i just started a rebuild on) On Thu, Sep 11, 2014 at 9:15 AM, Matthew Farrellee m...@redhat.com mailto:m...@redhat.com wrote: shane, is there anything we should do for pull requests that failed, but for unrelated issues? best, matt

[jira] [Commented] (SPARK-3470) Have JavaSparkContext implement Closeable/AutoCloseable

2014-09-10 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14128751#comment-14128751 ] Matthew Farrellee commented on SPARK-3470: -- while you can implement Closeable

[jira] [Commented] (SPARK-2972) APPLICATION_COMPLETE not created in Python unless context explicitly stopped

2014-09-09 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14127016#comment-14127016 ] Matthew Farrellee commented on SPARK-2972: -- I suggest having context implement

[jira] [Created] (SPARK-3458) enable use of python's with statements for SparkContext management

2014-09-09 Thread Matthew Farrellee (JIRA)
Matthew Farrellee created SPARK-3458: Summary: enable use of python's with statements for SparkContext management Key: SPARK-3458 URL: https://issues.apache.org/jira/browse/SPARK-3458 Project

[jira] [Commented] (SPARK-2972) APPLICATION_COMPLETE not created in Python unless context explicitly stopped

2014-09-09 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14127187#comment-14127187 ] Matthew Farrellee commented on SPARK-2972: -- +1 close this and open 2 feature

[jira] [Commented] (SPARK-2972) APPLICATION_COMPLETE not created in Python unless context explicitly stopped

2014-09-08 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14125938#comment-14125938 ] Matthew Farrellee commented on SPARK-2972: -- Thanks for answering. I guess it's

[jira] [Commented] (SPARK-1087) Separate file for traceback and callsite related functions

2014-09-08 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14125957#comment-14125957 ] Matthew Farrellee commented on SPARK-1087: -- [~jyotiska] please do! Separate

[jira] [Commented] (SPARK-2972) APPLICATION_COMPLETE not created in Python unless context explicitly stopped

2014-09-07 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14124872#comment-14124872 ] Matthew Farrellee commented on SPARK-2972: -- [~roji] this was addressed

[jira] [Commented] (SPARK-1701) Inconsistent naming: slice or partition

2014-09-06 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14124458#comment-14124458 ] Matthew Farrellee commented on SPARK-1701: -- slice vs partition has also come up

[jira] [Created] (SPARK-3425) OpenJDK - when run with jvm 1.8, should not set MaxPermSize

2014-09-06 Thread Matthew Farrellee (JIRA)
Matthew Farrellee created SPARK-3425: Summary: OpenJDK - when run with jvm 1.8, should not set MaxPermSize Key: SPARK-3425 URL: https://issues.apache.org/jira/browse/SPARK-3425 Project: Spark

[jira] [Commented] (SPARK-3425) OpenJDK - when run with jvm 1.8, should not set MaxPermSize

2014-09-06 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14124467#comment-14124467 ] Matthew Farrellee commented on SPARK-3425: -- this is still an issue for openjdk

[jira] [Commented] (SPARK-1701) Inconsistent naming: slice or partition

2014-09-06 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14124510#comment-14124510 ] Matthew Farrellee commented on SPARK-1701: -- ok, and one more https://github.com

[jira] [Issue Comment Deleted] (SPARK-1701) Inconsistent naming: slice or partition

2014-09-06 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee updated SPARK-1701: - Comment: was deleted (was: ok, i also created 2 other PRs https://github.com/apache

[jira] [Issue Comment Deleted] (SPARK-1701) Inconsistent naming: slice or partition

2014-09-06 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee updated SPARK-1701: - Comment: was deleted (was: ok, and one more https://github.com/apache/spark/pull/2304

[jira] [Commented] (SPARK-3321) Defining a class within python main script

2014-09-06 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14124704#comment-14124704 ] Matthew Farrellee commented on SPARK-3321: -- this has come up a few times. it's

[jira] [Commented] (SPARK-3401) Wrong usage of tee command in python/run-tests

2014-09-06 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14124705#comment-14124705 ] Matthew Farrellee commented on SPARK-3401: -- nice catch Wrong usage of tee

[jira] [Updated] (SPARK-3401) Wrong usage of tee command in python/run-tests

2014-09-06 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee updated SPARK-3401: - Fix Version/s: 1.1.1 Wrong usage of tee command in python/run-tests

[jira] [Resolved] (SPARK-3401) Wrong usage of tee command in python/run-tests

2014-09-06 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Farrellee resolved SPARK-3401. -- Resolution: Fixed Wrong usage of tee command in python/run-tests

Re: How spark parallelize maps Slices to tasks/executors/workers

2014-09-06 Thread Matthew Farrellee
On 09/04/2014 09:55 PM, Mozumder, Monir wrote: I have this 2-node cluster setup, where each node has 4-cores. MASTER (Worker-on-master) (Worker-on-node1) (slaves(master,node1)) SPARK_WORKER_INSTANCES=1 I am trying to understand Spark's parallelize behavior.

Re: [VOTE] Release Apache Spark 1.1.0 (RC4)

2014-09-03 Thread Matthew Farrellee
+1 built from sha w/ make-distribution.sh tested basic examples (0 data) w/ local on fedora 20 (openjdk 1.7, python 2.7.5) tested detection and log processing (25GB data) w/ mesos (0.19.0) nfs on rhel 7 (openjdk 1.7, python 2.7.5) On 09/03/2014 03:24 AM, Patrick Wendell wrote: Please vote

Re: spark-ec2 depends on stuff in the Mesos repo

2014-09-03 Thread Matthew Farrellee
that's not a bad idea. it would also break the circular dep in versions that results in spark X's ec2 script installing spark X-1 by default. best, matt On 09/03/2014 01:17 PM, Shivaram Venkataraman wrote: The spark-ec2 repository isn't a part of Mesos. Back in the days, Spark used to be

Re: spark-ec2 depends on stuff in the Mesos repo

2014-09-03 Thread Matthew Farrellee
to many Spark versions and you can configure which one should be used. Shivaram On Wed, Sep 3, 2014 at 10:22 AM, Matthew Farrellee m...@redhat.com mailto:m...@redhat.com wrote: that's not a bad idea. it would also break the circular dep in versions that results in spark X's ec2 script

Re: Ask something about spark

2014-09-03 Thread Matthew Farrellee
reynold, would you folks be willing to put some creative commons license information on the site and its content? best, matt On 09/02/2014 06:32 PM, Reynold Xin wrote: I think in general that is fine. It would be great if your slides come with proper attribution. On Tue, Sep 2, 2014 at

Re: Ask something about spark

2014-09-03 Thread Matthew Farrellee
legal to chime in. On Wed, Sep 3, 2014 at 11:15 AM, Matthew Farrellee m...@redhat.com mailto:m...@redhat.com wrote: reynold, would you folks be willing to put some creative commons license information on the site and its content? best, matt On 09/02/2014 06:32 PM

[jira] [Commented] (SPARK-3181) Add Robust Regression Algorithm with Huber Estimator

2014-09-02 Thread Matthew Farrellee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14118566#comment-14118566 ] Matthew Farrellee commented on SPARK-3181: -- pls excuse my changes to this issue

Re: [PySpark] large # of partitions causes OOM

2014-09-02 Thread Matthew Farrellee
On 08/29/2014 06:05 PM, Nick Chammas wrote: Here’s a repro for PySpark: |a = sc.parallelize([Nick,John,Bob]) a = a.repartition(24000) a.keyBy(lambda x: len(x)).reduceByKey(lambda x,y: x + y).take(1) | When I try this on an EC2 cluster with 1.1.0-rc2 and Python 2.7, this is what I get: |a =

  1   2   3   >