[jira] [Resolved] (SPARK-19126) Join Documentation Improvements

2017-01-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-19126. -- Resolution: Fixed Assignee: Bill Chambers Fix Version/s: 2.2.0

[jira] [Commented] (SPARK-18011) SparkR serialize "NA" throws exception

2017-01-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15810660#comment-15810660 ] Felix Cheung commented on SPARK-18011: -- [~wangmiao1981]do you remember this one? I thought at one

[jira] [Created] (SPARK-19130) SparkR should support setting and adding new column with singular value implicitly

2017-01-08 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-19130: Summary: SparkR should support setting and adding new column with singular value implicitly Key: SPARK-19130 URL: https://issues.apache.org/jira/browse/SPARK-19130

[jira] [Commented] (SPARK-18823) Assignation by column name variable not available or bug?

2017-01-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15810634#comment-15810634 ] Felix Cheung commented on SPARK-18823: -- I think to Shivaram, this is a bit tricky since we

[jira] [Commented] (SPARK-18570) Consider supporting other R formula operators

2017-01-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15810250#comment-15810250 ] Felix Cheung commented on SPARK-18570: -- Hi - the code is here: https://github.com/apache/spark/blob

Spark checkpointing

2017-01-07 Thread Felix Cheung
From: Steve Loughran <ste...@hortonworks.com> Sent: Friday, January 6, 2017 9:57:05 AM To: Ankur Srivastava Cc: Felix Cheung; u...@spark.apache.org Subject: Re: Spark GraphFrame ConnectedComponents On 5 Jan 2017, at 21:10, Ankur Srivastava <ankur.srivast...@gmail.com<m

Re: Windows support ?

2017-01-05 Thread Felix Cheung
No, Travis does not support Windows https://github.com/travis-ci/travis-ci/issues/2104 AppVeyor does... https://www.appveyor.com/docs/installed-software/#operating-system From: Jeff Zhang Sent: Thursday, January 5, 2017 7:04:18 PM To:

Re: Spark GraphFrame ConnectedComponents

2017-01-05 Thread Felix Cheung
. From: Ankur Srivastava <ankur.srivast...@gmail.com> Sent: Thursday, January 5, 2017 3:45:59 PM To: Felix Cheung; dev@spark.apache.org Cc: u...@spark.apache.org Subject: Re: Spark GraphFrame ConnectedComponents Adding DEV mailing list to see if this is a defect with ConnectedComponent or if th

Re: Spark GraphFrame ConnectedComponents

2017-01-05 Thread Felix Cheung
. From: Ankur Srivastava <ankur.srivast...@gmail.com> Sent: Thursday, January 5, 2017 3:45:59 PM To: Felix Cheung; d...@spark.apache.org Cc: user@spark.apache.org Subject: Re: Spark GraphFrame ConnectedComponents Adding DEV mailing list to see if this is a defect with ConnectedCom

Re: Spark GraphFrame ConnectedComponents

2017-01-05 Thread Felix Cheung
uary 5, 2017 10:05:03 AM To: Felix Cheung Cc: user@spark.apache.org Subject: Re: Spark GraphFrame ConnectedComponents Yes it works to read the vertices and edges data from S3 location and is also able to write the checkpoint files to S3. It only fails when deleting the data and that is because it

Re: Spark GraphFrame ConnectedComponents

2017-01-05 Thread Felix Cheung
nkur.srivast...@gmail.com>> Sent: Wednesday, January 4, 2017 9:23 PM Subject: Re: Spark GraphFrame ConnectedComponents To: Felix Cheung <felixcheun...@hotmail.com<mailto:felixcheun...@hotmail.com>> Cc: <user@spark.apache.org<mailto:user@spark.apache.org>> This is the exact trace

Re: Spark GraphFrame ConnectedComponents

2017-01-04 Thread Felix Cheung
Do you have more of the exception stack? From: Ankur Srivastava Sent: Wednesday, January 4, 2017 4:40:02 PM To: user@spark.apache.org Subject: Spark GraphFrame ConnectedComponents Hi, I am trying to use the ConnectedComponent

Re: py4j doesn't work with python interpreter

2017-01-02 Thread Felix Cheung
You said you ran sudo pip install pyplot I think you need to pip install py4j instead? _ From: Jianfeng (Jeff) Zhang > Sent: Monday, January 2, 2017 4:24 PM Subject: Re: py4j doesn't work with python interpreter

Re: Issue with SparkR setup on RStudio

2017-01-02 Thread Felix Cheung
is not set in the Windows tests. _ From: Md. Rezaul Karim <rezaul.ka...@insight-centre.org<mailto:rezaul.ka...@insight-centre.org>> Sent: Monday, January 2, 2017 7:58 AM Subject: Re: Issue with SparkR setup on RStudio To: Felix Cheung <felixcheun...@hotm

Re: How to load a big csv to dataframe in Spark 1.6

2016-12-31 Thread Felix Cheung
ect: Re: How to load a big csv to dataframe in Spark 1.6 To: Felix Cheung <felixcheun...@hotmail.com<mailto:felixcheun...@hotmail.com>> Cc: <user@spark.apache.org<mailto:user@spark.apache.org>> Hello Felix, I followed the instruction and ran the command: >

Re: [ML] [GraphFrames] : Bayesian Network framework

2016-12-30 Thread Felix Cheung
GraphFrames has a Belief Propagation example Have you checked it out? graphframes.github.io/api/scala/index.html#org.graphframes.examples.BeliefPropagation$ From:

Re: ml word2vec finSynonyms return type

2016-12-30 Thread Felix Cheung
Could you link to the JIRA here? What you suggest makes sense to me. Though we might want to maintain compatibility and add a new method instead of changing the return type of the existing one. _ From: Asher Krim > Sent:

Re: Spark Graphx with Database

2016-12-30 Thread Felix Cheung
You might want to check out GraphFrames - to load database data (as Spark DataFrame) and build graphs with them https://github.com/graphframes/graphframes _ From: balaji9058 > Sent: Monday, December 26, 2016 9:27 PM

Re: Difference in R and Spark Output

2016-12-30 Thread Felix Cheung
Could you elaborate more on the huge difference you are seeing? From: Saroj C Sent: Friday, December 30, 2016 5:12:04 AM To: User Subject: Difference in R and Spark Output Dear All, For the attached input file, there is a huge difference

Re: How to load a big csv to dataframe in Spark 1.6

2016-12-30 Thread Felix Cheung
Have you tried the spark-csv package? https://spark-packages.org/package/databricks/spark-csv From: Raymond Xie Sent: Friday, December 30, 2016 6:46:11 PM To: user@spark.apache.org Subject: How to load a big csv to dataframe in Spark 1.6

[jira] [Commented] (SPARK-12757) Use reference counting to prevent blocks from being evicted during reads

2016-12-29 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15786726#comment-15786726 ] Felix Cheung commented on SPARK-12757: -- ping. Still seeing a lot of these messages on Spark 2.1

Re: Issue with SparkR setup on RStudio

2016-12-29 Thread Felix Cheung
Any reason you are setting HADOOP_HOME? >From the error it seems you are running into issue with Hive config likely >with trying to load hive-site.xml. Could you try not setting HADOOP_HOME From: Md. Rezaul Karim Sent:

[jira] [Resolved] (SPARK-18958) SparkR should support toJSON on DataFrame

2016-12-28 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-18958. -- Resolution: Fixed Target Version/s: 2.2.0 > SparkR should support toJ

[jira] [Resolved] (SPARK-18903) uiWebUrl is not accessible to SparkR

2016-12-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-18903. -- Resolution: Fixed Assignee: Felix Cheung Target Version/s: 2.2.0

[jira] [Commented] (SPARK-10523) SparkR formula syntax to turn strings/factors into numerics

2016-12-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15768165#comment-15768165 ] Felix Cheung commented on SPARK-10523: -- [~cantdutchthis]I'm curious, do you know why all your

[jira] [Created] (SPARK-18958) SparkR should support toJSON on DataFrame

2016-12-20 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18958: Summary: SparkR should support toJSON on DataFrame Key: SPARK-18958 URL: https://issues.apache.org/jira/browse/SPARK-18958 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-18924) Improve collect/createDataFrame performance in SparkR

2016-12-19 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15763165#comment-15763165 ] Felix Cheung commented on SPARK-18924: -- Thank you for bring this up. JVM<->Java performance ha

Re: GraphFrame not init vertices when load edges

2016-12-18 Thread Felix Cheung
There is not a GraphLoader for GraphFrames but you could load and convert from GraphX: http://graphframes.github.io/user-guide.html#graphx-to-graphframe From: zjp_j...@163.com <zjp_j...@163.com> Sent: Sunday, December 18, 2016 9:39:49 PM To: Felix Cheung

Re: GraphFrame not init vertices when load edges

2016-12-18 Thread Felix Cheung
Or this is a better link: http://graphframes.github.io/quick-start.html _ From: Felix Cheung <felixcheun...@hotmail.com<mailto:felixcheun...@hotmail.com>> Sent: Sunday, December 18, 2016 8:46 PM Subject: Re: GraphFrame not init vertices when load edge

Re: [VOTE] Apache Spark 2.1.0 (RC5)

2016-12-18 Thread Felix Cheung
nnot fit in memory. On Sat, Dec 17, 2016 at 10:23 AM, Joseph Bradley <jos...@databricks.com<mailto:jos...@databricks.com>> wrote: +1 On Fri, Dec 16, 2016 at 3:21 PM, Herman van Hövell tot Westerflier <hvanhov...@databricks.com<mailto:hvanhov...@databricks.com>> wrote: +1 O

Re: GraphFrame not init vertices when load edges

2016-12-18 Thread Felix Cheung
Can you clarify? Vertices should be another DataFrame as you can see in the example here: https://github.com/graphframes/graphframes/blob/master/docs/quick-start.md From: zjp_j...@163.com Sent: Sunday, December 18, 2016 6:25:50 PM To: user

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-18 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15759368#comment-15759368 ] Felix Cheung commented on SPARK-18817: -- testing fix, will open a PR shortly. > Ensure noth

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-17 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15758005#comment-15758005 ] Felix Cheung commented on SPARK-18817: -- And as a side note, I feel like spark-warehouse

[jira] [Comment Edited] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-17 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15757998#comment-15757998 ] Felix Cheung edited comment on SPARK-18817 at 12/18/16 2:04 AM: Aside

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-17 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15757998#comment-15757998 ] Felix Cheung commented on SPARK-18817: -- Aside from changing the existing shipped behavior

[jira] [Commented] (SPARK-18903) uiWebUrl is not accessible to SparkR

2016-12-16 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15756379#comment-15756379 ] Felix Cheung commented on SPARK-18903: -- this sounds like a reasonable ask, I'll take a look

[jira] [Updated] (SPARK-18903) uiWebUrl is not accessible to SparkR

2016-12-16 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18903: - Component/s: (was: Java API) > uiWebUrl is not accessible to Spa

[jira] [Resolved] (SPARK-18902) Include Apache License in R source Package

2016-12-16 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-18902. -- Resolution: Not A Problem > Include Apache License in R source Pack

[jira] [Commented] (SPARK-18902) Include Apache License in R source Package

2016-12-16 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15756369#comment-15756369 ] Felix Cheung commented on SPARK-18902: -- We have the license in DESCRIPTION file as required for R

[jira] [Closed] (SPARK-18902) Include Apache License in R source Package

2016-12-16 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung closed SPARK-18902. Assignee: Felix Cheung > Include Apache License in R source Pack

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753462#comment-15753462 ] Felix Cheung commented on SPARK-18817: -- I ran more of this but wasn't seeinng derby.log

Re: Spark Dataframe: Save to hdfs is taking long time

2016-12-15 Thread Felix Cheung
What is the format? From: KhajaAsmath Mohammed Sent: Thursday, December 15, 2016 7:54:27 PM To: user @spark Subject: Spark Dataframe: Save to hdfs is taking long time Hi, I am using issue while saving the dataframe back to HDFS. It's

Re: How to load edge with properties file useing GraphX

2016-12-15 Thread Felix Cheung
Have you checked out https://github.com/graphframes/graphframes? It might be easier to work with DataFrame. From: zjp_j...@163.com Sent: Thursday, December 15, 2016 7:23:57 PM To: user Subject: How to load edge with properties file useing

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753414#comment-15753414 ] Felix Cheung commented on SPARK-18817: -- It looks like javax.jdo.option.ConnectionURL can also be set

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753064#comment-15753064 ] Felix Cheung commented on SPARK-18817: -- Actually, I'm not seeing derby.log or metastore_db

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753048#comment-15753048 ] Felix Cheung commented on SPARK-18817: -- Tested this just now, I still see spark-warehouse when

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753040#comment-15753040 ] Felix Cheung commented on SPARK-18817: -- we could, but we did ship 2.0 with it enabled by default

[jira] [Issue Comment Deleted] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18817: - Comment: was deleted (was: we could, but we did ship 2.0 with it enabled by default though

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753041#comment-15753041 ] Felix Cheung commented on SPARK-18817: -- we could, but we did ship 2.0 with it enabled by default

[jira] [Comment Edited] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753041#comment-15753041 ] Felix Cheung edited comment on SPARK-18817 at 12/16/16 1:03 AM: we could

[jira] [Commented] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15753032#comment-15753032 ] Felix Cheung commented on SPARK-18862: -- FYI I reorg the vignettes based on what's discussed here

[jira] [Commented] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15750929#comment-15750929 ] Felix Cheung commented on SPARK-18862: -- Right I think if there are families that span multiple

[jira] [Commented] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15750614#comment-15750614 ] Felix Cheung commented on SPARK-18862: -- ah :) Would we end up having mllib-gmm.R, mllib-als.R and so

[jira] [Resolved] (SPARK-18865) SparkR vignettes MLP and LDA updates

2016-12-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-18865. -- Resolution: Fixed Assignee: Miao Wang Target Version/s: 2.1.1, 2.2.0

[jira] [Commented] (SPARK-18849) Vignettes final checks for Spark 2.1

2016-12-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15749696#comment-15749696 ] Felix Cheung commented on SPARK-18849: -- working on this... > Vignettes final checks for Spark

[jira] [Commented] (SPARK-7839) Augment build environment to support native libraries with SparkR

2016-12-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15749626#comment-15749626 ] Felix Cheung commented on SPARK-7839: - This could be behavior that we turn on for R package

[jira] [Commented] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15749617#comment-15749617 ] Felix Cheung commented on SPARK-18817: -- this could block R package release > Ensure noth

[jira] [Updated] (SPARK-18817) Ensure nothing is written outside R's tempdir() by default

2016-12-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18817: - Priority: Critical (was: Major) > Ensure nothing is written outside R's tempdir() by defa

[jira] [Commented] (SPARK-18849) Vignettes final checks for Spark 2.1

2016-12-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15749591#comment-15749591 ] Felix Cheung commented on SPARK-18849: -- I could take this [~mengxr] as soon as https://github.com

[jira] [Comment Edited] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15749559#comment-15749559 ] Felix Cheung edited comment on SPARK-18862 at 12/14/16 9:38 PM: AFAIK, R

[jira] [Comment Edited] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15749559#comment-15749559 ] Felix Cheung edited comment on SPARK-18862 at 12/14/16 9:37 PM: AFAIK, R

[jira] [Commented] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-14 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15749559#comment-15749559 ] Felix Cheung commented on SPARK-18862: -- AFAIK, R package has a constrain that it has to be a flat

[jira] [Commented] (SPARK-18823) Assignation by column name variable not available or bug?

2016-12-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15747433#comment-15747433 ] Felix Cheung commented on SPARK-18823: -- We will address both of your suggestions. As for x$y <-

[jira] [Commented] (SPARK-18823) Assignation by column name variable not available or bug?

2016-12-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15747429#comment-15747429 ] Felix Cheung commented on SPARK-18823: -- For #2, I do agree it could get messy, but I was thinking

[jira] [Commented] (SPARK-18849) Vignettes final checks for Spark 2.1

2016-12-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15747412#comment-15747412 ] Felix Cheung commented on SPARK-18849: -- probably would be good to check for warning or error

[jira] [Commented] (SPARK-18825) Eliminate duplicate links in SparkR API doc index

2016-12-13 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15747410#comment-15747410 ] Felix Cheung commented on SPARK-18825: -- I will see what I can do... > Eliminate duplicate li

[jira] [Commented] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2016-12-12 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15744481#comment-15744481 ] Felix Cheung commented on SPARK-4591: - Is SVM part of this? > Algorithm/model parity for spark

[jira] [Commented] (SPARK-18823) Assignation by column name variable not available or bug?

2016-12-12 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15744415#comment-15744415 ] Felix Cheung commented on SPARK-18823: -- How important it is to support df[[myname]] <- c(1:nrow

[jira] [Created] (SPARK-18828) Refactor SparkR build and test scripts

2016-12-11 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18828: Summary: Refactor SparkR build and test scripts Key: SPARK-18828 URL: https://issues.apache.org/jira/browse/SPARK-18828 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-18570) Consider supporting other R formula operators

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18570: - Priority: Minor (was: Major) > Consider supporting other R formula operat

[jira] [Updated] (SPARK-18569) Support R formula arithmetic

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18569: - Affects Version/s: (was: 2.2.0) Target Version/s: 2.2.0 > Support R formula arithme

[jira] [Updated] (SPARK-18569) Support R formula arithmetic

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18569: - Affects Version/s: 2.2.0 > Support R formula arithme

[jira] [Updated] (SPARK-18570) Consider supporting other R formula operators

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18570: - Target Version/s: 2.2.0 > Consider supporting other R formula operat

[jira] [Updated] (SPARK-18348) Improve tree ensemble model summary

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18348: - Target Version/s: 2.2.0 > Improve tree ensemble model summ

[jira] [Commented] (SPARK-18813) MLlib 2.2 Roadmap

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740216#comment-15740216 ] Felix Cheung commented on SPARK-18813: -- This is great, Joseph. Thanks for putting down the framework

[jira] [Updated] (SPARK-18821) Bisecting k-means wrapper in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18821: - Shepherd: Felix Cheung > Bisecting k-means wrapper in Spa

[jira] [Updated] (SPARK-18822) Support ML Pipeline in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18822: - Shepherd: Felix Cheung > Support ML Pipeline in Spa

[jira] [Updated] (SPARK-15767) Decision Tree Regression wrapper in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-15767: - Shepherd: Felix Cheung > Decision Tree Regression wrapper in Spa

[jira] [Updated] (SPARK-18822) Support ML Pipeline in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18822: - Description: >From Joseph Bradley: " Supporting Pipelines and advanced use cases: Ther

[jira] [Updated] (SPARK-18822) Support ML Pipeline in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18822: - Description: >From Joseph Bradley: " Supporting Pipelines and advanced use cases: Ther

[jira] [Comment Edited] (SPARK-18813) MLlib 2.2 Roadmap

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740184#comment-15740184 ] Felix Cheung edited comment on SPARK-18813 at 12/11/16 7:11 PM: I added

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740185#comment-15740185 ] Felix Cheung commented on SPARK-15581: -- re: Pipeline in R - certainly. opened https

[jira] [Commented] (SPARK-18813) MLlib 2.2 Roadmap

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740184#comment-15740184 ] Felix Cheung commented on SPARK-18813: -- I added a couple of JIRAs for R that can be found

[jira] [Commented] (SPARK-18822) Support ML Pipeline in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740181#comment-15740181 ] Felix Cheung commented on SPARK-18822: -- I'll take a shot at this. > Support ML Pipeline in Spa

[jira] [Created] (SPARK-18822) Support ML Pipeline in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18822: Summary: Support ML Pipeline in SparkR Key: SPARK-18822 URL: https://issues.apache.org/jira/browse/SPARK-18822 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-18821) Bisecting k-means wrapper in SparkR

2016-12-11 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18821: Summary: Bisecting k-means wrapper in SparkR Key: SPARK-18821 URL: https://issues.apache.org/jira/browse/SPARK-18821 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-18332) SparkR 2.1 QA: Programming guide, migration guide, vignettes updates

2016-12-11 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15740172#comment-15740172 ] Felix Cheung commented on SPARK-18332: -- [~josephkb] they are because of the {code}@aliases{code

[jira] [Assigned] (SPARK-18810) SparkR install.spark does not work for RCs, snapshots

2016-12-10 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung reassigned SPARK-18810: Assignee: Felix Cheung > SparkR install.spark does not work for RCs, snapsh

Re: Question about SPARK-11374 (skip.header.line.count)

2016-12-10 Thread Felix Cheung
+1 I think it's useful to always have a pure SQL way and skip header for plain text / csv that lots of companies have. From: Dongjoon Hyun Sent: Friday, December 9, 2016 9:42:58 AM To: Dongjin Lee; dev@spark.apache.org Subject: Re: Question

[jira] [Commented] (SPARK-18788) Add getNumPartitions() to SparkR

2016-12-09 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15737022#comment-15737022 ] Felix Cheung commented on SPARK-18788: -- I looked and didn't see that on DataFrame? I'm not sure what

[jira] [Comment Edited] (SPARK-15799) Release SparkR on CRAN

2016-12-09 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15736790#comment-15736790 ] Felix Cheung edited comment on SPARK-15799 at 12/10/16 12:23 AM

[jira] [Commented] (SPARK-15799) Release SparkR on CRAN

2016-12-09 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15736790#comment-15736790 ] Felix Cheung commented on SPARK-15799: -- That certainly would be the option. Typically when hive

[jira] [Commented] (SPARK-18810) SparkR install.spark does not work for RCs, snapshots

2016-12-09 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15736230#comment-15736230 ] Felix Cheung commented on SPARK-18810: -- Also to expand on the earlier note above, I think the main

[jira] [Commented] (SPARK-18810) SparkR install.spark does not work for RCs, snapshots

2016-12-09 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15736217#comment-15736217 ] Felix Cheung commented on SPARK-18810: -- For RC, it actually expects to have a subdirectory `spark

[jira] [Comment Edited] (SPARK-18810) SparkR install.spark does not work for RCs, snapshots

2016-12-09 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15736217#comment-15736217 ] Felix Cheung edited comment on SPARK-18810 at 12/9/16 8:06 PM: --- For RC

[jira] [Commented] (SPARK-18810) SparkR install.spark does not work for RCs, snapshots

2016-12-09 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15736161#comment-15736161 ] Felix Cheung commented on SPARK-18810: -- I've found the same issue while testing as well

[jira] [Created] (SPARK-18807) Should suppress output print for calls to JVM methods with void return values

2016-12-09 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18807: Summary: Should suppress output print for calls to JVM methods with void return values Key: SPARK-18807 URL: https://issues.apache.org/jira/browse/SPARK-18807

[jira] [Updated] (SPARK-18349) Update R API documentation on ml model summary

2016-12-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18349: - Fix Version/s: 2.1.1 > Update R API documentation on ml model summ

[jira] [Resolved] (SPARK-18349) Update R API documentation on ml model summary

2016-12-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-18349. -- Resolution: Fixed Assignee: Miao Wang Target Version/s: 2.1.1 > Updat

[jira] [Commented] (SPARK-18788) Add getNumPartitions() to SparkR

2016-12-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15734403#comment-15734403 ] Felix Cheung commented on SPARK-18788: -- In SparkR we don't officially support RDD - In which way

<    13   14   15   16   17   18   19   20   21   22   >