[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34179089 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundati

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34179618 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundati

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34183085 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundati

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34183255 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundati

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34184040 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/fpm/PrefixSpanSuite.scala --- @@ -0,0 +1,69 @@ +/* + * Licensed to the Apache Software Foun

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread feynmanliang
Github user feynmanliang commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-119696997 If you `collect` L112's `frequentLengthOnePatternAndCounts` (after renaming), L112-129 and L169-183 are basically identical (`frequentLengthOnePatternAndCounts` <=>

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-119761323 @zhangjiajin The issue with method 2 is projection before filtering. It may increase the shuffle size. After we generate possible prefixes, we should count their frequenc

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34214439 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34214527 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34215012 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34215036 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34215138 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34217018 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34217882 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34218011 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34218107 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34220995 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-08 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34221305 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34228787 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34228983 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34230322 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34230317 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34230381 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34230466 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34237882 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/fpm/PrefixSpanSuite.scala --- @@ -0,0 +1,69 @@ +/* + * Licensed to the Apache Software Found

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34237868 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/fpm/PrefixSpanSuite.scala --- @@ -0,0 +1,69 @@ +/* + * Licensed to the Apache Software Found

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34237940 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/fpm/PrefixSpanSuite.scala --- @@ -0,0 +1,69 @@ +/* + * Licensed to the Apache Software Found

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34241848 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/fpm/PrefixSpanSuite.scala --- @@ -0,0 +1,69 @@ +/* + * Licensed to the Apache Software Found

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34242017 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120008278 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120008317 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120009003 [Test build #36933 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36933/consoleFull) for PR 7258 at commit [`1dd33ad`](https://gith

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120009392 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120009384 [Test build #36933 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36933/console) for PR 7258 at commit [`1dd33ad`](https://github.

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120017949 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120017981 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120020087 [Test build #36935 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36935/consoleFull) for PR 7258 at commit [`4c60fb3`](https://gith

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120021942 [Test build #36935 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36935/console) for PR 7258 at commit [`4c60fb3`](https://github.

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120021960 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120026499 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120026390 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120028906 [Test build #36936 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36936/consoleFull) for PR 7258 at commit [`ba5df34`](https://gith

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34269643 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34269635 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34269706 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34269648 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34269691 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34269685 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34269678 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34269665 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34269682 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34269688 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34269658 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34269672 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34269652 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120045016 @zhangjiajin Since you already collected the frequent items (length-1 patterns) to driver, you don't need to keep the RDD of length-1 patterns. When generating the final

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120045530 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120045224 [Test build #36936 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/36936/console) for PR 7258 at commit [`ba5df34`](https://github.

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34269699 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundation (AS

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-09 Thread zhangjiajin
Github user zhangjiajin commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120057018 @mengxr I don't know why method 2 is projection before filtering. I think the method two is exactly what you want. The only need to add functionality to the current

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34333429 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34333437 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34333492 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34333569 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34333634 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34333999 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34334197 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34343241 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34344040 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120394889 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120403966 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120404660 [Test build #37035 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37035/consoleFull) for PR 7258 at commit [`574e56c`](https://gith

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120411698 [Test build #37035 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37035/console) for PR 7258 at commit [`574e56c`](https://github.

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120411721 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34405125 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/LocalPrefixSpan.scala --- @@ -0,0 +1,129 @@ +/* + * Licensed to the Apache Software Fou

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34405150 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/LocalPrefixSpan.scala --- @@ -0,0 +1,129 @@ +/* + * Licensed to the Apache Software Fou

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34405364 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/LocalPrefixSpan.scala --- @@ -0,0 +1,129 @@ +/* + * Licensed to the Apache Software Fou

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34405371 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/LocalPrefixSpan.scala --- @@ -0,0 +1,129 @@ +/* + * Licensed to the Apache Software Fou

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34405622 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,179 @@ +/* + * Licensed to the Apache Software Foundati

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34408039 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,179 @@ +/* + * Licensed to the Apache Software Foundati

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34408050 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,179 @@ +/* + * Licensed to the Apache Software Foundati

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34408256 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/LocalPrefixSpan.scala --- @@ -0,0 +1,129 @@ +/* + * Licensed to the Apache Software Foun

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34408260 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/LocalPrefixSpan.scala --- @@ -0,0 +1,129 @@ +/* + * Licensed to the Apache Software Foun

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34408297 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,179 @@ +/* + * Licensed to the Apache Software Foundati

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread feynmanliang
Github user feynmanliang commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34408322 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,179 @@ +/* + * Licensed to the Apache Software Foundati

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34408636 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,179 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120564465 @feynmanliang comments: Delete makePrefixProjectedDatabases, move the groupByKey() to the last call in this method (no need to include the two map()s on L161 and L16

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34408693 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,179 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34408696 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,179 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120565046 Merged build triggered. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120565049 Merged build started. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120565155 [Test build #37073 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37073/consoleFull) for PR 7258 at commit [`ca9c4c8`](https://gith

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread feynmanliang
Github user feynmanliang commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120565450 @zhangjiajin Yep, you're right. Thanks for pointing it out! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub a

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread feynmanliang
Github user feynmanliang commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120565530 LGTM pending tests --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have thi

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120566150 [Test build #37073 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/37073/console) for PR 7258 at commit [`ca9c4c8`](https://github.

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120566171 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on a diff in the pull request: https://github.com/apache/spark/pull/7258#discussion_r34409302 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/fpm/PrefixSpan.scala --- @@ -0,0 +1,252 @@ +/* + * Licensed to the Apache Software Foundatio

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120571131 @zhangjiajin Let's merge this version and make improvements in follow-up PRs: 1. `LocalPrefixSpan` performance a. `run` should output `Iterator` instead of

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120571467 Merged into master. Thanks for contributing PrefixSpan! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark pull request: [SPARK-6487][MLlib] Add sequential pattern min...

2015-07-10 Thread zhangjiajin
Github user zhangjiajin commented on the pull request: https://github.com/apache/spark/pull/7258#issuecomment-120572678 OK --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

  1   2   3   >